We are looking for a Senior SRE with a focus on Observability and Performance to work in a critical mission and high-volume environment in the financial segment, directly contributing to observability, monitoring, and application performance initiatives for the card platform.
🎯 Main Activities
• Work on the evolution of observability and monitoring of critical applications;
• Implement and enhance application instrumentation;
• Perform performance analysis and troubleshooting in production;
• Identify bottlenecks and propose architecture and infrastructure improvements;
• Support development teams in optimizing applications and services;
• Participate in resolving critical incidents and root cause analysis;
• Contribute to the continuous evolution of observability practices and performance engineering.
✅ What we are looking for
🔹 Observability and APM
• Practical experience with observability and application instrumentation;
• Creation and analysis of custom metrics;
• Structuring and analysis of logs;
• Distributed tracing using OpenTelemetry or similar tools;
• Experience with APM platforms such as Dynatrace, AppDynamics, New Relic, or equivalents;
• Work in instrumentation, monitoring adjustments, and troubleshooting with APM tools.
🔹 Cloud-Native Architecture
• Experience with microservices-based architectures;
• Development and maintenance of REST APIs;
• Knowledge in messaging and event streaming (Kafka, Pub/Sub, or similar);
• Experience with asynchronous workloads and batch processing.
🔹 Performance Engineering
• Diagnosis and mitigation of performance issues in production;
• Identification of bottlenecks;
• Analysis of CPU, memory, I/O, and infrastructure resource consumption;
• Knowledge of latency, throughput, and behavior under load concepts;
• Evaluation of impacts caused by external dependencies.
🔹 Tuning and Optimization
• SQL query optimization;
• Connection management and tuning;
• Improvement of computational resource utilization;
• Implementation of best practices focused on scalability and operational efficiency.
🔹 Cloud and Automation
• Experience with AWS, Azure, or GCP;
• Knowledge in autoscaling and troubleshooting in cloud environments;
• Experience with observability stacks;
• Experience with CI/CD pipelines;
• Automation of processes related to observability, performance testing, and continuous validations.
⭐ Differentiator
• Experience in high operational criticality environments;
• Experience in financial institutions, payment methods, or card platforms;
• Work in environments with high transactional volume.
🌎 Work Model: Remote
If you have experience in high-availability environments, observability, and critical application performance, we want to know your profile!