This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior SRE DevOps Engineer. In this role, you will have the opportunity to contribute to the development of a satellite communication platform that enables vital voice calls and messaging when traditional connectivity fails. Your expertise will play a crucial part in ensuring reliability and enhancing operational efficiency across our cloud infrastructure. You'll collaborate with a dedicated team to implement innovative solutions that address the complexities of real-time applications and bridging mobile devices with satellite hardware. This is a remote position that offers flexibility and a chance to make a significant impact in the field of cloud and DevOps engineering.
Accountabilities
Implement SLI/SLO frameworks with error budgets to drive reliability decisions
Design release strategies including blue/green deployments and version tracking
Lead incident response and develop automated runbooks to reduce MTTR
Develop tooling and automation frameworks in TypeScript/Python for enhanced productivity
Write services focused on reliability, such as health checkers and auto-remediation controllers
Maintain production AWS infrastructure using IaC with a focus on microservices orchestration
Establish CI/CD pipelines for backend services and mobile apps
Define and enforce security policies across the infrastructure
Build observability features with OpenTelemetry and distributed tracing
Manage database configurations including PostgreSQL and Redis
Requirements
7+ years of experience in SRE/DevOps/Platform Engineering with a strong software background
Proficient in at least one backend language (TypeScript/Node.js, Python, or Go)
Deep expertise in AWS technologies including ECS, EKS, and RDS
Strong experience with IaC tools like Terraform or CloudFormation
Proven track record in CI/CD pipeline design for both on-prem and cloud environments
Experience in container orchestration with Docker and Kubernetes
Solid understanding of network security and incident response
Experience implementing SLI/SLO frameworks and reduction strategies
Operations knowledge for PostgreSQL, Redis, and message queues
Strong understanding of distributed systems patterns
Benefits
Build critical communication infrastructure for remote areas
A role merging engineering and operations with significant ownership
Technically challenging environment across cloud, IoT, and satellite systems
Full ownership of infrastructure with direct impact on reliability
Competitive compensation and flexible remote work options
Why Apply Through Jobgether?
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1