Senior Site Reliability Engineer

TLDR

Design, develop, and maintain scalable software solutions in Azure Cloud while mentoring junior engineers and leading technical projects to enhance software development practices.

 

Important Information 

Location: Costa Rica

Work Mode: Hybrid

Job Summary 

As a Senior Site Reliability Engineer (19245), you will be responsible for designing, developing, and maintaining high-quality software solutions. You will collaborate with cross-functional teams to understand business requirements and translate them into scalable and efficient software applications. Your role will involve leading technical projects, mentoring junior engineers, and continuously improving software development practices to ensure the delivery of robust and reliable software systems.  

Responsibilities and Duties 

  • Design, implement, and maintain highly available, resilient, and scalable systems in Azure Cloud ensuring reliability and operational continuity
  • Define and manage SLIs, SLOs, monitoring, alerting, and incident response processes to guarantee system performance and uptime
  • Automate infrastructure and operational processes using Terraform and CI/CD practices to reduce manual effort and improve efficiency
  • Collaborate with development and business teams to improve system architecture, perform capacity planning, and drive continuous improvement initiatives

Qualifications and skills

  • 5+ years of experience in IT environments and 3+ years in SRE, DevOps, or similar roles
  • Strong knowledge of SRE principles, reliability engineering, and operational best practices
  • Hands-on experience with Microsoft Azure including cloud architecture, networking, and security fundamentals
  • Proficiency in Terraform and Infrastructure as Code practices
  • Experience with CI/CD pipelines and DevOps processes
  • Strong knowledge of containerization and orchestration technologies such as Docker and Kubernetes
  • Experience with monitoring and observability tools such as Prometheus, Grafana, Datadog, or Azure Dashboards
  • Experience with GitHub, GitHub Flow, and GitHub Actions
  • Strong understanding of networking concepts and protocols
  • Excellent problem-solving skills, critical thinking, and ability to work in fast-paced environments

About Encora 
 
Encora is a global company that offers Software and Digital Engineering solutions. Our practices include Cloud Services, Product Engineering & Application Modernization, Data & Analytics, Digital Experience & Design Services, DevSecOps, Cybersecurity, Quality Engineering, AI & LLM Engineering, among others.  

At Encora, we hire professionals based solely on their skills and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.  

 

 

Encora provides tailored software engineering and digital product development solutions for fast-growing technology companies. With a global team of over 9,000 experts, we specialize in a wide range of practices, including cloud services, product engineering, and AI engineering, making us a trusted partner for enterprises looking to innovate and modernize their digital infrastructure.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior Site Reliability Engineer Q&A's
Report this job
Apply for this job