Site Reliability Engineer (SRE)

AI overview

Leverage over 9 years of experience in Site Reliability Engineering to enhance system reliability and performance using advanced cloud and containerization technologies.

Role: Site Reliability Engineer (SRE)

Location: Miami FL – Onsite

Position Type: Contract

Required Skills & Qualifications

• 9+ years of experience in Site Reliability Engineering, DevOps, or similar role.

• Strong experience with Linux/Unix systems administration and troubleshooting.

• Proficiency in at least one scripting or programming language (e.g., Python, Go, Shell, Ruby).

• Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code tools like Terraform or CloudFormation.

• Expertise with CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD).

• Strong knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).

• Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack, Datadog, New Relic).

• Understanding of networking, security best practices, and load balancing.

• Solid problem-solving and incident response skills.

Preferred Qualifications

• Experience with SRE principles: SLAs, SLOs, error budgets, and chaos engineering.

• Familiarity with service mesh technologies (e.g., Istio, Linkerd).

• Certification in a relevant cloud provider (e.g., AWS Certified DevOps Engineer).

Soft Skills

• Excellent communication and documentation skills.

• Strong analytical and problem-solving mindset.

• Ability to work collaboratively across teams and functions.

• Eagerness to learn and improve existing systems and processes.

Role: Site Reliability Engineer (SRE)

Location: Miami FL – Onsite

Position Type: Contract

Required Skills & Qualifications

• 9+ years of experience in Site Reliability Engineering, DevOps, or similar role.

• Strong experience with Linux/Unix systems administration and troubleshooting.

• Proficiency in at least one scripting or programming language (e.g., Python, Go, Shell, Ruby).

• Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code tools like Terraform or CloudFormation.

• Expertise with CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD).

• Strong knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).

• Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack, Datadog, New Relic).

• Understanding of networking, security best practices, and load balancing.

• Solid problem-solving and incident response skills.

Preferred Qualifications

• Experience with SRE principles: SLAs, SLOs, error budgets, and chaos engineering.

• Familiarity with service mesh technologies (e.g., Istio, Linkerd).

• Certification in a relevant cloud provider (e.g., AWS Certified DevOps Engineer).

Soft Skills

• Excellent communication and documentation skills.

• Strong analytical and problem-solving mindset.

• Ability to work collaboratively across teams and functions.

• Eagerness to learn and improve existing systems and processes.

Role: Site Reliability Engineer (SRE)

Location: Miami FL – Onsite

Position Type: Contract

Required Skills & Qualifications

• 9+ years of experience in Site Reliability Engineering, DevOps, or similar role.

• Strong experience with Linux/Unix systems administration and troubleshooting.

• Proficiency in at least one scripting or programming language (e.g., Python, Go, Shell, Ruby).

• Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code tools like Terraform or CloudFormation.

• Expertise with CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD).

• Strong knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).

• Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack, Datadog, New Relic).

• Understanding of networking, security best practices, and load balancing.

• Solid problem-solving and incident response skills.

Preferred Qualifications

• Experience with SRE principles: SLAs, SLOs, error budgets, and chaos engineering.

• Familiarity with service mesh technologies (e.g., Istio, Linkerd).

• Certification in a relevant cloud provider (e.g., AWS Certified DevOps Engineer).

Soft Skills

• Excellent communication and documentation skills.

• Strong analytical and problem-solving mindset.

• Ability to work collaboratively across teams and functions.

• Eagerness to learn and improve existing systems and processes.

Axiom is a global information technology, consulting and outsourcing company and services provider. Our IT solutions empower organizations and individuals throughout the world to maximize value and quality to succeed in today's challenging business environment. As a fast-growing new economy company, we focus our strengths to offer world-class solutions and services through the convergence of technology, innovation, expertise and experience. We provide software consulting, development and IT-enabled services to clients across the globe. We work towards delivering sustained value creation for customers, employees, industries and society at large. Core offerings include data warehousing, middleware development, product development and web-enablement of legacy applications in verticals like telecom, finance, healthcare, manufacturing, energy & utilities, retail & distribution, enablement of legacy Relentless exploration of technology horizons and a Global Delivery Model that is a judicious combination of onsite, offsite and offshore development, offer a complete range of high-ROI business solutions spanning the consulting, technology, operations and process outsourcing value chain.

View all jobs
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job