DevOps Automation & Observability Engineer

AI overview

Collaborate closely with cross-functional teams to design and optimize cloud-native infrastructure using cutting-edge tools across AWS and Azure, supporting next-gen AI/ML workloads.

We are looking for experienced DevOps / Cloud Platform Engineer to design, automate, and optimize modern cloud-native infrastructures across AWS, Azure, and GCP. You will work with Kubernetes, Terraform, CI/CD, observability tools, and cloud services to build secure, scalable, and production-ready platforms. In this role, you’ll collaborate closely with development and cross-functional teams to improve delivery workflows, reliability, and performance, while supporting next-generation AI/ML workloads and contributing to best practices and automation across the organization.



What Will You Do

  • Design, build, and maintain cloud-native infrastructures across AWS, Azure, and (optionally) GCP.
  • Implement scalable, secure, and highly available systems using Kubernetes, Terraform, and CI/CD pipelines.
  • Automate cloud provisioning and deployments, improve platform reliability, and ensure cost and performance optimization.
  • Integrate observability tools (Datadog, Grafana, Prometheus, Splunk) into applications and support teams in monitoring and troubleshooting.
  • Collaborate with developers, QA, and cross-functional teams to enable DevOps practices, streamline workflows, and improve delivery processes.
  • Support AI/ML workloads by designing infrastructure for training, inference, and MLOps pipelines (SageMaker, Azure ML, Vertex AI).
  • Maintain documentation, build self-service DevOps tools, and contribute to platform best practices.


Who You Are

  • You have 4+ years of experience in DevOps, SRE, or cloud platform engineering.
  • Strong expertise in AWS or Azure cloud architectures, networking, and security.
  • Skilled in Kubernetes (EKS/AKS), Docker, Helm, and modern infrastructure-as-code (Terraform).
  • Solid understanding of Linux systems, distributed systems, and scalable architecture design.
  • Hands-on experience with CI/CD tools (Jenkins, GitHub Actions, Azure DevOps) and GitOps (ArgoCD).
  • Comfortable with observability tooling (Datadog, Splunk, Prometheus, Grafana).
  • Experience with AI/ML platforms or ML-driven workloads is a strong plus.
  • You work well with cross-functional teams, communicate clearly, and enjoy building reliable, automated, developer-friendly platforms.


We appreciate the interest of all applicants. Please note that only those whose qualifications align closely with the position requirements will be contacted for the next steps in the selection process.


All applications will be handled with confidentiality. 


⋮IWConnect's Privacy Statement for Job Applicants


Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Engineer Q&A's
Report this job
Apply for this job