DevOps Engineer (Cloud AIaaS)

TLDR

Design and maintain infrastructure for scalable AI inference workloads, collaborating with machine learning engineers to enhance system performance and integrate advanced technologies.

As a DevOps Engineer, you will be responsible for designing, deploying, and maintaining infrastructure and services that enable scalable and secure AI inference workloads on-premises.

What You’ll Do

  • Design, develop, and maintain infrastructure for AI inference workloads, including GPU scheduling, model deployment pipelines, and data access patterns in on-prem environments
  • Build and manage monitoring and observability tools for AI inference platforms, including dashboards, alerts, and runbooks for model health and system performance
  • Collaborate with ML engineers and platform teams to design system architecture for AI workloads, integrate inference runtimes, and test performance at scale

What We’re Looking For Hands-on Experience In

  • Containerization and Container Orchestration: Kubernetes, Helm, Docker/CRI-O
  • Linux and networks
  • Programming and Scripting: Python/Go/Bash
  • Infrastructure as Code (IaC) approach: Ansible, Terraform
  • Creating CI/CD pipelines: GitLab/GitHub actions
  • Experience with Cluster API or any other "Kubeception" technology
  • Deep experience with Kubernetes CNI, CSI, and Operators

Nice to Have

  • Knowledge in Kubernetes-related technologies such as ArgoCD, Helmfile
  • Experience with Prometheus stack
  • Experience with other Cloud Native technologies

Benefits 

At Gcore, we want you to do your best work and enjoy the journey. Our benefits are designed to support your growth, well-being, and life beyond work: 

  • Competitive compensation 
  • Flexible working hours and hybrid or remote options, depending on your role 
  • Work from anywhere in the world for up to 45 days per year 
  • Private medical insurance for you and your family* 
  • Extra paid vacation and sick leave days* 
  • Support for life’s important moments and celebrations 
  • Language courses to help you connect and grow 
  • Modern, welcoming offices with snacks, drinks, and entertainment* 
  • Team sports and social activities* 

*Benefits may vary depending on your location. 

Equal Opportunity Employer 

We provide equal opportunity to all applicants without regard to race, color, religion, sex, sexual orientation, age, gender identity, gender expression, national origin, disability, or any other legally protected characteristics. 

Benefits

Flexible Work Hours

Flexible working hours and hybrid or remote options, depending on your role

Health Insurance

Private medical insurance for you and your family*

Team sports and social activities

Paid Time Off

Extra paid vacation and sick leave days*

Gcore builds a comprehensive infrastructure and software suite that powers the digital experiences of AI, cloud, network, and security services. Aimed at global enterprises, its solutions enhance everything from real-time communications to secure web applications. With a focus on reliability and performance, Gcore stands out as a go-to partner for businesses navigating the demands of the digital landscape.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Operations Engineer Q&A's
Report this job
Apply for this job