Lightning AI is hiring a

DevOps Engineer

Palo Alto, United States

Who We Are

Lightning AI is the company reimagining the way AI is built. After creating and releasing PyTorch Lightning in 2019, Lightning AI was launched to reshape the development of artificial intelligence products for commercial and academic use.

We are on a mission to simplify AI development, making it accessible to everyone—from solo researchers to large enterprises. By removing the complexity of building and deploying AI tools, we empower innovators to focus on solving real-world problems. Our platform is built to scale with the latest AI advancements while staying intuitive and adaptable, so you can bring your ideas to life.

Lightning AI has offices in New York City, Palo Alto, and London, and is backed by $58.6 million in funding from Coatue, Index Ventures, Bain Capital Ventures, and Firstminute.

Our Values

  • Move Fast: We act with speed and precision, breaking down big challenges into achievable steps.

  • Focus: We complete one goal at a time with care, collaborating as a team to deliver features with precision.

  • Balance: Sustained performance comes from rest and recovery. We ensure a healthy work-life balance to keep you at your best.

  • Craftsmanship: Innovation through excellence. Every detail matters, and we take pride in mastering our craft.

  • Minimal: Simplicity drives our innovation. We eliminate complexity through discipline and focus on what truly matters.

What We're Looking For

We are looking for an experienced DevOps Engineer to design, build, and maintain our cloud infrastructure and scale CI/CD pipelines, ensuring reliability and stability for our enterprise customers. With a primary focus on Golang, you'll play a key role in automating our deployment processes, monitoring system performance, and troubleshooting infrastructure issues.

As part of the Red Squad Team, you’ll work closely with our development teams and report to the Director of Product. This hybrid role is based in Palo Alto, with a two-day in-office requirement. The salary range is $120,000 - $215,000.

What You’ll Do

  • Design, build, and maintain scalable infrastructure for deploying, monitoring, and automating our cloud environments.
  • Collaborate closely with development teams to ensure seamless integration and delivery of new features.
  • Implement and manage CI/CD pipelines to improve deployment frequency and reduce manual intervention.
  • Monitor system performance, identify bottlenecks, and develop strategies to improve reliability and performance.
  • Ensure security best practices are followed across infrastructure and deployment processes.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Stay up to date with the latest industry trends and tools to drive innovation in DevOps practices.

What You’ll Need

  • Proven experience as a DevOps Engineer or in a similar role, with a deep understanding of cloud infrastructure (AWS, GCP, or Azure).
  • Expertise in CI/CD tools such as Jenkins, CircleCI, GitHub, or GitLab.
  • Ability to code in golang
  • Experience with infrastructure as code tools like Terraform, Ansible, or CloudFormation.
  • Familiarity with containerization technologies like Docker and Kubernetes.
  • Knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK stack.
  • A strong security mindset with experience in managing secure cloud environments.
  • Excellent problem-solving skills, attention to detail, and ability to work in a fast-paced, collaborative environment.

Benefits and Perks

We offer competitive base salaries and stock options with a 25% one year cliff and monthly vesting thereafter. For our international employees, we work with Velocity Global to pay you in your local currency and provide equitable benefits across the globe.

In the US, we offer:

  • 90% monthly premium contributions towards medical and 100% monthly premium contributions towards select dental and vision plans for you and your dependents
  • Life and AD&D insurance 
  • Flexible paid time off plus 1 week of winter closure
  • Generous paid family leave benefits
  • $500 monthly meal reimbursement, including groceries & food delivery services
  • $1,000 home office stipend
  • $1,000 annual learning & development stipend 
  • 100% Citibike membership (NYC only)
  • $45/month gym membership 
  • Additional various medical and mental health services

At Lightning AI, we are committed to fostering an inclusive and diverse workplace. We believe that diverse teams drive innovation and create better products. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected characteristic. We are dedicated to building a culture where everyone can thrive and contribute to their fullest potential.

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Operations Engineer Q&A's
Report this job
Apply for this job