Staff Software Engineer, Compute

AI overview

Contribute to the development and maintenance of a core Python platform that manages AI workloads and infrastructure while leveraging advanced tools like K8s and Terraform.

You are an experienced software engineer who thrives on building large scale computation platforms. You have deep expertise in backend systems that orchestrate workloads and route requests efficiently, while taking care of capacity and resource constraints. You possess a strong understanding of foundational cloud infrastructure and Linux provisioning and management tools. You know how to achieve reliability and scale with minimum operational load.

Key responsibilities

  • Develop and maintain our core Python platform, which handles routing of requests, orchestration of AI workloads, GPU server capacity management, observability, authentication, rate limiting, and many others

  • Develop and maintain our infrastructure layer where we use Terraform, Ansible, and provider APIs to manage our fleet of GPU workers

  • Own K8s, FluxCD, Nomad, Prometheus, Thanos, Grafana, Loki, distributed networking storage, and other technologies that underpin our platform

  • Create the vision and lay the foundation for where our infrastructure should go in the next 1/2/5 years

Requirements

  • Deep experience building distributed compute platforms, preferably with Python

  • Strong foundation in managing both cloud and bare metal infrastructure

  • Solid understanding of K8s and CI/CD on it

  • Excellent communication

  • Self-starter who executes quickly, takes ownership and constantly seeks improvement

Compensation

  • $180,000-250,000 plus equity

Location

  • San Francisco, CA

What we offer at fal

  • Interesting and challenging work

  • Employee-friendly equity terms (early exercise, extended exercise)

  • A lot of learning and growth opportunities

  • We are currently hiring in downtown San Francisco.

  • We offer visa sponsorship and will help you relocate to San Francisco.

  • Health, dental, and vision insurance (US)

  • Regular team events and offsites

Perks & Benefits Extracted with AI

  • Equity Compensation: Employee-friendly equity terms (early exercise, extended exercise)
  • Health Insurance: Health, dental, and vision insurance (US)
  • Visa Sponsorship: We offer visa sponsorship and will help you relocate to San Francisco.

In the modern era, content is shifting from being human-made and algorithm-distributed to being generated on demand - personalized in real time for every audience, context, and moment. We’re Fal, and we’re building the infrastructure powering this transformation. Our platform is the first of its kind: a generative media stack for developers that enables real-time, AI-generated content across image, video, and audio.   At the core is our serverless Python runtime, purpose-built to run massive ML models across thousands of GPUs with unmatched speed and efficiency. Applications built on Fal already serve millions of users - and we’re just getting started. Founded in 2021, we're scaling fast and backed by top investors including a16z, Bessemer, and Kindred. If you're an ambitious builder who wants to define the future of AI and media, we’d love to meet you.

View all jobs
Salary
$180,000 – $250,000 per year
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Staff Software Engineer Q&A's
Report this job
Apply for this job