Full Stack Engineer (Serverless)

AI overview

As a Full Stack Engineer, you'll build scalable systems for fal's rapidly growing Serverless platform, impacting thousands of enterprise customers with your technical ownership.

fal is building the fastest and most scalable infrastructure for AI inference. Fal Serverless powers 1,300+ endpoints on the fal Marketplace and handles tens of millions of requests per day across production workloads.

Enterprises use fal Serverless to deploy, operate, and scale custom AI models without managing infrastructure themselves. Autoscaling, observability, and operational complexity are handled end-to-end by fal’s platform and UI.

Serverless began as internal infrastructure built to support fal’s own scale and was released publicly to enterprise customers in early 2025. It is now a core, revenue-driving product with rapidly growing adoption.

fal is one of the fastest-growing AI startups, reaching Series D at a $4.5B valuation with a lean team of ~70 employees. You’ll be joining early, with meaningful ownership and direct impact on a foundational product.

 

About this role:

As a Full Stack Engineer on Serverless, you will build the core product across frontend and backend that powers fal’s Serverless platform. This is a deeply product-focused role. You will work side-by-side with Product and Infrastructure to design and ship reusable, scalable systems that enterprise customers rely on in production every day.

You will be a foundational technical owner of fal Serverless as it scales to thousands of enterprise customers, with real responsibility, autonomy, and impact. This is a chance to help build a new product vertical from the ground up inside a company that is already scaling at rocket-ship speed.

 

What you’ll work on:

  • Build and maintain core Serverless UI features (dashboards, logs, observability, configuration, usage)
  • Design and implement backend APIs that power the Serverless product experience
  • Improve performance, reliability, and scalability of customer-facing systems
  • Work closely with Infrastructure to ensure product features align with platform capabilities
  • Own features end-to-end, from design through production and iteration

What we’re looking for:

  • Strong experience working across both frontend and backend
  • Proficiency with TypeScript, Python, Postgres, and Next.js
  • Experience owning features end-to-end in production systems
    Ability to context switch between UI, backend, and performance work
  • Product-minded engineer who values clean abstractions and long-term maintainability
  • Comfortable working in a fast-moving, low-process environment

Nice to have:

  • Experience building developer platforms or infrastructure-adjacent products
  • Familiarity with observability tooling (logging, metrics, tracing) in production environments
  • Background in distributed systems, container orchestration, or cloud-native architectures
  • Experience with real-time systems, streaming logs, or high-throughput data pipelines
  • Exposure to technologies such as Kubernetes, Prometheus, Datadog, gRPC, or similar systems
  • Entrepreneurial mindset and strong ownership mentality

What we offer at fal:

  • Interesting and challenging work
  • Competitive salary and equity
  • A lot of learning and growth opportunities
  • We offer visa sponsorship and will help you relocate to San Francisco.
  • Health, dental, and vision insurance (US)
  • Regular team events and offsite

Compensation:

  • $150,000 - $230,000 + equity + comprehensive benefits package

Location:

  • We are currently hiring in downtown San Francisco.

Perks & Benefits Extracted with AI

  • Health Insurance: Health, dental, and vision insurance (US)
  • Visa Sponsorship: We offer visa sponsorship and will help you relocate to San Francisco.

In the modern era, content is shifting from being human-made and algorithm-distributed to being generated on demand - personalized in real time for every audience, context, and moment. We’re Fal, and we’re building the infrastructure powering this transformation. Our platform is the first of its kind: a generative media stack for developers that enables real-time, AI-generated content across image, video, and audio.   At the core is our serverless Python runtime, purpose-built to run massive ML models across thousands of GPUs with unmatched speed and efficiency. Applications built on Fal already serve millions of users - and we’re just getting started. Founded in 2021, we're scaling fast and backed by top investors including a16z, Bessemer, and Kindred. If you're an ambitious builder who wants to define the future of AI and media, we’d love to meet you.

View all jobs
Salary
$150,000 – $230,000 per year
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Full-Stack Engineer Q&A's
Report this job
Apply for this job