Forward Deployed Engineer

AI overview

Work directly with enterprise clients to deploy and scale AI workloads, translating their needs into actionable plans while driving product growth and fostering customer success.

fal is building the fastest and most scalable infrastructure for AI inference. Fal Serverless powers 1,300+ endpoints on the fal Marketplace and handles tens of millions of requests per day across production workloads. Enterprises use fal Serverless to deploy, operate, and scale custom AI models without managing infrastructure themselves.

Autoscaling, observability, and operational complexity are handled end-to-end by fal’s platform and UI. Serverless began as internal infrastructure built to support fal’s own scale and was released publicly to enterprise customers in early 2025. It is now a core, revenue-driving product with rapidly growing adoption.

fal is one of the fastest-growing AI startups, reaching Series D at a $4.5B valuation with a lean team of ~70 employees. You’ll be joining early, with meaningful ownership and direct impact on a foundational product.

About this role

As a Forward Deployed Engineer on Serverless, you will work directly with enterprise customers to help them deploy, scale, and operationalize their AI workloads on fal. This is a highly technical, customer-facing role where you’ll act as the bridge between Sales, Product and Infrastructure teams.

You’ll join customer calls, deeply understand their architecture and needs, and translate those into actionable implementation plans and product requirements. You will be responsible for unblocking customer deployments, accelerating onboarding, and ensuring enterprise accounts successfully reach production fast.

This is a role for someone who loves solving real-world engineering problems and wants direct ownership over outcomes that drive revenue and product growth.

What you’ll work on

  • Join enterprise onboarding calls and act as the technical owner for deployments
  • Help customers integrate their models into fal Serverless (APIs, scaling, observability, deployment workflows)
  • Debug customer issues end-to-end across frontend, backend, and infra layers
  • Translate customer feedback into clear product specs, tasks, and engineering priorities
  • Work closely with Product + Infra to ensure enterprise needs are shipped into the platform
  • Build custom proofs-of-concept or lightweight integrations to unblock adoption
  • Identify repeatable patterns across customers and turn them into reusable product features
  • Improve internal tooling, onboarding flows, and docs based on real customer pain points

What we’re looking for

  • Strong engineering background (Proficiency with TypeScript, Python, Postgres, and Next.js)
  • Experience working with customers in a technical capacity (Solutions Engineer, Forward Deployed Engineer, DevRel Engineer, or similar)
  • Comfortable jumping into ambiguous customer problems and finding solutions fast
  • Ability to understand complex systems and communicate clearly with both technical and non-technical stakeholders
  • Strong written communication skills (turning customer conversations into actionable specs/tasks)
  • Experience working across APIs, infrastructure, and cloud environments
  • High ownership mentality: you take responsibility for customer success end-to-end
  • Comfort operating in a fast-moving, low-process environment

Nice to have

  • Experience with serverless platforms, infra products, or developer platforms
  • Familiarity with observability tooling (logs, metrics, tracing)
  • Background in distributed systems, Kubernetes, or cloud-native deployments
  • Experience with AI/ML workloads in production
  • Experience writing documentation, onboarding guides, or customer playbooks

Why join

  • Own the success of fal’s most important enterprise deployments
  • Work on a product used at massive scale with real production workloads
  • Direct influence over product roadmap through customer feedback loops
  • High autonomy and visibility across Product, Infra, and Sales leadership
  • Be a foundational member of a rapidly growing product vertical
  • Work at one of the fastest-growing AI startups, helping shape a new category

What we offer at fal

  • Interesting and challenging work
  • Competitive salary and equity
  • A lot of learning and growth opportunities
  • We offer visa sponsorship and will help you relocate to San Francisco.
  • Health, dental, and vision insurance (US)
  • Regular team events and offsite

Compensation

  • $150,000 - $230,000 + equity + comprehensive benefits package

Location

  • We are currently hiring in downtown San Francisco.

Perks & Benefits Extracted with AI

  • Health Insurance: Health, dental, and vision insurance (US)
  • Visa Sponsorship: We offer visa sponsorship and will help you relocate to San Francisco.

In the modern era, content is shifting from being human-made and algorithm-distributed to being generated on demand - personalized in real time for every audience, context, and moment. We’re Fal, and we’re building the infrastructure powering this transformation. Our platform is the first of its kind: a generative media stack for developers that enables real-time, AI-generated content across image, video, and audio.   At the core is our serverless Python runtime, purpose-built to run massive ML models across thousands of GPUs with unmatched speed and efficiency. Applications built on Fal already serve millions of users - and we’re just getting started. Founded in 2021, we're scaling fast and backed by top investors including a16z, Bessemer, and Kindred. If you're an ambitious builder who wants to define the future of AI and media, we’d love to meet you.

View all jobs
Salary
$150,000 – $230,000 per year
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Forward Deployed Engineer Q&A's
Report this job
Apply for this job