Site Reliability Engineer

New York , United States

AI overview

Design and develop core systems managing hundreds of thousands of AI applications, while collaborating with top engineers to innovate AI agent architectures.

Superblocks is reimagining software development for a billion builders. Our mission is to help every team build, deploy, and manage AI-powered software with full control and flexibility.

Why Join Us?

We’re one of the fastest-growing AI startups, backed by top-tier investors and widely adopted by companies like Instacart, Sofi, Betterment, and Carrier. Our team comes from Uber, Stripe, Datadog, Confluent, Elastic, and Google, and has founded/architected systems like Kafka, Kibana, Debezium, Datadog APM, and more.

Since launching Clark, our AI builder, the response has been overwhelming with strong adoption from enterprises across different industries.

We’re fully in-person at our NYC HQ near Union Square and are looking for exceptional engineers who are passionate about creating great products.

The Role

You’ll play a key role in designing and developing the core systems that power and manage hundreds of thousands of AI applications. If you're interested in building and operating complex infrastructure in production, innovating new AI agent architectures, and building with some of the sharpest engineers, this is the place for you.

Responsibilities:

  • Architect and operate scalable production systems supporting both multi-tenant cloud and on-premise deployments.
  • Design and develop a real-time distributed execution engine that powers all AI applications, workflows, and agents.
  • Build, deploy, and optimize AI agent architecture, guardrails and evals.
  • Partner with product and customers to define the roadmap and bring new builder and AI experiences to life

Must haves:

  • 3+ years of experience managing cloud-based production apps with deep knowledge of containers, VMs, caches, task queues, networking, and OS.
  • Designed and deployed infrastructure in production at scale with containerized solutions like Docker, Kubernetes (k8s), ECS/EKS, Firecracker etc.
  • Strong product sense focused on great user experiences and strategic thinking to meet market and customer needs.

Nice to haves:

  • Built and operated production AI systems and are familiar with AI inference techniques
  • Optimized language runtimes and enable cross-language integration (e.g., Go, Python, C), including customizing or building WASM compilers and runtimes.
  • Experience with machine learning algorithms, platforms, and frameworks like PyTorch and Tensorflow.

Compensation

The base salary ranges between $175,000–$225,000+ USD, plus a generous equity package and benefits. Final comp will be based on experience and skills.

 

If you’re excited to build the core infrastructure powering the next billion AI-powered apps, let’s talk.

Superblocks is the AWS for internal tooling, providing developers with a low-code platform to build custom internal apps quickly and efficiently.

View all jobs
Salary
$175,000 – $225,000 per year
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job