Site Reliability Engineer

AI overview

Contribute to designing and enhancing infrastructure that boosts developer productivity and ensures product reliability, utilizing cutting-edge tools like Kubernetes and Terraform.
Our Mission

Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers the powerful APIs, SDKs, and tools necessary to build and scale onchain apps and rollups.

Our infrastructure powers 70% of the top web3 teams, 90%+ of web2 companies building in web3 and 100+ million end users. Our customers include top web3 brands like Polymarket, OpenSea, Circle, WorldCoin, as well as major global brands like Shopify and Adobe.

The Alchemy team draws from decades of deep expertise in massively scalable infrastructure, AI, and blockchain from leadership roles at leading companies and universities like Google, Microsoft, Facebook, Stanford, and MIT.

We're backed by the world's leading VCs and institutions, including: Lightspeed, Silver Lake, a16z, Coatue, Pantera, Addition, Stanford University, Coinbase, and Charles Schwab, among others.

About the Role

As an engineer in the Infrastructure department at Alchemy, you will collaborate with our engineering team to design, deploy, and continuously improve the infrastructure supporting our globally used developer platform. Your focus will be on enhancing developer productivity and ensuring product reliability as we scale.

The Infrastructure team’s mission is to provide the infrastructure, tooling and expertise needed to allow Alchemy engineers to ship, scale and operate high quality products to our customers in a fast, safe and cost efficient manner.

Come and help us build, maintain and scale the underlying infrastructure that is required to build products that delight our customers when it comes to reliability, latency and cost.

What You'll Do
  • Architect and operate scalable infrastructure leveraging Kubernetes, Terraform, and cloud-native tools.
  • Build and maintain automation pipelines for deployments, monitoring, and incident response.

  • Lead incident management, conducting root cause analyses and driving continuous reliability enhancements.

  • Develop observability frameworks using Prometheus, Grafana, ELK, and tracing tools like Jaeger or OpenTelemetry.

  • Provide technical leadership and mentorship to elevate the team’s operational capabilities.

What We're Looking For
  • 5+ years of experience as an Infrastructure Engineer focused on reliability (e.g., Site Reliability Engineer, Production Engineer, Platform Engineer).

  • Experience leading and driving company-wide reliability efforts and engineering initiatives.

  • Strong familiarity with observability best practices and tooling like Prometheus, Grafana, and Datadog.

  • Comfortable working with cloud infrastructure (AWS/GCP) and container orchestration (Kubernetes).

  • Skilled with infrastructure-as-code tools such as Terraform or Helm, and eager to improve automation.

  • Able to respond calmly and effectively to incidents, with a focus on learning and improving.

  • Strong communicator who can work well in cross-functional teams.

  • Curious about blockchain technology and distributed systems — not required but a plus.

Benefits and Perks

🩺  Medical, Dental, & Vision
💪  Gym Reimbursement
🖥️  Home Office Build-out Budget
🥙  In-Office Group Meals
🧘‍♂️  Wellbeing & Mental Health Perks
📚 Learning & Development Stipend
🎉  Company Sponsored Conferences & Events
💸  HSA and FSA Plans
🧬  Fertility Benefits

More on the Role

Alchemy is committed to offering competitive compensation, including base salary as well as equity. Additionally, Alchemy offers comprehensive medical, dental, and vision coverage, as well as other benefits such as 401k and unlimited flexible time off.

The base salary range for this position is estimated to be between $135,000 - $240,000 annually. Please note this range reflects base salary only, and does not include bonus, equity, or benefits. Your salary will be determined by various factors, including relevant experience, skill set, qualifications, and other business needs.

Perks & Benefits Extracted with AI

  • Free Meals & Snacks: In-Office Group Meals
  • Health Insurance: Medical, Dental, & Vision
  • Home Office Stipend: Home Office Build-out Budget
  • Learning Budget: Learning & Development Stipend
  • Other Benefit: Fertility Benefits
  • Wellness Stipend: Wellbeing & Mental Health Perks

Whether you're a beginner developer, startup, web3 market leader, or a large enterprise, Alchemy makes multichain web3 development easy with reliable and scalable node infrastructure, enhanced APIs, and developer tools. Get started for free!

View all jobs
Salary
$135,000 – $240,000 per year
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job