Coupang is hiring a

Staff Back-end Engineer (Site Reliability Engineer)

Mountain View, United States

We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurial surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.

Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world.

 

About the Role:  

Site Reliability Engineers (SREs) at Coupang is a mission-critical role which combines software and system engineering to build, run and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer facing services are healthy, monitored, automated, and designed to scale. As SRE organization we take pride in handling “operations as an engineering” problem with automation first approach. You will use your background to build best in class infrastructure automation for areas such as Observability, Incident management, Disaster Recovery, Load testing, Capacity engineering and many more. In this role you will work very closely with our product development teams from an early stage of design to all the way helping resolve any production incidents, maintaining SLI/SLA bar for production services and influencing them with SRE principles and best practices. If you take pride in complete ownership, have a passion for solving complex technical challenges for large scale distributed systems and demeanor to work and communicate effectively across team boundaries, this is the role for you!  

 

Key Responsibilities: 

  • Serve as a primary point responsible for the reliability, health, and performance of all Coupang customer-facing services.  

  • Gain deep knowledge of Coupang application workflow and dependencies.  

  • Define and track key performance indicators (KPIs) and service-level objectives (SLOs) related to system availability, performance, and reliability. 

  • Build world class incident management process and automation, including fast incident remediation, incident operational reviews and retrospectives. 

  • Develop and implement best practices for creating and maintaining effective monitoring, alerting, and telemetry systems. 

  • Build automation to execute regular Disaster Recovery testing and load testing to stay ahead of expected growth of Coupang services.  

  • Work closely with product development teams to ensure the products are designed with scale and operability in mind.  

  • Build right guardrails and automation for deploying production changes holding the reliability bar.  

  • Participate in a 24x7 rotation for production issue escalations, functions well in a fast-paced environment.  

  • Communicate effectively with people at all levels of the organization. 

 

Essential Qualifications: 

  • 10+ years of industry experience building and operating large scale distributed systems.  

  • SLO/SLA management and implementation experience 

  • Deep UNIX/Linux systems knowledge and administration background. 

  • Demonstrated programming skills in one or more of: Python, Java, Golang, Ruby. 

  • Strong problem-solving and analytical skills spanning systems, network (TCP/IP) and code, with a focus on data-driven decision-making. 

  • Experience with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform. 

  • Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC). 

  • Experience with containerization and orchestration technologies, such as Docker and Kubernetes. 

  • Excellent communication and collaboration skills, with the ability to work with teams across distinct functions and technical domains. 

  • Knowledge of observability ecosystem including metrics, logging, tracing and tools, such as Prometheus, Grafana, Elastic Stack, Datadog, or New Relic. 

 

Preferred Qualifications: 

  • Bachelor's degree in computer science, Engineering, or a related technical field.  

  • Prior experience working with large scale web-based Java architectures and JVM configuration. 

  • Professional certifications in cloud platforms, monitoring tools, or related technologies. 

  • Previous experience working on a large-scale ecommerce platform.  

 

Pay & Benefits   

Our compensation reflects the cost of labor across several US geographic markets. At Coupang, your base pay is one part of your total compensation. 

 

The base pay for this position ranges from $138,000/year in our lowest geographic market to $297,000/year   in our highest geographic market. Pay is based on several factors including market location and may vary depending on job-related knowledge, skills, and experience.

 

General Description of All Benefits  

  • Medical/Dental/Vision/Life, AD&D insurance  
  • Flexible Spending Accounts (FSA) & Health Savings Account (HSA)
  • Long-term/Short-term Disability
  • Employee Assistance Program (EAP) program
  • 401K Plan with Company Match
  • 18-21 days of the Paid Time Off (PTO) a year based on the tenure
  • 12 Public Holidays
  • Paid Parental leave
  • Pre-tax commuter benefits
  • MTV - [Free] Electric Car Charging Station 

 

  General Description of Other Compensation  

 “Other Compensation” includes, but is not limited to, bonuses, equity, or other forms of compensation that would be offered to the hired applicant in addition to their established salary range or wage scale. 

 

Coupang is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to actual or perceived race (including traits historically associated with race, including but not limited to hair texture and protective hair styles), color, religion, religious creed (including religious dress and grooming practices), sex or gender (including pregnancy, childbirth, breastfeeding, and medical conditions related to pregnancy, childbirth or breastfeeding), gender identity, gender expression, sexual orientation, ,ancestry, national origin (including language use restrictions), age (40 and over), physical or mental disability, medical condition, genetic information, HIV/AIDS or Hepatitis C status, family status (including but not limited to marital or domestic partnership status), military or veteran status, use of a trained dog guide or service animal, political activities or affiliations, ancestry, citizenship, family and medical leave status, status as a victim of any violent crime, or any other characteristic or class protected by the laws or regulations in the locations where we operate. Coupang is also committed to providing a safe work environment for its employees and its consumers.   If you need assistance and/or a reasonable accommodation in the application of recruiting process due to a disability, please contact us at  [email protected]

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job