Senior Staff Back-end Engineer (SRE) [L7-I]

As a Principal Back-End SRE Engineer within the Site Reliability organization, you will work with large-scale cloud infrastructure, handling billions of metrics and petabytes of logs. 

You would leverage this data to help internal teams monitor service reliability and predict/prevent incidents. You have the opportunity to build the next-generation Observability Platform based on Kubernetes and other OSS solutions and build software components from scratch. You would work directly with various engineering teams in Coupang, influence them with SRE principles and best practices, and see your impact directly.

Key Responsibilities:

  • Architect and drive the build-out out Fault injection, chaos engineering practices.
  • Collect requirements, architect, design, and implement the next generation APM Platform on AWS for company-wide teams to improve observability, reliability & service availability.
  • Work with internal teams directly and help them effectively leverage our monitoring infrastructure, as well as evangelize best SRE practices.
  • Write reliable and reusable code with the ability to scale with very large data volumes.

Essential Qualifications:

  • BS or advanced degree in Computer Science, Computer Engineering, or Electrical Engineering
  • 10+ years of software engineering experience
  • Experience in architecting, building and maintaining large-scale service infrastructures
  • Experience in Java, distributed systems, micro services
  • In-depth knowledge in metrics collection/visualization, log collection/aggregation, and tracing
  • Strong AWS Cloud Background on both development and operation
  • A strong team player, ability to quickly triage and troubleshoot complex problems
  • A strong SRE/Devops background.
  • SLO/SLA management and implementation experience

Preferred Qualifications:

  • Building and deploying Kubernetes, K8S or similar technology on AWS
  • Experience in successfully deliver large scale platform services/tools
  • Experiences with various metrics, logging, tracing, and APM tools
  • Publications and/or open-source projects related to service observability and system monitoring

https://coupang.sharepoint.com/:f:/r/sites/PlanYourDays/People%20PolicyGuidelines/Job%20Profile?csf=1&e=UIGecP

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Backend Engineer Q&A's
Report this job
Content missing

This job is no longer available

Enter your email address below to get notified whenever we find a similar job post.

Unsubscribe at any time.