Site Reliability Engineer

AI overview

Develop and maintain platform services using Go and Python, improve CI/CD pipelines, and manage Kubernetes applications while contributing to a collaborative and innovative SRE culture.
About Tamr Tamr offers an AI-native Master Data Management (MDM) platform that helps enterprises create unified, accurate records across vast, complex data ecosystems. We accelerate the discovery, enrichment, and maintenance of golden records—clean, trusted, and continuously updated profiles—that enable informed decision-making and improved operational efficiency. About the Role We're looking for a Site Reliability Engineer to join our team and help build and maintain the platform services that power our infrastructure. You'll work with Go and Python to develop tooling, improve CI/CD pipelines, and manage containerized applications on Kubernetes. This role offers significant growth opportunities—you'll learn advanced SRE practices while contributing meaningfully from day one. About the Team You'll join our SRE team in Harvard Square (Cambridge)—one of the most important technical hubs in the world. Our team is a close-knit group of professionals who are always willing to teach and eager to learn. We've created a collaborative, supportive, and fun culture while doing serious work. With a small team, every member needs to contribute meaningfully and have a significant impact on the business. When you join, we'll work with you to find a project you're excited to finish—and one we're excited to have. What to expect
  • You'll join a team of 4–5 people
  • You'll have a manager who will mentor you in your professional development
  • You'll have a tech lead who will support you with scheduling, architecture, and coding
  • You'll design, break down, and own stories for every sprint
  • Your code will be held to the same standard expected of every engineer
  • What You'll Do
  • Develop and maintain platform services and tools in Go and Python
  • Build and improve CI/CD pipelines using Jenkins
  • Deploy and manage containerized applications on Kubernetes
  • Write and optimize database queries and schemas in PostgreSQL
  • Collaborate with senior engineers on infrastructure automation
  • Participate in code reviews and contribute to team best practices
  • Monitor and troubleshoot platform services
  • Document systems, processes, and runbooks
  • Participate in on-call rotation
  • What We're Looking For
  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field
  • Proficiency in Python and Go, including familiarity with best practices and common frameworks
  • Understanding of object-oriented and functional programming concepts
  • Experience with version control using Git and GitHub workflows (pull requests, code reviews, branching strategies)
  • Basic understanding of containerization concepts and Docker
  • Basic knowledge of Linux/Unix command line and shell scripting
  • Familiarity with Jenkins or other CI/CD tools
  • Hands-on experience with at least one of the major public cloud providers such as Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure.
  • Familiarity with relational databases and SQL (PostgreSQL experience is a plus)
  • Understanding of RESTful APIs and microservices architecture concepts
  • Understanding of monitoring and logging tools
  • Nice to Have
  • Hands-on experience with Google Cloud Platform services
  • Previous internship, co-op, or 1–2 years of professional software development experience
  • Experience with infrastructure-as-code concepts
  • Exposure to Kubernetes or container orchestration
  • Experience building Kubernetes Operators
  • Our Tech Stack
  • Cloud: Multi-cloud (GCP, AWS, Azure)
  • Infrastructure: Kubernetes, Helm, Istio, Docker, Terraform
  • CI/CD: Jenkins, FluxCD, Flagger, GitOps
  • Languages: Go, Python, Java, Kotlin
  • Data: PostgreSQL, BigQuery, Spanner, BigTable, Dataproc/Spark
  • What You'll Learn
  • Advanced Go programming and design patterns
  • Kubernetes architecture and operations
  • GCP services and cloud-native development
  • Building resilient, scalable platform services
  • Site reliability engineering practices
  • Infrastructure as code and GitOps workflows
  • Work Authorization
    Applicants must be authorized to work for any employer in the United States. We are currently unable to sponsor or assume sponsorship of employment visas.

    Additional Information
    Tamr provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state, and local laws.


    The only cloud-native data mastering solution (cloud MDM) accelerate analytics through machine learning (ML), available on Google Cloud, Azure and AWS

    View all jobs
    Salary
    $110,000 – $150,000 per year
    Ace your job interview

    Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

    Site Reliability Engineer Q&A's
    Report this job
    Apply for this job