Sitetracker
Sitetracker

Site Reliability Engineer

CAD $97,000 – CAD $149,200 per year

TLDR

Lead the establishment of a reliability practice at Sitetracker, enhancing engineering standards for AI workloads and empowering the engineering team with proactive reliability measures.

Within 90 Days, You'll
  • Fully onboard and partner with the engineers currently managing reliability to review and revise the existing operational plan.

  • Operationalize high-leverage items to transition the team out of reactive firefighting and into a more stable, proactive state.

  • Establish a baseline for current system behavior by identifying the most critical user journeys that require immediate SLI/SLO definitions.

  • Within 180 Days, You'll
  • Independently drive the revised reliability plan, ensuring SLIs/SLOs are in place and actively used to guide engineering decisions.

  • Standardize the incident response structure, including severity definitions, Incident Commander roles, and a cadence for blameless postmortems.

  • Measurably reduce paging volume and ensure that every alert that pages an engineer is backed by a clear, effective runbook.

  • Within 365 Days, You'll
  • Establish a mature reliability practice where production-readiness reviews and error-budget conversations are default parts of the development lifecycle.

  • Define a clear, evidence-based tooling roadmap for the next phase of our evolution, such as Terraform, service mesh, or multi-region expansion.

  • Serve as an organizational multiplier, having built the observability and culture necessary for other engineers to reason about reliability without constant supervision.

  • Sitetracker builds a comprehensive platform tailored for companies in infrastructure-heavy industries like Telecom and Utilities, helping them manage high-volume distributed projects with real-time collaboration and automated reporting. By optimizing workflows and enhancing operational efficiency, Sitetracker empowers global enterprises to effectively deploy mission-critical infrastructure and streamline their project management across platforms and teams.

    Founded
    Founded 2013
    Employees
    201-500 employees
    Industry
    Diversified Telecommunication Services
    Total raised
    $180M raised
    View company profile
    Report this job
    Apply for this job