Important Information
Experience: +7 years
Job Mode: Full-time
Work Mode: Remote
ID: 19995
About the Role
We are seeking an experienced Observability Engineer with deep expertise in New Relic to drive monitoring, performance optimisation, and reliability insight across our application estate.
You will act as the subject matter expert for observability strategy, instrumentation standards, dashboard design, and alerting governance. Working closely with engineering, platform, and service teams, you will ensure that our systems running across Microsoft Azure and on-premise environments are measurable, diagnosable, and resilient.
This is not a passive tool-administration role — it requires a technically capable, forward-thinking engineer who can translate telemetry into actionable reliability improvements.
Key Responsibilities
Observability Strategy & Governance
Own and evolve the organisation’s observability standards.
Define best practices for instrumentation, alerting, and dashboard design.
Ensure consistency of monitoring across multiple applications and services.
New Relic Expertise
Configure and optimise APM agents, logging, and tracing.
Design meaningful dashboards for engineering and leadership audiences.
Tune alerts to reduce noise and improve signal quality.
Analyse transaction performance, error rates, and infrastructure health.
Leverage NRQL to produce actionable insights.
Reliability & Performance Engineering
Support teams in identifying bottlenecks and performance risks.
Improve MTTR through better telemetry design.
Define and monitor SLIs/SLOs.
Identify systemic reliability gaps across the estate.
Cross-Team Enablement
Partner with SRE, DevOps, and engineering teams.
Coach developers on effective instrumentation.
Drive adoption of monitoring standards in CI/CD pipelines.
Support major incident diagnostics when required.
What We’re Looking For
Experience
Proven hands-on experience administering and optimising New Relic.
Background in SRE, DevOps, Platform Engineering, or Production Engineering.
Experience monitoring distributed systems and cloud-based applications.
Exposure to hybrid environments (cloud + on-premise).
Technical Capability
Strong understanding of APM, logging, metrics, and tracing concepts.
Experience with Azure-based architectures.
Ability to analyse logs, performance metrics, and system behaviour.
Comfortable working with APIs and automation where required.
Mindset
Analytical and detail-oriented.
Self-starter who takes ownership of reliability outcomes.
Improvement-driven rather than reactive.
Able to translate technical data into business insight.
Success in This Role
Reduced alert noise and improved signal quality.
Clear, trusted service dashboards.
Faster diagnosis during incidents.
Improved system performance and reliability.
Strong observability maturity across teams.
About Encora
Encora is the preferred digital engineering and modernization partner of some of the world’s leading enterprises and digital native companies. With over 9,000 experts in 47+ offices and innovation labs worldwide, Encora’s technology practices include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering.
At Encora, we hire professionals based solely on their skills and qualifications, and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.
Encora specializes in delivering customized software engineering solutions and digital product development services to fast-growing technology firms, leveraging advanced technologies to foster innovation and growth across various industries.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Engineer Q&A's