Operations Team Lead (Production & Reliability)

TLDR

Lead operational excellence and drive reliability engineering, transforming incident management and team dynamics in a hands-on role.

Complexio is Foundational AI works to automate business activities by ingesting whole company data – both structured and unstructured – and making sense of it. Using proprietary models and algorithms Complexio forms a deep understanding of how humans are interacting and using it. Automation can then replicate and improve these actions independently.

Complexio is a joint venture between Hafnia and Símbolo, in partnership with Marfin Management, C Transport Maritime, Trans Sea Transport and BW Epic Kosan.

Operations Team Lead (Production & Reliability)

We’re looking for an Operations Team Lead to own production.

Not just keep it running, but build a system that scales.

You’ll lead operational excellence across all live customer-facing systems. Your mission: make production reliable, observable, predictable, and continuously improving.

This is a hands-on role. You’ll shape process, lead incidents, build the team, and move us from reactive firefighting to proactive reliability engineering.

What You’ll Own

Production

  • Stability and availability of all live systems
  • Operational readiness for new releases
  • Safe production access and change coordination

Production is a high-discipline environment. You make sure it stays that way.

Incident Management

You own the full lifecycle:

  • High-signal alerting and fast detection
  • Structured incident response
  • Clear internal and customer communication
  • Blameless postmortems
  • Systemic fixes that prevent repeats

Goal: Fast recovery. Fewer recurring incidents.

On-Call

  • Design sustainable rotations
  • Clear escalation paths
  • Defined severity levels
  • Strong runbooks
  • No burnout culture

Someone accountable is always reachable. Escalations are fast and predictable.

Monitoring & Reliability

  • Define SLIs/SLOs for critical systems
  • Improve visibility across availability, latency, errors, and saturation
  • Track MTTR, incident frequency, and escalation trends
  • Drive reliability roadmap initiatives

We measure reliability, and improve it continuously.

Team Leadership

  • Lead and grow the Operations team
  • Set clear standards and KPIs
  • Build a culture of ownership and accountability
  • Raise the bar on operational discipline

You’re responsible for both system performance and team performance.

Requirements

What We’re Looking For

  • Strong experience in SRE, DevOps, Infrastructure, or Production Engineering
  • Prior experience leading technical teams
  • Deep hands-on incident management experience
  • Strong observability and reliability mindset
  • Calm under pressure, clear in communication
  • Systems thinker, fixes root causes, not symptoms

How We Think

  • Production is sacred.
  • Clear ownership beats ambiguity.
  • Blameless culture, high accountability.
  • Fix systems, not people.
  • Reliability is a product feature.

Complexio builds a Foundational AI platform that automates business processes by comprehensively understanding both structured and unstructured enterprise data. It's designed for companies looking to enhance their productivity through context-aware automation, utilizing proprietary models and orchestration layers to optimize human-computer interactions. What sets Complexio apart is its focus on privacy-first automation, ensuring that businesses can scale their workflows independently while maintaining data integrity.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Operations Team Lead Q&A's
Report this job
Apply for this job