Senior Site Reliability Engineer (DevOps)

Gdańsk , Poland
full-time

AI overview

Drive the reliability and scalability of SPOTIO’s cloud-native infrastructure by building automated systems and integrating with development teams for high-performance solutions.

A little about us

SPOTIO is a dynamic, fast-growing American start-up with a 10-year tradition of creating the #1 Sales Engagement Platform. SPOTIO's platform helps field sales teams manage sales activities, increase the productivity of sales representatives and record field sales insights.

We have offices in Dallas, Texas and Gdańsk. In Gdańsk, there is a development, product and QA team totalling 21 people (50 people in total globally).

How do we work?

There's no corporate vibe or dress code here. Instead, there's a friendly, collaborative atmosphere and small teams. On Tuesdays and Thursdays, we meet in the office in Gdańsk - because we want to. On other days, we work remotely because we can.

Requirements

We’re looking for a Senior Site Reliability Engineer who will own and drive the reliability, scalability and operational excellence of our cloud-native infrastructure on Microsoft Azure. You’ll be a key player in building self-healing, automated systems, partnering with development teams and senior leadership to ensure that feature velocity does not compromise stability. You’ll bring deep hands-on experience with Azure, infrastructure-as-code, automation tools and modern operational practices.

Responsibilities

  • Ensure the availability, performance, and scalability of SPOTIO’s production systems and infrastructure.
  • Define, implement and monitor SLIs/SLOs, error budgets, capacity planning, incident response and root-cause workflows.
  • Build and evolve automation tooling: provisioning, deployments (CI/CD), monitoring, alerting, self-healing mechanisms, infrastructure as code.
  • Partner with software engineering teams to help design Highly Available and scalable systems
  • Manage and optimize our cloud stack on Azure: compute, networking, storage, identity, security, cost-optimization, disaster recovery, high-availability.
  • Work with tools such as Azure, Cloudflare, Elasticsearch, Pulumi, Kubernetes, and GitHub
  • Build and maintain robust CI/CD pipelines (automation of builds, tests, releases, rollbacks) to accelerate safe feature delivery.
  • Participate in on-call rotations, perform incident triage, drive post-mortem analyses and remediation.
  • Advocate for reliability culture: mentor engineers, evangelize best practices, create documentation, run disaster recovery exercises.

Skills & Experience:

  • 5+ years of experience in a Site Reliability, Platform, or DevOps role at scale (cloud-native environment).
  • 3+ years of experience with containerization and orchestration (Kubernetes, AKS).
  • 3+ years of experience with Microsoft Azure (IaaS, PaaS, networking, security, monitoring).
  • Hands-on experience with Cloudflare (CDN, DNS, WAF or similar), Elasticsearch (deployment/management/observability) and Pulumi (or similar IaC: Terraform, CDK).
  • Strong automation mindset and tooling experience: build/operate CI/CD pipelines, scripting (Python, .NET, JavaScript, or similar).
  • Experience with version control (Git / GitHub), branching strategies, release management.
  • Excellent troubleshooting skills.  E.G. Strong ability to dig into production issues, latency, MTTR, and drive remediation.
  • Strong communication and collaboration skills w/ experience working across teams.
  • Proactive, self-driven, and comfortable in a rapidly-evolving startup/scale-up environment.

Preferred qualifications

Experience with .NET (C#) and/or JavaScript/TypeScript backend systems.

Experience with observability platforms (Prometheus, Grafana, Datadog, ELK) and building dashboards & alerts.

Experience with chaos engineering or resilience testing.

Familiarity with cost-optimization in cloud environments.

Experience mentoring or leading other engineers (senior or technical lead kind of role).

Benefits

What we can offer you:

  • Interesting work and real impact on the product, organization and technology selection
  • Modern equipment
  • Working with great people ;)
  • Free parking
  • Modern office
  • Medicover sport and health
  • Paid vacation, sick leave
  • Flexible working hours
  • Conference budget 1500 PLN

Location: Poland (R&D Team), Gdańsk

Type of work: hybrid (2 days a week from a CUBE office)

Contact person: Agnieszka Myśliwczyk

Perks & Benefits Extracted with AI

  • Flexible Work Hours: Flexible working hours
  • Health Insurance: Medicover sport and health
  • Conference budget: Conference budget 1500 PLN
  • Paid Time Off: Paid vacation, sick leave

SPOTIO is the #1 field sales software for sales reps and managers to enable insane productivity, increase sales by 23%, and shorten sales cycles.

View all jobs
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior Site Reliability Engineer Q&A's
Report this job
Apply for this job