We are looking for a Senior Platform / DevOps Engineer to join our existing Platform team. We run a modern AWS + Kubernetes platform, but we also integrate with traditional / on-prem infrastructure.
You will help design, build and operate the core platform used by our product teams: Kubernetes clusters, networking, observability, GitOps, and infrastructure as code. Working closely with other platform engineers and the Lead DevOps Engineer, owning specific areas and driving improvements end-to-end.
This role is fully remote within Germany and requires close collaboration with engineers across multiple teams.
In addition, you'll be responsible for the following task:
Platform engineering
Design, build and operate our AWS- and Kubernetes-based platform.
Own one or more areas (e.g. Kubernetes, observability, networking, IaC) and act as the go-to person for those topics in the team.
Cloud & Kubernetes
Operate production AWS environments (multi-account, multi-environment).
Operate Kubernetes clusters (upgrades, capacity, reliability, security).
GitOps & delivery
Design and maintain Argo CD–based GitOps workflows (multi-cluster, multi-env).
Contribute to CI/CD patterns and deployment strategies together with the team.
Observability
Evolve and operate our observability stack:
Metrics: Prometheus, Thanos
Logs: Grafana Loki
Traces: Grafana Tempo
Instrumentation: OpenTelemetry
Help define SLOs, dashboards and alerting that are actually useful for teams.
Networking & connectivity
Work on Kubernetes networking, Ingress controllers and traffic routing.
Contribute to designs using Envoy (or Envoy-based components), API gateways and SD-VPN solutions.
Infrastructure as Code
Build and maintain Terraform modules and state layouts for AWS, Kubernetes and related infrastructure.
Help migrate existing/manual infrastructure to Terraform with minimal disruption.
(Bonus) Use Crossplane where it makes sense for Kubernetes-driven provisioning.
Hybrid & on-prem integration
Support connectivity and integration between cloud workloads and on-prem systems.
Collaboration
Work as part of the existing Platform team – participate in design reviews, incident reviews and on-call.
Support product teams as internal customers by providing clear platform interfaces, documentation and examples.
We are searching for a motivated candidate with following qualifications:
10+ years of experience in infrastructure / operations / platform / DevOps roles.
Strong experience with AWS in production:
Multi-account setups (e.g. separation of dev / test / prod).
Networking (VPC, subnets, routing, security groups, load balancers).
Landing Zones
IAM and security best practices.
Strong experience with Kubernetes in production:
Operating clusters (EKS or similar).
Workloads, deployments, autoscaling, upgrades and cluster lifecycle.
Solid observability experience with at least several of:
Prometheus, Thanos, Grafana.
Grafana Loki, Grafana Tempo or similar logging/tracing systems.
OpenTelemetry for metrics/logs/traces instrumentation and pipelines.
Solid networking knowledge:
TCP/IP, DNS, TLS, HTTP, routing, VPNs, firewalls, load balancing.
Experience with Envoy (directly or via service mesh / API gateway), Ingress controllers and API gateways.
Experience with SD-VPN solutions (e.g. AWS VPN, Tailscale or similar).
Strong experience with Terraform and Terragrun:
Modular design, state management, remote backends.
Managing AWS/Kubernetes infrastructure as code at team or org scale.
Hands-on experience with Argo CD and GitOps principles:
Application structure, app-of-apps patterns, promotion across environments, RBAC and ABAC.
Experience with traditional/on-prem infrastructure (VMs, networks, VPNs, storage, legacy systems) and connecting it to cloud environments.
Comfortable working in a remote-first environment in European time zones (good written communication, async collaboration).
Nice to have:
Experience with Crossplane in production.
Experience in a Platform/Enablement team serving multiple product teams..
Experience with Argo Workflows or similar workflow/orchestration tools.
Experience with AWS Control Tower and Account Factory
German language skills (B2)
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Operations Engineer Q&A's