We are looking for multiple Monitoring and Incident Response Specialists on behalf of and in support to our client based in The Hague, The Netherlands. Our client is a major European organization operating in a complex, multi-site environment. Its IT department plays a key role in maintaining and evolving systems that support business processes, collaboration, and innovation. Please note that this position is intended to be filled on a 100% off-site basis from anywhere within the EU.
Please note, the successful candidate will be employed by ATG Europe (or one of its subsidiaries). Furthermore, upon selection, they will be subject to a general security screening performed by an external provider (further information will be provided at interview stage).
The successful candidate will be tasked with, but not limited to:
Implement, support and maintain monitoring setup using modern platforms and tools (i.e. PagerDuty, Terraform, Testkube) for applications and services that require to be monitored;
Assist and liaise with Product Teams for smooth and timely implementation of monitoring configuration for related applications or services in accordance with stipulated requirements;
Cooperate with Incident and Problem Managers for prompt and accurate information on incidents, monitor setup of critical applications and possible improvements;
Support and assist the migration of remaining services and applications in the new alerting tool (PagerDuty);
Support and assist the migration of remaining legacy probes (e.g. JMETERs, JAVA probes, URL responses) from the legacy platform to the new Kubernetes-based one (Testkube);
Ensure the smooth functioning and availability of the platforms and tools in the Monitoring and Observability area (e.g. PagerDuty, Teskube) and provide timely support to any technical investigations to maintain the stipulated availability levels (monitoring tools being Platinum applications);
Create and deliver reports on routine/ad-hoc activities upon request (e.g. querying PagerDuty via API, statistics on PagerDuty users).
ATG is dedicated to diversity and inclusion and is an equal opportunity employer. Regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, age, or any other reason protected by relevant state or municipal legislation, we are pleased to consider all eligible candidates for employment.
Please ensure that your CV clearly explains how you meet the mandatory requirements. If it doesn’t align completely with the essential criteria, it could result in rejection.
University Degree in the field of IT (or related field);
In the absence of a University Degree, candidates with a 4 years post-secondary education in IT technical studies in a computer-related field and 5 years professional experience in the field of Monitoring and Observability may be considered;
At least 5 years of relevant hands-on experience;
At least 5 years of experience in the field of monitoring, observability and maintenance;
At least 2 years of experience in 2nd Level Expert Support;
Experience with ITIL Incident Management;
Experience with event correlation rules design and optimization;
Hands-on experience with monitoring and observability tools and proven experience in proposing improvement of performance and stability of used platforms;
Experience in dealing with various components of the IT landscape (Windows, Linux, Storage, Networking; Firewalls);
Hands-on experience with commonly used up-to-date technologies like Ansible, GitHub, Kubernetes, Elasticsearch;
Technical skills:
SQL basics;
PagerDuty;
Testkube;
Grafana Labs K6;
Cloud-native observability tools: Prometheus, AlertManager, Grafana, Elastic, OpenTelemetry tools from Grafana Labs (Loki, Tempo, Mimir);
Jmeter;
Terraform;
K8s knowledge – CKAD level or ideally CKA level;
Java Programming:
Building Java applications for K8s;
Building Java applications (to be executed as a monitoring probe or to be run in Jboss);
Scripting skills:
Phyton;
Perl;
JavaScript (mandatory for K6);
ServiceNow (user level);
Understanding of cloud offerings and virtualization technologies (Kubernetes, GCP, Azure, AWS);
Familiarity with Agile methodologies;
Excellent analytical skills;
Ability to function as an individual contributor and a member of a team;
Good communication skills to engage with the various stakeholders;
Ability to work on separate streams in parallel;
Strong appetite to learn new skills and share knowledge on challenging topics;
Proven success in contributing to a team-oriented environment;
Candidates must be eligible to live and work in the EU;
Fluency in English, both written and spoken;
Knowledge of French and/or German.
Please submit your application in English via the apply button below. Applications submitted in other languages will not be considered.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Be the first to apply. Receive an email whenever similar jobs are posted.
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Specialist Q&A's