Salla, a leading Information Technology and Services company, is seeking an experienced Senior DevOps Engineer with a strong background in CI/CD, infrastructure automation, and cloud technologies. As a Senior DevOps Engineer, you will play a crucial role in architecting and implementing robust CI/CD pipelines, managing Kubernetes clusters in AWS, and maintaining monitoring tools for our organization.
Responsibilities:
1/ CI/CD Pipeline Management:
- Design, develop, and maintain scalable and reliable CI/CD pipelines using tools such as Jenkins, Bit Rise, GitHub, and GitHub Actions.
- Automate build, test, and deployment processes to ensure efficient and high-quality software delivery.
- Collaborate with development and QA teams to optimize CI/CD workflows and integrate best practices.
- Infrastructure Automation:
- Utilize infrastructure-as-code (IaC) tools, particularly Terraform Cloud, to provision and manage cloud resources in AWS.
- Implement configuration management and automation solutions to ensure consistent and reproducible infrastructure deployments.
- Continuously improve the scalability, reliability, and security of our infrastructure.
- Good Experience building and optimizing CI/CD pipelines.
2/ Kubernetes Cluster Management:
- Manage and maintain Kubernetes clusters in AWS, ensuring their availability, performance, and scalability.
- Work closely with development teams to optimize application deployments on Kubernetes, leveraging best practices and industry standards.
- Monitor cluster health, diagnose and resolve issues, and implement proactive measures for cluster performance optimization.
3/ Monitoring Tools Maintenance:
- Maintain and enhance monitoring tools infrastructure, including Grafana, Prometheus, and PagerDuty.
- Configure monitoring dashboards, alerts, and notifications to provide real-time visibility into system performance and availability.
- Collaborate with cross-functional teams to identify and address performance bottlenecks and ensure effective incident response.
Requirements
Professional Experience:
- Minimum 5 years of experience as a DevOps Engineer, with a strong focus on CI/CD pipeline management and infrastructure automation.
- Hands-on experience with CI/CD tools such as Jenkins, BitRise, GitHub, and GitHub Actions.
- Solid understanding of infrastructure-as-code (IaC) principles, preferably using Terraform Cloud.
- Proficiency in managing Kubernetes clusters in AWS, including deployment, scaling, and troubleshooting.
Cloud and Infrastructure:
- In-depth knowledge of Amazon Web Services (AWS) and its core services, with a focus on infrastructure automation and management.
- Experience with containerization technologies like Docker and container orchestration platforms such as Kubernetes.
- Familiarity with serverless architecture and services, such as AWS Lambda.
Monitoring and Alerting:
- Strong experience in setting up and maintaining monitoring tools like Grafana, Prometheus, and PagerDuty.
- Proven ability to configure monitoring dashboards, alarms, and alerting mechanisms for proactive system monitoring.
- Good Experience with open source observability tools (Prometheus, Grafana and OpenTelemtery)
Scripting and Automation:
- Strong knowledge of automation an entire DevOps pipeline, including CI/CD cycles, infrastructure and configurations, app performance monitoring.
- Strong programming skills (Python and Golang is highly preferable)
- Strong understanding of Kubernetes and containers.
- Strong understanding of cloud core services (VPCs, IAM, EC2, storage, managed Kubernetes). (AWS is highly preferable)
- Heavy experience with managing large infra with IaC (Terrform is highly preferable)
- Excellent knowledge of DevOps tools, coding, and scripting.
- Experience with service meshes and API Gateways (huge plus).
Collaboration and Communication:
- Strong teamwork and collaboration skills with the ability to work effectively across cross-functional teams.
- Excellent verbal and written communication skills to articulate technical concepts and solutions effectively.
- Strong understanding of networking and operating systems.
- Strong troubleshooting skills.
Benefits
- Training & Development
- performance bounce
- Work from Home