Required skills/experience
7+ years of experience in DevOps & Cloud Technologies
Expertise in any one of AWS/Azure/GCP
Experience on handling high scale production workloads
Strong working knowledge on Kubernetes
Expertise with infrastructure automation tools like Terraform/Pulumi/Cloudformation, Helm, etc
Strong debugging/troubleshooting skills.
Deep working knowledge of Linux servers and networking
Hands-on knowledge of any one of - Python, Shell, Go or Java
Experience with monitoring solutions like DataDog, NewRelic, ELK, Prometheus/Grafana
Familiarity with modern cloud development practices (microservices architectures, REST interfaces, etc.)
Passion to work on an exciting, fast-paced environment
Responsibilities:
Responsible for the design and implementation of Secure, Resilient and highly scalable Infrastructure
Own and automate infrastructure provisioning, demand forecasting, capacity planning and right sizing
Build automation tools and frameworks to improve the system's observability, availability, reliability, performance/latency, monitoring
Ensure the availability, performance and scalability of applications in respect of proven design and architecture best practices.
Raise the bar on engineering excellence through design/code/document reviews and self service automation tools
Practice sustainable incident Response as well as participate in peer reviews and blameless postmortems
Envision, implement and rollout best DevOps tooling and automation for all of our services
Build and maintain CI/CD pipelines