SurveyMonkey is hiring a

Senior Site Reliability Engineer - GetFeedback Digital Team

Bengaluru, India

SurveyMonkey is a global leader in online surveys and forms that empowers people with the insights they need to make decisions with speed and confidence. Our fast, intuitive feedback management platform connects millions of users worldwide with real-time AI-powered insights that drive meaningful decisions. We provide answers to more than 20 million questions every day so that people and organizations can attract new audiences, delight customers, create advocates, and extend their competitive advantage in the marketplace. Our vision is to raise the bar for human experiences by amplifying individual voices. Learn more at surveymonkey.com.

What we’re looking for

As a member of the Infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers.

What you’ll be working on

  • Architect, build, and operate AWS environments at scale with well-established industry best practices
  • Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery
  • Support and maintain AWS services, such as EKS
  • Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems
  • Support and partner with other teams on improving our observability systems to monitor site stability and performance
  • Work closely with developers in supporting new features and services.
  • Work in a highly collaborative team environment
  • Serving as a technical mentor
  • Participate in on-call rotation

We’d love to hear from people with

Experience and Background:

  • 5-8+ years of professional experience in relevant fields, with solid engineering expertise.
  • Over 5 years of hands-on experience in production environments using AWS services, including EKS, EC2, IAM, Kafka, Redis, Memcache, and CloudWatch.

Cloud Infrastructure and Tools:

  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Familiarity with observability tools like Splunk, OpenTelemetry, New Relic, Datadog, and Grafana/Prometheus.

Configuration Management and Automation:

  • Proficient in configuration management and orchestration tools such as Ansible, Terraform, and CloudFormation.
  • Strong expertise in Docker and Kubernetes for containerization and orchestration.
  • Knowledge of GitOps practices and tools like ArgoCD or FluxCD.

Scripting and Programming:

  • Proficient in scripting languages (Bash, Python, YAML) for system automation.
  • Experience instrumenting Python and Node.js applications to send metrics, traces, and logs to third-party observability tools.

Data Management and Analysis:

  • Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Memcached, Redis, and Kafka.
  • Experience with metrics and logging libraries, data aggregation, and visualization tools, particularly Splunk and OpenTelemetry.

Collaboration and Communication:

  • Excellent communication skills, capable of collaborating effectively with both co-located and remote teams.
  • Ability to listen and partner to understand requirements, troubleshoot problems, and promote platform adoption.

Agile and Development Practices:

  • Experience working in agile environments and utilizing JIRA.
  • Familiarity with GitHub and GitHub Actions in software engineering or DevOps contexts.
  • Preferably experienced with secrets management, particularly HashiCorp Vault.

­­Additional Skills:

  • Strong troubleshooting skills across systems, networks, and application code.
  • Genuine interest in the instrumentation and optimization of Kubernetes clusters.
  • Understanding of scalable and high-performance application architecture.

This opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week.

#LI-Hybrid

Why SurveyMonkey? We’re glad you asked 

SurveyMonkey is a place where the curious come to grow.  We’re building an inclusive workplace where people of every background can excel no matter their time zone. At SurveyMonkey, we weave employee feedback and our core values into everything we do to create forward-looking benefits policies, employee programs, and an award-winning culture, including our annual holiday refresh, our annual week of service, learning and development opportunities like Curiosity Week, and our C.H.O.I.C.E Fund

Our commitment to an inclusive workplace

SurveyMonkey is an equal opportunity employer committed to providing a workplace free from harassment and discrimination. We celebrate the unique differences of our employees because that is what drives curiosity, innovation, and the success of our business. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate. Accommodations are available for applicants with disabilities.

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior Site Reliability Engineer Q&A's
Report this job
Apply for this job