Site Reliability Engineer (SRE)

We are looking for a Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports a few big trading platforms. You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release/deploy process, seamless monitoring, and alerting of all the solutions.

  • Work closely with developers for prototyping, and designing new features as part of the infrastructure,
  • Deploy, install, configure and maintain sophisticated Trading/Finance and related software,
  • Configure bare metal instances by using Infrastructure as Code,
  • Build & maintain CI/CD pipelines,
  • Make key decisions for scalability, reliability and accessibility,
  • Install and manage in-house developed and external well-known monitoring systems,
  • Design, deploy and configure servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc,
  • Administrate UNIX infrastructure – installation, configuration and maintenance,
  • Work with the Nexus and GIT repositories.
     

It would be great if you:
 

  • Experience as an SRE or DevOps,
  • Experience with support of JVM application (garbage collection, memory leaks),
  • Experience with software development,
  • Strong experience with OS-level administration on Linux and/or UNIX,
  • Hands-on scripting experience with Bash, Python, and/or Groovy,
  • Experience with configuring TeamCity CI/CD pipelines,
  • IAAS solutions using Ansible and/or Terraform,
  • Experience with Docker containers orchestrating (K8S/OpenShift/Hashicorp),
  • Know how to read and analyse errors,
  • In-depth knowledge of TCP/IP and ISO/OSI stack,
  • Experience with monitoring and logging tools (Zabbix, Elasticsearch or Opensearch, Grafana, Kibana, etc),
  • Experience in working with Apache, Nginx, HAproxy, Envoy, etc,
  • Configure bare metal instances by using Infrastructure as Code,
  • Strong ability to solve problems using code and scripting,
  • English level not lower than B2.

    Care for the employees is one of Devexperts' core values. For the suggested position, we offer a benefits package that will guarantee the comfort of our new teammate.

    Work Regime Flexibility benefits: 

    • Possibility of hybrid work mode,

    • Flexible working hours,

    Health and recreation benefits: 

    • Fully paid additional wellness days (3 unwell days per year),

    • Medical insurance for the employee;

    • Reimbursement of fitness;

    • Meal allowance.

    Facility benefits: 

    • Modern office with new equipment,

    • Special Parking allowance,

    • Free drinks and snacks.

    Community benefits: 

    • Teambuilding activities,

    • Corporate parties

    • Speakers' Club,

    • Free admission to corporate external events,

    • Possibility of joining conferences and professional fairs,

    • Personal branding development support.

    Social benefits: 

    • Parental bonus,

    • Referral bonus,

    • Blood donation paid leave.

    Careers at Devexperts. Find Great Talent with Career Pages. | powered by SmartRecruiters | Find Great Talent with a Career Page.

    View all jobs
    Get hired quicker

    Be the first to apply. Receive an email whenever similar jobs are posted.

    Ace your job interview

    Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

    Site Reliability Engineer Q&A's
    Report this job

    This job is no longer available

    Enter your email address below to get notified whenever we find a similar job post.

    Unsubscribe at any time.