We are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients. You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release/deploy process, seamless monitoring, and alerting of all the solutions.
- work closely with developers for prototyping, and designing new features as part of the infrastructure
- deploy, install, configure and maintain sophisticated Trading/Finance and related software
- configure bare metal & сloud instances by using Infrastructure as Code
- make key decisions for scalability, reliability and accessibility
- install and manage in-house developed and external well-known monitoring systems
- design, deploy and configure cloud-based servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc.
- administrate UNIX/Cloud infrastructure – installation, configuration and maintenance
- work with the Nexus and GIT repositories
- 5+ years of experience in UNIX/Linux administration
- 5+ years of experience in Networking
- experience as an SRE or DevOps
- strong experience with OS-level administration on Linux and/or UNIX
- hands-on scripting experience with Bash, Python, and/or Groovy
- experience with configuring TeamCity CI/CD pipelines
- IAAS solutions using Ansible (AWX), Terraform
- experience with Docker containers orchestrating (K8S/OpenShift/Hashicorp)
- know how to read and analyze errors
- in-depth knowledge of TCP/IP and ISO/OSI stack
- experience with monitoring and logging tools (Zabbix, Elasticsearch, or OpenSearch, Grafana, Kibana, Dynatrace, Prometheus, etc.)
- experience in working with Apache, Nginx, HAproxy, Envoy, etc
- strong ability to solve problems using code and scripting
- understanding of ITIL processes and routines
- Excellent English (written and verbal)
Additional skills considered as an advantage:
- experience with SQL-like command language
- experience with Ansible (AWX)
- knowledge of Java programming language
- experience with trading/exchange/risk management software usage
- experience with Atlassian software (JIRA, Confluence, FishEye, etc.)