Devexperts is hiring a

Site Reliability Engineer (SRE)

Jersey City, United States
Full-Time

We are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients. You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release/deploy process, seamless monitoring, and alerting of all the solutions.

  • work closely with developers for prototyping, and designing new features as part of the infrastructure
  • deploy, install, configure and maintain sophisticated Trading/Finance and related software
  • configure bare metal & сloud instances by using Infrastructure as Code
  • make key decisions for scalability, reliability and accessibility
  • install and manage in-house developed and external well-known monitoring systems
  • design, deploy and configure cloud-based servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc.
  • administrate UNIX/Cloud infrastructure – installation, configuration and maintenance
  • work with the Nexus and GIT repositories
  • 5+ years of experience in UNIX/Linux administration
  • 5+ years of experience in Networking
  • experience as an SRE or DevOps
  • strong experience with OS-level administration on Linux and/or UNIX
  • hands-on scripting experience with Bash, Python, and/or Groovy
  • experience with configuring TeamCity CI/CD pipelines
  • IAAS solutions using Ansible (AWX), Terraform
  • experience with Docker containers orchestrating (K8S/OpenShift/Hashicorp)
  • know how to read and analyze errors
  • in-depth knowledge of TCP/IP and ISO/OSI stack
  • experience with monitoring and logging tools (Zabbix, Elasticsearch, or OpenSearch, Grafana, Kibana, Dynatrace, Prometheus, etc.)
  • experience in working with Apache, Nginx, HAproxy, Envoy, etc
  • strong ability to solve problems using code and scripting
  • understanding of ITIL processes and routines
  • Excellent English (written and verbal)

Additional skills considered as an advantage:

  • experience with SQL-like command language
  • experience with Ansible (AWX)
  • knowledge of Java programming language
  • experience with trading/exchange/risk management software usage
  • experience with Atlassian software (JIRA, Confluence, FishEye, etc.)
Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job