NMI is hiring a

Senior DevOps Infrastructure Engineer (US)

Schaumburg, United States
Full-Time
Remote

What is the role?

NMI is seeking a Senior DevOps Engineer with deep Linux, virtualization, and hardware experience who is passionate about running applications in an exceedingly high availability environment within our SRE organization.  This opportunity to work with similarly skilled professionals in a rapidly growing environment offers opportunities to level-up observability and automation skills while maintaining a mission critical, 4-nines availability platform, and participating in environment modernization.

The SRE team is responsible for the operation of all hardware and software within the production and SDLC environments.  This consists of a global network connecting numerous sites which must be highly available 24x7 with a minimal desired target of 99.99% availability.  The successful applicant as a Senior DevOps Infrastructure Engineer will be a core member of the SRE team with the opportunity to work with experts in the infrastructure, networking, and DevOps space.

The Ideal Candidate:

  • Will have a track record of implementing low-toil solutions to traditionally high-touch operational or administrative tasks.
  • Has a deep technical background and can engage with engineers with the nuances of complex systems, while also being able to zoom out and see the bigger picture.
  • Has a high level of competency implementing hardware projects in data center environments (server & storage installation, troubleshooting, decommissioning).
  • Enjoys being challenged to find creative solutions using both legacy and cutting edge technology.  This is a codespeak for us having a legacy system that has to be maintained and improved while also looking at new technology and tools to improve resiliency, performance, ease of administration, and observability.  It’s not all “the fun stuff”.
  • Wants to work with a globally distributed team of similarly skilled professionals, and is comfortable building relationships with teammates up to thousands of miles away.
  • Is as comfortable in a shell or VIM as an accountant is in QuickBooks.
  • Refuses to believe a service or appliance is production ready until they have the metrics and alerts to prove it.

Key duties:

  • Administration - Participate in maintenance and operations of our production environment, including patching, deployment, server administration, and troubleshooting, either using configuration as code tooling or manually.
  • Reliability & Performance - Ensure reliability, availability and performance of services.  Respond to incidents and resolve before they become customer impacting.
  • Projects - Deliver complex solutions that traverse all layers of the technology stack: Operating System, Virtualisation, Network, Storage & Cloud.
  • Data Centre - Participate and coordinate on-site deployments of critical hardware, including servers and storage.
  • Collaboration - Work closely with teammates, software, and security teams to rapidly meet customer, business, and compliance needs.
  • Automation - Drive the automation of operational tasks, and ensure our infrastructure is more like cattle than pets.
  • Observability - Develop and maintain internal and commercial or OSS tools to improve system health, performance, and deployment.
  • Continuous Improvement - Drive never-ending improvement in SRE processes, tools, and methodologies.  Take a leading role in blameless post-mortems to avoid repeat issues or mistakes and clearly document all lessons learned for others.  If you love writing actionable documentation, we’d love to set up an interview.
  • On-Call - Participate in a rotating 24x7 on-call schedule with your team to ensure availability of services across the production environment.

This is a fully remote role (work anywhere in the US); however, if you live within a reasonable commutable distance, we’d love to see you in the office from time to time!  Periodic travel (typically 1-4 times a year) will be required to company colocation facilities, at company expense.

Requirements

Essential Skills & Experience:

  • 5+ years of experience in Site Reliability Engineering, DevOps, System Administration, or similar roles.
  • Deep experience working in colocation facilities – we have a hybrid footprint, and if you have only worked in the public cloud space, this role is not a great fit for you.
  • Experience using Puppet, Ansible, or other common configuration as code tooling to deploy and configure systems.
  • Strong familiarity with Linux systems (any distro is fine, but we have a preference for RHEL downstreams).
  • Experience using Proxmox, VMWare, or KVM as virtualization platforms for large-scale production environments.
  • Experience administering enterprise grade SANs and load balancers is necessary to be successful in this role.
  • Demonstrated proficiency in one or more scripting or programming languages (e.g., Python, Go, Bash/ZSH, etc.)
  • Multiple years experience proactively implementing and responding to infrastructure, application, and network alerts using industry standard or homebrew toolchains.
  • Strong problem-solving skills and experience working in extreme high availability production environments (99.95% or greater), with high performance requirements, is required.

Preferred Skills and Experience:

  • Experience with F5 BigIP LTMs or NetApp SANs is highly desirable.
  • Experience using Grafana, Prometheus, and the ELK stack for observability is highly desirable.
  • Experience with MySql (any engine variant) will be extremely helpful in this role.
  • Kubernetes experience is a significant plus.  Alternatively, a burning desire to learn it.
  • Experience working with SaaS based WAF/DDoS protection services such as Silverline, CloudFlare, or Akamai is preferred.
  • Prior experience on a team following common agile processes such as Kanban or Scrum would be valuable.
  • Experience in the start-up to scale-up space will be very valuable.  We are not a calcified, enormous enterprise, and move quickly.
  • GitLab experience is a plus.

Benefits

As well as being a part of something exciting everyday, you will also receive the following benefits:

  • Annual base salary of $120,000 - $155,000 plus bonus
  • Health, Dental and Vision Insurance
  • Life, ADD, Short-term and Long-term Disability insurance
  • 401k matching up to 4% after two months of service
  • Flexible Spending Account/Dependent Care/Transit and Commuting Account
  • Flexible PTO and Sick time
  • Personal growth and advancement opportunities
  • A remote first culture!

Equal Opportunity

NMI is committed to providing equal employment opportunity for all persons regardless of race, color, religion, sex, age, marital status, national origin, sexual orientation or sexual identity, genetic information, citizen status (except those that do not have the legal right to be employed in the United States), disability, military service, service member, veteran status, or any other basis protected by applicable law.

Applicants must be authorized to work in the United States. As part of the selection process this role may require an assessment and professional references to determine suitability. An offer will be subject to financial and criminal background checks.

About us

We enable our partners with choice, and challenge the one-size-fits-all approach to payments. You've probably used NMI in the last 24 hours without even realizing it. We’re the platform that powers success for innovative tech created by SMBs, entrepreneurs and fintech startups. We’re creative problem solvers who help visionaries smash through boundaries and think beyond what’s possible so they can think about what’s next. But we’re not just built for the tech savvy. We democratize the latest payments technology so that everyone can realize the benefits of easy payments across the full spectrum of commerce. We’re all about enabling more payments in more ways and more places.

We believe that having a diverse group of employees strengthens both our work and our workplace. We’re focused on making NMI more diverse and welcoming with initiatives like having a dedicated Diversity, Equity & Inclusion action group, diversity goals for hiring, anonymized resume screening, affinity groups such as our Women's network and LGBTQ+ Network, open forums for discussions on diversity and social justice, and measuring inclusion and belonging as part of our regular employee engagement surveys.

This job is no longer available

Enter your email address below to get notified whenever we find a similar job post.

Unsubscribe at any time.