Site Reliability Engineer

TLDR

Implement and manage corporate systems while maintaining infrastructures for cloud environments and automating solutions to enhance reliability and performance.

Build, Deploy, and Maintain AI for an Unpredictable World

Striveworks helps organizations harness the power of artificial intelligence to solve real-world national security and business challenges by serving as the command center between data, models, and business outcomes. Founded by data scientists and engineers, Striveworks set out to make the journey from deployment to ongoing optimization simple and effective.

With Striveworks, organizations aren’t just deploying AI—they’re building systems that remain reliable, adaptable, and ready to scale in an unpredictable world. Mission-critical operations require models that perform where they’re deployed, scale as workloads grow, and adapt rapidly as AI capabilities advance. Striveworks meets these demands, increasing reliability and performance while lowering costs—and enabling confident, data-driven decision-making in dynamic environments.

The Role

As a Site Reliability Engineer at Striveworks, you’ll be challenged—and trusted—on day one to implement and manage all corporate systems. You’ll be exposed to, and gain proficiency with, a wide array of systems and infrastructure automation tools, and you will be given the opportunity to build and/or incorporate additional tools. You’ll be called on to develop solutions that prevent problems from reoccurring in the future, instead of simply mitigating the issue for today. You’ll be highly encouraged to automate solutions to reduce or eliminate “toil.”

Your day-to-day will include:

  • Maintaining and developing infrastructure (as code) within both private (OpenStack) and commercial (AWS, Azure, GCP) cloud environments
  • Maintaining and developing configuration management automation for Windows laptops and Linux servers
  • Providing user support for all corporate systems

This position offers a hybrid/on-site environment at our office in northwest Austin, TX. 

The Right Fit

In addition to the specific skills and expertise detailed below, we are looking for individuals who share our values. Sharing a set of values allows us to move at the speed of trust. 

Collectively, we value a high-trust work environment where people respect each other and use candor kindly and constructively. We value work that intersects passion and perseverance, we geek out about the potential of our contributions, and we find joy in working hard on things that matter. Finally, we value taking ownership, having agency, and feeling individual responsibility for collective results.

Here’s what we’re looking for:

  • 4+ years of experience in any IT-related field
  • Experience deploying infrastructure in a cloud environment such as AWS, Azure, GCP, or OpenStack
  • Experience with virtualization and/or containerization solutions (e.g., OpenStack, Kubernetes, Docker, VMware, KVM, or Hyper-V)
  • Experience with Ansible or another configuration management solution (e.g., Chef, Puppet, or Salt)
  • Programming experience in Python or other programming/scripting languages (e.g., Bash, PowerShell, Go, Java, or JavaScript)
  • Due to the nature of this role, candidates must be a US person (a US citizen, a US national, or a Green Card holder)

The Wish List

We’re very interested in candidates who possess the above qualifications, and we appreciate and consider the addition of:

  • Experience with automation and infrastructure as code, DevSecOps, CI/CD pipelines, or automated security scanning (Windows and Linux)
  • Understanding of US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST SP 800-171, NIST SP 800-53, NIST RMF, and CMMC
  • Experience with network technologies (e.g., VLANs, switches, routers, firewalls, and VPN)
  • Experience working with GPUs for compute workloads
  • Experience maintaining distributed/clustered systems

The anticipated base pay range for this position is $110,000–$128,000/year. Striveworks’ total compensation package includes a competitive base salary, equity grants, and cash bonuses.

The Benefits

  • Medical/dental/vision insurance
  • Voluntary life, long-term disability, accident, and hospital indemnity insurance
  • HSA and FSA (including dependent care FSA) plans 
  • 401(k) plan
  • Unlimited PTO
  • Paid parental leave

Check us out on Built In!

Striveworks is an Equal Opportunity Employer and does not discriminate in employment on the basis of race, color, religion, belief, sex (including pregnancy and gender identity or expression), national origin, social or ethnic origin, political affiliation, sexual orientation, marital status, disability, genetic information, age, membership in an employee organization, retaliation, parental status, military service, or other non-merit factors. Striveworks will not tolerate discrimination or harassment of any kind.

If you require assistance or a reasonable accommodation in the application process, please contact Operations at [email protected].

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete an employment eligibility verification form upon hire.

Striveworks is a participating employer in the E-Verify program.

Benefits

Health Insurance

Medical/dental/vision insurance

Paid Parental Leave

Paid Time Off

Unlimited PTO

Striveworks builds an advanced MLOps platform that accelerates the machine learning lifecycle, simplifying model development and governance. Our services cater to Fortune 500 companies and public sector leaders, enabling them to effectively utilize AI for tackling national security and complex business challenges.

View all jobs
Salary
$110,000 – $128,000 per year
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job