Operations Engineer

AWL (All Web Leads, Inc.) is one of the most successful customer acquisition marketing companies in the US.  Simply put, we help our customers acquire customers for the US insurance industry.  Our amazing team of nearly 1,000 talented and successful professionals use internet marketing strategies to turn consumer interest in insurance products into policy sales for the world’s largest insurance carriers and more than 30,000 of their agents.   We are Austin-born, and our growth from a 2-person startup in 2005 to a highly profitable business has been a remarkable journey.  We are a tight-knit team with a fast-paced, energetic, and entrepreneurial company culture. We have been ranked an Austin Best Place to Work over 10 times by Austin Business Journal, Austin Statesman, Built-in and Glassdoor!  AWL fosters a vibrant, dynamic work culture built on trust, data, technology, passion, collaboration, and winning, where employees want to engage and be impactful. We provide competitive pay, outstanding benefits, and a fabulous, fun, collaborative environment that allows our people to be their best.

As an Operations Engineer at AWL, you will be responsible for ensuring continuous business operations and system continuity through ownership of our infrastructure, software, and associated systems.   You’ll be part of a highly agile team and will be exposed to a variety of challenges and opportunities, including opportunities in Virtualization, Containerization, Multiple datacenters, and Cloud Computing technologies.  To be successful one must apply proven communication, analytical, and problem-solving skills to help identify, communicate, and resolve issues within a complex system that is highly driven by analytics and metrics with a goal of 24/7/365 up-time.

General Responsibilities:

  • Provide installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure:
    • Bare metal machine physical installation and configuration
    • Windows and Linux OS’.
    • System patching and maintenance
    • Implementation of critical CVE fixes
  • History of writing technical and staff documentation
    • Keeping documentation current
    • Improving documentation based on lessons learned or common problems
  • Ensure the correct design, implementation, and scalability of systems that must operate in a Highly Available manner via assisting staff engineers
    • Couchbase
    • RabbitMQ
    • MySQL
    • Kubernetes
    • Fortinet switches and firewalls
  • Perform daily system monitoring, assist in problem root cause determination across a variety of systems
    • Knowledge or ability to learn Zabbix
    • Attention to detail, ability to document problems, and wiliness to escalate when needed
    • Certificate maintenance
    • System quarterly patching and on-going regular maintenance of systems
  • Implement automated approaches for system administration tasks:
    • Powershell
    • Linux scripting
    • Use of AI automation (any)
    • AWS CLI
  • Manage distributed infrastructure across multiple physical datacenters on a highly virtualized stack running VMWare Enterprise over Kubernetes
  • Manage networking across multiple datacenters, Amazon Web Services, Microsoft Azure/0365, and a fully remote workforce
  • Familiarity with application critical SQL databases running under a 24/7/365 environment.
  • Troubleshooting of a diverse set of systems:
    • Familiarity with collecting logs and data
    • Use of AI to facilitate technical resolution
    • Use of technical services / support structure when appropriate (VMWare, RabbitMQ, Mysql)
  • Provide 24x7 support by participating in an on-call rotation.
  • Participate in a highly collaborative cross functional environment spanning departments.

 

 

Requirements:

  • 2+ years of experience as a systems administrator, network administrator, or Dev/Ops engineer or associated role doing technical administration (Azure/O365) for employees
  • Bachelor’s degree in Information Science, Engineering, or Computer Science or equivalent technical experience.
  • Experience working with both Windows and Linux systems
  • Experience working with application-tier infrastructure / containerization
  • Experience with Enterprise level monitoring tools Zabbix, Grafana, Prometheus, and/or cloud-based monitoring.
  • Experience with automation or scripting to accomplish repetitive tasks.
  • Excellent analytical and creative problem-solving skills.
  • A never give up attitude and a desire to dig in deeper when investigating a problem.

 

Additional Desired Skills:

  • History of using AI to facilitate technical problem resolution
  • Experience administering, supporting, and debugging mission critical database infrastructure (MySQL, MSSQL, or similar database).
  • Experience designing and maintaining Highly Available infrastructure.
  • Certification(s) with Amazon Web Services, Azure, and/or multi-region configuration and deployment.
  • Certification(s) with VMWare.
  • Certification(s) with Fortinet.
  • Windows Network administration integrated with Azure Cloud experience
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Operations Engineer Q&A's
Report this job
Apply for this job