xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.
The Data Center Implementation Manager is responsible for the physical installation, configuration, and provisioning of the IT equipment in the data center. This position will coordinate closely with engineering leads and the construction/build managers. This position is intended to be transient and move to each construction project prior to handing off to IT site-operations once the installation of IT equipment is complete and functioning as designed.
Responsibilities
- Configure, deploy, and debug network solutions within the data center, including routers, switches, firewalls, and other network devices.
- Coordinate the installation of IT racks, network equipment, and connectivity during bring up of new data center build outs.
- Collaborate with architects and other engineers to ensure the network design aligns with data center goals.
- Troubleshoot and resolve connectivity issues, including network latency, packet loss, and hardware failures.
- Conduct regular assessments of network performance and capacity, recommending upgrades as needed.
- Ensure compliance with industry standards, regulations, and best practices related to data center connectivity.
- Coordinate the physical installation and connectivity of large scale network and GPU deployments.
- Maintain accurate documentation of network configurations, changes, and maintenance activities.
- Prepare reports on network performance, issues, and resolutions for management and stakeholders.
- Work closely with IT teams, data center operations, and third-party vendors to support network-related projects and initiatives.
- Provide technical guidance and support to site operations technicians and managers
- Partake in the hiring of site operations technicians for new data center build outs.
- Train new site operations manager and site operation lead technicians at new data centers before handing over operations to the assigned site operations manager.
- Partake in debugging and troubleshooting of GPU servers, compute servers, storage devices, network switches, and layer 1 network hardware.
- Install and configure servers, networking equipment, and other hardware in accordance with design specifications.
- Manage and organize cabling infrastructure to ensure efficient and reliable connectivity.
- Perform testing and validation of all deployed systems to ensure optimal performance and compliance with design specifications.
- Diagnose and resolve hardware, software, and network issues in a timely manner to minimize downtime.
- Work with network engineers, system administrators, and other stakeholders to ensure seamless integration of new deployments with existing infrastructure.
Basic Qualifications
- Minimum of 3-5 years of experience in network engineering, with a focus on data center environments.
Preferred Skills and Experience
- Experience with network design, implementation, and troubleshooting in a data center setting.
- Proficiency with network hardware such as Cisco, Juniper, or Arista.
- Familiarity with data center infrastructure management (DCIM) tools.
- Proficiency in data center hardware installation and configuration.
- Familiarity with data center monitoring tools and software.
- Ability to lift and move heavy equipment as needed.
- Willingness to work outside regular business hours, including nights and weekends, as required by deployment schedules.
- Relevant certifications such as CCNP, CCDP, or similar are preferred.
- Excellent problem-solving and analytical skills.
Additional Requirements
- Comfortable working in an environment requiring exposure to noise
- Available to work evenings and weekends, as the schedule varies depending on site operational needs; flexibility is required