xAI· Data Center· Memphis, TN
Manager, Operations
Classified Tasks (17)
Automate 0%Augment 65%Human-Only 35%
Augment (11)
AI assists, human decides
Own day-to-day and long-term performance of mission-critical data center operations including power generation, power distribution, cooling, mechanical, electrical, and environmental systems.
operational
Drive reliability and efficiency initiatives to achieve continuous 24/7 infrastructure availability.
operational
Ensure seamless 24/7 uptime for the infrastructure powering AI training.
operational
Manage operation, maintenance, monitoring, and optimization of on-site power generation assets, electrical systems, mechanical/HVAC, liquid cooling, power distribution, UPS, generators, and building management systems.
technical
Oversee design, deployment, maintenance, and expansion of high-speed fiber optic networks, dark fiber, and connectivity infrastructure supporting AI compute clusters and data center interconnects.
technical
Own, track, and report key performance metrics including uptime (targeting 99.999%+), MTTD/MTTR, PUE, WUE, power generation efficiency, and overall infrastructure availability.
analytical
Develop and enforce standard operating procedures (SOPs) for facilities and power generation operations.
operational
Implement and maintain preventive maintenance programs for critical infrastructure and power generation assets.
operational
Develop and enforce incident response protocols and continuous improvement processes to minimize downtime and maximize efficiency.
operational
Manage operational budgets for facilities, power generation, and fiber operations.
administrative
Manage spare parts inventory for mission-critical infrastructure.
administrative
Human-Only (6)
Requires human judgment
Lead and scale the facilities operations and power generation teams responsible for reliable operation of hyperscale AI compute facilities.
leadership
Direct fiber teams responsible for high-capacity networking and connectivity that support supercomputing clusters.
leadership
Build and lead high-performing operations, power generation, and fiber teams.
leadership
Build, mentor, and grow multidisciplinary teams of operations technicians, power generation engineers, and controls specialists.
leadership
Partner with engineering, construction, procurement, and AI hardware teams to support new facility builds, expansions, commissioning, power integration, and handovers to operations.
communication
Manage vendor relationships with maintenance contractors, fiber providers, power generation OEMs, and fuel suppliers.
communication
Job description
ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: We are seeking an exceptional Manager, Operations to lead facilities operations and power generation for xAI’s hyperscale AI compute facilities. This role will own the day-to-day and long-term performance of mission-critical data center operations, including power generation, power distribution, cooling, mechanical, electrical, and environmental systems, while also directing the fiber teams responsible for high-capacity networking and connectivity that support our supercomputing clusters. You will build and lead high-performing operations, power generation, and fiber teams, drive relentless reliability and efficiency, and ensure seamless 24/7 uptime for the infrastructure powering xAI’s AI training at unprecedented scale. This high-impact position requires deep expertise in data center or hyperscale operations (including power generation), strong leadership in fast-paced environments, and the ability to deliver world-class performance under aggressive growth timelines. This is a full-time, primarily onsite role with significant travel to sites and vendor locations. RESPONSIBILITIES: Lead and scale the facilities operations and power generation teams responsible for the reliable operation, maintenance, monitoring, and optimization of critical infrastructure including on-site power generation assets, electrical systems, mechanical/HVAC, liquid cooling, power distribution, UPS, generators, and building management systems. Direct the fiber teams overseeing the design, deployment, maintenance, and expansion of high-speed fiber optic networks, dark fiber, and connectivity infrastructure supporting AI compute clusters and data center interconnects. Own key performance metrics such as uptime (targeting 99.999%+), mean time to detect/repair (MTTD/MTTR), power usage effectiveness (PUE), water usage effectiveness (WUE), power generation efficiency, and overall infrastructure availability. Develop and enforce standard operating procedures (SOPs), preventive maintenance programs, incident response protocols, and continuous improvement processes for both facilities and power generation assets to minimize downtime and maximize efficiency. Build, mentor, and grow multidisciplinary teams of operations technicians, power generation engineers and controls specialists while fostering a culture of ownership, safety, and excellence. Partner closely with engineering, construction, procurement, and AI hardware teams to support new facility builds, expansions, commissioning, power integration, and smooth handovers from project to operations. Manage operational budgets, vendor relationships (maintenance contractors, fiber providers, power generation OEMs, fuel suppliers), spare parts inven