Command Center Technician
Confirmed live in the last 24 hours
CoreWeave
Job Description
What You’ll Do:
The Global Data Center Operations team serves as the backbone of our infrastructure, ensuring the seamless performance of our global hyperscale environment. Operating in a high-stakes, 24/7 setting, this team is responsible for safeguarding the availability, stability, and reliability of mission-critical systems that power our most essential services.
About the role:
As a Command Center Technician, you will serve as the front-line mission control for our global data center fleet. In a 24/7 operations environment, you will be responsible for real-time monitoring, coordination, and incident response across critical electrical, mechanical, and environmental systems. Acting as the central point of visibility, you will identify anomalies, coordinate cross-functional response efforts, and drive rapid resolution to ensure maximum uptime, operational continuity, and safety across our infrastructure.
- Provide continuous 24/7 monitoring of global data center infrastructure systems using BMS, EPMS, DCIM, and other monitoring platforms.
- Monitor critical assets including UPS, generators, switchgear, chillers, and fire suppression systems.
- Serve as the first responder for infrastructure alarms, triaging incidents and initiating response actions per SOPs, MOPs, and EOPs.
- Escalate incidents promptly to on-site operations, engineering teams, and leadership based on defined escalation matrices.
- Coordinate incident response activities across multiple teams to minimize risk to production environments.
- Support root cause analysis (RCA) efforts by providing detailed timelines, logs, and incident documentation.
- Act as a central communication hub, providing clear and accurate status updates during incidents and maintenance events.
- Enforce adherence to change management processes and safety requirements within mission-critical environments.
- Maintain detailed event logs and participate in structured shift handovers to ensure operational continuity.
- Make timely, high-quality decisions in high-pressure situations while maintaining a strong focus on safety, uptime, and operational excellence.
Who You Are:
- 2+ years of experience working in mission-critical environments (data centers, utilities, or industrial operations).
- Foundational technical knowledge of data center electrical and mechanical infrastructure systems.
- Proven experience following and executing SOPs, MOPs, and EOPs in high-availability environments.
- Proficiency with monitoring tools, ticketing systems, and operational dashboards.
- Experience in incident management, technical troubleshooting, and structured escalation.
- Ability to work rotating shifts, including nights, weekends, and holidays, in a 24/7 operations environment.
- Availability to work a flexible schedule within a 24/7 environment.
- Excellent time management, organizational, and communication skills.
- Must be able to prioritize tasks and react quickly to issues.
- Work is primarily performed in a climate-controlled data center.
Preferred:
- 4+ years of experience working in mission-critical environments
- Prior experience in a h
Similar Jobs
Scale AI
ML Research Engineer, ML Systems
Scale AI
Manager, Machine Learning Research Scientist, GenAI
Scale AI
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AI
Machine Learning Research Scientist / Research Engineer, Post-Training
Scale AI
Machine Learning Research Scientist / Engineer, Reasoning
Scale AI