About the Role
As a Power & Resource Efficiency Engineer, you'll sit at the intersection of IT and facilities , building the systems, models, and control loops that optimize how we allocate and consume power, cooling, and physical capacity across our TPU/GPU fleet.
You'll own the technical strategy for turning raw data center capacity into reliable, efficient compute, working across power topology, workload scheduling, and real-time telemetry to push utilization as close to the physical envelope as possible while maintaining our availability commitments.
Responsibilities
- Build models that forecast consumption across electrical and mechanical subsystems, informing capacity planning, energy procurement, oversubscription targets and risks, including statistical modeling of cluster utilization, workload profiles, and failure modes.
- Design IT/OT interfaces that bridge compute orchestration with facility controls, enabling real-time telemetry across accelerator hardware, power distribution, cooling, and schedulers.
- Build and operate load management systems that use power and cooling topology to enable load management and power/thermal-aware placement to maximize throughput while meeting SLOs.
- Partner with data center providers to drive design optimizations and hold them accountable to SLA-grade performance standards, providing technical diligence on partner architectures.
What We're Looking For
- Deep knowledge of data center power distribution and cooling architectures, and how they interact with IT load profiles. Experience with reliability engineering, SLA development, and failure-mode analysis.
- Proficiency in statistical modeling and simulation for infrastructure capacity or power utilization.
- Familiarity with SCADA/BMS/EPMS, telemetry pipelines, and control systems. Experience building software that bridges IT and OT.
- Exposure to accelerator deployments and their power management interfaces strongly preferred.
- Demand response, grid interaction, or behind-the-meter generation experience is a plus.
- Ability to translate between infrastructure engineering, software teams, and external partners.
Required Qualifications
- Bachelor's degree in Electrical Engineering, Mechanical Engineering, Power Systems, Controls Engineering, or a related field.
- 5+ years of experience in data center infrastructure or facility engineering.
- Demonstrated experience with data center power distribution and cooling system architectures.
- Experience building or operating software-based power management, load scheduling, or control systems.
- Proficiency in Python or similar languages for statistical modeling, simulation, or automation of data center infrastructure optimizations.
- Familiarity with SCADA, BMS, EPMS, or industrial control systems and associated protocols (Modbus, BACnet, SNMP).
- Track record of cross-functional collaboration across hardware, software, and facilities teams.
Preferred Qualifications
- Master's or PhD in Controls, Power Systems, or related discipline and 3+ years of experience in data center infrastructure or facility engineering.
- Experience with accelerator-class deployments and their power management interfaces.
- Background in control theory, dynamical systems, or cyber-physical systems design.
- Experience with energy storage, microgrid integration, demand response, or behind-the-meter generation.
- Familiarity with reliability engineering methods.
- Experience with SLA development, availability modeling, or service credit frameworks.
- Exposure to ML/optimization techniques applied to infrastructure or energy systems.
Salary
The annual compensation range for this role is $320,000-$405,000 USD.
Benefits
We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with our team.
XML job scraping automation by YubHub