Director, Site Reliability Engineering (SRE)
Confirmed live in the last 24 hours
IonQ
Compensation
$192,979 - $252,659/year
Job Description
About IonQ:
IonQ, Inc. [NYSE: IONQ] is the world’s leading quantum company delivering solutions to solve the world’s most complex problems. IonQ’s newest generation quantum computers, IonQ Tempo and IonQ Forte Enterprise, are the latest in cutting-edge systems that have been helping customers and partners such as Amazon Web Services, AstraZeneca, and NVIDIA achieve 20x performance results. The company achieved 99.99% two-qubit gate fidelity, setting a world record in quantum computing performance in 2025.
The company is accelerating its technology roadmap and intends to deliver the world’s most powerful quantum computers with 2 million qubits by 2030 to accelerate innovation in drug discovery, materials science, financial modeling, logistics, cybersecurity, and defense. IonQ’s advancements in quantum networking position the company as a leader in building the quantum internet.
Location: This role is based onsite at our office in Pleasanton, CA.
Travel: Up to 20%
Job ID: 1457
The Role:
We are looking for a Director of SRE. As a Director of SRE, you'll be part of a cross-functional team whose mission is to lead IonQ on its journey to build the world's best quantum computers to solve the world's most complex problems.
In this role, you will build and lead SRE/DevOps organizations operating multi-tenant SaaS at scale on AWS, Azure, and GCP. You will be responsible for production ownership of availability, latency, incident response, and capacity management while implementing an SRE operating model using SLOs/SLIs and error budgets. Your leadership will bridge the gap between cloud infrastructure architecture and AI-ready operations to ensure a secure-by-default platform for our product teams.
Responsibilities:
- Build and lead SRE/DevOps organizations operating multi-tenant SaaS at scale on AWS/Azure/GCP, including production ownership for availability, latency, incident response, DR, and capacity management.
- Architect cloud infrastructure focusing on networking (VPC/VNet, routing, private connectivity), compute, containers/orchestration, and data platforms.
- Implement SRE operating models using SLOs/SLIs and error budgets to balance reliability and delivery velocity.
- Drive CI/CD and release engineering leadership, ensuring safe progressive delivery (canary/blue-green), automated rollbacks, and measurable deployment health.
- Scale Infrastructure-as-Code (IaC) and platform automation through "golden pipelines," standardized modules, and secure-by-default guardrails.
- Lead cross-functional execution across Product, Engineering, Security, Support, and Customer Success while setting clear ownership boundaries.
- Own organizational planning, including hiring, team topology, on-call models, budget, and vendor strategy.
- Establish a culture of operational excellence through blameless postmortems, corrective-action tracking, and toil reduction.
Requirements:
- At least 15 years of experience building and leading SRE/DevOps organizations operating multi-tenant SaaS at scale on AWS, Azure, or GCP.
- Deep technical knowledge of cloud infrastructure architecture, networking, containers, and secure-by-default platform guardrails.
- Proven ability to run production for global enterprise/federal customer bases, including tenant isolation and data residency considerations.
Preferred Qualifications:
- AI-ready operations experience for networking SaaS, including streaming telemetry pipelines and closed-loop automation.
- Experience with Juniper Mist AI or similar large-scale networking SaaS platforms is strongly preferred.
- Knowledge of AI-native networking concepts such as service-level expectations (SLEs) and proactive anomaly detection.
- Security and resilience mindset aligned to Zero Trust designs and continuous telemetry policy enforcement.
- Hands-on experience operating SaaS products for networking/security domains
Similar Jobs
Netflix
Engineering Manager, Open Connect Site Reliability Engineering
Microsoft
Senior Site Reliability Engineering Manager- CTJ- Secret (Cleared Environments)
Acryl Data
Site Reliability Engineering Tech Lead
Peloton
Senior Manager, Site Reliability Engineering
Apple
Site Reliability Engineering (SRE) Manager, Apple Maps
Filevine