About the role
Company
Cox Automotive - USAJob Family Group
Job Profile
Management Level
Flexible Work Option
Travel %
Work Shift
Compensation
Compensation includes a base salary in the range of $163,400.00 - $272,300.00. The base salary may vary within the anticipated base pay range based on factors such as the ultimate location of the position and the selected candidate’s knowledge, skills, and abilities. Position may be eligible for additional compensation that may include an incentive program.Job Description
SENIOR LEAD SITE RELIABILITY & SYSTEMS ENGINEER
Platform Engineering | Infrastructure, Reliability & Systems Architecture
ABOUT THE ROLE
We are seeking a Senior Lead Site Reliability & Systems Engineer — a versatile technical leader who combines deep SRE expertise with broad systems engineering capability. In this hybrid role you will drive platform reliability, operational excellence, and systems architecture across our infrastructure, ensuring our products are scalable, resilient, and delivered with high velocity. You will partner with engineering, product, and operations teams to embed reliability and sound systems design at every layer of the stack.
KEY RESPONSIBILITIES
Reliability Engineering & Incident Management
Define and drive the SRE strategy, roadmap, and standards across engineering teams
Establish and enforce SLOs, SLIs, and error budgets across all production services
Own the incident management lifecycle — detection, response, resolution, and prevention
Lead blameless postmortems and translate findings into lasting systemic improvements
Manage on-call rotations and aggressively reduce toil through automation
Systems Architecture & Design
Lead the design and evolution of large-scale, distributed systems and platform infrastructure
Define technical standards, architectural patterns, and engineering best practices org-wide
Evaluate and recommend technologies and tooling aligned to business and reliability requirements
Conduct architecture reviews and provide guidance on complex technical trade-offs
Lead capacity planning, performance engineering, and infrastructure scaling strategies
Platform & Infrastructure
Build and maintain highly available, fault-tolerant infrastructure on cloud platforms (AWS/GCP/Azure)
Drive infrastructure-as-code adoption (Terraform) and enforce best practices
Architect and implement observability platforms — metrics, logging, tracing, and alerting
Build and improve CI/CD pipelines, deployment automation, and release engineering workflows
Lead chaos engineering and game day exercises to validate system resilience
Champion automation across provisioning, testing, deployment, and monitoring workflows
Leadership, Mentorship & Collaboration
Mentor and grow a team of SREs, platform engineers, and systems engineers
Partner with DevOps, security, and product teams to align on shared platform goals
Serve as the technical escalation point for critical infrastructure incidents and outages
Communicate complex technical concepts clearly to non-technical stakeholders and leadership
Contribute to build vs. buy evaluations and drive strategic vendor assessments
REQUIRED QUALIFICATIONS
8+ years of experience in SRE, systems engineering, platform engineering, or DevOps roles
3+ years in a senior or lead capacity with ownership of large-scale, distributed systems
Deep expertise in at least one major cloud provider — AWS preferred
Strong proficiency in Python, Go, Bash, Java, or C++
Hands-on experience with Kubernetes, container orchestration, and service mesh technologies
Solid understanding of Linux/Unix internals, networking (TCP/IP, DNS, TLS/SSL, load balancing)
Proficiency with observability tooling: Datadog, Prometheus/Grafana, Splunk, or equivalent
Proven track record defining and operating against SLOs and error budgets
Experience with infrastructure-as-code tools — Terraform required
Strong understanding of distributed systems design, security fundamentals, and data governance
PREFERRED QUALIFICATIONS
Experience with service mesh (Istio, Linkerd) and API gateways (Kong, Apigee)
Background in systems integration across enterprise middleware, ERP, or CRM platforms
Familiarity with FinOps practices and cloud cost optimization
Experience in regulated industries: financial services, automotive, healthcare, or government
Familiarity with compliance frameworks: SOC 2, ISO 27001, or NIST
Track record of leading migrations — legacy-to-cloud or monolith-to-microservices
Relevant certifications: AWS Solutions Architect, CKA/CKAD, GCP Professional, or Red Hat RHCA
WHAT WE OFFER
Compensation & Benefits
Competitive base salary + annual bonus
Comprehensive health, dental, and vision coverage
401(k) with company match
Generous PTO and paid parental leave
Culture & Growth
Flexible hybrid work model
Learning & development budget (conferences, certs, courses)
Engineering-first culture with direct product impact
Collaborative teams and transparent leadership
Drug Testing
Benefits
About Us
Aplyr's read
Cox Enterprises is a telecommunications giant known for its diverse service offerings and commitment to innovation, attracting talent in technology and operational roles.
What's promising
- •Cox Enterprises is a leader in telecommunications, offering stable career opportunities.
- •The company invests in technology, providing roles like Entry Level Software Engineer.
- •Cox offers competitive hiring bonuses for mechanic positions, enhancing job appeal.
What to watch
- •Limited public information about company culture may deter potential applicants.
- •Being privately owned, Cox's financial transparency is less than public companies.
- •The focus on operational roles may limit opportunities for creative professionals.
Why Cox Enterprises
- •Cox Enterprises is a privately owned leader in U.S. telecommunications.
- •The company offers substantial hiring bonuses for technical roles, like mechanics.
- •Cox's diverse service offerings include digital cable, telecommunications, and home automation.
Aplyr’s read is generated by AI from public sources. Was it useful?
About Cox Enterprises
Cox Communications is a privately owned subsidiary of Cox Enterprises, providing digital cable television, telecommunications, and home automation services in the United States.
Similar roles
Sr. Site Reliability Engineer III (6448)
MetroStar Systems
Cloud Site Reliability Engineer
SambaNova Systems
Staff Site Reliability Engineer – Automation and Platform
Cerebras Systems
Site Reliability Engineer - Ops & Automation
Cerebras Systems
Sr. Site Reliability Engineer III (6572)
MetroStar Systems
Site Reliability Engineer Lead, Plant Floor Systems
Lucid Motors