Site Reliability Engineer - Storage Engineer

Confirmed live in the last 24 hours

Godaddy

Canada

Hybrid

Posted April 3, 2026

Job Description

Location Details:

At GoDaddy the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) and some work entirely remotely.

This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings.

Join Our Team

GoDaddy is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. This role will focus on automating and maintaining our storage infrastructure with a focus on Ceph, ensuring the reliability, scalability, and performance of our systems.

What you'll get to do...

Automate and maintain day-to-day operations of storage systems to support application demands
Develop and maintain tools and automation scripts to streamline storage operations and improve efficiency
Monitor system performance, identify issues, and implement solutions to ensure high availability and reliability
Participate in agile concepts such as daily stand-up meetings, task tracking boards, design and code reviews, automated testing, continuous integration, and deployment
Continuously improve system reliability, performance, and capacity through proactive monitoring, automation, and optimization

Your experience should include...

2+ years of professional experience with Ceph, working in a production environment
2+ years of experience in site reliability engineering or a similar role
2+ years of professional experience with Ceph, including deployment, configuration, and management of Ceph clusters and systems
Experience working on Linux/Unix systems, with a focus on automation and operating at scale
Proficiency in Python or Bash
Experience with Ansible, Terraform, or SaltStack
Experience with Nagios-based monitoring tools, such as Icinga2
Experience with observability tooling, such as Prometheus, Grafana, Mimir, and Loki
Solid understanding of core networking concepts and protocols, particularly in relation to Linux/Unix systems

You might also have...

Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
Exposure to and experience working with compute platforms (e.g., OpenStack, AWS)
Familiarity with ability to contribute to CI/CD pipelines and automation workflows

We've got your back...  We offer a range of total rewards that may include paid time off, retirement savings (e.g., 401k, pension schemes), bonus/incentive eligibility, equity grants, participation in our employee stock purchase plan, competitive health benefits, and other family-friendly benefits including parental leave. GoDaddy’s benefits vary based on individual role and location and can be reviewed in more detail during the interview process.

We also embrace our diverse culture and offer a range of Employee Resource Groups (Culture). Have a side hustle? No problem. We love entrepreneurs! Most importantly, come as you are and make your own way.

We encourage you to apply even if your experience or skillset doesn’t align perfectly with every requirement. We value a wide range of backgrounds and transferable skills, and we are excited to support learning and growth.

About us...&nbs

pythongoawskubernetesdockeraiiosdataproductdesign

Similar Jobs

Anduril Industries
Senior Site Reliability Engineer
SeniorWashington, District...
GHX (Global Healthcare Exchange)
Sr Site Reliability Engineer
SeniorHyderabad, Telangana...
Carta
Senior Site Reliability Engineer
SeniorSan Francisco, Calif...
Archer Aviation
Senior Site Reliability Engineer (SRE)
SeniorSan Jose, California...
Godaddy
Senior Site Reliability Engineer - Database Services
SeniorUnited Kingdom
Dropbox
Site Reliability Engineer
Mid-LevelRemote - Mexico