Site Reliability Engineer - Storage Engineer
Confirmed live in the last 24 hours
Godaddy
Job Description
Location Details:
At GoDaddy the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) and some work entirely remotely.
This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings.
Join Our Team
GoDaddy is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. This role will focus on automating and maintaining our storage infrastructure with a focus on Ceph, ensuring the reliability, scalability, and performance of our systems.
What you'll get to do...
- Automate and maintain day-to-day operations of storage systems to support application demands
- Develop and maintain tools and automation scripts to streamline storage operations and improve efficiency
- Monitor system performance, identify issues, and implement solutions to ensure high availability and reliability
- Participate in agile concepts such as daily stand-up meetings, task tracking boards, design and code reviews, automated testing, continuous integration, and deployment
- Continuously improve system reliability, performance, and capacity through proactive monitoring, automation, and optimization
Your experience should include...
- 2+ years of professional experience with Ceph, working in a production environment
- 2+ years of experience in site reliability engineering or a similar role
- 2+ years of professional experience with Ceph, including deployment, configuration, and management of Ceph clusters and systems
- Experience working on Linux/Unix systems, with a focus on automation and operating at scale
- Proficiency in Python or Bash
- Experience with Ansible, Terraform, or SaltStack
- Experience with Nagios-based monitoring tools, such as Icinga2
- Experience with observability tooling, such as Prometheus, Grafana, Mimir, and Loki
- Solid understanding of core networking concepts and protocols, particularly in relation to Linux/Unix systems
You might also have...
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
- Exposure to and experience working with compute platforms (e.g., OpenStack, AWS)
- Familiarity with ability to contribute to CI/CD pipelines and automation workflows
We've got your back... We offer a range of total rewards that may include paid time off, retirement savings (e.g., 401k, pension schemes), bonus/incentive eligibility, equity grants, participation in our employee stock purchase plan, competitive health benefits, and other family-friendly benefits including parental leave. GoDaddy’s benefits vary based on individual role and location and can be reviewed in more detail during the interview process.
We also embrace our diverse culture and offer a range of Employee Resource Groups (Culture). Have a side hustle? No problem. We love entrepreneurs! Most importantly, come as you are and make your own way.
We encourage you to apply even if your experience or skillset doesn’t align perfectly with every requirement. We value a wide range of backgrounds and transferable skills, and we are excited to support learning and growth.
About us...&nbs
Similar Jobs
Anduril Industries
Senior Site Reliability Engineer
GHX (Global Healthcare Exchange)
Sr Site Reliability Engineer
Carta
Senior Site Reliability Engineer
Archer Aviation
Senior Site Reliability Engineer (SRE)
Godaddy
Senior Site Reliability Engineer - Database Services
Dropbox