Senior Infrastructure Engineer - InfraOps
Confirmed live in the last 24 hours
BitGo
Job Description
BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage. Since our founding in 2013, we have focused on enabling our clients to securely navigate the digital asset space. With a global presence and multiple Trust companies, BitGo serves thousands of institutions, including many of the industry's top brands, exchanges, and platforms, and millions of retail investors worldwide. As the operational backbone of the digital economy, BitGo handles a significant portion of Bitcoin network transactions and is the largest independent digital asset custodian, and staking provider, in the world. For more information, visit www.bitgo.com.
BitGo is seeking a highly experienced DevOps/SRE Engineer to lead and architect our highly available digital asset infrastructure on Kubernetes. This pivotal role demands a proven leader capable of driving strategic initiatives, guaranteeing robust performance, and ensuring unparalleled reliability across our global operations. The successful candidate will proactively define and implement advanced monitoring and security frameworks, significantly enhancing network integrity, optimizing operational efficiency, and delivering a stable, cost-efficient, and highly scalable platform that empowers our developers and solidifies user trust. This position blends deep expertise in both web2 and web3 technologies, directly contributing to the security and scalability of over $100 billion in digital assets and shaping the future of our infrastructure.
This role is on-site in Palo Alto (CA, US) or San Francisco (CA, US) and requires participation in a 24/7 on-call rotation, including weekend coverage.
Responsibilities:
- Architect, design, and champion the adoption of cutting-edge Infrastructure as Code (IaC) tooling and automation solutions across the organization, setting best practices and driving innovation.
- Lead cross-functional collaborations with engineering and business teams to proactively identify and address complex infrastructure requirements, ensuring the delivery of highly scalable, resilient, and performant solutions that align with strategic business objectives.
- Evaluate, integrate, and strategically deploy advanced open-source and commercial tools to significantly enhance our security posture, infrastructure capabilities, and consistently meet evolving business demands.
Drive cost optimization initiatives across cloud infrastructure including capacity planning, resource right-sizing, and reserved instance strategies.
- Define, own, and execute the technical roadmaps for critical system components, ensuring seamless alignment with organizational strategic objectives and long-term vision.
- Drive operational excellence, reliability, and performance of critical client and internal systems through proactive project leadership, sophisticated incident response, and mentorship within on-call rotations.
Required:
- Extensive and demonstrable experience securing, scaling, and operating multiple complex environments on Kubernetes, coupled with deep expertise in associated tooling (ArgoCD, GitOps, Grafana) and advanced Terraform implementations.
- Strong Linux systems administration skills, including performance tuning, troubleshooting, and security hardening.
- Deep understanding of networking fundamentals: VPCs, security groups, load balancers, CNI plugins, and network policy management.
- Proficiency in at least one high-level programming language, preferably Go, with strong bash scripting capabilities.
- Deep operational expertise with relational and NoSQL databases (including advanced connection maintenance, intricate slow query analysis,index management) as well as large-scale object storage solutions.
- Proven track record with Github Actions and architecting robust CI/CD pipelines.
Similar Jobs
Becton Dickinson
Senior Principal Engineer – Software Development (Medical Devices)
Johnson & Johnson
Staff Engineer, Digital Transformation – MedTech R&D
Apple
Software Systems Engineer - Health Software Team
GE HealthCare
Lead Clinical Applications Engineer
Danaher
Principal Software Engineer (Medical Devices)
WHOOP