Senior Site Reliability Engineer - Azure
Confirmed live in the last 24 hours
Hashgraph (Hedera)
Job Description
About Hashgraph:
Hashgraph is a fast-growing software company committed to supporting, developing and servicing Hedera, an open source, proof-of-stake platform. Hedera is EVM-compatible and has been specifically built to meet the needs of enterprise and web3 applications, which require speed, security, stability and sustainability. Hedera’s public network is governed by industry-leading organizations, spanning 11 sectors and 14 regions who oversee the development and direction of the decentralized platform.
The role:
We are hiring a Senior Site Reliability Engineer (Azure) to build and scale the Azure infrastructure foundation for HashSphere, a new private DLT network harnessing Hedera's institutional grade technology, being built by a passionate team of industry leaders.This role exists to ensure that our platform can operate as a secure, scalable, and production-ready system in Azure, supporting complex enterprise use cases and high reliability expectations.
The impact you'll have:
In this role, you will:
- Design and build secure, scalable Azure infrastructure from first principles for a production-grade distributed system
- Develop and own Terraform-based infrastructure as code, enabling repeatable and automated deployments
- Translate product and customer requirements into technical architecture and execution plans
- Build and enhance platform services, APIs, and integrations that extend HashSphere capabilities
- Partner across engineering, security, and product teams to deliver enterprise-ready infrastructure solutions
- Contribute to operational excellence, including reliability, observability, and incident response
- Support customer deployments and production environments through Tier 2 infrastructure support
What success looks like in 6-12 months:
- Azure is a production-ready deployment environment for HashSphere
- Customer deployments are repeatable, scalable, and secure
- Azure achieves feature parity with other supported cloud environments
What you bring:
Core capabilities:
- Proven experience designing and building production-grade systems on Azure
- Ability to take ambiguous requirements to structured technical solutions to delivered systems
- Strong technical communication skills across engineering and non-technical stakeholders
- High ownership mindset with a bias for action and accountability
- Collaborative approach with a focus on building durable, scalable solutions
Functional expertise:
- Azure cloud services (networking, compute, identity, security, storage)
- Terraform (infrastructure as code at production scale)
- Programming experience in Go and/or Python
- Experience building greenfield infrastructure environments
- Distributed systems, high-availability architectures, or platform engineering
- CI/CD and automation tooling for infrastructure lifecycle management
Nice to haves:
- Kubernetes and container orchestration
- Observability tooling (Prometheus, Grafana)
- Workflow/orchestration platforms (Argo, Spacelift, or similar)
Similar Jobs
MongoDB
Site Reliability Engineer (Senior or Staff), Atlas
MongoDB
Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)
MongoDB
Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)
MongoDB
Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)
MongoDB
Site Reliability Engineer (Senior or Staff), Infrastructure Security
MongoDB