Back to Search
Overview
Staff

Staff Site Reliability Engineer

Confirmed live in the last 24 hours

Bugcrowd

Bugcrowd

Compensation

$151,040 - $188,800/year

Remote - US
Remote
Posted March 27, 2026

Job Description

We are Bugcrowd. Since 2012, we’ve been empowering organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted alliance of elite hackers, with our patented data and AI-powered Security Knowledge Platform™. Our network of hackers brings diverse expertise to uncover hidden weaknesses, adapting swiftly to evolving threats, even against zero-day exploits. With unmatched scalability and adaptability, our data and AI-driven CrowdMatch™ technology in our platform finds the perfect talent for your unique fight. We aim to create a new era of modern crowdsourced security that outpaces threat actors. Unleash the ingenuity of the hacker community with Bugcrowd, visit www.bugcrowd.com. Based in San Francisco and New Hampshire, Bugcrowd is supported by General Catalyst, Rally Ventures, Costanoa Ventures, and others.

Job Summary

We’re seeking a Staff Site Reliability Engineer to serve as a technical leader within our infrastructure organization. In this role, you’ll help shape the reliability strategy across our engineering teams, drive adoption of best practices, and tackle our most complex infrastructure challenges. You’ll be part of an international, highly engaged and technical group that is well-versed in building enterprise-ready and extremely secure software systems. Our core values of “simple is strong, respect is king, build it like you own it and think like a hacker” should resonate with you. 

Essential Duties and Responsibilities

  • Define and drive the technical vision for infrastructure reliability across the organization
  • Architect large-scale, fault-tolerant systems on AWS using Terraform
  • Lead cross-functional initiatives to improve system reliability, scalability, and efficiency
  • Establish standards for infrastructure-as-code, CI/CD, and deployment practices
  • Design and implement solutions for our most complex operational challenges
  • Lead incident response for critical outages and drive systemic improvements
  • Mentor senior engineers and help grow the SRE team’s capabilities
  • Evaluate and introduce new technologies that improve operational excellence
  • Influence engineering culture around reliability, observability, and operational maturity

Education, Experience, Skills, & Abilities

  • 5+ years of experience in SRE, DevOps, or systems engineering, with demonstrated technical leadership
  • Expert-level knowledge of Terraform, including module design, state management, and scaling IaC across teams
  • Deep expertise in AWS architecture and services at scale, with strong focus on ECS
  • Proven experience designing and operating containerized workloads on ECS, including capacity planning, service scaling, and task placement strategies
  • Strong experience designing and implementing CI/CD systems with GitHub Actions or similar tools
  • Track record of leading complex, cross-team technical initiatives
  • Advanced proficiency in Python, Ruby, Javascript, or similar languages
  • Strong understanding of distributed systems principles
  • Excellent written and verbal communication skills
  • Proven ability to balance long-term technical strategy with immediate operational needs

Preferred Experience

  • Experience building internal developer platforms or self-service infrastructure tooling
  • Knowledge of FedRAMP
  • Background in cost optimization and FinOps practices
  • Contributions to open-source infrastructure projects
  • Experience scaling infrastructure organizations and processes
  • Experience defining and implementing SLO frameworks

Working Conditions

The ideal candidate must be able to complete all physical requirements of the job with or without reasonable accommodation.

Sitting and/or standing - Must be able to remain in a stationary position 50% of the time

Carrying and /or lifting - Must be able to carry / move laptop as needed throughout the work day.

Environment - remote, work-from-home 100% of the time.

ADA Statement

Bugcrow

pythonjavajavascriptgorustawsaidevopsdataproduct