Back to Search
Overview
Staff

Staff Site Reliability Engineer, Security- GCP

Confirmed live in the last 24 hours

Okta

Okta

Bengaluru, India
On-site
Posted March 23, 2026

Job Description

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Okta’s Workforce Identity Cloud Security Engineering group is looking for an experienced and passionate Staff Site Reliability Engineer to join a team focused on designing and developing Security solutions to harden our cloud infrastructure. We embrace innovation and pave the way to transform bright ideas into excellent security solutions that help run large-scale, critical infrastructure. We encourage you to prescribe defense-in-depth measures, industry security standards and enforce the principle of least privilege to help take our Security posture to the next level. Our Infrastructure Security team has a niche skill-set that balances Security domain expertise with the ability to design, implement, rollout infrastructure across multiple cloud environments without adding friction to product functionality or performance. We are responsible for the ever-growing need to improve our customer safety and privacy by providing security services that are coupled with the core Okta product.

This is a high-impact role in a security-centric, fast-paced organization that is poised for massive growth and success. You will act as a liaison between the Security org and the Engineering org to build technical leverage and influence the security roadmap. You will focus on engineering security aspects of the systems used across our services. Join us and be part of a company that is about to change the cloud computing landscape forever.

As a Staff Engineer, you should be able to identify gaps, propose innovative solutions, and contribute to roadmaps while driving alignment across multiple teams within the organization. Additionally, you should serve as a role model, providing technical mentorship to junior team members and fostering a culture of learning and growth

What are we looking for?
We are looking for a security-first SRE engineer who doesn't just "flag" issues but builds the automation to solve them. You should have a deep-seated intuition for cloud-native security and a proven track record of hardening large-scale GCP and AWS environments. As a Technical SME, you will design and build production infrastructure with a "security-at-scale" mindset.

What You Will Work On?
Security Evangelism:
Lead initiatives to strengthen our security posture for critical infrastructure and promote best practices across the engineering organization.
Incident Response & Reliability: Respond to production security incidents, perform root cause analysis, and build automated preventions to ensure high performance and reliability.
Automated Hardening: Identify manual security processes and automate them using custom tooling and CI/CD integrations.
Architecture & Documentation: Develop technical documentation, runbooks, and procedures for a 24x7 online environment.
Platform Evolution: Continuously evolve our monitoring platforms, moving from simple auditing to active, automated prevention.

Minimum Required Knowledge, Skills, & Abilities:
Experience: 8+ years of experience architecting and running complex cloud networking and infrastructure, with at least 7+ years specialized in DevSecOps or Cloud Security.
GCP ExpertiseMinimum 3+ years of deep, hands-on experience securing GCP (GKE, GCE, Shared VPC etc).
Infrastructure as Code (IaC): 10+ years of experience using Terraform and Chef to manage complex cloud resources and OS hardening.
Automation Mastery: Expert-level proficiency in Go, Python, or Ruby for building custom security tooling and automated remediation.
Hardened Containers: Proven track record of securing containerized workloads, including image scanning, K8s RBAC, and runtime security tools (e.g., CrowdStrike Falcon, Falco, or gVisor).
Unflappable Troubleshooting: A "see a problem, fix the problem" mindset with the abil

pythongorustawsgcpkubernetesmachine learningaidataproduct