Lead Observability Engineer
Confirmed live in the last 24 hours
Kobie Marketing
Job Description
About the Team and What We’ll Build Together
You are a Lead Observability Engineer who will drive the strategy, adoption, and evolution of observability across all production and delivery environments. You will play a critical role in ensuring system reliability, performance visibility, and proactive issue resolution across our platforms.
You will operate at the intersection of Engineering, DevOps, and Production Support, bringing structure, standardization, and intelligence to how we monitor and manage systems. You will lead the shift from reactive operations to proactive, AI-driven observability and automated reliability.
In this role, you will:
- Own and evolve the observability platform (e.g., New Relic) to provide end-to-end visibility across applications and infrastructure
- Establish standards for monitoring, alerting, dashboards, and telemetry (logs, metrics, traces)
- Leverage AIOps capabilities to improve anomaly detection, reduce noise, and accelerate root cause analysis
- Drive automation and self-healing workflows to minimize manual intervention and improve system resilience
- Collaborate across teams to ensure systems are observable by design and aligned with reliability goals
- Continuously analyze system behavior and incident patterns to improve performance, scalability, and uptime
You will be part of a team focused on building a highly reliable, data-driven, and scalable operational ecosystem, where observability is a core foundation for engineering excellence.
Similar Jobs
LaunchDarkly
Full Stack Engineer - Observability
Cloudflare
Software Engineer, Workers Observability
Ivalua
Senior Software Engineer - Observability & SRE (H/F)
LangChain
FullStack Engineer, AI Observability & Evals Platform (LangSmith)
LangChain