Back to Search
Overview
Mid-Level

Cloud Software Engineer - Observability Platform

Confirmed live in the last 24 hours

Clickhouse

Clickhouse

Compensation

$141,000 - $230,000/year

United States (remote)
Hybrid
Posted March 27, 2026

Job Description

About ClickHouse

Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

The company’s sustained, accelerating momentum was recently validated by a $400M Series D financing round. Over the past three months, customers including Capital One, Lovable, Decagon, Polymarket, and Airwallex have adopted the platform or expanded existing deployments. These customers join an established base of AI innovators and global brands such as Meta, Cursor, Sony, and Tesla.

We’re on a mission to transform how companies use data. Come be a part of our journey!

ClickHouse is looking for an experienced engineer to join our Observability team. We build and operate the telemetry platform that powers both internal monitoring and the observability features our customers rely on. Our systems ingest trillions of events per day with sustained throughput in the tens of millions per second. Engineers on the team are hybrid software, systems, and infrastructure engineers who ensure this platform is reliable, scalable, and efficient. We work closely with product and infrastructure teams and play a key role in major engineering initiatives across the company.

We're looking for someone who thrives in fast-paced environments, isn't afraid to get hands-on during incidents, and knows when to automate the pain away. While experience in roles like Software Engineer, SRE, Systems Engineer, or DevOps is valuable, we care most about your problem-solving skills and mindset. If you enjoy tackling complex challenges across system design, infrastructure, automation, and incident response—while helping us scale with confidence—you’ll fit right in.

What you’ll do

  • Design, build, and operate distributed systems that power observability across ClickHouse Cloud
  • Own reliability, performance, and cost-efficiency of our telemetry pipeline and storage systems
  • Take part in the on-call rotation and help drive root-cause resolution and long-term fixes
  • Build tooling and automation to eliminate repetitive operational work
  • Help shape the roadmap for observability by identifying bottlenecks and scaling challenges
  • Collaborate with other engineering teams to improve their observability posture
  • Contribute to design discussions, architecture reviews, and mentor teammates

What we’re looking for

  • Strong bias for action and ownership — you ship, fix, and improve systems proactively
  • Great production debugging skills and a problem-solving mindset
  • Strong communication skills; comfortable working in a remote, async-friendly team
  • Experience balancing system performance, reliability, and cost
  • Ability to iterate quickly: build MVPs, collect feedback, and improve continuously

Requirements

  • 5+ years building and running production systems at scale
  • Proficiency in Golang
  • Experience with Kubernetes, Helm, ArgoCD, and Terraform or similar IaC tools
  • Comfortable working with at least one major cloud provider (AWS, GCP, Azure)
  • Experience with OpenTelemetry, Prometheus, Grafana, or similar tools
  • Experience with ClickHouse preferred

#LI-Remote

The typical starting salary for this role in the US is
$141,000$208,000 USD
The typical starting salary for this role in US Premium Markets is
$157,000$230,000 USD

Compensation

For roles based in

goawsgcpazurekubernetesaidevopsdataanalyticsproduct