Back to Search
Overview
Senior

Senior Site Reliability Engineer

Confirmed live in the last 24 hours

WorldQuant

WorldQuant

Hanoi OR Ho Chi Minh City
On-site
Posted April 11, 2026

Job Description

WorldQuant develops and deploys systematic financial strategies across a broad range of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a balanced, global investment platform.

WorldQuant is built on a culture that pairs academic sensibility with accountability for results. Employees are encouraged to think openly about problems, balancing intellectualism and practicality. Excellent ideas come from anyone, anywhere. Employees are encouraged to challenge conventional thinking and possess an attitude of continuous improvement.

Our goal is to hire the best and the brightest. We value intellectual horsepower first and foremost, and people who demonstrate an outstanding talent. There is no roadmap to future success, so we need people who can help us build it.

Technologists at WorldQuant research, design, code, test and deploy firmwide platforms and tooling while working collaboratively with researchers. Our environment is relaxed yet intellectually driven. We seek people who think in code and are motivated by being around like-minded people.

The Role: We're seeking a Senior Site Reliability Engineer to join the team. You will build and operate the infrastructure and tooling behind WorldQuant's data ingestion pipelines — systems that onboard, validate, and deliver large-scale datasets to the firm's research platform. This is a 70% build / 30% operate role. You'll spend most of your time engineering automation, observability, and developer tooling, while also participating in on-call rotations and incident response for production data pipelines. You'll partner with engineering, analyst, and research teams to ensure reliability at scale — this requires excellent analytical skills, clear communication, and the ability to collaborate across teams.

What You'll Do:

Build (70%):

  • Design and develop automation, monitoring, CI/CD, and reliability features for the data onboarding pipeline
  • Develop and maintain internal infrastructure and services that reduce toil and improve pipeline reliability
  • Build observability solutions — dashboards, alerting, log aggregation — using Grafana, the ELK stack, and Vector
  • Design and implement CI/CD pipelines, test automation, and release management workflowsWrite infrastructure-as-code for provisioning, scaling, and managing platform components: Kubernetes, bare metal hosts
  • Integrate and extend tools such as Redis, Celery, MySQL

Operate (30%): Keep production data pipelines healthy and respond to incidents

  • Participate in on-call rotation, respond to production incidents, and drive post-mortems&l
pythonjavajavascriptgoawsgcpkubernetesdockeraiios