Back to Search
Overview
Senior

Senior Staff Engineer, AI Platform and Infrastructure

Confirmed live in the last 24 hours

OKX

OKX

Singapore, Singapore
On-site
Posted March 16, 2026

Job Description

OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa

Who We Are

At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
 

About the opportunity

The AI Engineering team is responsible for integrating AI models with different business lines, across teams such as Compliance, Trading, Financial Products, and Business Intelligence.
 
We are looking for a Senior Staff Engineer to lead the design, development, and evolution of large-scale AI infrastructure that powers mission-critical machine learning and generative AI workloads. In this role, you will operate at the intersection of systems engineering, distributed computing, and applied AI, setting technical direction and building platforms that enable teams across the company to develop, train, deploy, and operate AI models reliably at scale.
 
You will be a hands-on technical leader, shaping long-term platform strategy while also diving deep into architecture, performance, and reliability challenges across compute, data, and ML systems.

What You’ll Be Doing

  • Architect and build large-scale AI infrastructure supporting training, fine-tuning, and deployment of AI models
  • Define platform standards and reference architectures for distributed training, inference, and model lifecycle management
  • Lead the design of scalable systems across compute, storage, networking, and orchestration layers (e.g. GPU/accelerator clusters, schedulers, data pipelines)
  • Drive performance, reliability, and cost optimization for AI workloads at scale
  • Partner with the AI Science team, and relevant business units to translate requirements into robust platform capabilities
  • Set technical direction and best practices for AI platform development, including scalability, reliability, observability, security, and operational excellence
  • Evaluate and integrate emerging technologies in AI infrastructure (frameworks, accelerators, serving stacks, tooling)
  • Mentor senior engineers and act as a subject-matter-expert, influencing architecture beyond your immediate scope

What We Loo

pythonjavagorustmachine learningaidataproductdesign