Systems Engineer, Kernel
Confirmed live in the last 24 hours
CoreWeave
Job Description
What You’ll Do:
CoreWeave is seeking a highly skilled and motivated Systems Kernel Engineer to join our HAVOCK Team, reporting into the Manager of Systems Engineering. In this role, you will be a key contributor to the stability, performance, and evolution of CoreWeave’s Linux based infrastructure. As a kernel generalist, you will be responsible for debugging kernel-level issues, analyzing and fixing crashes, panics, dumps, and upstreaming fixes and features that improves the performance and reliability of our stack.
This position is ideal for someone who thrives in low-level systems engineering, and understands how modern workloads stress kernels, and is excited to work across a diverse hardware/software ecosystem including CPUs, GPUs, DPUs, networking, and storage.
Kernel Hardware - Acceleration - Virtualization - Operating Systems - Containerization - Kubelet
Our Team’s Stack:
- Python, Go, bash/sh, C
- Prometheus, Victoria Metrics, Grafana
- Linux Kernel (custom build), Ubuntu
- Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs
- Docker, kubernetes (k8s), KubeVirt, containerd, kubelet
Focus Areas:
- Kernel Debugging – Analyze kernel crashes, oopses, panics, and dumps to identify root causes and propose fixes.
- Upstream Contributions – Develop patches for the Linux kernel and upstream them where applicable (networking, storage, virtualization, GPU/DPU enablement).
- Stack-Wide Support – Ensure kernel support and stability across:
- Virtualization (KubeVirt, QEMU, vFIO)
- Container runtimes (containerd, nydus, kubelet)
- HPC/AI workloads (CUDA, GPUDirect, RoCE/InfiniBand)
- Kernel-Hardware Enablement – Support new hardware bring-up across Intel, AMD, ARM CPUs, NVIDIA GPUs, DPUs, and NICs.
- Performance & Stability – Tune kernel subsystems for latency, throughput, and scalability in distributed HPC/AI clusters.
About the role:
- Triage and fix kernel crashes and performance regressions.
- Develop, test, and upstream kernel patches relevant to CoreWeave’s hardware/software environment.
- Collaborate with hardware vendors and the Linux community on feature enablement.
- Implement diagnostics and tooling for kernel-level observability.
- Work closely with HPC and Fleet teams to ensure kernel readiness for production workloads.
- Provide kernel-level expertise during incident response and root-cause investigations.
Who You Are:
Similar Jobs
Cerebras Systems
ML Systems Performance Engineer (Kernel-Level)
NVIDIA
Senior Systems Software Engineer, Base OS - Kernel
Pure Storage
Linux Kernel Software Engineer - Systems Engineering
NVIDIA
Senior Linux Kernel Systems Software Engineer – CSP Engagements
CoreWeave
Systems Engineer, Kernel (Performance)
CoreWeave