About the role
NVIDIA Cloud Functions (NVCF) is an Open Source Platform that links workloads to GPUs. It lets teams deploy, manage, and serve GPU-accelerated, containerized applications across regions and clusters worldwide. The platform routes inference, streaming, and batch jobs across decentralized GPU clusters. This allows endpoints to scale repeatably, whether hosted on-prem or in the cloud.
We are seeking a Senior Systems Software Engineer to join our team. You will focus on improving the performance, reliability, and scaling behavior of a system that routes AI workloads onto distributed GPU fleets. You will work on a polyglot platform that is now fully open source, with both control plane and edge deployments. The work suits someone with deep experience in systems performance, distributed systems, and Kubernetes-based runtimes. We are looking for engineers who want to learn and grow. Expect to be challenged, in an environment with rapidly shifting priorities, where insight, focus and execution is key.
What you'll be doing:
You'll be working in a distributed team that explores innovative ways to make GPU and DPU accelerated applications easier to develop, deploy, and monitor on the latest and greatest NVIDIA hardware.
Design and ship services in Java, Go and Rust, building in the open on a public repository where your commits, design proposals, and reviews are transparent to the community.
Work on automating and optimizing build, test, integration, and release processes for cloud native.
Partner with engineering teams across NVIDIA so the platform integrates with adjacent NVIDIA technologies, including the KAI Scheduler, NVIDIA NIM, Grove and Dynamo.
Help steward an open-source project. You will triage community issues and pull requests, write docs contributors can build on.
What we need to see:
Bachelor’s or Master’s Degree in Computer Science or equivalent program from an accredited University/College and 8+ years of hands-on software engineering.
Expert level knowledge in a systems programming language (Go, C, Rust) and proven understanding of Data Structures, Algorithms and Distributed Software Architecture
Strong understanding of Container Orchestration Systems (Kubernetes) and Container Technologies with hands-on automation experience in continuous integration frameworks like Gitlab & ArgoCD.
Expertise in a scripting language (Bash, Python) and knowledge and experience working with System internals of Unix/Unix-like kernels such as Linux.
Understanding of performance, security and reliability in complex distributed systems.
Ways to stand out from the crowd:
Background with pub-sub models and message queues
Experience optimizing for high-throughput network paths, with a working understanding of unary versus streaming and bidirectional protocols across HTTP/2 and gRPC.
Experience with developing Kubernetes Custom Resources and Operators deployed in Cloud Service Providers
We have some of the most hard-working and skilled people in the world working for us and our world-class engineering teams are growing fast. If you're a creative and self-motivated engineer with a real passion for technology, we want to hear from you!
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com
Aplyr's read
NVIDIA is a pioneering force in GPUs and AI, attracting top talent in engineering and innovation-driven roles across various tech domains.
What's promising
- •NVIDIA leads the GPU market, crucial for gaming and AI applications.
- •The company invests heavily in AI and deep learning, driving technological advancements.
- •NVIDIA's strong market position offers stability and growth opportunities for employees.
What to watch
- •High competition in the semiconductor industry can impact market share.
- •Rapid technological changes require constant adaptation and learning.
- •Intense workload and high expectations may affect work-life balance.
Why NVIDIA
- •NVIDIA's GPUs are industry benchmarks in gaming and professional graphics.
- •The company's AI research is at the forefront of deep learning innovation.
- •NVIDIA's culture emphasizes cutting-edge technology and engineering excellence.
Aplyr’s read is generated by AI from public sources. Was it useful?
About NVIDIA
NVIDIA is a leading technology company known for its graphics processing units (GPUs) for gaming and professional markets, as well as its advancements in artificial intelligence and deep learning.
Similar roles
Senior Systems Software Engineer - GPU Performance at Scale
NVIDIA
Senior Software Architect - Data Center Systems
NVIDIA
Systems Engineer
Rolls-Royce
Software Engineer, Intelligence Systems
Anduril Industries
Senior Cloud Software Engineer, Intelligence Systems
Anduril Industries
Site Reliability Engineer, Intelligence Systems
Anduril Industries