About the role
NVIDIA is a global leader in AI, high-performance computing, and visualization, with GPU technology powering everything from modern computers to robots, autonomous systems, and the metaverse. As the pioneer of AI computing, NVIDIA is shaping the future of next-generation multimedia, generative AI, and spatial computing.
We are looking for a visionary Senior Deep Learning Research Engineer to join our team. In this role, you will bridge the gap between Deep learning research and production, pioneering the next generation of audio foundation models. If you are passionate about solving ultra-complex, multi-modal AI problems that will redefine industry standards for entertainment, communication, and digital twins, come join us.
What you’ll be doing:
Pioneer Generative AI - Audio Architecture: Lead the research, design, and development of state-of-the-art deep learning models solving complex problems in audio and speech domains.
Drive Future Technology Roadmaps: Define the technical vision and R&D strategy for NVIDIA’s future Audio, Speech and multimedia DL algorithms, ensuring alignment with hardware advancements.
Scale Massive Foundation Models: Train and optimize large-scale generative models (Diffusion, Transformers, Autoregressive models) using distributed training across massive GPU clusters.
Audio-Speech-Visual Fusion: Develop advanced algorithms for speech transformation, spatial audio, audio enhancements, and real-time video/audio enhancements using Deep Learning Algos.
Cross-Functional Leadership: Collaborate closely with NVIDIA Research, hardware architecture teams, and product groups (such as NeMo, AI4Media, and Broadcast) to productize breakthrough technologies.
Productize and Deploy models on Edge: Develop and productize inference models on NVIDIA GPUs and Nvidia RTX Spark platforms as SDK/Microservices after optimization.
Technical Mentorship: Mentor senior engineers and scientists, foster a culture of technical excellence, and maintain high standards via code and design reviews.
What we need to see:
PhD in Computer Science, Artificial Intelligence, Applied Mathematics, or a related quantitative field.
10+ years of industry or post-doc experience directly developing advanced deep learning models for audio, image, and video processing.
Deep Mastery of Generative AI: Proven track record of working with Diffusion models, GANs, Transformers, VAEs, Neural Radiance Fields (NeRFs) / 3D Gaussian Splatting, GRUs etc.
Strong Programming & Software Architecture: Elite Python coding skills with a solid foundation in production-grade software design, scalability, and data structures.
Framework Proficiency: Expert-level hands-on experience with PyTorch and deep familiarity with multi-modal data processing pipelines (video decoding, audio DSP, spectrogram analysis).
Proven Impact: A strong portfolio of shipped high-impact commercial AI products or a stellar publication record at top-tier AI conferences (CVPR, ICCV, SIGGRAPH, NeurIPS, ICASSP, Interspeech).
Ways to stand out from the crowd:
Deep understanding of Generative Audio/Speech Architecture and Algorithms
Experience with large-scale distributed training frameworks (e.g., Megatron-LM, DeepSpeed, PyTorch FSDP) on cluster architectures.
High proficiency in C++ and low-level GPU optimization tools like CUDA, cuDNN, Triton, or TensorRT.
Experience in building World Models or physics-informed neural networks for video synthesis would be an add-on.
NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Aplyr's read
NVIDIA is a pioneering force in GPUs and AI, attracting top talent in engineering and innovation-driven roles across various tech domains.
What's promising
- •NVIDIA leads the GPU market, crucial for gaming and AI applications.
- •The company invests heavily in AI and deep learning, driving technological advancements.
- •NVIDIA's strong market position offers stability and growth opportunities for employees.
What to watch
- •High competition in the semiconductor industry can impact market share.
- •Rapid technological changes require constant adaptation and learning.
- •Intense workload and high expectations may affect work-life balance.
Why NVIDIA
- •NVIDIA's GPUs are industry benchmarks in gaming and professional graphics.
- •The company's AI research is at the forefront of deep learning innovation.
- •NVIDIA's culture emphasizes cutting-edge technology and engineering excellence.
Aplyr’s read is generated by AI from public sources. Was it useful?
About NVIDIA
NVIDIA is a leading technology company known for its graphics processing units (GPUs) for gaming and professional markets, as well as its advancements in artificial intelligence and deep learning.
Similar roles
Director of AI Engineering & Research, Frontier Systems
Anduril Industries
Senior Product Manager - AI Workflows (Expert Research)
AlphaSense
Associate Director and Senior Principal, AI Methods, AI for Research (AI4R)
Novartis
Postdoctoral Research Associate - Embodied AI Lab Automation: Vision-Guided Real Time Robotic Manipulation
Brookhaven Lab
Sr. Staff AI Research TLM - AI Systems
Databricks
Principal Research Scientist - AI Scaling & Optimization
Databricks