About the role
We are looking for a Senior Network Engineer to join a dynamic, results-oriented team supporting InfiniBand and Ethernet fabrics in the most demanding data center, AI, storage, and HPC environments. As a Technical Support Engineer, you will be an approachable, proficient communicator who takes ownership of resolving issues while ensuring a high level of customer satisfaction. You will work closely with Engineering, Marketing, and Support teams on complex technical issues.
What you will be doing:
Tier 3 support for InfiniBand and Ethernet fabrics: installing, supporting, and resolving complex technical issues in large AI/HPC and storage clusters
Troubleshooting end-to-end InfiniBand fabrics — subnet management (UFM / OpenSM), routing topology and congestion control.
Developing and presenting comprehensive technical solutions to customer problems, including project management of complex fabric installations
Problem reporting, issue replication, and resolution management
Responding to customer technical inquiries
Site visits and conference calls with customers and partners
Developing and refining internal processes to improve support efficiency and productivity
What we need to see:
B.Sc. in Computer Science, Software Engineering, or Electrical Engineering, or equivalent experience
5+ years providing customer support for hardware & software products
Hands-on InfiniBand experience — OR deep Ethernet/RDMA expertise with a demonstrated ability and strong motivation to ramp quickly on InfiniBand (subnet manager, fabric topologies such as fat-tree, RDMA verbs, diagnostics)
Extensive knowledge of LAN switching/routing (STP, MSTP, MLAG, VPC, VRRP, LACP) and IP routing (OSPF, BGP, PIM), plus virtualization, EVPN, VXLAN
Linux experience and scripting
Automation — Ansible
Ability to work under pressure and support high-level customers
Experience operating and configuring major vendors' switches and routers
Analysis and diagnosis of highly complex networking problems
Strong communication, presentation, and oral skills; excellent verbal and written English
Ways to stand out from the crowd:
Deep InfiniBand expertise: UFM, OpenSM, adaptive routing, congestion control, link-layer troubleshooting, ibdiagnet/perftest, ConnectX/BlueField adapters
RDMA, RoCE, and GPUDirect RDMA in production AI/HPC clusters
Clustering and data-center technologies, including upper-layer protocols (e.g., MPI, NCCL)
Knowledge of Linux/Unix at an administration level
Ethernet (10/40/100+ GigE) and/or InfiniBand at scale
NVIDIA is widely considered one of the technology world's most desirable employers, with some of the most forward-thinking and hardworking people anywhere. If you're creative, results-oriented, and enjoy having fun — what are you waiting for? Apply today!
Aplyr's read
NVIDIA is a pioneering force in GPUs and AI, attracting top talent in engineering and innovation-driven roles across various tech domains.
What's promising
- •NVIDIA leads the GPU market, crucial for gaming and AI applications.
- •The company invests heavily in AI and deep learning, driving technological advancements.
- •NVIDIA's strong market position offers stability and growth opportunities for employees.
What to watch
- •High competition in the semiconductor industry can impact market share.
- •Rapid technological changes require constant adaptation and learning.
- •Intense workload and high expectations may affect work-life balance.
Why NVIDIA
- •NVIDIA's GPUs are industry benchmarks in gaming and professional graphics.
- •The company's AI research is at the forefront of deep learning innovation.
- •NVIDIA's culture emphasizes cutting-edge technology and engineering excellence.
Aplyr’s read is generated by AI from public sources. Was it useful?
About NVIDIA
NVIDIA is a leading technology company known for its graphics processing units (GPUs) for gaming and professional markets, as well as its advancements in artificial intelligence and deep learning.