NVIDIA is looking for a passionate, modern software engineer or junior architect early in their career. The role involves developing and prototyping new scalable training and inference advancements using NVIDIA's Spectrum-X AI fabric.
This role offers a rare opportunity to work on innovative AI and networking technologies, building prototypes that influence the development of large-scale AI systems. You will help improve AI application–network interaction by refining communication, crafting congestion control, contributing to NIC and switch capabilities, and enhancing AI factory performance at scale.
Responsibilities:
- Prototype end-to-end solutions to improve distributed training and disaggregated inference performance.
- Analyze and optimize communication flows across application, transport, and network layers.
- Develop system software spanning communication libraries, drivers, and firmware integrations.
- Collaborate with hardware, firmware, and SDK teams to co-design network features.
- Validate and integrate prototypes into NVIDIA's AI infrastructure and products.
Requirements:
- Bachelor's or Master's Degree in Computer Science or Electrical Engineering
- 0-2 years of experience in relevant fields.
- Programming knowledge in C/C++
- Ability to work closely with architects and R&D teams.
- Passion to learn and innovate independently.
Ways to stand out from the crowd:
- Demonstrated innovation and leadership turning prototypes into impactful product features.
- Understanding of Networking Protocols – Ethernet, InfiniBand is an advantage.
- Ability to quickly adapt to new technology and go deep into new areas.
- Contributions to open-source projects, academic papers, or performance benchmarking tools.
- Background in AI factory architectures, distributed inference, or network telemetry.
XML job scraping automation by YubHub