As a Senior Multi-GPU Signal Processing and System Architecture Engineer, you will design and implement real-time signal-processing subsystems that convert physics-based channel descriptions into received signals for large numbers of emulated devices, across systems of potentially thousands of interconnected GPUs.
You will work on foundational technology for 5G and 6G network simulation, using NVIDIA's world-class compute and interconnect platforms. Your expertise will be crucial in architecting the inter-cell data-flow layer, ensuring that the information each cell needs to model interference from its neighbors is compressed, transported, and consumed within the available NVLink and NIC budgets at scale.
You will collaborate with the propagation engine and RAN stack teams to orchestrate the end-to-end simulation pipeline, ensuring that propagation updates, channel application, and stack execution remain synchronized across hundreds or thousands of GPUs. You will assess design and implementation trade-offs between physical fidelity, latency, and system scalability.
Requirements:
- PhD in high-performance computing, computer architecture, signal processing, or wireless communications (or equivalent experience)
- 12+ years of proven experience
- Proficiency in CUDA kernel design with attention to memory hierarchy, register pressure, and HBM bandwidth planning, with a track record of writing production-quality GPU code that meets hard real-time deadlines
- Demonstrated ability to build and reason about data flows across multi-device GPU systems (NVLink, NIC/RDMA) with explicit bandwidth and latency accounting
- Working knowledge of OFDM signal processing and the 5G NR physical layer, sufficient to implement and validate a channel-emulation pipeline
- Impactful publications involving GPU-accelerated numerical workloads or real-time system design
Nice to have:
- Experience with GPU-accelerated RAN platforms, L1/L2 software stacks, or channel emulators
- Knowledge of high-bandwidth GPU interconnects (NVLink, NVSwitch) and their scaling properties
- Familiarity with massive MIMO beamformer design and MU-MIMO precoding
If you're eager to contribute to crafting the future of telecommunications and meet the above qualifications, we'd love to hear from you. Submit your application and join NVIDIA as we continue to push the boundaries of what's possible.
XML job scraping automation by YubHub