Job Description
NVIDIA is seeking outstanding Networking Solutions Architects (SA) to help design and deploy large-scale AI Factories across Canada. In this role, you will collaborate with customers to build end-to-end infrastructure and become a trusted technical advisor on exciting projects focused on high-performance networking for generative AI and production AI inference pipelines. You will work with internal engineering, product, and business teams on performance analysis and modeling of large GPU clusters.
**What You Will Be Doing:**
- Collaborating with customers to build, deploy, and optimize large-scale AI training and inference infrastructure using NVIDIA technology.
- Analyzing deployment and performance data to identify product health trends, system bottlenecks, and operational risks.
- Solving challenging technical problems involving GPUs, networking, drivers, containers, firmware, and distributed system interactions.
- Delivering streamlined executive-level communication on status, risks, progress, and required decisions.
- Some travel to customer sites is required, up to 20%.
**What We Need To See:**
- BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields (or equivalent experience).
- 5+ years of Solution Architecture or similar roles.
- Understanding of high-performance networking technologies (e.g., RDMA, congestion control).
- Hands-on experience with NVIDIA GPU platforms and system software stacks (CUDA, NCCL).
- Strong Linux fundamentals and performance analysis skills.
**Ways To Stand Out From The Crowd:**
- Experience deploying or optimizing deep learning training and inference at scale on large GPU clusters.
- Familiarity with NVIDIA hardware and systems technology.