Job Description
NVIDIA is seeking an experienced AI Infrastructure Solutions Architect (SA) to bridge design and deployment of large-scale GPU infrastructure. In this role, you will interact with customers, partners, and internal teams to analyze, define, and implement large-scale AI/HPC projects, while offering recommendations on our product roadmap.
**What you'll be doing:**
- Collaborating with NVIDIA Cloud Partners in Canada on large data center GPU server and networking system deployments.
- Guiding customer discussions on network design, compute/storage, and supporting server/network/cluster deployments.
- Acting as the primary technical driver for customers during the design, development, construction, integration, and production of GPU Cloud infrastructure.
- Conducting regular technical meetings for product roadmap, cluster issue debugging, and introducing new technology solutions.
- Analyzing and debugging compute/network configuration and performance issues.
- Preparing and delivering technical content including presentations, workshops, and tutorials.
**What we need to see:**
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, Mathematics, or equivalent experience.
- 5+ years of Solution Engineering or similar roles, with direct experience working with partners and customers.
- System level expertise in CPU/GPU server architecture, NICs, Linux, and system software.
- Knowledge of networking switches for Ethernet/Infiniband and Data Center infrastructure.
- Familiarity with DevOps/MLOps technologies such as Docker, Kubernetes.
- Excellent presentation, communication, and collaboration skills.