• Deep Learning Performance Architect - China, Shanghai
  • Nvidia in Shanghai, , China
  • jobs
  • 1 month ago

jobs description

Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software? NVIDIA is seeking world-class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes. In this role, you will write code that ships in our deep learning libraries, as well as guide the direction of our future GPU architectures. This position offers the opportunity to have real impact in a fast-moving, technology-focused company. What you'll be doing: Developing production-quality software that ships as part of NVIDIA's AI software stack. Analyzing the performance of important workloads, tuning our current software, and proposing improvements for future software. Working with cross-collaborative teams of deep learning software engineers and GPU architects to develop innovative solutions Adapting to the constantly evolving AI industry by being agile and excited to contribute across the... codebase, including API design, software architecture, testing, and GPU kernel development. What we need to see: MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience Strong background in computer architecture Strong C/C++ programming and software design skill GPU programming experience (CUDA) desired LLVM/MLIR development experience a plus Good communication and organizational skills #deeplearning
Shanghai China


Apply - Deep Learning Performance Architect - China, Shanghai