Senior High-Performance AI Training Engineer

Nvidia

Santa Clara, CA

Job posting number: #7281311 (Ref:JR1987557)

Posted: September 18, 2024

Job Description

We are now looking for a Senior High-Performance AI Training Engineer:

NVIDIA is seeking senior engineers who are obsessed with performance analysis and optimization to help us squeeze every last clock cycle out of AI training, the workload driving the design and construction of the largest and most powerful compute systems in the world. This role offers the opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution.

What you will be doing:

  • Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Build and support NVIDIA submissions to the MLPerf Training benchmark suite.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we need to see:

  • PhD in CS, EE or CSEE and 5+ years; or MS (or equivalent experience) and 8+ years of relevant meaningful work experience.

  • Strong background in deep learning and neural networks, in particular training.

  • Strong background in computer architecture and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance, as well as experience with processor and system-level performance modelling.

  • Programming skills in C++, Python, and CUDA.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you are willing to work across all layers of the hardware/software stack - from GPU architecture to the application code - to achieve peak performance, we want to hear from you!

#LI-Hybrid

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.





Apply Now

Please mention to the employer that you saw this ad on CollegeJobs.com

Job posting number:#7281311 (Ref:JR1987557)
Application Deadline:Open Until Filled
Employer Location:Nvidia
Santa Clara,California
United States
More jobs from this employer