We seek a manager to lead all aspects of a team of software engineers tasked with improving and crafting a collection of system stress applications tailored for NVIDIA's forthcoming data center products, operational within customer and partner infrastructures. Our focus lies in crafting software that subjects GPU servers to the most thorough testing scenarios imaginable. Our team collaborates closely with architecture, ASIC, systems engineering, and operations teams to devise methodologies aimed at pushing every hardware component to its limits. Situated at the core of NVIDIA's data center enterprise, from GPU baseboards to standalone servers and entire clusters, we are responsible for developing the comprehensive suite of system stress applications. We partner with NVIDIA operation teams to find efficient balance between product quality, test yield, and manufacturing efficiency. Wouldn't you want to be a key factor of NVIDIA gross margin?
What you will be doing:
Collaborated with multi-functional teams to do NPI project and improve and refine software deployed on our customers' servers and environments, facilitating detailed identification of hardware or software issues.
As the manager, you will run multiple concurrent projects through active prioritization, and communication.
On the engineer management side, we want the manager to continue to groom future technical leaders in the team and recruit new talent.
Constant development is another area of responsibility. We look for candidates who are proactive - seek opportunities to improve NVIDIA product quality and production efficiency.
We also need our candidates to be reactive: be able to drive root cause of critical issues and embrace corrective actions.
Finally, we need our leaders to develop long range strategies for the team to prepare for new challenges and drive execution.
What we need to see:
Bachelor of science in Computer Science, Computer Engineering, Electrical Engineering (or equivalent experience).
8+ overall years of system software experience, deep understanding of software development principles, comfortable working in large code space and deep driver stack with 4+ years of team management experience
Good system design skills
Good programming skills in C/C++, python programming is a plus.
Solid understanding in computer architecture, operating system, kernel driver, device programming.
Experience driving feature development and multi-team debug.
Ways to stand out from the crowd:
Knowledge of GPU compute or server product technologies like BMC (Baseboard Management Controller), Infiniband, PCIE, NVLink.
Extensive experience collaborating with customer software teams
Strong experience to engineer software with consideration of RAS
Comfortable with unknown and change
With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the most desirable employers in the world. We have some of the most brilliant and talented people in the world working for us. If you are creative, autonomous and love a challenge, we want to hear from you. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
#LI-Hybrid
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Are you ready to take on a leadership role as a Diag Software Manager at NVIDIA in Taipei, Taiwan? We're on the lookout for a talented professional to lead our dynamic team of software engineers who are dedicated to enhancing a collection of system stress applications designed for NVIDIA's groundbreaking data center products. Your expertise will help ensure that our GPU servers undergo rigorous testing to deliver top-notch performance within customer and partner infrastructures. At NVIDIA, we work alongside experts from various teams, including architecture, ASIC, systems engineering, and operations, to develop innovative methodologies that push our hardware components to their limits. As a core player in NVIDIA's data center operations, you'll oversee the development of comprehensive system stress applications. Your role goes beyond management; you will play a crucial part in ensuring product quality, optimizing test yield, and enhancing manufacturing efficiency. We're looking for a manager who can juggle multiple projects while grooming future technical leaders in the team and attracting new talent. If you're proactive in seeking improvements and reactive in driving solutions, your impact will significantly shape our products. With a solid educational background in computer science or engineering and extensive experience in system software and team management, you’ll fit right in our visionary environment. Join us at NVIDIA, where competitive salaries and generous benefits await you, alongside a team of brilliance and creativity!
Join Google as a software engineer to help develop cutting-edge technologies for managing information at scale.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
84 jobsSubscribe to Rise newsletter