Job details

Senior Storage and Data Production Engineer

Get a free resume review

Production engineering is a team that involves designing, building, and maintaining large-scale production systems with high efficiency and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. Production Engineers possess expertise in different domains, such as storage architecture, high-performance distributed storage, data management, systems, networking, coding, database management, capacity planning, continuous delivery, and deployment, as well as open-source cloud-enabling technologies like Kubernetes, containers, and virtualization. Their responsibilities include ensuring reliable, scalable, high-performance storage solutions, optimizing data placement and access patterns, managing large-scale distributed storage systems, and ensuring low-latency data access for high-performance computing (HPC) and AI/ML workloads.

Production Engineers at NVIDIA ensure that our internal and external-facing GPU cloud services have reliability and uptime as promised to the users while enabling developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency, and performance. This role also requires an approach focused on automating storage operations, improving data access efficiency, and optimizing storage performance. Much of our software development focuses on eliminating manual work through automation, performance tuning, and growing the efficiency of storage and production systems.

What You Will Be Doing:

Design, implement, and support large-scale storage clusters, ensuring scalability, high availability, and data integrity.
Develop and maintain storage monitoring, logging, and alerting systems to ensure proactive detection and resolution of performance issues.
Work with AI/ML workloads to optimize storage architectures for low-latency access, efficient caching, and high-throughput performance. Improve the lifecycle of storage services – from inception and design to deployment, operation, and continuous optimization.
Support storage services before they launch through activities such as system design consulting, developing automation frameworks, capacity management, and launch reviews.
Maintain storage infrastructure once live by monitoring availability, latency, and system health, using predictive analytics and AI-driven automation.
Optimize storage efficiency through compression, duplication, tiering strategies, and intelligent workload placement.
Scale storage systems sustainably using AI/ML-driven automation, policy-based tiering, and dynamic data migration techniques. Ensure data security and compliance by implementing encryption, access controls, and auditing mechanisms for storage systems.
Practice sustainable incident response and blameless postmortems. Be part of an on-call rotation to support storage and production systems.

What We Need To See:

BS degree or equivalent experience in Computer Science, Storage Systems, or a related technical field (e.g., physics, mathematics), and 5+ years of practical experience.
Experience with high-performance storage solutions, including parallel file systems (Lustre, GPFS), distributed storage (Ceph, MinIO), and enterprise-scale object storage (S3, NetApp, Pure Storage, etc.).
Solid understanding of block, file, and object storage technologies, including their performance characteristics and standard methodologies.
Experience with storage networking protocols such as NFS, SMB, iSCSI, Fibre Channel, RDMA, and NVMe over Fabrics.
Expertise in algorithms, data structures, complexity analysis, software design, and maintaining large-scale Linux-based storage systems.
Experience in one or more of the following: C/C++, Java, Python, Go, Perl, or Ruby for storage automation, monitoring, and performance tuning.
Hands-on experience with infrastructure configuration management tools like Ansible, Chef, Puppet, and Terraform for automating storage deployments.
Experience with observability and tracing tools like InfluxDB, Prometheus, and the Elastic stack for monitoring storage system health.

Ways to stand out from the crowd:

Deep understanding of large-scale distributed storage architectures, replication strategies, and erasure coding techniques. Proven experience in capacity planning, performance tuning, and troubleshooting high-throughput storage systems.
Experience with Git, code review, pipelines, and CI/CD for handling infrastructure as code. Interest in analyzing and improving distributed storage system performance at scale. Strong debugging skills with a systematic problem-solving approach to identify complex storage issues. Experience using or running private and public cloud storage solutions based on Kubernetes, OpenStack, or hybrid cloud architectures.
Ability to design and implement automated storage migration, backup, and disaster recovery strategies. Thrive in collaborative environments and enjoy working with various teams to optimize storage performance. Flexible in adapting to different working styles and emerging storage technologies.

At NVIDIA, you’ll be at the forefront of innovative storage technologies, working on high-performance storage solutions that power the next generation of AI, HPC, and cloud computing. NVIDIA is leading in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking, and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you!

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$217750 / YEARLY (est.)

min

max

$148000K

$287500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Engineering Lab Manager

NVIDIA Remote US, CA, Santa Clara

VIEW

Posted 12 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Join NVIDIA as an Engineering Lab Manager and lead innovative lab operations while supporting cutting-edge GPU and AI technologies.

Senior Network Deployment Engineer, Software Automation - DGX Cloud

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 12 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Join NVIDIA as a Senior Network Deployment Engineer to lead innovative automation solutions for DGX Cloud in a dynamic engineering environment.

Design Engineer – Tier II/III

Visa Inc. Hybrid Brecksville, Ohio, United States

VIEW

Posted 2 hours ago

Bring your creativity and engineering skills to AMT as a Design Engineer, focused on advancing enteral devices in a collaborative team environment.

Solutions Designer, Cloud/Containers

Visa Remote Austin, TX, USA

VIEW

Posted 6 days ago

Become a key team member at Visa, helping to drive scalable infrastructure solutions in a hybrid work environment.

 Senior Cloud Engineer – Network Applications

Toyota Hybrid Plano, Texas

VIEW

Posted 11 days ago

At Toyota, we seek a Senior Cloud Engineer passionate about optimizing cloud infrastructure and enhancing user experience in innovative environments.

Bridge Design Engineer

Fisher Associates Remote Albany, New York, United States

VIEW

Posted 7 days ago

Join Fisher Associates as a Bridge Design Engineer to lead exciting infrastructure projects while being supported by a culture of mentorship and collaboration.

Environmental Health & Safety Engineer

SpaceX Hybrid McGregor, TX

VIEW

Posted 4 days ago

Mission Driven

Social Impact Driven

Passion for Exploration

Reward & Recognition

Join SpaceX as an Environmental Health & Safety Engineer, where you'll play a crucial role in maintaining safety standards at our cutting-edge rocket development facility.

Static Timing Analysis Engineer, FullChip/ASIC Implementation

Google Hybrid San Diego, California, United States

VIEW

Posted 14 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Take Risks

Collaboration over Competition

Growth & Learning

Transparent & Candid

Customer-Centric

Social Impact Driven

Rapid Growth

Passion for Exploration

Dare to be Different

Reward & Recognition

Friends Outside of Work

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Bias Training

Employee Resource Groups

401K Matching

Paternity Leave

Maternity Leave

Some Meals Provided

Social Gatherings

Join Google as a Static Timing Analysis Engineer to innovate and deliver high-performance silicon solutions for the next generation of consumer products.

Construction Estimators

Virtual Assist Remote No location specified

VIEW

Posted 12 days ago

We're seeking a detail-oriented Construction Estimator to join our remote team, focusing on delivering precise and competitive construction project bids.

Systems Performance Engineer (all gender)

ALTEN Hybrid Innsbruck, Austria

VIEW

Posted 7 days ago

Join ALTEN as a Systems Performance Engineer, where you'll improve gas engines' performance in a collaborative, innovative team.

Electrical Designer

Arthur Grand Technologies Inc Hybrid Ogden, Utah, United States

VIEW

Posted 7 days ago

Join Arthur Grand Technologies as an Electrical Designer, where you will lead the development of complex electrical systems while collaborating with a dedicated team.

Project Engineer (Korean Bilingual)

Harmonious Hiring LLC Hybrid Fort Lee, New Jersey, United States

VIEW

Posted 12 days ago

Join our Fort Lee-based team as a Project Engineer where your bilingual skills will aid in managing projects for major manufacturing companies and the U.S. Military.

Roadway and Traffic Engineer-In-Training

Ferrovial Hybrid Atlanta, GA

VIEW

Posted 6 days ago

Step into the role of a Roadway and Traffic Engineer-In-Training at Ferrovial, where innovation in infrastructure is at the forefront of your career.

Senior Infrastructure Engineer Nginx

NCS Australia Hybrid Canberra ACT, Australia

VIEW

Posted 13 days ago

Step into a vibrant environment at NCS Australia as a Senior Infrastructure Engineer with Nginx expertise, where your technical skills will drive impactful client projects.

Engineering Technician II

The City of Fort Worth Hybrid Development Services

VIEW

Posted 12 days ago

As an Engineering Technician II with the City of Fort Worth, you'll play a crucial role in reviewing development projects related to water and wastewater improvements.

Get a free resume review

NVIDIA

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

389 jobs

MATCH

VIEW MATCH

BADGES