Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Lead Systems Engineer, High-Performance Computing image - Rise Careers
Job details

Lead Systems Engineer, High-Performance Computing - job 17 of 20

IaaS Systems and Storage & Engineering (ISSE) team is part of the Operations & Infrastructure technology organization. Distributed Compute engineering (DCE) is part of ISSE and High-performance compute platform engineering is part of DCE. Our vision, mission and purpose are summarized as following:

Vision: To become a leading technical engineering professional, pioneering in the design and automation of server infrastructure. We envision creating highly secure and efficient operations environments that drive business success and technological advancement.

Mission: Our mission is to deliver high-quality server infrastructure design and automated implementation. We are committed to operating in complex, highly secure, and highly available environments, while maintaining rigorous operations, security, and procedural models.

Purpose: The purpose of this role is to utilize strong hands-on technical engineering skills to design and automate the implementation of server infrastructure based on business requirements. This role will interact with technology domain experts to maintain high security and availability in complex operational environments, thereby driving business efficiency and security.

Essential Functions:

  • GPU as a Service and High-Performance Compute Platform Support: Expertise in deploying, managing, and optimizing GPU as a Service (GaaS) and high-performance compute platforms to support advanced computational workloads.
  • Extensive Datacenter Experience: Proficient in managing complex, geographically distributed IT infrastructures to ensure high availability and performance.
  • Advanced Technical Knowledge: Profound understanding of high-performance, highly available, and secure computing systems utilizing x86 technologies and protocols (NVME, GPU, PCI-E).
  • Enterprise Server and Component Expertise: In-depth knowledge of server components (storage/network controllers, HBA, SSDs) and their functionalities, essential for maintaining high-performance compute environments.
  • Processor and GPU Systems Proficiency: Strong grasp of Intel/AMD architectures, GPU systems, memory hierarchy, and hardware-level security to enhance system performance and reliability.
  • Out-of-Band, UEFI, and BIOS Expertise: Comprehensive understanding of out-of-band management, UEFI, BIOS settings, and their impact on system performance and security in high-performance computing environments.
  • Hardware Lifecycle Management: Experienced in hardware lifecycle management, including firmware and OS driver certifications, to ensure the longevity and reliability of compute resources.
  • Infrastructure Management and Automation: Proficient in installing, configuring, supporting, and maintaining compute infrastructure management tools, with skills in Ansible for automation to streamline deployment and operational tasks.
  • Performance Benchmarking and Tech Evaluation: Capable of running performance benchmarks and evaluating new technologies for various platforms (Linux, Windows, containerized, and virtualized) to ensure optimal performance.
  • Scripting Proficiency: Advanced skills in scripting languages such as PowerShell and Python to automate and optimize infrastructure tasks.
  • Team and Independent Work: Highly motivated, excellent team player, capable of working independently, with strong analytical and troubleshooting abilities to resolve complex issues and mentor junior staff.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Lead Systems Engineer, High-Performance Computing, Visa

Are you ready to take your career to the next level? We’re searching for a talented Lead Systems Engineer specializing in High-Performance Computing to join our dynamic team at the forefront of innovation in Ashburn. At the IaaS Systems and Storage & Engineering (ISSE) team, you'll collaborate with the Distributed Compute Engineering (DCE) group to enhance our high-performance computing platforms. Your technical expertise will shine as you design and automate server infrastructure that meets complex business requirements while ensuring security and availability. We are committed to delivering top-notch server infrastructure design and automated implementation, and we need your extensive datacenter experience to help us operate smoothly in highly secure environments. You'll utilize your skills in deploying GPU as a Service, managing geographically distributed IT infrastructures, and performance benchmarking across various platforms. Proficiency in scripting languages like PowerShell and Python will allow you to optimize tasks, while your knowledge of server components and processor systems enhances performance and reliability. In this hybrid role, you’ll balance working from the office and remotely, providing flexibility while ensuring business efficiency. Join us, and be part of a mission-driven company focused on pioneering secure and efficient operational environments. Get ready to mentor junior staff, tackle complex issues head-on, and drive technological advancement in high-performance computing!

Frequently Asked Questions (FAQs) for Lead Systems Engineer, High-Performance Computing Role at Visa
What are the responsibilities of a Lead Systems Engineer at the IaaS Systems and Storage & Engineering team?

As a Lead Systems Engineer at the IaaS Systems and Storage & Engineering team, your responsibilities will include designing and automating server infrastructure for high-performance computing, deploying and managing GPU as a Service, and ensuring high availability in complex IT environments. You’ll also collaborate with domain experts, perform performance benchmarking, and utilize your scripting skills to streamline tasks, all while maintaining rigorous security protocols.

Join Rise to see the full answer
What qualifications are required for the Lead Systems Engineer role in High-Performance Computing?

To excel as a Lead Systems Engineer in High-Performance Computing, you should have extensive experience with datacenter operations, a strong understanding of high-performance computing technologies, and proficiency in server component management. Required qualifications include expertise in x86 technologies, knowledge of GPU systems, and experience in automation tools like Ansible, along with strong scripting skills in PowerShell and Python.

Join Rise to see the full answer
How does the hybrid work model benefit the Lead Systems Engineer position?

The hybrid work model for the Lead Systems Engineer position allows for flexibility and a balanced work-life dynamic. Employees will work from the office 2-3 days a week, enabling collaboration with the team while also providing the opportunity to focus on complex tasks from home. This setup fosters innovation and productivity, aligning with our mission to drive technological advancements.

Join Rise to see the full answer
What specific technologies will a Lead Systems Engineer handle at this company?

As a Lead Systems Engineer at our company, you'll handle a range of technologies including GPU as a Service platforms, x86 architectures, NVME and PCI-E protocols, as well as various operating systems like Linux and Windows. Your role will also involve utilizing out-of-band management and BIOS settings to enhance performance and security in high-performance computing environments.

Join Rise to see the full answer
What skills are essential for the Lead Systems Engineer role in High-Performance Computing?

Essential skills for the Lead Systems Engineer role in High-Performance Computing include advanced technical knowledge of computing systems, scripting proficiency in languages like Python and PowerShell, excellent problem-solving and analytical skills, and experience with hardware lifecycle management. Being a strong team player and capable of independent work is also crucial to succeed in this position.

Join Rise to see the full answer
Common Interview Questions for Lead Systems Engineer, High-Performance Computing
What experience do you have with GPU as a Service platforms?

When answering this question, highlight your direct experience with deploying, managing, or optimizing GPU as a Service platforms. Discuss specific projects where you improved performance and efficiency and any relevant tools or technologies you utilized.

Join Rise to see the full answer
Can you explain the importance of high availability in computing environments?

In your response, define high availability and emphasize its critical role in ensuring business continuity. Discuss your strategies for achieving high availability, such as redundant systems or load balancing, and any relevant experiences integrating these solutions.

Join Rise to see the full answer
How do you approach automating infrastructure tasks?

Share your methodology for automation, focusing on the tools you use (like Ansible). Discuss specific examples where you automated a process, the results of that automation, and any challenges you faced in the process.

Join Rise to see the full answer
What steps would you take to manage performance benchmarking across different platforms?

Outline your systematic approach to performance benchmarking by discussing the key metrics you would track, the tools you would use, and how you analyze results to improve system performance. Use examples from your past experiences to illustrate.

Join Rise to see the full answer
Describe your knowledge of server components and their functionalities.

Provide a brief overview of various server components, including storage controllers and SSDs, and their role in a computing environment. Highlight your hands-on experience with these components and how you ensure optimal performance.

Join Rise to see the full answer
What do you understand about hardware lifecycle management?

Explain hardware lifecycle management, including the stages from procurement to end-of-life. Discuss your experience with firmware and OS driver certifications and how these practices impact system reliability and longevity.

Join Rise to see the full answer
How do you ensure security in high-performance computing environments?

Talk about the various security protocols and practices you implement, such as regular updates, access controls, and out-of-band management. Discuss your awareness of the potential risks in high-performance systems and how you proactively mitigate them.

Join Rise to see the full answer
Can you provide an example of a complex issue you've resolved in your past roles?

Share a specific challenge you faced, the steps you took to resolve it, and the outcomes. Highlighting your analytical skills and problem-solving abilities while discussing how you mentored junior staff through the process can demonstrate your leadership potential.

Join Rise to see the full answer
What is your experience with scripting languages in infrastructure management?

Detail your experience with scripting languages such as PowerShell or Python. Provide examples of scripts you’ve written to automate tasks in your previous roles and the improvements those scripts brought to efficiency and performance.

Join Rise to see the full answer
Why do you want to work as a Lead Systems Engineer at our company?

Your answer should reflect your genuine interest in our mission and the specific role as a Lead Systems Engineer. Discuss how your skills align with our objectives and how you envision contributing to our team and technological advancements in high-performance computing.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Posted yesterday

Join Path Robotics as a Field Service Technician and be part of a team transforming manufacturing through advanced robotic technology.

L3Harris Technologies Hybrid US, El Paso County, CO; Colorado, Colorado Springs, CO
Posted 2 days ago

Become a key player in L3Harris's defense technology solutions as a Senior Specialist in Systems Engineering focused on Astrodynamics Software.

Photo of the Rise User
ServiceNow Hybrid 4810 Eastgate Mall, San Diego, California, United States
Posted 7 hours ago
Inclusive & Diverse
Mission Driven
Rise from Within
Diversity of Opinions
Work/Life Harmony
Empathetic
Feedback Forward
Take Risks
Collaboration over Competition
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Paid Time-Off
Maternity Leave
Equity

Become a pivotal part of ServiceNow’s mission as a Senior Systems Engineer, ensuring the stability and efficiency of our Linux operations.

Photo of the Rise User
Posted 10 days ago

Join AECOM as a Senior Hydraulics Engineer and be a part of a team delivering impactful infrastructure projects.

Photo of the Rise User
Boeing Hybrid US, Saint Louis County, MO; Missouri, Hazelwood, MO
Posted yesterday

Join Boeing's Phantom Works team as a Senior Wire Design & Install Engineer and contribute to cutting-edge defense technology.

Photo of the Rise User
Rolls-Royce Remote US, Marion County, IN; Indiana, Indianapolis, IN
Posted 7 days ago

Join Rolls-Royce as a Senior Control System Engineer and help solve some of the industry's complex challenges while working with a global team.

Dive into the world of energy optimization at Toyota with a focus on carbon neutrality through this enriching internship experience.

Photo of the Rise User
Posted 14 days ago

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

9778 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!