Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Lead Systems Engineer, High-Performance Computing image - Rise Careers
Job details

Lead Systems Engineer, High-Performance Computing - job 7 of 20

IaaS Systems and Storage & Engineering (ISSE) team is part of the Operations & Infrastructure technology organization. Distributed Compute engineering (DCE) is part of ISSE and High-performance compute platform engineering is part of DCE. Our vision, mission and purpose are summarized as following:

Vision: To become a leading technical engineering professional, pioneering in the design and automation of server infrastructure. We envision creating highly secure and efficient operations environments that drive business success and technological advancement.

Mission: Our mission is to deliver high-quality server infrastructure design and automated implementation. We are committed to operating in complex, highly secure, and highly available environments, while maintaining rigorous operations, security, and procedural models.

Purpose: The purpose of this role is to utilize strong hands-on technical engineering skills to design and automate the implementation of server infrastructure based on business requirements. This role will interact with technology domain experts to maintain high security and availability in complex operational environments, thereby driving business efficiency and security.

Essential Functions:

  • GPU as a Service and High-Performance Compute Platform Support: Expertise in deploying, managing, and optimizing GPU as a Service (GaaS) and high-performance compute platforms to support advanced computational workloads.
  • Extensive Datacenter Experience: Proficient in managing complex, geographically distributed IT infrastructures to ensure high availability and performance.
  • Advanced Technical Knowledge: Profound understanding of high-performance, highly available, and secure computing systems utilizing x86 technologies and protocols (NVME, GPU, PCI-E).
  • Enterprise Server and Component Expertise: In-depth knowledge of server components (storage/network controllers, HBA, SSDs) and their functionalities, essential for maintaining high-performance compute environments.
  • Processor and GPU Systems Proficiency: Strong grasp of Intel/AMD architectures, GPU systems, memory hierarchy, and hardware-level security to enhance system performance and reliability.
  • Out-of-Band, UEFI, and BIOS Expertise: Comprehensive understanding of out-of-band management, UEFI, BIOS settings, and their impact on system performance and security in high-performance computing environments.
  • Hardware Lifecycle Management: Experienced in hardware lifecycle management, including firmware and OS driver certifications, to ensure the longevity and reliability of compute resources.
  • Infrastructure Management and Automation: Proficient in installing, configuring, supporting, and maintaining compute infrastructure management tools, with skills in Ansible for automation to streamline deployment and operational tasks.
  • Performance Benchmarking and Tech Evaluation: Capable of running performance benchmarks and evaluating new technologies for various platforms (Linux, Windows, containerized, and virtualized) to ensure optimal performance.
  • Scripting Proficiency: Advanced skills in scripting languages such as PowerShell and Python to automate and optimize infrastructure tasks.
  • Team and Independent Work: Highly motivated, excellent team player, capable of working independently, with strong analytical and troubleshooting abilities to resolve complex issues and mentor junior staff.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Lead Systems Engineer, High-Performance Computing, Visa

As a Lead Systems Engineer for High-Performance Computing at our Ashburn location, you will be at the forefront of our cutting-edge IaaS Systems and Storage & Engineering (ISSE) team within the Operations & Infrastructure technology organization. This thrilling role empowers you to leverage your robust hands-on technical engineering experience to design and automate the implementation of advanced server infrastructure tailored to dynamic business requirements. You'll collaborate with technology domain experts, ensuring that we maintain an ultra-secure and highly available operational environment that drives business efficiency and success. Your expertise in GPU as a Service and high-performance compute platforms will support complex workloads, while your extensive datacenter experience will shine as you manage geographically distributed IT infrastructures. With your deep technical knowledge across x86 technologies, process and component mastery, and proficiency in advanced scripting languages like PowerShell and Python, you will be pivotal in delivering high-quality infrastructure designs and automating their implementation. As a team player who can also tackle challenges independently, you will drive not only the performance and reliability of our computing systems but also mentor junior staff, making a meaningful impact in our mission to innovate and excel in the tech landscape. Embrace the hybrid work model that allows you to balance remote and in-office teamwork, ensuring that you thrive wherever you choose to work.

Frequently Asked Questions (FAQs) for Lead Systems Engineer, High-Performance Computing Role at Visa
What are the responsibilities of a Lead Systems Engineer at our High-Performance Computing in Ashburn?

The Lead Systems Engineer role at our High-Performance Computing facility in Ashburn involves designing and automating server infrastructure based on business needs. You will support GPU as a Service and high-performance computing platforms, manage distributed datacenter environments, and ensure the systems are secure and highly available, driven by your advanced technical knowledge.

Join Rise to see the full answer
What qualifications are necessary to be a Lead Systems Engineer for High-Performance Computing?

To excel as a Lead Systems Engineer for High-Performance Computing, candidates should possess deep technical expertise in x86 technologies, GPU systems, and server components, alongside a strong understanding of out-of-band management and hardware lifecycle management. Proficiency in scripting languages like PowerShell and Python is essential, along with experience in delivering solutions in highly secure and complex environments.

Join Rise to see the full answer
How does a Lead Systems Engineer impact the performance of high-computing environments?

A Lead Systems Engineer directly impacts the performance of high-computing environments by optimizing GPU as a Service and ensuring the infrastructure supports advanced workloads. By implementing performance benchmarking and evaluating new technologies, this role plays a crucial part in maintaining high availability and security, thus driving business efficiency.

Join Rise to see the full answer
What kind of experience is beneficial for the Lead Systems Engineer role in High-Performance Computing?

Beneficial experience for the Lead Systems Engineer role in High-Performance Computing includes extensive knowledge in data center operations, hardware management, and familiarity with leveraging automated tools such as Ansible for infrastructure management. Additionally, experience in both Linux and Windows platforms is advantageous for this position.

Join Rise to see the full answer
Is the Lead Systems Engineer for High-Performance Computing role a hybrid position?

Yes, the Lead Systems Engineer for High-Performance Computing position is hybrid, allowing you to alternate between remote work and on-site presence. Typically, employees are expected to work in the office for 2-3 set days a week, depending on business needs.

Join Rise to see the full answer
Common Interview Questions for Lead Systems Engineer, High-Performance Computing
What experience do you have with high-performance computing systems?

In your response, highlight specific projects or roles where you worked with high-performance computing systems, mentioning technologies and processes you have implemented or optimized.

Join Rise to see the full answer
Can you describe your experience with GPU as a Service?

Discuss your hands-on experience deploying and managing GPU as a Service, focusing on the challenges faced, solutions implemented, and the impacts on system performance.

Join Rise to see the full answer
How do you ensure security in complex operational environments?

Provide examples of security protocols and best practices you've implemented in previous roles, emphasizing your approach to maintaining high levels of security in datacenter operations.

Join Rise to see the full answer
What tools do you use for infrastructure management and automation?

List the tools you are proficient in, such as Ansible, and describe how you have used them to streamline deployment and operational tasks to improve efficiency.

Join Rise to see the full answer
Explain your approach to performance benchmarking.

Discuss the methodology you follow for benchmarking performances, which tools you utilize, and how the results influence decisions on technology implementations.

Join Rise to see the full answer
Can you discuss your knowledge of server components and their functionalities?

Share your in-depth understanding of server components like storage/network controllers and HBA, explaining how they contribute to overall system performance and reliability.

Join Rise to see the full answer
What scripting languages are you proficient in and how do you apply them?

Mention your proficiency in scripting languages such as PowerShell and Python, providing examples of tasks you've automated that significantly enhanced your team’s productivity.

Join Rise to see the full answer
How do you approach troubleshooting complex system issues?

Explain your systematic approach to troubleshooting, detailing the steps you take and tools you use to diagnose and resolve complex issues effectively.

Join Rise to see the full answer
Tell us about a time you mentored a junior team member.

Discuss the experience, focusing on how you guided the individual, shared knowledge, and helped them develop relevant skills within a team environment.

Join Rise to see the full answer
What challenges have you faced in hardware lifecycle management?

Share specific challenges related to hardware lifecycle management, what steps you took to address them, and the outcomes of those actions to underline your problem-solving skills.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 3 days ago

Join AECOM as a Construction Project Manager to lead impactful infrastructure projects in Ohio.

Photo of the Rise User

Join Sandisk as a Staff Engineer in Firmware Development, where you'll design and develop cutting-edge SSD firmware.

Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Posted 9 hours ago

Boeing is looking for a Test Program Set Engineer to join their dynamic engineering team in Oklahoma City, driving innovations in aerospace technologies.

Posted 8 days ago
Photo of the Rise User
Posted 5 days ago

Join AECOM as a High Voltage Transmission Electrical Engineer to work on groundbreaking projects with a global team.

Photo of the Rise User
Apple Hybrid Boulder, Colorado, United States
Posted 7 days ago
Inclusive & Diverse
Diversity of Opinions
Work/Life Harmony
Dare to be Different
Reward & Recognition
Empathetic
Take Risks
Growth & Learning
Transparent & Candid
Mission Driven
Passion for Exploration
Feedback Forward
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
Maternity Leave
Social Gatherings

Join Apple as a System Experience Engineer to innovate user interactions on the Apple Vision Pro.

Photo of the Rise User
Anglo American / De Beers Group Hybrid Moranbah North Coal Mine, Moranbah, Australia
Posted 11 days ago

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

8905 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!