Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Lead Systems Engineer, High-Performance Computing image - Rise Careers
Job details

Lead Systems Engineer, High-Performance Computing - job 9 of 20

IaaS Systems and Storage & Engineering (ISSE) team is part of the Operations & Infrastructure technology organization. Distributed Compute engineering (DCE) is part of ISSE and High-performance compute platform engineering is part of DCE. Our vision, mission and purpose are summarized as following:

Vision: To become a leading technical engineering professional, pioneering in the design and automation of server infrastructure. We envision creating highly secure and efficient operations environments that drive business success and technological advancement.

Mission: Our mission is to deliver high-quality server infrastructure design and automated implementation. We are committed to operating in complex, highly secure, and highly available environments, while maintaining rigorous operations, security, and procedural models.

Purpose: The purpose of this role is to utilize strong hands-on technical engineering skills to design and automate the implementation of server infrastructure based on business requirements. This role will interact with technology domain experts to maintain high security and availability in complex operational environments, thereby driving business efficiency and security.

Essential Functions:

  • GPU as a Service and High-Performance Compute Platform Support: Expertise in deploying, managing, and optimizing GPU as a Service (GaaS) and high-performance compute platforms to support advanced computational workloads.
  • Extensive Datacenter Experience: Proficient in managing complex, geographically distributed IT infrastructures to ensure high availability and performance.
  • Advanced Technical Knowledge: Profound understanding of high-performance, highly available, and secure computing systems utilizing x86 technologies and protocols (NVME, GPU, PCI-E).
  • Enterprise Server and Component Expertise: In-depth knowledge of server components (storage/network controllers, HBA, SSDs) and their functionalities, essential for maintaining high-performance compute environments.
  • Processor and GPU Systems Proficiency: Strong grasp of Intel/AMD architectures, GPU systems, memory hierarchy, and hardware-level security to enhance system performance and reliability.
  • Out-of-Band, UEFI, and BIOS Expertise: Comprehensive understanding of out-of-band management, UEFI, BIOS settings, and their impact on system performance and security in high-performance computing environments.
  • Hardware Lifecycle Management: Experienced in hardware lifecycle management, including firmware and OS driver certifications, to ensure the longevity and reliability of compute resources.
  • Infrastructure Management and Automation: Proficient in installing, configuring, supporting, and maintaining compute infrastructure management tools, with skills in Ansible for automation to streamline deployment and operational tasks.
  • Performance Benchmarking and Tech Evaluation: Capable of running performance benchmarks and evaluating new technologies for various platforms (Linux, Windows, containerized, and virtualized) to ensure optimal performance.
  • Scripting Proficiency: Advanced skills in scripting languages such as PowerShell and Python to automate and optimize infrastructure tasks.
  • Team and Independent Work: Highly motivated, excellent team player, capable of working independently, with strong analytical and troubleshooting abilities to resolve complex issues and mentor junior staff.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Lead Systems Engineer, High-Performance Computing, Visa

As a Lead Systems Engineer specializing in High-Performance Computing at our Ashburn location, you'll be a pivotal member of the IaaS Systems and Storage & Engineering (ISSE) team, which thrives on innovation within the Operations & Infrastructure technology organization. We take pride in our vision to spearhead the design and automation of server infrastructure while ensuring efficiency and security in our operations. Your expertise will be crucial as you'll design and automate server infrastructure that meets the specific demands of our business environment. You’ll leverage your extensive experience in deploying and managing GPU as a Service and high-performance compute platforms, ensuring that our advanced computational workflows are top-notch. Your day-to-day will involve handling complex, geographically distributed IT infrastructures to maintain high availability and performance levels. With a strong foundation in x86 technologies, you'll also have opportunities to engage in hardware lifecycle management and performance benchmarking. Furthermore, we champion a culture of collaboration while recognizing the value of independent problem-solving. In this hybrid role, you’ll balance between working from the office and remotely, participating in a dynamic team environment that nourishes professional growth and encourages innovation. Your journey with us as a Lead Systems Engineer will not only bolster your technical acumen but also play a vital role in shaping our organizational success and technological advancements.

Frequently Asked Questions (FAQs) for Lead Systems Engineer, High-Performance Computing Role at Visa
What responsibilities does a Lead Systems Engineer at High-Performance Computing have?

As a Lead Systems Engineer at High-Performance Computing, your primary responsibilities include designing and automating server infrastructure, deploying and optimizing GPU as a Service, and managing complex IT environments to ensure high performance and availability. You'll work closely with technology experts, engage in hardware lifecycle management, and run performance benchmarks to evaluate new technologies, all the while maintaining top-tier security.

Join Rise to see the full answer
What qualifications are necessary for the Lead Systems Engineer position at High-Performance Computing?

To excel as a Lead Systems Engineer at High-Performance Computing, candidates should possess advanced knowledge in high-performance computing systems, a strong grasp of Intel/AMD architectures, and proficiency in scripting languages like PowerShell and Python. Additionally, having extensive datacenter experience and familiarity with key technologies such as NVME, GPU, PCI-E, and expertise in infrastructure management tools are essential.

Join Rise to see the full answer
What skills are essential for success as a Lead Systems Engineer in High-Performance Computing?

Successful Lead Systems Engineers at High-Performance Computing must demonstrate advanced technical knowledge of computing systems, especially regarding out-of-band management and high-security environments. Key skills include expertise in server components, infrastructure management, performance benchmarking, and strong analytical abilities coupled with excellent team collaboration and mentoring capabilities.

Join Rise to see the full answer
Can you describe the work environment for the Lead Systems Engineer at High-Performance Computing?

The work environment for the Lead Systems Engineer at High-Performance Computing is a dynamic hybrid model, allowing for a mix of remote work and in-office collaboration. Employees can expect to engage interactively with their teams while also having the autonomy to manage their tasks independently. The culture encourages knowledge sharing and collective problem-solving to enhance overall efficiency.

Join Rise to see the full answer
What kind of growth opportunities are available for a Lead Systems Engineer at High-Performance Computing?

A Lead Systems Engineer at High-Performance Computing will find numerous growth opportunities, ranging from hands-on technical challenges, mentorship roles, to leadership positions within the team. With our commitment to fostering technical innovation and operational excellence, employees are supported in pursuing their professional development and career advancement.

Join Rise to see the full answer
Common Interview Questions for Lead Systems Engineer, High-Performance Computing
What is your experience with GPU as a Service in high-performance computing?

Discuss your practical experience in deploying and managing GPU as a Service. Highlight specific projects where you've optimized performance and addressed challenges in deploying high-performance compute platforms.

Join Rise to see the full answer
How do you ensure high availability in geographically dispersed IT infrastructures?

Talk about your strategies for managing high availability. Mention tools, processes, and any experience you have with automating failover and disaster recovery solutions in your previous roles.

Join Rise to see the full answer
Can you explain the importance of out-of-band management?

Outline your understanding of out-of-band management, focusing on how it enhances systems' performance and security. Share your practical experience dealing with BIOS and UEFI settings and their impact on high-performance computing.

Join Rise to see the full answer
What scripting languages are you proficient in and how have you used them?

Identify your proficiency in scripting languages like PowerShell and Python. Provide examples of how you've used scripting to automate infrastructure tasks, resulting in improved efficiency or reduced errors in operations.

Join Rise to see the full answer
Describe your process for running performance benchmarks for new technologies.

Outline the steps you take in running performance benchmarks. Discuss how you evaluate technologies across platforms (Linux, Windows) and what criteria you consider for performance optimization.

Join Rise to see the full answer
How do you handle complex issues in a high-performance computing environment?

Explain your troubleshooting process. Highlight your analytical skills and how your approach can quickly identify and resolve complex issues while maintaining system performance and security.

Join Rise to see the full answer
What are the key considerations for hardware lifecycle management?

Discuss the critical aspects of hardware lifecycle management, including updating firmware, OS driver certifications, and ensuring that systems remain reliable and efficient over time.

Join Rise to see the full answer
Can you provide examples of how you've mentored junior staff in previous roles?

Share specific instances where you've successfully mentored junior engineers. Highlight the skills you helped them develop and any positive outcomes from your mentoring efforts.

Join Rise to see the full answer
What is your approach to team collaboration in IT projects?

Articulate how you work within a team setting, maintaining effective communication and ensuring all team members contribute to project success. Talk about tools or methodologies you use to support collaboration.

Join Rise to see the full answer
What motivates you to excel as a Lead Systems Engineer?

Share your intrinsic motivations, such as a passion for technology and innovation, a desire to drive efficiency in systems engineering, or the satisfaction of solving complex problems in high-performance environments.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Visa Remote Foster City, California, United States
Posted 3 days ago

Join Visa as a Finance Director in Management Reporting, leading a team to enhance financial reporting and insights.

Photo of the Rise User
Visa Remote Foster City, CA
Posted 3 days ago

Join Visa as a Sr. Software Engineer to work with cutting-edge technologies in a hybrid environment.

Photo of the Rise User
Posted 8 days ago
Photo of the Rise User
Boeing Hybrid US, Saint Louis County, MO; Missouri, Berkeley, MO
Posted 12 days ago
Photo of the Rise User
Posted 2 days ago

Join Bone Dry Roofing as a Solar Systems Electrician to enhance your skills in solar energy installation and maintenance.

Photo of the Rise User
The Cigna Group Remote Santa Monica, California, United States
Posted 9 days ago
Photo of the Rise User
WSP Hybrid Austin, Texas, United States
Posted 10 days ago
Photo of the Rise User
Posted 10 days ago

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

8343 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!