Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Lead Systems Engineer, High-Performance Computing image - Rise Careers
Job details

Lead Systems Engineer, High-Performance Computing - job 13 of 20

IaaS Systems and Storage & Engineering (ISSE) team is part of the Operations & Infrastructure technology organization. Distributed Compute engineering (DCE) is part of ISSE and High-performance compute platform engineering is part of DCE. Our vision, mission and purpose are summarized as following:

Vision: To become a leading technical engineering professional, pioneering in the design and automation of server infrastructure. We envision creating highly secure and efficient operations environments that drive business success and technological advancement.

Mission: Our mission is to deliver high-quality server infrastructure design and automated implementation. We are committed to operating in complex, highly secure, and highly available environments, while maintaining rigorous operations, security, and procedural models.

Purpose: The purpose of this role is to utilize strong hands-on technical engineering skills to design and automate the implementation of server infrastructure based on business requirements. This role will interact with technology domain experts to maintain high security and availability in complex operational environments, thereby driving business efficiency and security.

Essential Functions:

  • GPU as a Service and High-Performance Compute Platform Support: Expertise in deploying, managing, and optimizing GPU as a Service (GaaS) and high-performance compute platforms to support advanced computational workloads.
  • Extensive Datacenter Experience: Proficient in managing complex, geographically distributed IT infrastructures to ensure high availability and performance.
  • Advanced Technical Knowledge: Profound understanding of high-performance, highly available, and secure computing systems utilizing x86 technologies and protocols (NVME, GPU, PCI-E).
  • Enterprise Server and Component Expertise: In-depth knowledge of server components (storage/network controllers, HBA, SSDs) and their functionalities, essential for maintaining high-performance compute environments.
  • Processor and GPU Systems Proficiency: Strong grasp of Intel/AMD architectures, GPU systems, memory hierarchy, and hardware-level security to enhance system performance and reliability.
  • Out-of-Band, UEFI, and BIOS Expertise: Comprehensive understanding of out-of-band management, UEFI, BIOS settings, and their impact on system performance and security in high-performance computing environments.
  • Hardware Lifecycle Management: Experienced in hardware lifecycle management, including firmware and OS driver certifications, to ensure the longevity and reliability of compute resources.
  • Infrastructure Management and Automation: Proficient in installing, configuring, supporting, and maintaining compute infrastructure management tools, with skills in Ansible for automation to streamline deployment and operational tasks.
  • Performance Benchmarking and Tech Evaluation: Capable of running performance benchmarks and evaluating new technologies for various platforms (Linux, Windows, containerized, and virtualized) to ensure optimal performance.
  • Scripting Proficiency: Advanced skills in scripting languages such as PowerShell and Python to automate and optimize infrastructure tasks.
  • Team and Independent Work: Highly motivated, excellent team player, capable of working independently, with strong analytical and troubleshooting abilities to resolve complex issues and mentor junior staff.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$140000 / YEARLY (est.)
min
max
$120000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Lead Systems Engineer, High-Performance Computing, Visa

As a Lead Systems Engineer for High-Performance Computing at our Ashburn facility, you'll be at the forefront of designing and automating server infrastructure that powers our advanced computational workloads. You'll be part of the dynamic IaaS Systems and Storage & Engineering team, specifically within Distributed Compute engineering. Our mission is to craft highly secure, efficient, and reliable operations that drive technology and business success. Your role will leverage your hands-on technical skills to ensure that our GPU as a Service (GaaS) and high-performance compute platforms are optimized and functioning seamlessly. With a deep understanding of datacenter management, including NVME and GPU technologies, you'll be an expert at maintaining high availability in complex IT infrastructures. Whether it’s working on out-of-band management settings or performing performance benchmarking against emerging technologies, your expertise will enhance our systems' performance and reliability. You'll also utilize your scripting skills in PowerShell or Python to automate and streamline infrastructure management tasks. This hybrid position allows you flexibility, as you'll spend part of your week at our office and part working remotely, promoting a balanced work environment. Join us as we pave the way for innovative server infrastructure solutions and make significant strides in high-performance computing!

Frequently Asked Questions (FAQs) for Lead Systems Engineer, High-Performance Computing Role at Visa
What are the main responsibilities of a Lead Systems Engineer in High-Performance Computing at this company?

The Lead Systems Engineer in High-Performance Computing is primarily responsible for designing and automating server infrastructures. This includes managing GPU as a Service (GaaS), overseeing complex IT infrastructures, and optimizing high-performance compute systems. Additionally, you'll engage in hardware lifecycle management and automation using tools like Ansible while running performance benchmarks to continuously improve system performance.

Join Rise to see the full answer
What qualifications are required for the Lead Systems Engineer role focused on High-Performance Computing?

Candidates for the Lead Systems Engineer position should have extensive experience in high-performance computing, with a strong understanding of x86 technologies, NVME, and GPU systems. Proficiency in scripting languages such as PowerShell and Python is essential, along with a proven track record in managing datacenter environments. A deep knowledge of enterprise server components, alongside hardware lifecycle management expertise, will also be highly beneficial.

Join Rise to see the full answer
How important is scripting and automation for the Lead Systems Engineer position in High-Performance Computing?

Scripting and automation play a crucial role in the Lead Systems Engineer position, particularly in streamlining operational tasks and infrastructure management. Proficiency in languages such as PowerShell and Python will enable you to automate deployments, optimize workflows, and manage complex environments effectively, ultimately enhancing performance and efficiency.

Join Rise to see the full answer
What does the hybrid work model look like for the Lead Systems Engineer in High-Performance Computing?

The hybrid work model for the Lead Systems Engineer role allows you to alternate between remote work and in-office days. Employees are expected to be in the office 2-3 set days a week, depending on business needs, which generally provides a balance of collaborative in-person interaction and the flexibility of remote work.

Join Rise to see the full answer
What potential growth opportunities exist for a Lead Systems Engineer in High-Performance Computing at this company?

As a Lead Systems Engineer in High-Performance Computing, you'll not only have opportunities to enhance your technical skills but also to mentor junior staff and lead projects. This position allows for professional growth through innovative projects and collaboration with technology experts, setting the stage for potential advancement into senior leadership roles within the organization.

Join Rise to see the full answer
Common Interview Questions for Lead Systems Engineer, High-Performance Computing
Can you describe your experience with GPU as a Service (GaaS) and its implementation?

Be prepared to discuss specific projects where you've successfully deployed and managed GPU resources. Highlight the technologies used, challenges faced, and the outcomes achieved. Explain your approach to optimizing performance and any automation scripts you may have developed.

Join Rise to see the full answer
What strategies do you use to ensure high availability in a complex IT infrastructure?

Discuss methodologies such as redundancy plans, load balancing, and regular performance assessments. Provide examples from your past where you implemented these strategies successfully to ensure uptime and reliability.

Join Rise to see the full answer
How do you approach performance benchmarking and tech evaluation?

Talk about your process for setting up benchmark tests, the metrics you focus on, and how you analyze results. Include any tools you've used for benchmarking and how those evaluations influenced hardware or software decisions in your past roles.

Join Rise to see the full answer
Explain your experience with scripting and how it has helped in infrastructure management?

Mention specific scripting projects where you automated routine tasks. Discuss the programming languages used and the impact these scripts had on efficiency, error reduction, and overall operational capability.

Join Rise to see the full answer
What are the key challenges you anticipate facing in this Lead Systems Engineer role?

Articulate potential challenges such as managing resource allocation among multiple projects, keeping up with rapidly evolving technology, and ensuring security in high-performance environments. Discuss how you plan to address these with both proactive and reactive strategies.

Join Rise to see the full answer
How do you stay updated on the latest trends and technologies in High-Performance Computing?

Share your methods for staying informed, such as attending conferences, participating in webinars, following industry publications, or being part of relevant communities. Highlight specific sources that have proven valuable in your continuous learning.

Join Rise to see the full answer
Can you provide an example of a time you resolved a complex technical issue?

Prepare a specific example that illustrates your analytical and troubleshooting abilities. Discuss the problem, the steps you took to identify the root cause, and the ultimate solution you implemented.

Join Rise to see the full answer
What is your experience with hardware lifecycle management in high-performance environments?

Explain your familiarity with managing hardware from acquisition through retirement. Include specifics about certifications, firmware updates, and how you've ensured long-term reliability of hardware resources.

Join Rise to see the full answer
How do you prioritize tasks in a dynamic and demanding environment?

Discuss your approach to prioritization, such as using tools and frameworks like Agile or Kanban. Provide examples of how you have successfully managed competing deadlines and demands in previous roles.

Join Rise to see the full answer
What do you consider when evaluating new technologies for deployment?

Emphasize factors such as compatibility, performance metrics, cost analysis, and expected ROI. Share insights on your evaluation process and any criteria you utilize to ensure informed decisions.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Visa Remote Foster City
Posted 11 days ago
Photo of the Rise User
DarioHealth Hybrid United States
Posted 10 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Posted 8 days ago

Join Bone Dry Roofing as a Solar Systems Electrician to enhance your skills in solar energy installation and maintenance.

Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 6 days ago

Join AECOM as a Senior or Principal Flood Risk Modeller to lead innovative flood risk projects in a flexible and inclusive environment.

Photo of the Rise User
NVIDIA Hybrid Santa Clara, California, United States
Posted 12 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
Photo of the Rise User
Citi Hybrid Sioux Falls, South Dakota, United States
Posted 13 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Photo of the Rise User
ServiceNow Hybrid Two Addison Circle 15725 North Dallas Parkway Suite 200, Addison, Texas, United States
Posted 12 days ago
Inclusive & Diverse
Mission Driven
Rise from Within
Diversity of Opinions
Work/Life Harmony
Empathetic
Feedback Forward
Take Risks
Collaboration over Competition
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Paid Time-Off
Maternity Leave
Equity

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

9244 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Gallipolis just viewed Email Copywriting Intern, Summer 2025 at Power Digital
Photo of the Rise User
Someone from OH, Columbus just viewed Warehouse People Ops Coordinator at Babylist
Photo of the Rise User
9 people applied to Pega Engineer at Proxymity
Photo of the Rise User
Someone from OH, Pickerington just viewed Sr. Client Project Manager at Forge Biologics
Photo of the Rise User
Someone from OH, Toledo just viewed Field Recruiter (MI) at Wonderschool
d
Someone from OH, Columbus just viewed Reconciliation & Payments Specialist at dopay
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed VP of Customer Operations at OXIO Corporation
Photo of the Rise User
23 people applied to Supervisor, Plumbing at SpaceX
Photo of the Rise User
Someone from OH, Springfield just viewed IT helpdesk Team Leader at Optimiza
Photo of the Rise User
Someone from OH, Akron just viewed Director of Revenue Cycle Management at Gather Health
Photo of the Rise User
Someone from OH, Dayton just viewed Data Entry Clerk at Hireframe
Photo of the Rise User
Someone from OH, Cincinnati just viewed Customer Success Manager - Illinois at Alma Technologies (OR)
Photo of the Rise User
Someone from OH, Cleveland just viewed Client Services Manager at Vitesse PSP
Photo of the Rise User
Someone from OH, Fairborn just viewed IOS Developer at Advansys
Z
Someone from OH, Reynoldsburg just viewed Educator Onboarding Associate at Zen Educate
Photo of the Rise User
Someone from OH, Canton just viewed SEASONER at Shearer's Foods
Photo of the Rise User
Someone from OH, Avon Lake just viewed Data Analyst I - Hospitality Data Team at Lightspeed Commerce
Photo of the Rise User
Someone from OH, Columbus just viewed Brand Awareness Specialist - Entry Level at Smart Solutions
Photo of the Rise User
Someone from OH, Cleveland just viewed Quality Assurance Weekender at Anheuser-Busch