Voltage Park is on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities to seed-stage startups and nonprofits. Providing seamless access to compute with pricing and inventory transparency is the future of access to GPUs, and we are the only cloud provider offering a platform that shows all available GPUs with transparent, market-based pricing, in addition to long-term reserve contracts for our customers.
We’re in search of a Data Center Site Operations Manager in the datacenter organization to oversee the operational integrity, maintenance, and efficiency of the data center's infrastructure and technical teams. This role focuses on ensuring that the data center's physical infrastructure runs smoothly and meets performance and availability standards, while aligning with the organization’s broader business objectives.
This role is based onsite in our Sterling, VA datacenter. We are unable to provide sponsorship for this position.
What you’ll do:
Infrastructure Management: Ensure the data center’s power, cooling, and physical infrastructure (including servers, racks, and networking equipment) are properly maintained and optimized to maximize uptime.
Team Leadership: Oversee and develop a team of technical staff responsible for day-to-day operations, including an onsite asset manager, fostering a culture of accountability, collaboration, and continuous improvement.
Ticketing System Oversight: Monitor and manage break-fix tickets through the organization’s ticketing system, ensuring issues are prioritized, assigned, and resolved in a timely manner by appropriate team members.
Response and Resolution Coordination: Coordinate responses to tickets that involve hardware repairs, component replacements, or network/server troubleshooting. Ensure timely dispatch and effective resolution by qualified personnel.
Tracking and Reporting: Track ticket progress to ensure issues are resolved within agreed Service Level Agreements (SLAs), and provide regular performance reports to senior management, covering metrics such as ticket resolution time and uptime.
Incident and Problem Management: Lead troubleshooting and incident management efforts for technical issues, including power failures, equipment malfunctions, or connectivity problems, aiming for swift resolution and minimal downtime.
Vendor and Asset Management: Manage relationships with external vendors for hardware, software, and facility services; oversee data center assets, from procurement to installation and lifecycle management.
Capacity and Performance Planning: Monitor infrastructure performance to meet current and projected demand, planning for necessary upgrades or expansions, and ensuring resources are allocated efficiently.
Compliance and Security: Ensure data center compliance with industry standards and regulations (e.g., ISO, SOC, HIPAA) and oversee the implementation of security protocols to protect data and systems.
Project Management: Manage and deliver data center projects related to expansions, migrations, and upgrades, coordinating cross-functional teams to meet project goals within schedule and budget.
Qualifications:
Minimum of 5 years of experience in data center operations, with a proven track record in team management, optimizing operations, and meeting uptime and SLA targets.
Strong knowledge of data center infrastructure, including power distribution, HVAC, cabling, networking, and server environments.
Experience with capacity planning, resource allocation, and budget management for efficient, cost-effective operations.
Proven leadership abilities in hiring, training, and developing technical teams, with a focus on fostering accountability and continuous improvement.
Excellent problem-solving and decision-making skills, with the ability to handle critical incidents under pressure to ensure timely resolution.
Strong communication and collaboration skills, with the ability to work effectively across cross-functional teams, stakeholders, and vendors.
Project management experience, particularly in coordinating deployments, decommissioning, and infrastructure upgrades, with a focus on adhering to schedules and budgets.
Metrics and KPIs: Proven experience in managing and achieving operational metrics, including uptime percentage, ticket resolution time, and overall customer satisfaction.
Preferred Certifications: Certifications such as PMP, Data Center Certified Associate (DCCA), or ITIL are a plus, reflecting advanced expertise in data center management practices.
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
As a Network Administrator at Peraton, you will play a vital role in ensuring the reliability and security of network operations for the U.S. Coast Guard.
As a Senior Principal Cybersecurity Specialist at Medtronic, you will lead the charge in safeguarding our Operational Technology systems with innovative security strategies.
Become a key player at Databricks as an IT Support Specialist, optimizing user experiences and technical support in a vibrant IT organization.
Join Sika in Zurich as a Lead SAP Integration, where you'll spearhead innovative integration strategies in a dynamic, international environment.
Qantas is looking for a Senior Manager in Cyber Defence to drive security innovations while working in a collaborative environment.
Seeking a skilled Epic Application Analyst to enhance healthcare billing systems through expert analysis, system optimization, and collaborative problem-solving.
Scale is searching for a passionate IT Support Engineer to enhance our technology capabilities at our San Francisco office.
Join PingWind as a Configuration Management Specialist and contribute to vital U.S. military operations at Naval Base Guantanamo Bay.
Join the University of Arkansas as the Deputy Chief Information Security Officer to lead vital information security and privacy initiatives.
Join a dynamic team as a Records Management System Administrator, overseeing the M-Files system to enhance document control and records management processes.
Join Scalable Capital as a working student in Information Security, contributing to innovative financial services in an inclusive fintech environment.
Join A.P. Moller - Maersk as a Senior Cyber Detection Engineer and drive the transformation of cybersecurity strategies in a pivotal role.
Join Peraton as a Blue Team Engineer and play a crucial role in defending our nation through cutting-edge cybersecurity solutions.
voltage park is building a new class of cloud infrastructure from the ground up. join us, we're hiring!
34 jobsSubscribe to Rise newsletter