Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Data Center Site Operations Manager image - Rise Careers
Job details

Data Center Site Operations Manager

Voltage Park is on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities to seed-stage startups and nonprofits. Providing seamless access to compute with pricing and inventory transparency is the future of access to GPUs, and we are the only cloud provider offering a platform that shows all available GPUs  with transparent, market-based pricing, in addition to long-term reserve contracts for our customers. 

We’re in search of a Data Center Site Operations Manager in the datacenter organization to oversee the operational integrity, maintenance, and efficiency of the data center's infrastructure and technical teams. This role focuses on ensuring that the data center's physical infrastructure runs smoothly and meets performance and availability standards, while aligning with the organization’s broader business objectives.

This role is based onsite in our Sterling, VA datacenter. We are unable to provide sponsorship for this position.

What you’ll do:

  • Infrastructure Management: Ensure the data center’s power, cooling, and physical infrastructure (including servers, racks, and networking equipment) are properly maintained and optimized to maximize uptime.

  • Team Leadership: Oversee and develop a team of technical staff responsible for day-to-day operations, including an onsite asset manager, fostering a culture of accountability, collaboration, and continuous improvement.

  • Ticketing System Oversight: Monitor and manage break-fix tickets through the organization’s ticketing system, ensuring issues are prioritized, assigned, and resolved in a timely manner by appropriate team members.

  • Response and Resolution Coordination: Coordinate responses to tickets that involve hardware repairs, component replacements, or network/server troubleshooting. Ensure timely dispatch and effective resolution by qualified personnel.

  • Tracking and Reporting: Track ticket progress to ensure issues are resolved within agreed Service Level Agreements (SLAs), and provide regular performance reports to senior management, covering metrics such as ticket resolution time and uptime.

  • Incident and Problem Management: Lead troubleshooting and incident management efforts for technical issues, including power failures, equipment malfunctions, or connectivity problems, aiming for swift resolution and minimal downtime.

  • Vendor and Asset Management: Manage relationships with external vendors for hardware, software, and facility services; oversee data center assets, from procurement to installation and lifecycle management.

  • Capacity and Performance Planning: Monitor infrastructure performance to meet current and projected demand, planning for necessary upgrades or expansions, and ensuring resources are allocated efficiently.

  • Compliance and Security: Ensure data center compliance with industry standards and regulations (e.g., ISO, SOC, HIPAA) and oversee the implementation of security protocols to protect data and systems.

  • Project Management: Manage and deliver data center projects related to expansions, migrations, and upgrades, coordinating cross-functional teams to meet project goals within schedule and budget.

Qualifications:

  • Minimum of 5 years of experience in data center operations, with a proven track record in team management, optimizing operations, and meeting uptime and SLA targets.

  • Strong knowledge of data center infrastructure, including power distribution, HVAC, cabling, networking, and server environments.

  • Experience with capacity planning, resource allocation, and budget management for efficient, cost-effective operations.

  • Proven leadership abilities in hiring, training, and developing technical teams, with a focus on fostering accountability and continuous improvement.

  • Excellent problem-solving and decision-making skills, with the ability to handle critical incidents under pressure to ensure timely resolution.

  • Strong communication and collaboration skills, with the ability to work effectively across cross-functional teams, stakeholders, and vendors.

  • Project management experience, particularly in coordinating deployments, decommissioning, and infrastructure upgrades, with a focus on adhering to schedules and budgets.

  • Metrics and KPIs: Proven experience in managing and achieving operational metrics, including uptime percentage, ticket resolution time, and overall customer satisfaction.

  • Preferred Certifications: Certifications such as PMP, Data Center Certified Associate (DCCA), or ITIL are a plus, reflecting advanced expertise in data center management practices.

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter. 

Average salary estimate

$105000 / YEARLY (est.)
min
max
$90000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Data Center Site Operations Manager, Voltage Park

Looking to make a significant impact at a forward-thinking company? Voltage Park is seeking a talented Data Center Site Operations Manager to join our dynamic team in Sterling, VA. As a key member of our datacenter organization, you will oversee the operational integrity and efficiency of our data center while ensuring that it meets high performance and availability standards. In this exciting role, you’ll manage a dedicated team focused on maintaining our physical infrastructure, from power and cooling systems to networking equipment. Your leadership will cultivate a culture of accountability and continuous improvement among technical staff members, ensuring seamless daily operations. You’ll monitor our ticketing system, coordinate responses to hardware issues, and manage external vendor relationships. You’ll also engage in capacity planning to ensure our infrastructure meets current and future demands. Additionally, ensuring compliance with industry standards and overseeing security protocols will be crucial to maintaining our operations. With a minimum of five years’ experience in data center operations, you’ll leverage your knowledge of infrastructure management and problem-solving skills to tackle challenges head-on. If you're ready to drive results and elevate our operations to new heights, we want to hear from you!

Frequently Asked Questions (FAQs) for Data Center Site Operations Manager Role at Voltage Park
What are the responsibilities of a Data Center Site Operations Manager at Voltage Park?

At Voltage Park, a Data Center Site Operations Manager is responsible for overseeing the operational integrity, maintenance, and efficiency of the data center's infrastructure. This includes managing physical systems like power and cooling, leading technical teams, handling ticketing processes, and coordinating hardware repairs and troubleshooting efforts to ensure maximum uptime.

Join Rise to see the full answer
What qualifications are needed for the Data Center Site Operations Manager position at Voltage Park?

Candidates applying for the Data Center Site Operations Manager role at Voltage Park should have a minimum of five years of experience in data center operations, a strong understanding of data center infrastructure, and proven leadership abilities. Certifications such as PMP or ITIL could be beneficial and showcase advanced expertise in data center management practices.

Join Rise to see the full answer
What skills are important for a Data Center Site Operations Manager at Voltage Park?

Essential skills for a Data Center Site Operations Manager at Voltage Park include strong problem-solving and decision-making abilities, excellent communication and team collaboration skills, and expertise in capacity planning and resource management. Familiarity with compliance requirements and incident management is also crucial for success in this role.

Join Rise to see the full answer
How does Voltage Park ensure compliance within the data center?

Voltage Park ensures compliance by adhering to industry standards and regulations such as ISO, SOC, and HIPAA. The Data Center Site Operations Manager is also responsible for overseeing the implementation of security protocols that protect data and systems, ensuring consistent compliance throughout operations.

Join Rise to see the full answer
What does the team structure look like for a Data Center Site Operations Manager at Voltage Park?

In the Data Center at Voltage Park, the Site Operations Manager leads a team of technical staff responsible for daily operations. This includes an onsite asset manager, and the role involves fostering collaboration and accountability to drive continuous improvement within the team.

Join Rise to see the full answer
Common Interview Questions for Data Center Site Operations Manager
Can you describe your experience managing team operations in a data center?

When answering this question, focus on providing specific examples of your leadership style, how you developed team capabilities, and any metrics you achieved, such as improvement in uptime or ticket resolution times. Highlight situations where you fostered a culture of accountability and what strategies you used for team development.

Join Rise to see the full answer
How do you ensure the infrastructure of a data center is optimized?

Discuss your approach to monitoring infrastructure performance and your strategies for identifying opportunities for optimization. Include examples of specific practices like regular maintenance checks, performance metrics you track, and how you prioritize updates and expansions.

Join Rise to see the full answer
What steps do you take to handle critical incidents in a data center?

Provide a structured response outlining the steps you take during critical incidents, including how you assess problems, coordinate your team, communicate with stakeholders, and ensure swift resolutions while minimizing downtime.

Join Rise to see the full answer
How do you manage external vendors for data center operations?

Explain your approach to vendor management, including how you assess vendor performance, your criteria for selection, and how you maintain relationships to ensure quality service delivery. Notable experiences can elevate your response.

Join Rise to see the full answer
What metrics do you consider most important for data center operations?

Discuss the key performance indicators (KPIs) you focus on, such as uptime percentage, ticket resolution time, and customer satisfaction. Elaborate on how you track these metrics and use them for decision-making and continuous improvement.

Join Rise to see the full answer
How do you handle capacity planning in a data center?

Describe your experience with capacity planning by including your method for forecasting demand, analyzing trends, and planning for necessary upgrades. Provide examples of how you have successfully anticipated and met future needs.

Join Rise to see the full answer
Can you discuss a project you managed related to data center expansions?

Share a detailed account of a specific expansion project you managed, discussing objectives, challenges faced, how you coordinated cross-functional teams, and the final outcomes. Highlight your approach to adhering to schedules and budgets.

Join Rise to see the full answer
What experience do you have with compliance and regulatory standards in data centers?

Detail your familiarity with various compliance standards relevant in the industry and your role in ensuring adherence to these regulations. Provide examples of audits or compliance initiatives that you successfully led.

Join Rise to see the full answer
How do you foster a culture of continuous improvement within your technical team?

Focus on your techniques for encouraging team members to innovate and seek improvement, such as regular feedback sessions, training opportunities, and collaborative problem-solving. Cite specific examples of initiatives you have implemented.

Join Rise to see the full answer
What are the most challenging aspects of being a Data Center Site Operations Manager?

When discussing challenges, be honest about obstacles you have faced, such as managing critical incidents or balancing resource allocation. Share how you handled these situations and what you learned to improve processes in the future.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 10 days ago
Photo of the Rise User
Posted 42 minutes ago
Photo of the Rise User
Lime Remote No location specified
Posted 11 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Photo of the Rise User
Posted 5 days ago
Talent Worx Remote No location specified
Posted 21 hours ago
Kentro Remote No location specified
Posted 12 days ago

voltage park is building a new class of cloud infrastructure from the ground up. join us, we're hiring!

23 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
March 17, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!