Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Director, Site Reliability Engineering image - Rise Careers
Job details

Director, Site Reliability Engineering

HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states. 

 

The Director, Site Reliability Engineering is responsible for leading the teams that manage and support all of our hosting services, including colocated hardware and cloud-based services, as well as defining and operating the processes for change management, financial management and incident response.

 

To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation.  Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.


Essential Job Duties
  • Ensure high availability, scalability, and security of cloud services across multiple geographies.
  • Implement and improve automation, incident management, and capacity planning practices.
  • Lead and mentor a team of Site Reliability Engineers and leaders. Lead the transformation of the organization to an SRE model.
  • Integrate the technology, practices and policies of disparate organizations into a single cohesive team that supports disparate technologies and platforms with minimal variation in practice.
  • Develop and execute strategic plans for cloud infrastructure and operations to support business growth and acquisitions.
  • Oversee the management and optimization of cloud infrastructure for cost-efficiency.
  • Maintain and improve monitoring, logging, and alerting systems.
  • Collaborate closely with product development teams to facilitate delivery of new functionality and capabilities to our SaaS platform and hosted products.
  • Champion and support the transformation to a DevOps culture.
  • Develop and manage budgets for cloud infrastructure and tooling.
  • Evaluate and implement new technologies and tools to enhance cloud infrastructure and operations.
  • Foster a culture of continuous improvement, collaboration, and innovation.


Other Job Duties
  • Other duties as assigned by supervisor or HHA exchange leader.


Travel Requirements
  • Travel up to 10%, including overnight travel


Required Education, Experience, Certifications and Skills
  • Bachelor’s or master’s degree in Computer Science, Engineering, or a related field.
  • 10+ years of experience in cloud engineering and operations, with at least 5 years in a leadership role.
  • Proven experience with managing large scale AWS cloud platforms.
  • Deep understanding of modern SRE practices and principles.
  • Experience with cloud infrastructure tools (monitoring, deployment, security).
  • Excellent leadership, communication, and interpersonal skills.
  • Proven experience driving process and culture transformation across organizations.
  • Ability to work effectively with cross-functional teams and stakeholders.
  • Strong problem-solving and decision-making abilities.


The base salary range for this US-based, full-time, and exempt position is $185,000-205,000 not including variable compensation. An employee’s exact starting salary will be based on various factors including but not limited to experience, education, training, merit, location, and the ability to exemplify the HHAeXchange core values.

 

This is a benefits-eligible position. HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.


HHAeXchange is an equal-opportunity employer. The Company offers employment opportunities to all applicants and employees without regard to race, color, religion, national origin, sex, sexual orientation, gender identity or expression, age, disability, medical condition, marital status, veteran status, citizenship, genetic information, hairstyles, or any other status protected by local or federal law.

HHAeXchange Glassdoor Company Review
3.1 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
HHAeXchange DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of HHAeXchange
HHAeXchange CEO photo
Greg Strobel
Approve of CEO

Average salary estimate

$195000 / YEARLY (est.)
min
max
$185000K
$205000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Director, Site Reliability Engineering, HHAeXchange

HHAeXchange is looking for a passionate Director of Site Reliability Engineering to join our dynamic team. If you're someone who thrives in a fast-paced environment and is eager to lead a group dedicated to transforming the healthcare space, this role is perfect for you! As the Director, you will play a pivotal role in overseeing our hosting services, which encompass both colocated hardware and cloud-based solutions, ensuring they remain high-performing and secure. Your main focus will be to enhance our change management, financial management, and incident response processes. Imagine leading a talented team of Site Reliability Engineers while guiding the transition toward a more integrated SRE model. Your expertise in cloud infrastructure will be crucial, as you’ll be responsible for optimizing our services to support business growth. Collaborating closely with product development teams, you'll facilitate the seamless delivery of new functionalities that support our innovative SaaS platform. At HHAeXchange, we foster a culture of continuous improvement, so expect to champion the transformation to a robust DevOps culture as well. With opportunities to evaluate and implement cutting-edge technologies, this position is not just a job; it’s a chance to make a meaningful impact in the lives of those needing home care. We offer competitive salaries and a comprehensive benefits package, because we value what you bring to our mission. If you’re ready to take the next step in your career and help us redefine home and community-based care, apply today!

Frequently Asked Questions (FAQs) for Director, Site Reliability Engineering Role at HHAeXchange
What are the main responsibilities of the Director, Site Reliability Engineering at HHAeXchange?

The Director, Site Reliability Engineering at HHAeXchange is responsible for leading teams that manage and support hosting services and ensuring their high availability, scalability, and security. This includes implementing automation, steering incident management and capacity planning, mentoring Site Reliability Engineers, developing strategic plans for cloud infrastructure, and fostering a culture of continuous improvement.

Join Rise to see the full answer
What qualifications are required for the Director, Site Reliability Engineering position at HHAeXchange?

To qualify for the Director, Site Reliability Engineering position at HHAeXchange, candidates should possess a bachelor's or master's degree in Computer Science, Engineering, or a related field, coupled with over 10 years of experience in cloud engineering and operations, including at least 5 years in a leadership role. Proficiency in managing large-scale AWS platforms and modern SRE practices is essential.

Join Rise to see the full answer
What is the expected salary range for the Director, Site Reliability Engineering role at HHAeXchange?

The base salary range for the Director, Site Reliability Engineering role at HHAeXchange is between $185,000 and $205,000, excluding variable compensation. Actual salary will depend on various factors such as experience, education, and location, while performance-based pay is also factored in.

Join Rise to see the full answer
How does HHAeXchange foster a culture of continuous improvement for the Director, Site Reliability Engineering?

At HHAeXchange, we emphasize a culture of continuous improvement, where the Director, Site Reliability Engineering will be tasked with leading transformation initiatives that enhance collaboration, innovation, and operational efficiency. Engaging with cross-functional teams, employees are encouraged to share ideas that will help elevate our practices in site reliability engineering.

Join Rise to see the full answer
What are the travel requirements for the Director, Site Reliability Engineering position at HHAeXchange?

The Director, Site Reliability Engineering role at HHAeXchange requires travel of up to 10%, which may include some overnight stays. This occasional travel will support collaboration with teams across various locations as part of the company’s operational strategy.

Join Rise to see the full answer
Common Interview Questions for Director, Site Reliability Engineering
Can you describe your experience with cloud infrastructure management as it relates to the Director, Site Reliability Engineering role?

In your answer, focus on specific projects where you successfully managed cloud infrastructure. Highlight your experience with AWS services and any relevant tools you’ve used for monitoring, deployment, and security. Show how your previous roles have prepared you for leading SRE practices.

Join Rise to see the full answer
What strategies would you employ to transition a team to an SRE model?

Discuss the importance of clear communication, training, and gradual implementation of SRE principles. Emphasize your experience in guiding teams through organizational changes and your approach to mentoring individuals to embrace these practices effectively.

Join Rise to see the full answer
How would you optimize a cloud infrastructure for cost-efficiency while maintaining performance?

Provide concrete examples of initiatives you’ve taken to analyze cloud spending, like implementing auto-scaling, identifying underutilized resources, and setting budgets. Discuss how metrics influence your decision-making process in balancing cost with performance.

Join Rise to see the full answer
How do you ensure high availability and reliability in cloud services?

Speak about implementing redundancy, load balancing, and failover strategies. Explain the tools you’ve used to monitor service health and how they help proactively manage outages. Illustrate the measures you’ve taken to maintain reliability across geographic locations.

Join Rise to see the full answer
What is your approach to incident management within an SRE framework?

Detail your experience with incident response plans and post-mortems. Talk about key performance indicators (KPIs) you track and how you’ve utilized them to refine processes and drive continuous improvement in incident management.

Join Rise to see the full answer
How do you keep up with the latest SRE practices and tools?

Mention specific resources you stay connected with, like conferences, webinars, and community forums. Share how you incorporate emerging technologies into your strategies and how that has positively impacted your previous teams.

Join Rise to see the full answer
Can you give an example of a challenging leadership situation you've faced and how you overcame it?

Focus on a specific incident that showcases your leadership and problem-solving skills. Discuss the strategies you employed to rally your team and turn the situation around, emphasizing your communication and conflict resolution skills.

Join Rise to see the full answer
How do you approach collaboration with product development teams?

Discuss your philosophy on cross-functional collaboration, and provide examples of how you have successfully navigated differing priorities. Emphasize the importance of aligning SRE initiatives with product development goals to foster a seamless release process.

Join Rise to see the full answer
What role does continuous improvement play in site reliability engineering?

Articulate your belief in the importance of continuous improvement within SRE. Discuss methods you’ve implemented to encourage team members to contribute ideas, analyze their impact on operations, and evaluate them regularly to foster an innovative environment.

Join Rise to see the full answer
How would you evaluate new tools for enhancing cloud infrastructure and operations?

Outline a systematic approach for assessing new tools based on criteria like efficiency, usability, scalability, and cost. Share past experiences of how you successfully implemented new technologies that made a significant positive difference in operations.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States
Posted 3 days ago
Fidelity Investments Remote US, Suffolk County, MA; Massachusetts, Boston, MA
Posted 2 days ago
Photo of the Rise User
Posted 19 hours ago
Photo of the Rise User
Posted 6 days ago
Weekday AI Remote No location specified
Posted 9 days ago
Photo of the Rise User
AECOM Hybrid Buffalo, NY, United States
Posted 3 days ago
Photo of the Rise User
AECOM Remote Gold Coast QLD, Australia
Posted 10 days ago

Our mission is to enable the most effective homecare ecosystem every day.

73 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
March 23, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!