Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Site Reliability Engineer image - Rise Careers
Job details

Senior Site Reliability Engineer

The exciting world of scientific research is fueled by people with a passion for solving complex problems. At Cayuse, we are committed to our customers’ success by empowering organizations to conduct globally connected research that advances their impact on science, discovery and society. We build on that commitment with proven, integrated and easy-to-use technology that delivers exceptional value, and world class service and support that accelerates outcomes.

But we are more than just an empowering platform powered by advanced technologies. We are a collaboration of exceptional, highly skilled people with multi-disciplinary expertise, and are building our team to support our ambitious growth plans. Cayuse’s foundational strength comes from our customer and employee focused values and commitment to industry-leading solutions. It’s an exciting time to become a key member of our growing team.

As a Senior Site Reliability Engineer, you will be a key technical leader, driving the reliability, scalability, and efficiency of our cloud-based infrastructure and SaaS products. This role combines deep technical expertise with a passion for mentoring and sharing knowledge. You will leverage your deep experience with AWS, SRE principles, automation, and infrastructure management to improve our systems, while also guiding and supporting your colleagues in their technical growth. Your focus will be on hands-on technical work, mentoring, and contributing to the overall improvement of our SRE practices, with a strong emphasis on automation using tools like Terraform and Bitbucket Pipelines.

 

Responsibilities

Technical Leadership and Mentorship

  • Serve as a technical expert and mentor to other engineers, sharing knowledge and best practices.
  • Lead by example, demonstrating strong technical proficiency in SRE principles and practices, specifically within the AWS ecosystem.
  • Contribute to the development and implementation of SRE standards and guidelines, tailored to AWS best practices.
  • Foster a culture of continuous learning and improvement within the team.
  • Help others to grow their automation skillsets.

Infrastructure and Automation

  • Design, build, and maintain robust and scalable infrastructure using Terraform, leveraging AWS services effectively.
  • Develop and optimize CI/CD pipelines using Bitbucket Pipelines, integrating seamlessly with AWS deployment strategies.
  • Implement and maintain monitoring and logging solutions to ensure system observability, utilizing AWS monitoring tools.
  • Automate infrastructure and operational tasks to reduce toil and improve efficiency, with a focus on AWS automation.
  • Contribute to the development and maintenance of automation tools and scripts.
  • Troubleshoot complex infrastructure and application issues within the AWS environment.

Reliability and Incident Management

  • Participate in incident response and root cause analysis, contributing to the resolution of critical issues on AWS.
  • Define and monitor SLOs/SLAs to ensure system reliability, using AWS metrics and monitoring.
  • Contribute to disaster recovery planning and testing, utilizing AWS disaster recovery capabilities.
  • Analyze system performance and identify areas for improvement within AWS.
  • Proactively find and resolve potential issues before they become incidents.

Collaboration and Improvement

  • Collaborate with development, operations, and other teams to ensure smooth and efficient operations on AWS.
  • Contribute to code reviews and technical discussions.
  • Identify and implement process improvements to enhance team efficiency and effectiveness.
  • Document best practices and create knowledge-sharing resources.
  • Participate in agile ceremonies.

 

Qualifications

  • Deep experience with AWS, including core services like EC2, S3, RDS, Lambda, CloudWatch, EKS, and a solid understanding of AWS networking (VPC, Security Groups) and security fundamentals (IAM)
  • 4+ years of experience working with public cloud technologies (AWS preferred).
  • 4+ years of experience developing monitoring and log analysis tools, including proficiency with Grafana and New Relic.
  • Deep understanding of Site Reliability Engineering (SRE) principles, platforms, and tools.
  • Proven experience with Terraform and Bitbucket Pipelines.
  • Strong understanding of CI/CD pipelines and SDLC.
  • Experience with Docker and Kubernetes.
  • Proficiency in scripting languages (bash, Python).
  • Experience implementing and managing security controls and tools.
  • Understanding of security systems and best practices.
  • Experience with git and code branching/merging strategies.
  • Experience with Agile methodologies (Scrum, Kanban).
  • Strong problem-solving and troubleshooting skills.
  • Excellent communication and collaboration skills.
  • Passion for mentoring and sharing knowledge.
  • Automation-first mindset.
  • Ability to own medium to large technical projects.

 

Benefits

  • Competitive Medical Benefits (PPO + HSA available)
  • Vision, Dental, Short-Term Disability fully covered by Cayuse
  • Unlimited PTO + Holidays + Flexible Work Schedule
  • Remote Work Stipend
  • Equal Paid Parental Leave
  • 401k with Employer Matching
  • Quarterly Wellness Reimbursement
  • Remote Work Environment, supporting the Ultimate Employee Experience 

 

Cayuse does not accept agency resumes. Please do not forward resumes to our jobs alias or any Cayuse employees. Cayuse is not responsible for any fees related to unsolicited resumes.

Our culture is one of inclusion and belonging where everyone feels respected, treated justly, supported and nourished. We all share responsibility for creating and sustaining a work environment where differences are celebrated and we are empowered to strive for excellence. We’re proud to be an equal opportunity employer and actively seek to recruit, develop, and retain a diverse and talented workforce.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Site Reliability Engineer, Cayuse

At Cayuse, we're redefining the exciting world of scientific research with our commitment to empowering organizations in their mission for impactful discovery. As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring our cloud-based infrastructure and SaaS products are both reliable and efficient. If you're passionate about driving technical excellence and mentoring fellow engineers, this opportunity is for you! You'll be working in a relaxed, remote environment where your expertise in AWS and SRE principles will shine. Your daily adventures will include designing scalable infrastructure using Terraform, optimizing CI/CD pipelines with Bitbucket, and developing strong monitoring solutions. You will be a beacon of knowledge, guiding your team through best practices while proactively addressing complex infrastructure challenges. Here at Cayuse, collaboration is not just encouraged - it's at the heart of our culture. You’ll work closely with various teams, contribute to code reviews, and help shape a culture focused on continuous improvement. Plus, with our flexible work schedule and commitment to employee well-being, you’ll find a work-life balance that enhances productivity and creativity. This is your chance to lead with impact and help build a company that prides itself on innovation and collective success. Join us in creating technology that advances science and society - we can’t wait to welcome you to our team!

Frequently Asked Questions (FAQs) for Senior Site Reliability Engineer Role at Cayuse
What are the key responsibilities of a Senior Site Reliability Engineer at Cayuse?

As a Senior Site Reliability Engineer at Cayuse, you are responsible for driving the reliability and performance of our cloud infrastructure. You'll engage in technical leadership, mentoring fellow engineers while implementing robust infrastructure designs using Terraform and AWS. Your role will also involve automating tasks, contributing to incident management, and collaborating across departments to enhance overall operations.

Join Rise to see the full answer
What qualifications do I need to apply for the Senior Site Reliability Engineer position at Cayuse?

To be considered for the Senior Site Reliability Engineer role at Cayuse, candidates should have a deep understanding of AWS services and at least 4 years of experience with public cloud technologies. Proficiency with Terraform, CI/CD pipelines, and monitoring tools like Grafana or New Relic is also essential. Strong communication skills and a passion for mentoring are key in fostering an effective team environment.

Join Rise to see the full answer
What kind of experience is required for the Senior Site Reliability Engineer role at Cayuse?

Cayuse looks for candidates with at least 4 years of experience in SRE principles, public cloud technologies, and automation. Experience with AWS standard tools, proficiency in scripting languages, and a solid grasp of continuous integration and deployment methodologies are necessary to excel in this position.

Join Rise to see the full answer
How does Cayuse support the professional growth of a Senior Site Reliability Engineer?

At Cayuse, professional growth is paramount. As a Senior Site Reliability Engineer, you will not only work on enriching projects but also have opportunities for mentorship and continuous learning. Our environment encourages knowledge sharing, and our culture of inclusion means you will thrive among peers committed to collective excellence.

Join Rise to see the full answer
What benefits can I expect if I become a Senior Site Reliability Engineer at Cayuse?

Cayuse offers a comprehensive benefits package for its Senior Site Reliability Engineers, which includes competitive medical, vision, and dental benefits, unlimited PTO, and a flexible work schedule. Employees also benefit from remote work stipends, 401k with employer matching, and a wellness reimbursement program that supports your overall well-being.

Join Rise to see the full answer
Common Interview Questions for Senior Site Reliability Engineer
Can you describe your experience with AWS services relevant to a Senior Site Reliability Engineer role?

When answering this question, detail your hands-on experience with core AWS services such as EC2, S3, and RDS. Highlight specific projects where you've deployed services and how they contributed to enhancing system reliability or performance. Emphasize any challenges faced and how you overcame them.

Join Rise to see the full answer
How do you approach incident management and resolution in your role as a site reliability engineer?

Discuss your methodical approach to incident response, including how you prioritize incidents, gather data for root cause analysis, and implement solutions. Providing an example of a specific incident you've managed can be beneficial to showcase your experience in real-world scenarios.

Join Rise to see the full answer
What automation tools have you worked with, and how have they improved efficiency in your previous roles?

Share specific examples of automation tools, such as Terraform or Bitbucket Pipelines, that you've used. Explain how you’ve created automated workflows that reduced manual toil, highlighting any metrics that demonstrate the efficiency gained.

Join Rise to see the full answer
Can you discuss your experience with CI/CD processes?

Provide a comprehensive answer focused on your understanding of CI/CD principles. Describe a pipeline you've built, the tools used, and any challenges faced throughout the implementation process. Discuss how you ensured reliability and speed during deployments.

Join Rise to see the full answer
What role does monitoring play in your work as a Senior Site Reliability Engineer?

Articulate how you utilize monitoring tools to ensure system health. Explain how proactive monitoring can prevent incidents, the metrics you focus on, and how you use this data for performance improvement. Examples of your past work with tools like Grafana will strengthen your answer.

Join Rise to see the full answer
How would you mentor a junior engineer on your team?

Explain your mentoring philosophy, emphasizing the importance of guidance and knowledge sharing. Discuss how you would structure mentorship sessions—whether through pair programming, code reviews, or informal discussions—to foster their growth in SRE practices.

Join Rise to see the full answer
How do you prioritize tasks in a fast-paced SRE environment?

Discuss your approach to task prioritization based on impact, urgency, and team needs. Providing a framework or tool that you utilize to keep on track can demonstrate your organizational skills and ability to handle multiple priorities effectively.

Join Rise to see the full answer
What best practices do you follow for documentation in site reliability engineering?

Share the importance of documentation in maintaining consistency and shared knowledge. Talk about your method for documenting processes, automating scripts, or incident responses to ensure team alignment and easier onboarding for new hires.

Join Rise to see the full answer
Describe a time you improved SRE practices in your previous job.

Provide a concrete example of an initiative you led or contributed to that enhanced SRE practices. Be specific about the challenges before your improvement, the strategies implemented, and the positive outcomes resulting from your efforts.

Join Rise to see the full answer
What are your thoughts on the future of site reliability engineering?

Share your insights on trends you see shaping the future of SRE, such as increased automation or the growing importance of observability. Discuss how you plan to stay updated on these advancements and incorporate them into your work.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 15 hours ago
Photo of the Rise User
Redwood Materials Hybrid San Francisco, California, United States
Posted 7 days ago
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
Posted yesterday
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
Alma Hybrid Bay Area
Posted 12 days ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Fast-Paced
Growth & Learning
Feedback Forward
Mission Driven
Transparent & Candid
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
WFH Reimbursements
Pet Friendly
Paid Volunteer Time
Paid Holidays
Paid Time-Off
Equity
Photo of the Rise User
CoinTracker Remote No location specified
Posted 4 days ago
Dental Insurance
Vision Insurance

Cayuse provides an expanding product suite that simplifies growing and preserving the research funding, rankings and reputations of universities, research hospitals, and research institutes. Founded in 1994, the company is recognized for providing...

14 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
March 16, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
A
Someone from OH, Lewis Center just viewed 34505367634 - Fraud Analyst at Activate Talent
Photo of the Rise User
Someone from OH, Dublin just viewed Senior Third-Party Risk Analyst at Fenergo
Photo of the Rise User
Someone from OH, Columbus just viewed US Product Designer at Praxent
Photo of the Rise User
Someone from OH, Cleveland just viewed Accounting Co-Op (Part-Time) at Avery Dennison
Photo of the Rise User
Someone from OH, North Ridgeville just viewed Product Manager at ShiftCare
Photo of the Rise User
Someone from OH, North Ridgeville just viewed Product Operations at Binance
Photo of the Rise User
Someone from OH, Mentor just viewed Sales & Service Lead - Pinecrest at Alo Yoga
Photo of the Rise User
18 people applied to REMOTE Sr Piping Designer at Kelly