Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

Who we are

At CarGurus (NASDAQ: CARG), our mission is to give people the power to reach their destination. We started as a small team of developers determined to bring trust and transparency to car shopping. Since then, our history of innovation and go-to-market acceleration has driven industry-leading growth. In fact, we’re the largest and fastest-growing automotive marketplace, and we’ve been profitable for over 15 years.

What we do

The market is evolving, and we are too, moving the entire automotive journey online and guiding our customers through every step. That includes everything from the sale of an old car to the financing, purchase, and delivery of a new one. Today, tens of millions of consumers visit CarGurus.com each month, and ~30,000 dealerships use our products. But they're not the only ones who love CarGurus—our employees do, too. We have a people-first culture that fosters kindness, collaboration, and innovation, and empowers our Gurus with tools to fuel their career growth. Disrupting a trillion-dollar industry requires fresh and diverse perspectives. Come join us for the ride!

Role overview

As a member of the CarGurus reliability team, the site reliability engineer will be responsible for defining, maintaining, and promulgating best practices and tools for SRE and observability.

What you’ll do

  • Linux administration, site reliability best practices, incident management, critical on call.
  • Collaborating with Engineering and Product Managers to define SLOs and monitoring of well-designed SLIs
  • Embedding with Engineering teams and independently addressing issues or collaborating to improve operational excellence
  • Being the primary point of escalation and on the on call rotation for major engineering incidents
  • Owning our Incident Response Process, including conducting blameless Postmortems
  • Partnering with Engineering teams to ensure new services are production-ready
  • Championing our organizational standards for architecting, observing, deploying, and scaling our products
  • Evolving and maintaining our tracing, logging, monitoring, alerting, and other observability systems to increase observability and transparency
  • Educating the company on observability tools and troubleshooting techniques and practices
  • Making Data-Driven decisions to drive continuous improvement
  • Refusing to accept manual work as a solution to areas of weakness

What you’ll bring

  • Linux administration, SRE theory and vocabulary, basic coding and scripting, production experience, incident management experience.
  • A proven background in software engineering with multiple languages and significant relative operational experience running revenue-critical services at scale
  • Understanding of technologies beyond coding such as Load Balancing, Configuration Management, Kubernetes, Terraform and Observability Systems
  • Comfort in dealing with Incidents and Availability Issues under pressure
  • Familiarity and experience working with cloud infrastructure in an AWS environment
  • Familiarity with modern best Site Reliability Engineering practices and theory
  • Comfort and skill in written and verbal communication across teams and organizations
  • Excitement in solving puzzles, discovering how a new service or tool works by identifying the individual components, libraries, and relationships it is built upon
  • A bias for action, but sufficient emotional intelligence to approach colleagues with positive regard and understanding their challenges and decisions
  • Curiosity and the acceptance that there are always ways to learn and grow
  • The desire to be an active contributor in a collaborative and fast-paced environment


Working at CarGurus

We reward our Gurus’ curiosity and passion with best-in-class benefits and compensation, including equity for all employees, both when they start and as they continue to grow with us. Our career development and corporate giving programs, as well as our employee resource groups (ERGs) and communities, help people build connections while making an impact in personally meaningful ways. A flexible hybrid model and robust time off policies encourage work-life balance and individual well-being. Thoughtful perks like daily free lunch, a new car discount, meditation and fitness apps, commuting cost coverage, and more help our people create space for what matters most in their personal and professional lives.

We welcome all

CarGurus strives to be a place to which people can bring the ultimate expression of themselves and their potential—starting with our hiring process. We do not discriminate based on race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. We foster an inclusive environment that values people for their skills, experiences, and unique perspectives. That’s why we hope you’ll apply even if you don’t check every box listed in the job description. We also encourage you to tell your recruiter if you require accommodations to participate in our hiring process due to a disability so we can provide the appropriate support. We want to know what only you can bring to CarGurus. #LI-Hybrid

Average salary estimate

$110000 / YEARLY (est.)
min
max
$90000K
$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, CarGurus

At CarGurus, we’re on a mission to revolutionize the automotive marketplace, and we're looking for a passionate Site Reliability Engineer to join our dynamic team in Boston, Massachusetts. As a vital part of our reliability team, you’ll be at the forefront of driving our operational excellence and ensuring the highest availability of our services. In this role, you’ll have the opportunity to define and uphold best practices for SRE and observability, getting involved in exciting projects that impact millions of consumers. You'll engage in Linux administration, incident management, and collaborate closely with Engineering and Product Managers to set Service Level Objectives (SLOs) and monitor Key Performance Indicators (KPIs). Your daily work will involve addressing operational issues, conducting blameless postmortems, and championing organizational standards for deploying and scaling our services. With your strong background in software engineering and familiarity with technologies like Kubernetes and AWS, you'll help us evolve our observability systems, making data-driven decisions that drive continuous improvement. At CarGurus, we celebrate a people-first culture that values collaboration, innovation, and growth, ensuring that our Gurus have all the necessary tools at their disposal. With generous benefits, a flexible work environment, and a commitment to diversity and inclusion, joining CarGurus as a Site Reliability Engineer means not just advancing your career but being part of a team that's making a lasting impact in the industry. Come ride the wave of automotive innovation with us!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at CarGurus
What are the key responsibilities of a Site Reliability Engineer at CarGurus?

As a Site Reliability Engineer at CarGurus, your primary responsibilities will include defining best practices for site reliability, incident management, and collaborating with engineering teams to ensure new services are production-ready. You’ll manage critical on-call duties, define Service Level Objectives (SLOs), and improve the operational excellence of our services.

Join Rise to see the full answer
What qualifications do I need to apply for the Site Reliability Engineer position at CarGurus?

To apply for the Site Reliability Engineer role at CarGurus, you should have experience in Linux administration, incident management, and a proven software engineering background with knowledge of multiple programming languages. Familiarity with cloud infrastructure, particularly AWS, is also essential, alongside understanding of Kubernetes and observability systems.

Join Rise to see the full answer
What is the work environment like for a Site Reliability Engineer at CarGurus?

The work environment for a Site Reliability Engineer at CarGurus is collaborative and supportive, focusing on continuous improvement and innovative solutions. You’ll be part of a flexible hybrid work model and contribute to an inclusive culture that values diverse perspectives and professional growth.

Join Rise to see the full answer
How does CarGurus support the professional growth of its Site Reliability Engineers?

CarGurus supports the professional growth of its Site Reliability Engineers through best-in-class benefits, career development programs, and a culture that encourages learning and innovation. Employees have access to mentorship and opportunities to lead initiatives that increase both their skills and impact.

Join Rise to see the full answer
What tools and technologies will I use as a Site Reliability Engineer at CarGurus?

In the Site Reliability Engineer role at CarGurus, you will work with various tools and technologies including Linux systems, Kubernetes for container orchestration, AWS for cloud services, Terraform for infrastructure as code, and different observability systems to enhance operational transparency.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you explain your experience with incident management?

In answering this question, focus on specific incidents you have managed, detailing the steps you took to resolve them, how you communicated with the team, and what you learned from the experience. Highlight your contribution to blameless postmortems and how you use these insights to improve processes.

Join Rise to see the full answer
What SRE principles do you follow when managing production services?

Discuss your understanding of SRE principles like service level objectives, incident response, and maintaining availability. Illustrate your answer with examples of how you've applied these principles in previous projects to enhance service reliability and performance.

Join Rise to see the full answer
How do you prioritize tasks under pressure?

Share your methods for prioritizing tasks when faced with multiple critical issues, such as using urgency and impact to guide your decisions. Provide examples of past experiences where effective prioritization led to successful incident resolution.

Join Rise to see the full answer
What experience do you have with cloud infrastructure?

Highlight your experience with cloud platforms, particularly AWS, focusing on the services you've utilized. Discuss any projects where cloud infrastructure played a key role in achieving operational goals or enhancing system reliability.

Join Rise to see the full answer
How do you approach onboarding and educating teams about observability tools?

Discuss your strategies for onboarding teams, emphasizing your ability to communicate complex concepts in a clear and accessible manner. Provide examples of how you’ve successfully trained teams in using observability tools and the positive outcomes that resulted.

Join Rise to see the full answer
Describe a time you improved a system’s reliability. What was the outcome?

When answering, share a specific example of a reliability improvement you initiated or contributed to. Detail the steps taken, the challenges faced, and the positive business impact resulting from this improvement, such as increased uptime or enhanced performance metrics.

Join Rise to see the full answer
What scripting languages are you proficient in?

List the scripting languages you are familiar with and include examples of how you have utilized them in your previous roles to automate tasks, enhance monitoring, or streamline workflows.

Join Rise to see the full answer
How comfortable are you working with load balancing technologies?

Discuss your experience with load balancing, including specific technologies you have used, and how you configured them to optimize service availability and performance in production environments.

Join Rise to see the full answer
Can you explain your familiarity with incident response best practices?

Describe your understanding of incident response best practices, emphasizing the importance of timely communication, documentation, and collaborative resolution efforts. Provide an example of how you've implemented these practices in your work.

Join Rise to see the full answer
What motivates you to work in site reliability engineering?

Share your passion for SRE, focusing on your love for problem-solving, system optimization, and delivering resilient services. Discuss how this motivation translates into your daily work and drives you to achieve excellence.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
CarGurus Remote Boston, Massachusetts, United States
Posted 14 days ago
Photo of the Rise User
CarGurus Remote Boston, Massachusetts, United States
Posted 13 days ago
Photo of the Rise User
Reply Hybrid Atlanta | Detroit area | Kansas City | Philadelphia area
Posted 12 days ago
Photo of the Rise User
Posted 7 days ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition
Photo of the Rise User
InterImage Hybrid Fort Meade, Maryland, United States
Posted 6 days ago
Photo of the Rise User
Posted 12 hours ago
Dental Insurance
Flexible Spending Account (FSA)
Vision Insurance
Health Savings Account (HSA)
Performance Bonus
Family Medical Leave
Paid Holidays
Photo of the Rise User
Boeing Hybrid US, Saint Louis County, MO; Missouri, Berkeley, MO
Posted 4 days ago
FASTBRIDGE FIBER LLC Hybrid Wyomissing, Pennsylvania, United States
Posted 3 days ago

We give people the power to reach their destination.

42 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
March 31, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!