Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Site Reliability Engineer image - Rise Careers
Job details

Senior Site Reliability Engineer

We are expanding our team of motivated engineers with a proven track record of delivering a best in class DBaSS platform – ObjectRocket. You will have the opportunity to work with a strong team of engineers working on large-scale distributed enterprise systems. You will design, implement and support complex architectural design of hardware, software and networking systems.


Lead global SRE team to provide the highest-level reliability to our customers and platform. You will drive improvement through automation and best practices. his includes responding to, mitigating, investigating, and escalating incidents when they occur. You will be responsible for stepping above the day-to-day support, for synthesizing patterns of problems and business needs to the engineering teams. You will be responsible for ensuring that your services operations over time are improving to enhance our business effectiveness.


Key Responsibilities:
  • Ensure completeness of the technical infrastructure to support system performance
  • Stay up to date with emerging technologies and trends in the enterprise hardware, infrastructure and networking industry
  • Partner with the application engineering team to ensure the stability and performance of our technology solutions
  • Continuous identification of problems in the technology stack and processes and their corresponding burndown
  • Follow and execute Rackspace change management processes
  • Participate in systems/code reviews and design sessions
  • Contribute to and organize central store of knowledge
  • Take full ownership of product life cycle
  • Participate in on-call rotation


Qualifications:
  • Bachelor’s degree in Computer Science or equivalent experience
  • 8+ years of information systems design/architecture/development
  • Strong experience in one or more of: Perl, Python, or Bash
  • Strong experience in one or more of: Ansible, Chef, or Salt
  • Strong experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols. Networking: e.g. TCP/IP, UDP, ICMP, etc., MAC addresses, IP packets, DNS, SDN, OSI layers, and load balancing.
  • Experience in designing, analyzing and troubleshooting large-scale distributed systems.
  • Intermediate knowledge of operating systems.
  • Familiarity with algorithms, data structures and complexity analysis.
  • Intermediate experience designing complex SaaS applications for cloud reliability and scalability.
  • Intermediate experience with cloud infrastructure automation and CI/CD pipeline design.
  • Expertise in operational monitoring and management tools (Sensu, Prometheus, Grafana, etc.).
  • Intermediate written & verbal communication skills, both highly technical and non-technical.
  • Ability to work closely with non-technical stakeholders and executives.
  • Systematic problem-solving approach coupled with a strong sense of ownership and drive.
  • RHCE Preferred.
  • Preferred:
  • Experience working with Object Storage systems at Petabyte scale.
  • Experience using and managing one or more relational databases (e.g. MySQL).
  • Experience with non-relational databases (preferably Redis, Mongo)
  • Experience with cloud service providers (AWS, GCP, Azure, etc.)
  • Experience with Docker and container management systems (Swarm, Kubernetes, OpenShift, etc.)


$143,700 - $245,520 a year
The following information is required by pay transparency legislation in the following states: CA, CO, HI, NY and WA. This information applies only to individuals working in these states. 

The anticipated starting pay range for Colorado is: $143,700 - $210,760.

The anticipated starting pay range for Hawaii and New York (not including NYC) is: $153,000 - $224,400.

The anticipated starting pay range for California, New York City and Washington is: $167,400 - $245,520.

Based on eligibility, compensation for the role may include variable compensation in the form of bonus, commissions, or other discretionary payments.

These discretionary payments are based on company and/or individual performance, and may change at any time.

Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. 

Information on benefits offered is here.


#LI-JR1

#LI-Remote

#LI-USA

#rackspace

Average salary estimate

$194610 / YEARLY (est.)
min
max
$143700K
$245520K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Site Reliability Engineer, Rackspace

Are you ready to take your career to the next level as a Senior Site Reliability Engineer at Rackspace? If you’re passionate about building and maintaining top-notch DBaSS platforms like ObjectRocket, this remote position is an exciting opportunity for you! You’ll be joining a dynamic team of talented engineers committed to delivering large-scale distributed systems that keep our customers happy and our technology solutions running smoothly. Your role will encompass designing and implementing complex architectures while overseeing global SRE initiatives to maximize reliability and performance. You’ll play a crucial part in driving best practices and automation, addressing incidents effectively and continuously improving operational efficiency. Embrace the chance to collaborate with both engineering and application teams to foster a seamless technology environment. With responsibilities that range from problem identification to leading knowledge-sharing initiatives, your expertise will directly impact our organization’s success. To thrive in this role, you should have a strong background in various programming languages like Perl, Python, or Bash, and a solid understanding of Unix/Linux systems, networking, and cloud infrastructure. At Rackspace, we prioritize innovation and growth, so staying updated on the latest industry trends is a must. If you possess problem-solving skills coupled with a sense of ownership and dedication, we want to hear from you! Join Rackspace and help us elevate our systems to new heights while enjoying the benefits of a flexible remote work environment.

Frequently Asked Questions (FAQs) for Senior Site Reliability Engineer Role at Rackspace
What does a Senior Site Reliability Engineer do at Rackspace?

At Rackspace, a Senior Site Reliability Engineer focuses on ensuring the reliability and performance of our DBaSS platform, ObjectRocket. This role involves designing and implementing complex architectural solutions while leading initiatives that enforce best practices and automation to address incidents effectively.

Join Rise to see the full answer
How can I apply for the Senior Site Reliability Engineer position at Rackspace?

You can apply for the Senior Site Reliability Engineer position at Rackspace by visiting our careers page. Ensure your resume highlights relevant experience, especially in managing distributed systems and working with Linux environments.

Join Rise to see the full answer
What qualifications are required for the Senior Site Reliability Engineer role at Rackspace?

The Senior Site Reliability Engineer role at Rackspace requires a Bachelor’s degree in Computer Science or equivalent experience, along with at least 8 years in information systems design or architecture. Candidates should be proficient in scripting languages and possess strong knowledge of cloud infrastructure and networking.

Join Rise to see the full answer
What programming languages should I know for the Senior Site Reliability Engineer position at Rackspace?

Candidates for the Senior Site Reliability Engineer position at Rackspace should be proficient in at least one of the following programming languages: Perl, Python, or Bash. Experience with automation tools is also highly valued.

Join Rise to see the full answer
What is the expected salary for a Senior Site Reliability Engineer at Rackspace?

At Rackspace, the salary for a Senior Site Reliability Engineer ranges from $143,700 to $245,520 per year, depending on factors such as level of experience and specific work location.

Join Rise to see the full answer
What technologies will I work with as a Senior Site Reliability Engineer at Rackspace?

As a Senior Site Reliability Engineer at Rackspace, you'll interact with various technologies, including cloud service providers like AWS and Azure, container management systems, and database solutions like MySQL, Redis, and MongoDB. Your daily work will involve ensuring system reliability and performance across these tools.

Join Rise to see the full answer
Does Rackspace offer remote work for the Senior Site Reliability Engineer role?

Yes, the Senior Site Reliability Engineer position at Rackspace is fully remote, offering flexibility to work from anywhere within the United States, ensuring a great work-life balance.

Join Rise to see the full answer
Common Interview Questions for Senior Site Reliability Engineer
Can you explain how you would approach incident management as a Senior Site Reliability Engineer?

In your response, outline your systematic approach to incident management, emphasizing identification, documentation, and escalation processes. Share examples of past experiences where your actions led to improved response times and reduced downtime.

Join Rise to see the full answer
What experience do you have with large-scale distributed systems?

Discuss your hands-on experience with distributed systems, providing specific examples of projects you've worked on. Include details about the technologies used and the impact of your contributions on system reliability and performance.

Join Rise to see the full answer
How do you ensure the automation of processes within your role?

Talk about the automation tools and scripts you’ve implemented in previous roles. Focus on how you've also inspired others to adopt automation for increased efficiency and reduced human error.

Join Rise to see the full answer
What networking protocols are you most familiar with?

Highlight your knowledge of essential networking protocols such as TCP/IP, DNS, and load balancing. Provide examples of how you've applied this knowledge to troubleshoot or design network systems in past positions.

Join Rise to see the full answer
Describe a challenging technical problem you faced and how you resolved it.

Be prepared to detail the context of the challenge, the steps you took to analyze the situation, and the final solution you implemented. Focus on showcasing your problem-solving skills and technical knowledge.

Join Rise to see the full answer
How do you stay updated on emerging technologies in the industry?

Discuss your commitment to ongoing learning through courses, webinars, or tech communities. Mention how you apply this knowledge to your role and the outcomes of adopting new technologies at work.

Join Rise to see the full answer
What do you consider best practices in cloud infrastructure management?

Explain your understanding of cloud infrastructure best practices such as automation, monitoring, security, and compliance. Share examples of how you've successfully implemented these practices in previous roles.

Join Rise to see the full answer
How would you handle a situation where a stakeholder has non-technical requirements?

Talk about your approach to breaking down complex technical information for non-technical stakeholders. Emphasize the importance of clear communication and your ability to bridge the gap between technical and non-technical teams.

Join Rise to see the full answer
What has been your experience working in on-call rotations?

Share your experiences with being part of an on-call rotation, including challenges faced and how you managed work-life balance. Highlight any strategies you employed to minimize stress during high-pressure situations.

Join Rise to see the full answer
What key metrics do you monitor to ensure system reliability?

List the key performance indicators (KPIs) you typically monitor, like uptime, response time, and error rates. Provide examples of how you have reported on these metrics or used them to drive improvements in past roles.

Join Rise to see the full answer
Similar Jobs
ECP Remote No location specified
Posted 2 days ago
Photo of the Rise User
Spectrum Hybrid Grand View Estates, CO
Posted 11 days ago
Photo of the Rise User
Evolving Web Remote No location specified
Posted 9 days ago
Flexxon Remote No location specified
Posted 13 days ago
Photo of the Rise User
Posted 10 days ago
Inclusive & Diverse
Social Impact Driven
Collaboration over Competition
Growth & Learning
Maternity Leave
Paternity Leave
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching

Founded in 1998, Rackspace provides multi-cloud computing solutions and services. Offering advising to customers based on business challenges, designing solutions, building, and managing solutions. The company is headquartered in San Antonio, Texa...

54 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
November 27, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!