Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer - remote image - Rise Careers
Job details

Site Reliability Engineer - remote

Please Note:

1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description:

As a Site Reliability Engineer, you will be responsible for the implementation and operation of cloud infrastructure for a SaaS based network monitoring solution. In this role, you will:

  • Participate in the design, implementation, and operation of our SaaS platform, addressing concerns such as continuous integration, cloud infrastructure, solution deployment, and monitoring & alerting.

  • Partner closely with our other engineering teams to evolve product/service architecture

  • Migrate services into our freshly-minted platform, and collaborate with our dev teams to ensure that new services are designed with operability and observability in mind.

  • Build out, deploy, and maintain our monitoring strategy and technology stack

  • Automate all the things, freeing yourself and others from the tyranny of manual tasks.

  • Contribute to the achievement of our 99.99% monthly availability by participating in our incident management process and quiet on-call rotation.

  • Practice sustainable incident response and coordinate blameless postmortems.

  • Assist in the definition, prioritization, and planning of work through backlog maintenance and collaboration on the product delivery roadmap.


 

Required Education and Experience

  • SRE/DevOps experience in building and operating cloud-based SaaS platforms

  • Familiarity and experience with:

    • AWS and/or GCP

    • Infrastructure-as-code tooling (e.g. Terraform)

    • Containerization (Docker) and orchestration (Kubernetes, helm)

    • CI/CD pipelines, either self-hosted (e.g. Jenkins, TeamCity), or managed (e.g. GitHub Actions, GitLab)

    • Configuration management (Chef, Ansible, Puppet)

    • At least one programming language (Python preferred)

    • Monitoring solutions (e.g. Prometheus, Grafana, Cloudwatch, Stackdriver, ELK)

    • Linux systems, automation, package management

  • Demonstrable aptitude to learn new technologies, and apply that knowledge to solve real problems

  • Strong interpersonal communication skills (listening, speaking, and writing)

  • Experience operating large-scale, distributed systems on top of cloud infrastructure

Bachelors + 5+ years of related experience.

Broadcom Software - Agile Operations Division

Join Broadcom Software (#BroadcomSW), a world leader in business-critical software that modernizes, optimizes, and protects the world’s most complex hybrid environments. With our engineering-centered culture, we are building an extensive portfolio of industry-leading infrastructure and security software. Together, we solve big customer problems with some of the top technical talent in the industry.

In the Agile Operations Division, we offer business-critical software solutions that help the world’s leading companies transform their operating model to be more agile. Our ValueOps, NetOps, and Automation solutions help these organizations drive innovation and achieve operational excellence to realize better business outcomes – and better experiences for their customers.

Our industry success is built on a decades-long track record of delivering transformational solutions to teams who plan, build, test, and operate mission-critical software for the world’s largest and most complex businesses. To do this, we respond quickly and thoughtfully, innovate in the context of customer needs, and collaborate inclusively with customers and internal partners. Our business will nurture your intellect and give you opportunities to expand your skills even further. 

Additional Job Description:

Compensation and Benefits

The annual base salary range for this position is $91,000  - $146,000

This position is also eligible for a discretionary annual bonus in accordance with relevant plan documents, and equity in accordance with equity plan documents and equity award agreements.

Broadcom offers a competitive and comprehensive benefits package: Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.

Broadcom is proud to be an equal opportunity employer.  We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, gender identity, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law.  We will also consider qualified applicants with arrest and conviction records consistent with local law.

If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

Broadcom Glassdoor Company Review
3.7 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Broadcom DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Broadcom
Broadcom CEO photo
Hock E. Tan
Approve of CEO

Average salary estimate

$118500 / YEARLY (est.)
min
max
$91000K
$146000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer - remote, Broadcom

Broadcom Software is excited to welcome a new Site Reliability Engineer to join our dynamic Agile Operations Division! In this fully remote role, you will play a critical part in implementing and managing cloud infrastructure for our innovative SaaS network monitoring solution. You’ll collaborate with engineering teams to design, deploy, and maintain our platform, focusing on continuous integration and cloud solutions. Imagine being part of a team that prioritizes automating tedious tasks, enhancing availability, and contributing to our commitment to a remarkable 99.99% uptime. If you're passionate about many technologies like AWS, Docker, and Kubernetes, and love tackling problems with a solutions-oriented mindset, we want you on board. You are encouraged to bring your experience with infrastructure-as-code using tools like Terraform and monitoring solutions such as Prometheus and Grafana. This position not only allows you to showcase your technical skills but also to enhance them in an engaging, supportive environment. With a competitive salary range of $91,000 - $146,000 and a comprehensive benefits package, including 401(K) matching and health plans, Broadcom is committed to fostering your growth while solving impactful challenges together. Ready to make a mark? Join us in modernizing, optimizing, and protecting complex hybrid environments!

Frequently Asked Questions (FAQs) for Site Reliability Engineer - remote Role at Broadcom
What are the primary responsibilities of a Site Reliability Engineer at Broadcom Software?

As a Site Reliability Engineer at Broadcom Software, you will be responsible for designing and operating our SaaS platform. Your duties will include automating infrastructure management, developing a robust monitoring strategy, and ensuring high availability through constant collaboration with engineering teams. You’ll also conduct blameless postmortems and participate in incident management, contributing to our goal of achieving 99.99% uptime.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer role at Broadcom Software?

To qualify for the Site Reliability Engineer position at Broadcom Software, candidates should have extensive SRE or DevOps experience with cloud-based SaaS platforms. A Bachelor's degree along with 5+ years of related experience is necessary, along with hands-on expertise in AWS or GCP, deployment automation tools like Terraform, and monitoring solutions such as Prometheus and Grafana. Strong communication skills and programming knowledge, preferably in Python, are also important.

Join Rise to see the full answer
What tools and technologies should a Site Reliability Engineer at Broadcom Software be familiar with?

Site Reliability Engineers at Broadcom Software should be knowledgeable in various tools and technologies, including AWS or GCP for cloud services, AWS EC2, Terraform for Infrastructure as Code, Docker and Kubernetes for containerization, as well as CI/CD pipelines using tools like Jenkins and GitLab. Additionally, familiarity with configuration management tools such as Chef or Puppet is beneficial.

Join Rise to see the full answer
What can I expect from the work culture as a Site Reliability Engineer at Broadcom Software?

At Broadcom Software, the work culture emphasizes collaboration, innovation, and inclusivity. As a Site Reliability Engineer, you will be part of a team that values shared problem-solving, and each contribution counts towards our mission. The Agile Operations Division promotes an engineering-centered culture where continuous learning is encouraged, allowing you to grow your technical skills and knowledge base.

Join Rise to see the full answer
What are the compensation and benefits for the Site Reliability Engineer position at Broadcom Software?

The annual base salary for the Site Reliability Engineer position at Broadcom Software ranges from $91,000 to $146,000, along with a discretionary annual bonus and equity participation. In addition, the benefits package includes comprehensive medical, dental, and vision plans, 401(K) with company matching, and paid time off. Broadcom is committed to providing a supportive working environment for all employees.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer - remote
How do you approach incident management as a Site Reliability Engineer?

When answering this question, discuss your methodology for quickly identifying incidents, categorizing their impact, and ensuring effective communication during outages. Emphasize the importance of teamwork, post-incident reviews, and how you prioritize resolutions to meet service level agreements.

Join Rise to see the full answer
Can you describe your experience with cloud infrastructure management?

Provide examples showcasing your work with cloud tools and services, specifically outlining what you have built or managed. Discuss your familiarity with platforms like AWS or GCP and mention any specific projects that illustrate your ability to enhance performance or reduce costs.

Join Rise to see the full answer
What strategies do you use for automating infrastructure deployment?

Mention tools like Terraform or Ansible, and highlight your experience in defining infrastructure as code (IaC). Talk about how automation improves efficiency and leads to fewer errors, citing successful past projects that highlight these benefits.

Join Rise to see the full answer
How do you ensure a high level of availability and reliability in a SaaS solution?

Discuss your proactive measures for maintaining uptime, including monitoring strategies, automated scaling, and load balancing. Explain how you measure performance metrics and address them promptly to prevent downtime.

Join Rise to see the full answer
What programming languages are you proficient in, and how have you used them in your past roles?

Focus on your primary programming languages relevant to site reliability engineering, like Python. Discuss specific scripts or applications you’ve built, how they solved problems, or improved workflows in your previous roles.

Join Rise to see the full answer
How do you prioritize and manage your backlog as a Site Reliability Engineer?

Illustrate your approach to backlog management, including how you assess the urgency and impact of tasks. Talk about collaboration methods with product management teams to align on priorities and ensure workload balance.

Join Rise to see the full answer
What monitoring tools have you used, and how do you evaluate their effectiveness?

Mention any relevant monitoring tools, like Prometheus or Grafana, and articulate how you set up alerts and dashboards to track system health. Describe how you analyze data trends to enhance system performance.

Join Rise to see the full answer
Can you give an example of a challenging problem you solved in a previous SRE role?

Be prepared to discuss a specific situation where you identified a critical issue, how you approached solving it, and the final results. Focus on the analytical and communication skills you utilized during the process.

Join Rise to see the full answer
How do you handle failure or incidents in your work?

It's important to convey your understanding of blameless postmortems and how to learn from failures. Describe your philosophy on continuous improvement and how it applies to delivering better systems.

Join Rise to see the full answer
What is your experience with CI/CD pipelines?

Talk about your hands-on experience with CI/CD tools like Jenkins or GitLab and how implementing these practices increased deployment frequency and reliability. Highlight specific examples where CI/CD significantly benefited your projects.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 10 days ago

Join Broadcom as a Senior Executive Assistant to manage administrative tasks while supporting the executive team in a fast-paced environment.

Photo of the Rise User
Broadcom Remote United Kingdom-Remote Location
Posted 9 days ago

As an Account Director at Broadcom, you'll play a pivotal role in enhancing customer relationships and driving business success remotely.

Photo of the Rise User

Step into a pivotal role at Fifth Third Bank as a Principal Cyber Threat Analyst, where you'll lead efforts in cybersecurity and incident response.

Photo of the Rise User
Posted 10 days ago

Join Orlando Health as a Clinical Informaticist, enhancing healthcare technologies to deliver better patient care.

Photo of the Rise User
General Dynamics Information Technology Hybrid US, Loudoun County, VA; Virginia, Chantilly, Loudoun County, VA
Posted 6 days ago

As a Cyber Security Analyst Senior Advisor at GDIT, you'll play a critical role in protecting national security through robust cybersecurity measures.

Conscious Talent seeks world-class technology leaders to champion technical strategy and foster a culture of authenticity and growth.

Photo of the Rise User

Join the Judicial Council of California as a Senior Business Systems Analyst and help shape statewide technology initiatives while fostering clear communications.

Photo of the Rise User
GE HealthCare Hybrid IL03-01-Chicago-500 W Monroe St
Posted 4 days ago

Join GE HealthCare as a Senior IT Auditor to shape our internal audit processes and ensure effective risk management in the healthcare technology sector.

Photo of the Rise User
Posted 5 days ago

Lead BMS' cybersecurity efforts as a Senior Manager in Cyber Defense, focusing on Attack Surface Management and enhancing security measures.

Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States
Posted 8 days ago

Anduril Industries seeks a Hardware Technician I to join their team in Costa Mesa, supporting essential hardware and software functions.

Broadcom harbors broad ambitions for its semiconductors' impact on broadband communications: it wants them to drive every part of the high-speed wired and wireless networks of the future. The core applications for its integrated circuits (ICs) are...

36 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 10, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Cleveland just viewed Event Specialist at Marble Room
Photo of the Rise User
18 people applied to SOC Analyst I at CBIZ
Photo of the Rise User
Someone from OH, Youngstown just viewed Director, Clinical Informatics at Ro
Photo of the Rise User
Someone from OH, Dayton just viewed Shopify Specialist at Remote VA
L
Someone from OH, Dayton just viewed Mechanical Design Engineer(s) at LTTS
Photo of the Rise User
14 people applied to Junior Security Engineer at Epic
H
Someone from OH, Akron just viewed Financial Content Writer at Huntington
W
Someone from OH, Columbus just viewed Director of Regulatory Compliance - WEX Bank at WEX Inc