Sign up for our
weekly
newsletter
of fresh jobs
PagerDuty is seeking a Senior Site Reliability Engineer to join our SRE-Platform team. In this role you will be a key contributor to building, maintaining and scaling the Kubernetes platform that powers PagerDuty. We build solutions that accelerate developer productivity, improve reliability and help PagerDuty scale for today and tomorrow. lf you're passionate about platform engineering, developer experience and all things Kubernetes, we'd love to hear from you!PagerDuty is a flexible, hybrid workplace. We embrace and encourage in-person working as an integral part of our culture. Both our employees and external research tells us that co-located collaboration strengthens connections, drives innovation, and accelerates learning.This role is expected to come into our Atlanta office one day per month, so you can thrive in your new role and fully embrace being a Dutonian!Key Responsibilities• You help maintain the overall health of the platform including triaging and troubleshooting production issues, monitoring system capacity, and working with other technical teams to ensure adherence to compliance and security best practices• You partner with Engineering stakeholders to design and deliver a reliable, scalable, secure, and performant platform• You continuously strive to improve the developer experience: Full lifecycle support (creation, development, deployment, retirement), observability, flexible connectivity, and monitoring• You share your expertise with the entire Engineering organization• You participate in a 24/7 on-call rotation. And yes, we use PagerDuty to manage our on-call schedulesBasic Qualifications• 5+ years of experience in Platform Engineering, Site Reliability Engineering or DevOps roles• Experience managing multiple Kubernetes clusters in a production environment• Experience working on cloud-native infrastructure (e.g. AWS, GCP, Azure)• Experience deploying web applications on Kubernetes (Helm, ArgoCD)• Experience with infrastructure as code (ie. Terraform or CloudFormation)• Knowledge of a dynamic language like (ie. Ruby or Python)Preferred Qualifications• Experience with monitoring, observability and logging platforms (e.g. DataDog, New Relic, SumoLogic, Splunk)• Knowledge of configuration management systems (e.g. Ansible, Chef, Puppet)• Experience in automating releases, continuous integration/delivery systems and relevant tools (e.g. Jenkins, CircleCI, Travis CI, Buildkite)The base salary range for this position is 152,000 - 248,000 USD. This role may also be eligible for bonus, commission, equity, and/or benefits.Our base salary ranges are determined by role, level, and location. The range, which is subject to change based on primary work location, reflects the minimum and maximum base salary we expect to pay newly hired employees for the position. Within the range, we determine pay for an individual based on a number of factors including market location, job-related knowledge, skills/competencies and experience.Your recruiter can share more about the specific offerings for this role, as well as the salary range for your primary work location during the hiring process.