Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior AWS Cloud Site Reliability Engineer (SRE) image - Rise Careers
Job details

Senior AWS Cloud Site Reliability Engineer (SRE)

Responsibilities

We are seeking an experienced and motivated Senior AWS Cloud Site Reliability Engineer (SRE) to join our dynamic team. As an AWS Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure on Amazon Web Services (AWS). The ideal candidate will have a strong background in AWS services, a deep understanding of infrastructure as code, and a passion for implementing best practices in site reliability engineering. The AWS Site Reliability Engineer (SRE) will collaborate closely with cross-functional teams, including development, quality assurance, and operations, to ensure seamless software releases and continuous improvement of our release processes.

 

 What you will do: 

 

  • Infrastructure Automation:  Design, implement, and manage infrastructure as code (IaC) solutions using tools like AWS CloudFormation, Terraform or Helm Charts to automate deployment and scaling processes.  Collaborate with development teams to integrate continuous deployment practices and ensure the reliability of applications.
  • Monitoring and Alerting:  Implement robust monitoring and alerting systems to proactively identify and address potential issues before they impact system performance.  Analyze system metrics, logs, and alerts to troubleshoot and resolve issues promptly.
  • Performance Optimization:  Conduct performance analysis and optimization of AWS infrastructure components to enhance system efficiency and reduce latency.  Identify and implement improvements to enhance system reliability and resilience.
  • Incident Response:  Participate in on-call rotations to respond to and resolve incidents promptly.  Conduct post-incident reviews to identify root causes and implement preventive measures.
  • Security and Compliance:  Work closely with security teams to implement and enforce best practices for securing AWS environments.  Ensure compliance with industry standards and regulations related to cloud infrastructure.
  • Communication:  Facilitate clear communication across teams, providing updates on release status, known issues, and any potential impact on stakeholders. Coordinate communication of release schedules and changes to all relevant parties.
  • Release Planning and Coordination:  Collaborate with development, QA, and operations teams to plan and coordinate software releases. Define release scope, schedule, and dependencies to ensure timely and smooth deployments.  Create and submit change records as required for process and audit compliance.  Participation in Technical Change Advisory and Review boards as required.
  • Release Automation:  Develop and maintain automated deployment pipelines using industry-standard tools such as AWS Cl/CD, GitLab CI/CD, Jenkins or similar. Automate and streamline release processes to improve efficiency and reduce manual errors. 
  • Continuous Improvement:  Proactively identify areas for process improvement within the release management lifecycle. Implement feedback loops to capture lessons learned from each release and apply improvements iteratively.  Stay up to date with industry best practices, emerging technologies, and trends related to release management and reliability engineering.
  • Quality Assurance:  Collaborate with QA teams to establish and execute release validation procedures. Ensure releases are thoroughly tested and meet quality standards before deployment.  Drive continuous improvement by analyzing release management trends, identifying recurring issues, and working with teams to implement solutions.  

 

Qualifications

Required Qualifications:

 

  • Bachelor's degree and 8 years of experience. Additional 4 years of experience maybe accepted in lieu of the degree.
  • Proven experience as a Site Reliability Engineer or similar role.
  • In-depth knowledge of AWS services and expertise in managing cloud infrastructure.
  • Advanced level programming and/or scripting in 3 or more of the following languages: Python, Java, Chef, Helm, Playwright, Bash, JavaScript, Terraform.
  • Strong understanding of DevOps principles and continuous integration/continuous deployment (CI/CD) pipelines.
  • Proficiency in CI/CD tools such as AWS CI/CD, GitLab CI/CD, or others.
  • Familiarity with infrastructure as code (IaC) tools like CloudFormation, Terraform, Helm Charts, Morpheus, or similar technologies.
  • Hands-on experience with version control systems (GitLab, AWS CodeCommit, SVN) and branching strategies.
  • Experience with containerization and orchestration tools (e.g., Amazon Elastic Compute Service (ECS), Amazon Elastic Kubernetes Service (EKS), Docker, Kubernetes).
  • Familiarity with monitoring tools (e.g., CloudWatch, Prometheus, Grafana, Datadog, DynaTrace) and log analysis.
  • Solid understanding of Agile methodologies and their application in release management.
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.
  • Must be a US Citizen
  • Must be able to obtain and maintain the required agency clearance (6C Public Trust)

  

Preferred Qualifications:

 

  • Relevant certifications in DevOps or related fields are a plus.
  • Experience in SRE or Platform Engineering group for high availability/critical platforms/applications
  • Experience managing a distributed container platform including but not limited to deployment/release management, provisioning, capacity management, workload management

 

Peraton Overview

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we’re keeping people around the world safe and secure.

Target Salary Range

$104,000 - $166,000. This represents the typical salary range for this position based on experience and other factors.

EEO

EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$104000K
$166000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior AWS Cloud Site Reliability Engineer (SRE), Peraton

Are you ready to take your career to the next level as a Senior AWS Cloud Site Reliability Engineer (SRE) at Peraton? We're excited to welcome an experienced and motivated individual who will be instrumental in ensuring the reliability, scalability, and performance of our AWS cloud infrastructure. In this role, you will collaborate with diverse cross-functional teams, including development, quality assurance, and operations, making a real impact on seamless software releases and improvement processes. If you have a solid background in AWS services, infrastructure as code, and an undying passion for implementing best practices in site reliability engineering, this opportunity is for you! You'll be designing and managing infrastructure automation with tools like Terraform and CloudFormation and employing smart monitoring and alerting systems to proactively tackle issues. Performance optimization, incident response, and security compliance will also fall under your purview as you foster effective communication with stakeholders. Not only will you coordinate releases, but you'll also have a finger on the pulse of continuous improvement, bringing fresh ideas and insights to the table. If you're looking for a role where you can really contribute and grow within a leading national security company, don’t miss out on this chance to join Peraton and help us solve the challenges our customers face every day!

Frequently Asked Questions (FAQs) for Senior AWS Cloud Site Reliability Engineer (SRE) Role at Peraton
What are the key responsibilities of a Senior AWS Cloud Site Reliability Engineer at Peraton?

The Senior AWS Cloud Site Reliability Engineer (SRE) at Peraton is tasked with ensuring AWS infrastructure reliability and performance. This role involves designing and managing infrastructure automation using tools like Terraform, implementing monitoring systems, optimizing performance, and coordinating software releases. You'll also actively participate in incident responses and collaborate with various teams to enhance the release process.

Join Rise to see the full answer
What qualifications are needed for the Senior AWS Cloud Site Reliability Engineer position at Peraton?

Candidates applying for the Senior AWS Cloud Site Reliability Engineer role at Peraton should possess a Bachelor's degree and at least 8 years of experience, along with advanced knowledge of AWS services and expertise in infrastructure management. Proficiency in programming languages, CI/CD tools, and experience with monitoring and containerization technologies is also essential.

Join Rise to see the full answer
What tools and technologies should a Senior AWS Cloud Site Reliability Engineer be familiar with at Peraton?

A Senior AWS Cloud Site Reliability Engineer at Peraton should be adept in tools like AWS CloudFormation, Terraform, and CI/CD platforms such as AWS CI/CD and GitLab CI/CD. Familiarity with container orchestration tools, monitoring technologies, and source control systems is highly beneficial for success in this role.

Join Rise to see the full answer
What is the company culture like for a Senior AWS Cloud Site Reliability Engineer at Peraton?

At Peraton, the culture is collaborative and innovative, promoting growth and teamwork. As a Senior AWS Cloud Site Reliability Engineer, you will engage with diverse cross-functional teams, take on meaningful challenges, and contribute to national security missions. The environment encourages sharing ideas and continuously improving processes for enhanced performance and reliability.

Join Rise to see the full answer
What opportunities for career advancement exist for a Senior AWS Cloud Site Reliability Engineer at Peraton?

Peraton offers various opportunities for career advancement to its Senior AWS Cloud Site Reliability Engineers. You will gain valuable experience while working on challenging projects, allowing you to explore potential leadership roles or specialized positions within the organization, aligning with your career goals and aspirations.

Join Rise to see the full answer
Common Interview Questions for Senior AWS Cloud Site Reliability Engineer (SRE)
Can you explain what Site Reliability Engineering means to you?

Site Reliability Engineering (SRE) integrates development and operations by applying software engineering principles to infrastructure and operations problems. When asked this during your interview for the Senior AWS Cloud Site Reliability Engineer position at Peraton, emphasize the importance of reliability, scalability, and maintaining high availability while meeting customer needs.

Join Rise to see the full answer
How do you approach incident management in your role as a Site Reliability Engineer?

When it comes to incident management as a Senior AWS Cloud Site Reliability Engineer, it’s crucial to have a systematic approach. Discuss how you prioritize incidents based on impact, communicate effectively with stakeholders during a crisis, and utilize post-incident reviews to implement preventive measures, reinforcing your commitment to reliability.

Join Rise to see the full answer
What tools do you use for monitoring AWS infrastructure?

In responding to this question, highlight your experience with monitoring tools such as AWS CloudWatch, Prometheus, or Datadog. Explain how you utilize these tools to track performance metrics, analyze logs, and proactively identify potential issues that could affect system performance, showcasing your skills relevant to the Senior AWS Cloud Site Reliability Engineer role.

Join Rise to see the full answer
What is your experience with infrastructure as code in AWS environments?

Talk about your hands-on experience with tools like Terraform or AWS CloudFormation. Explain how you’ve utilized infrastructure as code to automate deployment, manage configurations, and ensure consistency in your AWS environments, demonstrating your fit for the Senior AWS Cloud Site Reliability Engineer position.

Join Rise to see the full answer
Can you describe a time you improved system performance or reliability?

Use the STAR method (Situation, Task, Action, Result) to outline a specific scenario where you successfully enhanced system performance. Be sure to quantify the results when possible, outlining how your contributions as a Senior AWS Cloud Site Reliability Engineer led to improved efficiency or reliability.

Join Rise to see the full answer
How do you manage CI/CD pipelines in your projects?

Discuss the CI/CD tools you've worked with, such as Jenkins or GitLab CI/CD, and your experience in automating testing and deployment processes. Share examples of how you have streamlined these pipelines and ensured timely releases, aligning with the responsibilities of a Senior AWS Cloud Site Reliability Engineer at Peraton.

Join Rise to see the full answer
What is your experience with container orchestration tools?

Describe your familiarity with container orchestration solutions like Kubernetes or Amazon ECS. Highlight any projects where you deployed and managed containerized applications, showcasing your relevant skill set for the Senior AWS Cloud Site Reliability Engineer role.

Join Rise to see the full answer
How do you ensure compliance and security in cloud environments?

Security and compliance are paramount in the role of a Senior AWS Cloud Site Reliability Engineer. Discuss your experience collaborating with security teams and implementing best practices to safeguard AWS environments while ensuring compliance with regulations, emphasizing your proactive approach.

Join Rise to see the full answer
How do you handle changes to release schedules or scope?

40Being able to adapt to changes is critical in this role. Discuss your strategies for agile release management, effective communication with teams, and how you coordinate adjustments to release schedules and scopes while minimizing disruption.

Join Rise to see the full answer
What do you see as the future of Site Reliability Engineering?

When addressing the future of SRE, you can share insights on trends such as increased automation, the importance of observability, and the integration of AI/ML in optimizing reliability. Show your enthusiasm for being part of these developments as a Senior AWS Cloud Site Reliability Engineer at Peraton.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 10 days ago

Join Peraton as a Digital Forensic Analyst and play a key role in supporting vital criminal investigations using advanced forensic tools and techniques.

Photo of the Rise User
Posted 10 days ago

Join Peraton as a Cloud Database Architect to design leading-edge database systems for a key federal agency and enhance national security efforts.

Photo of the Rise User
Posted yesterday
Open Door Policy
Customer-Centric
Mission Driven
Rapid Growth
Reward & Recognition
Startup Mindset
Inclusive & Diverse
Empathetic
Casual Dress Code
Collaboration over Competition
Work/Life Harmony
Transparent & Candid

PandaDoc is on the lookout for a Senior Python Engineer to join their Customer Value Track and help elevate customer experiences through innovative solutions.

Posted 8 days ago

Join Northrop Grumman's talented team as a Senior Software Engineer focusing on DevOps and Agile methodologies to support groundbreaking technology in the defense sector.

Photo of the Rise User
Visa Remote Bangalore, India
Posted 9 days ago

As a SW Engineer - SDET at Visa, contribute to building next-generation global payment systems within a dynamic hybrid work environment.

Photo of the Rise User
Nexthink Remote Bengaluru, Karnataka, India
Posted 8 days ago

Join Nexthink as a Platform Software Engineer to develop innovative tools that enhance the digital employee experience.

Photo of the Rise User
Posted 10 days ago

Join Lambda, a leading AI computing platform, as a Frontend Software Engineer to innovate on user-centric design and responsive applications.

Posted 8 days ago

Join Dandy as a Senior Full Stack Software Engineer and help transform the dental industry through cutting-edge technology.

Photo of the Rise User
Thaloz Remote No location specified
Posted 3 days ago

A dynamic opportunity for a Full Stack Engineer eager to shape innovative software solutions in a remote setting.

CBRL Group Hybrid 305 Hartmann Drive, Lebanon, Tennessee 37087-2519
Posted 13 days ago

Join Cracker Barrel as a Senior Fullstack Developer, where you'll architect innovative digital solutions that enhance user experiences.

Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)

Our mission is to protect and promote freedom around the world by Securing our future, Connecting our world, Safeguarding our enterprise, Protecting our borders, Enabling commerce, Enhancing human knowledge, and Protecting our citizens.

752 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
April 22, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!