Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior SRE Engineer (R3386) image - Rise Careers
Job details

Senior SRE Engineer (R3386)

Founded in 2015, Shield AI is a venture-backed defense technology company with the mission of protecting service members and civilians with intelligent, autonomous systems. Its products include Hivemind Enterprise—EdgeOS, Pilot, Commander, and Forge—as well as V-BAT and Sentient Vision Systems (wide-area motion imaging software). With offices in San Diego, Dallas, Washington, D.C., Abu Dhabi (UAE), Kyiv (Ukraine), and Melbourne (Australia), Shield AI’s technology actively supports U.S. and allied operations worldwide.  For more information, visit www.shield.ai. Follow Shield AI on LinkedIn, X and Instagram.    


Job Description:

As a Site Reliability Engineer at Hivemind, you will play a key role in ensuring the performance, reliability, and scalability of our cloud infrastructure. You’ll be responsible for building and maintaining monitoring and alerting systems for both internal and external services, defining and evolving incident response strategies, and automating operational processes to minimize risk and eliminate toil. Your work will directly impact the stability and resilience of Hivemind’s platform, helping us deliver exceptional experiences to our users. 


What You'll Do:
  • Design, implement, and maintain robust monitoring, logging, and alerting systems 
  • Define incident response procedures and participate in on-call rotations 
  • Identify and resolve reliability and performance issues across services 
  • Develop automation tools to streamline operations and reduce manual interventions 
  • Collaborate with engineering teams to ensure new services are production-ready 
  • Conduct root cause analyses and implement post-incident improvements 
  • Champion a culture of reliability, observability, and operational excellence 


Required Qualifications:
  • 5+ years of experience in Site Reliability Engineering, DevOps, or related roles 
  • Strong experience with AWS services (EC2, ECS/EKS, RDS, IAM, etc.) 
  • Deep understanding of Kubernetes and containerized deployments 
  • Proficiency with monitoring and observability tools (e.g. Prometheus, Grafana, Datadog, ELK) 
  • Strong scripting or programming skills (Python, Go, Bash, etc.) 
  • Experience with infrastructure-as-code (Terraform, CloudFormation, or similar) 
  • Solid understanding of networking, Linux systems, and distributed architectures 


Preferred Qualifications:
  • Experience with service meshes (e.g., Istio or Linkerd) 
  • Familiarity with security best practices in cloud environments 
  • Exposure to GitOps workflows and tools (e.g., ArgoCD or Flux) 


$129,467 - $194,201 a year

#LI-LD1

#LC


Full-time regular employee offer package:

Pay within range listed + Bonus + Benefits + Equity


Temporary employee offer package:

Pay within range listed above + temporary benefits package (applicable after 60 days of employment)


Salary compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. All offers are contingent on a cleared background and possible reference check. Military fellows and part-time employees are not eligible for benefits. Please speak to your talent acquisition representative for more information.


###


Shield AI is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender identity or Veteran status. If you have a disability or special need that requires accommodation, please let us know. 

Shield AI Glassdoor Company Review
3.3 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Shield AI DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Shield AI
Shield AI CEO photo
Ryan Tseng
Approve of CEO

Average salary estimate

$161834 / YEARLY (est.)
min
max
$129467K
$194201K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior SRE Engineer (R3386), Shield AI

Are you ready to make an impact with your expertise as a Senior SRE Engineer at Shield AI? Based in the vibrant San Diego Metro Area, you will be at the forefront of our mission to protect service members and civilians with innovative technology. Your primary focus will be on ensuring that our cloud infrastructure remains robust, reliable, and scalable. In this role, you'll have the opportunity to design and implement vital monitoring and alerting systems while defining effective incident response strategies that can transform how we operate. Collaborating closely with talented engineering teams, you will champion a culture of operational excellence and reliability. As you resolve performance issues and develop automation tools that streamline our operations, your contribution will be instrumental in enhancing the experience for our users. Your expertise in AWS, Kubernetes, and observability tools will help us minimize risks and eliminate toil. Join us at Shield AI, where your role as a Senior SRE Engineer will mean not just a job, but a chance to contribute significantly to cutting-edge defense technology. Together, we’ll help transform the landscape of autonomous systems and take pride in our work's meaningful impact across the globe.

Frequently Asked Questions (FAQs) for Senior SRE Engineer (R3386) Role at Shield AI
What are the responsibilities of a Senior SRE Engineer at Shield AI?

As a Senior SRE Engineer at Shield AI, you will be responsible for ensuring the performance and reliability of our cloud infrastructure. This includes designing and implementing robust monitoring and alerting systems, defining incident response procedures, and participating in on-call rotations. You'll identify and resolve reliability issues, develop automation tools, and collaborate with engineering teams to ensure new services are production-ready.

Join Rise to see the full answer
What qualifications are required for the Senior SRE Engineer position at Shield AI?

To qualify for the Senior SRE Engineer role at Shield AI, candidates should have over 5 years of experience in Site Reliability Engineering, DevOps, or related roles. Strong knowledge of AWS services, Kubernetes, and experience with monitoring tools like Prometheus and Grafana are essential. Scripting skills in languages like Python and Bash are also required, along with experience in infrastructure-as-code tools.

Join Rise to see the full answer
What tools and technologies do Senior SRE Engineers at Shield AI use?

Senior SRE Engineers at Shield AI utilize a variety of tools and technologies in their work. This includes AWS for cloud services, Kubernetes for container orchestration, and monitoring tools such as Grafana and Datadog. You’ll also work with infrastructure-as-code solutions like Terraform and focus on automation to streamline operations.

Join Rise to see the full answer
What is the work culture like for a Senior SRE Engineer at Shield AI?

At Shield AI, the work culture for a Senior SRE Engineer is collaborative and innovative. You will work alongside a dedicated team passionate about leveraging technology for global defense. The company nurtures a culture of reliability and operational excellence, encouraging engineers to share ideas, implement best practices, and continuously improve systems.

Join Rise to see the full answer
Does Shield AI offer any professional development opportunities for Senior SRE Engineers?

Yes, Shield AI is committed to the professional development of its employees. As a Senior SRE Engineer, you will be provided opportunities to expand your knowledge and skills through training, workshops, and access to the latest technologies. The company supports career growth and encourages continuous learning within the tech field.

Join Rise to see the full answer
Common Interview Questions for Senior SRE Engineer (R3386)
Can you describe your experience with AWS services relevant to the Senior SRE Engineer role?

When answering this question, highlight specific AWS services you have worked with, such as EC2, ECS/EKS, and RDS. Discuss how you've utilized these services in previous roles to enhance performance or reliability, and provide examples of projects that demonstrate your AWS proficiency.

Join Rise to see the full answer
How do you approach incident response as a Senior SRE Engineer?

When addressing your approach to incident response, detail your methodology for identifying, troubleshooting, and resolving incidents. Explain how you define incident response procedures and your experience in participating in on-call rotations, highlighting any examples where your response minimized downtime or improved reliability.

Join Rise to see the full answer
What automation tools have you developed for operations in your previous roles?

For this question, discuss specific automation tools you've developed or implemented in past positions. Describe the challenges they addressed, the technologies used, and the impact they had on streamlining operations. Be prepared to explain how these tools have benefited both teams and end-users.

Join Rise to see the full answer
How do you ensure the scalability of cloud infrastructure?

In your response, outline your strategies for ensuring cloud scalability. Discuss experience with container orchestration using Kubernetes, load balancing techniques, and best practices for resource management within AWS. Include examples of how you've successfully scaled services under varying loads in the past.

Join Rise to see the full answer
What monitoring and observability tools are you familiar with?

Share your experience with monitoring and observability tools such as Prometheus, Grafana, Datadog, or ELK. Discuss how you used these tools to gain insights into system performance, troubleshoot issues, and improve the overall reliability of services.

Join Rise to see the full answer
Describe a time you resolved a significant performance issue. What steps did you take?

When addressing this question, provide a specific example of a performance issue you encountered, detailing the context, diagnosis, and your approach to resolving it. Highlight your analytical skills and the collaborative efforts you undertook with teams to implement solutions that led to improved performance.

Join Rise to see the full answer
How do you keep updated with industry trends and best practices in Site Reliability Engineering?

To respond, mention specific resources you use to stay updated, such as blogs, industry forums, webinars, or professional groups. Discuss how these resources influence your work and how you implement industry best practices in your role as a Senior SRE Engineer.

Join Rise to see the full answer
What role does infrastructure-as-code play in your work?

Explain how you leverage infrastructure-as-code in your projects, discussing tools such as Terraform or CloudFormation. Provide examples of how using these tools has improved your efficiency in deploying and managing infrastructure and the benefits it brings.

Join Rise to see the full answer
Can you give an example of a root cause analysis you've conducted?

When answering, walk through a specific incident where you had to conduct a root cause analysis. Describe the steps you took to identify the root cause, the methods you used for analysis, and how the outcome informed future incident prevention strategies.

Join Rise to see the full answer
How do you foster a culture of reliability within your team?

Discuss your strategies to promote a culture of reliability, such as encouraging open communication around failures and incidents, championing best practices, and conducting regular reviews of system performance. Include examples of past initiatives that resulted in higher reliability awareness among team members.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User

Join Shield AI as a Senior Staff Engineer to lead development in simulation environments for autonomous systems.

Photo of the Rise User

Become a key player in Shield AI's mission by developing sophisticated avionics software for advanced unmanned aerial vehicles.

Photo of the Rise User
Posted 2 hours ago

Lead cloud engineering initiatives at Accenture Federal Services, dedicated to supporting the US federal government in enhancing national security through innovative technology.

A renowned engineering firm in Irving, TX is looking for a Senior Mechanical Systems Engineer with extensive CAD and leadership experience.

Posted 10 days ago

Be a key player at Aura Intelligence as a Forward-Deployed Solutions Engineer, implementing innovative solutions and ensuring client success.

Photo of the Rise User
Illumina Hybrid US - California - San Diego
Posted 3 days ago

Join Illumina as an Engineering Technician to enhance the impact of genomic innovations in health management.

Photo of the Rise User
Posted 2 days ago

Join AbbVie as an Associate Technical Operations Engineer to contribute to innovative healthcare solutions while ensuring regulatory compliance.

SSC HR Solutions Remote No location specified
Posted 14 hours ago

Join a pioneering tech firm as a Senior DevOps Engineer, driving automation and cloud solutions.

Photo of the Rise User
Nidec Hybrid North America/USA/Minnesota/Mankato, MN
Posted 7 days ago

Join Nidec as an Assembler A in Mankato, MN, where you'll be part of a team driving the future of motor technologies.

Photo of the Rise User
Posted 5 days ago

Join Goodwin's Volvo in Topsham, ME, as a skilled automotive technician and enjoy premium pay, no weekend shifts, and a supportive work environment.

Our mission is to protect service members and civilians with intelligent systems.

263 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
April 15, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Columbus just viewed Customer Success Manager, US SLED at Dataminr
Photo of the Rise User
Someone from OH, Greenville just viewed Systems Engineer (Linux & Shell or Python scripting) at Visa
Photo of the Rise User
Someone from OH, Greenville just viewed Help Desk Technician - Youngstown at R.I.T.A.
Photo of the Rise User
Someone from OH, Mount Orab just viewed Backend Developer at G2i Inc.
Photo of the Rise User
7 people applied to Technology Intern at SABIC
Photo of the Rise User
Someone from OH, Cincinnati just viewed Product Marketing Manager at Cast & Crew
Photo of the Rise User
Someone from OH, Cincinnati just viewed Marketing Manager at Cast & Crew
o
Someone from OH, Cincinnati just viewed Administrative Assistant at osu
A
Someone from OH, Cincinnati just viewed Data Entry Clerk at Alphabe Insight Inc
Photo of the Rise User
Someone from OH, Cincinnati just viewed Machine Learning Engineer at Allstate
Photo of the Rise User
Someone from OH, Twinsburg just viewed Data Analyst/Power BI Developer at Datadog
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed Small Fleet Underwriter at HDVI
Photo of the Rise User
18 people applied to HVAC Apprentice at DuPont
Photo of the Rise User
Someone from OH, Dublin just viewed Product Designer, Entry Level at Govini