Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

RGi is looking for a passionate Site Reliability Engineer (SRE) to join our dynamic OpenShift PaaS organization. In this exciting role, you’ll play a crucial part in ensuring the availability, performance, and scalability of our OpenShift environments—essential components that keep us at the forefront of innovation. Work alongside talented development, operations, and product teams to elevate our platforms to new heights.

You will have the opportunity to automate processes, create robust monitoring systems, and implement cutting edge solutions that enhance the overall reliability of our systems. If you thrive in a collaborative environment and are eager to make an immediate impact on our infrastructure, we want to hear from you.


Clearance:

Active Top Secret clearance with willingness and ability to obtain an SCI and CI polygraph

US Citizenship required


As a Site Reliability Engineer you will...
  • Design, implement, and maintain highly available OpenShift clusters to support mission-critical applications.
  • Develop and maintain automation scripts and tools to streamline deployment, scaling, and recovery processes using tools like Ansible, Terraform, and Helm.
  • Build and enhance monitoring and alerting systems (e.g., Prometheus, Grafana, ELK).
  • Respond to and resolve incidents, conducting post-mortem analyses to identify root causes.
  • Analyze and optimize system performance, ensuring minimal latency and maximum throughput.
  • Work closely with development teams to implement DevOps best practices, CI/CD pipelines, and platform enhancements.
  • Ensure platforms meet security and compliance requirements by integrating tools for vulnerability scanning, policy enforcement, and logging.


Site Reliability Engineer Qualifications:
  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
  • Minimum 5+ years of experience as an SRE, DevOps Engineer, or related role.
  • Expertise in OpenShift or Kubernetes platform administration.
  • Strong knowledge of Linux systems, networking, and containerization technologies (Docker).
  • Proficiency in scripting languages such as Python, Bash, or Go.
  • Experience with CI/CD pipelines (e.g., Jenkins, GitLab CI/CD).
  • Familiarity with monitoring and logging tools like Prometheus, Grafana, ELK, or Splunk.


Additional Skills We Would Like to See:
  • OpenShift certification (e.g., Red Hat Certified Specialist in OpenShift Administration).
  • Experience with cloud platforms (AWS, Azure, or GCP).
  • Knowledge of service mesh technologies (Istio, Linkerd).
  • Strong understanding of microservices and distributed systems architecture.


Who we are:

Reinventing Geospatial, Inc. (RGi) is a fast-paced small business that has the environment and culture of a start-up, with the stability and benefits of a well-established firm. We solve complex problems within geospatial software development and national defense to make an Immediate Impact for our nation’s soldiers and analysts.


We pride ourselves on giving employees an exceptional life experience, where creativity thrives, and challenges are simply part of the fun. We provide truly excellent benefits, including:


·        100% paid employee healthcare & dental insurance

·        Paid parental leave

·        401k with matching

·        Escalating vacation time

·        Referral bonuses

·        Tuition reimbursement

·        Professional development training

·        Free beverages and snacks

·        Weekly catered lunches and breakfast on Fridays

 

Grow to be our next leader:

At RGi, fostering a strong and organic corporate culture is paramount and serves as a compass on the decisions we make and how we operate the company. We believe our culture of camaraderie, innovation, and collaboration reflects the caliber of our employees and their dedication to the mission of providing quality software to our customers. As such, we want our employees to feel empowered to seek growth and leadership opportunities within the company and position us to maintain our culture as we grow. RGi provides opportunities, resources, training, and mentorship to all our employees to let them take control of their careers and become a leader or a crucial member of our company. If this is what you are looking for in a company, then you are what we are looking for in an employee.


Reinventing Geospatial, Inc. is an Equal Opportunity Employer committed to hiring and retaining a diverse workforce. We are an Equal Opportunity Employer, making decisions without regard to race, color, religion, sex, national origin, age, veteran status, disability, or any other protected class. U.S. Citizenship is required for all positions.

Average salary estimate

$110000 / YEARLY (est.)
min
max
$90000K
$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Reinventing Geospatial, Inc. (RGi)

At Reinventing Geospatial, Inc. (RGi), we’re on the lookout for a passionate Site Reliability Engineer (SRE) to join our dynamic OpenShift PaaS organization in Herndon, VA. This isn't just another job; it’s a chance to thrive in an environment that encourages innovation and teamwork. As an SRE, you will play a crucial role in ensuring the availability, performance, and scalability of our OpenShift environments—key components that keep our operations running smoothly. You’ll collaborate with talented development and operations teams to elevate our platforms and automate processes to improve reliability. Imagine having the autonomy to design and maintain highly available OpenShift clusters that support mission-critical applications while deploying and utilizing advanced automation tools like Ansible and Terraform. You’ll also get to hone your skills in monitoring and alerting using systems like Prometheus and Grafana. This position is perfect for someone who enjoys a fast-paced atmosphere and wants to make a tangible impact immediately. We believe that a collaborative mindset will allow you to excel, so if you’re ready to take on challenges that enhance our infrastructure while growing your skills, we want to hear from you! Don't miss the chance to join RGi, where creativity thrives and your hard work is recognized and appreciated—all while working towards a mission that matters!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Reinventing Geospatial, Inc. (RGi)
What responsibilities does a Site Reliability Engineer at RGi have?

As a Site Reliability Engineer at RGi, your primary focus will be the design, implementation, and maintenance of highly available OpenShift clusters to support mission-critical applications. You’ll develop and maintain automation scripts to streamline deployment and optimize system performance. Additionally, you’ll respond to and resolve incidents while enhancing our monitoring and alerting systems, ensuring compliance with security requirements.

Join Rise to see the full answer
What qualifications are needed for the Site Reliability Engineer position at RGi?

To qualify for the Site Reliability Engineer role at RGi, candidates must possess a Bachelor’s degree in Computer Science, Engineering, or a related field. You should have a minimum of 5 years of experience in an SRE or similar role, expertise in OpenShift or Kubernetes administration, and strong proficiency in scripting languages like Python or Bash. Familiarity with CI/CD pipelines and containerization technologies is also essential.

Join Rise to see the full answer
What tools and technologies should a Site Reliability Engineer at RGi be familiar with?

A Site Reliability Engineer at RGi should be well-versed in various tools and technologies including OpenShift, Kubernetes, automation tools like Ansible and Terraform, and monitoring solutions such as Prometheus and Grafana. Proficiency in scripting languages, CI/CD tools, and logging technologies like ELK or Splunk is also vital for this position.

Join Rise to see the full answer
What benefits does RGi offer to Site Reliability Engineers?

RGi provides a range of attractive benefits for its Site Reliability Engineers, including 100% paid employee healthcare and dental insurance, paid parental leave, 401k matching, escalating vacation time, and tuition reimbursement. We also prioritize professional development with training programs, mentorship, and a supportive work culture that encourages growth.

Join Rise to see the full answer
How does RGi foster a collaborative work environment for Site Reliability Engineers?

At RGi, we emphasize a strong corporate culture that encourages camaraderie, innovation, and collaboration. Site Reliability Engineers work alongside talented teams, participate in shared challenges, and are empowered to drive initiatives that improve our infrastructure, fostering an environment where everyone feels valued and motivated to contribute.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you explain your experience with OpenShift or Kubernetes as a Site Reliability Engineer?

When answering this question, highlight specific projects where you utilized OpenShift or Kubernetes, detailing your responsibilities, your approach to configuration or troubleshooting, and the impact your work had on system reliability and performance. Focus on your technical skills and teamwork in managing these platforms.

Join Rise to see the full answer
What strategies do you use to monitor and optimize system performance?

Discuss the monitoring and alerting tools you’ve used, such as Prometheus or Grafana, and how you leverage them to track key performance metrics. Explain your approach to conducting regular performance reviews and the methods you utilize to analyze data and implement improvements.

Join Rise to see the full answer
How do you handle incidents and post-mortem analysis?

The key here is to talk through your process, from identifying and documenting the incident to resolving it, and conducting a thorough post-mortem analysis. Emphasize your commitment to learning from incidents and implementing changes to prevent future occurrences.

Join Rise to see the full answer
What is your experience with automation in site reliability engineering?

Provide examples of how you've automated processes using tools like Ansible or Terraform. Discuss specific scenarios where automation helped improve deployment efficiency, reduce human error, or enhance system reliability. This showcases your proactive approach to challenges.

Join Rise to see the full answer
How do you ensure security and compliance in your SRE practices?

Address your understanding of security best practices in DevOps, discussing tools you’ve integrated into your workflow for vulnerability scanning and logging. Detail how you implement compliance measures and collaborate with security teams to ensure ongoing adherence to policies.

Join Rise to see the full answer
What role does collaboration play in your work as a Site Reliability Engineer?

Emphasize the importance of teamwork in troubleshooting, developing CI/CD pipelines, and addressing incidents. Discuss how you communicate effectively with development teams and engage stakeholders to ensure the reliability and performance of the systems.

Join Rise to see the full answer
How have you dealt with high-pressure situations in your role as an SRE?

Share an example of a particularly challenging situation, such as a system outage or performance issue, and detail how you maintained composure, prioritized tasks, and collaborated with your team to resolve the problem. This will illustrate your resilience and capability under pressure.

Join Rise to see the full answer
What experience do you have with cloud platforms in relation to site reliability?

Discuss your familiarity with cloud services like AWS, Azure, or GCP, mentioning how you’ve utilized them in your SRE roles. Talk about specific applications or services you’ve migrated, managed, or integrated into your reliability practices.

Join Rise to see the full answer
What do you consider the most important qualities for a Site Reliability Engineer?

Reflect on qualities such as problem-solving skills, adaptability, and strong communication. Share how embodying these traits has contributed to your success and collaborative spirit in previous roles.

Join Rise to see the full answer
How would you familiarize yourself with a new application or system as an SRE?

Mention methods such as documentation review, monitoring logs, and shadowing team members. Discuss strategies for gaining a solid understanding of system architecture and common workflows, ensuring a smooth transition while providing value to the team.

Join Rise to see the full answer
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
LOCATION
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 14, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!