Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Site Reliability Engineer image - Rise Careers
Job details

Senior Site Reliability Engineer

About Exygy


Exygy is a digital innovation studio on a mission to build resilient and healthy communities. We enable impact-focused organizations to rethink experiences and create digital products that solve their problems and delight users. Our diverse team brings a breadth of technical expertise, user-centric perspectives, and product strategy to every engagement. As a certified B-Corporation, we are driven by our fierce commitment to the betterment of humanity. Our clients include CARE International, QURE Healthcare, San Francisco Mayor's Office of Housing, and Hopelab.

Exygy has embraced a work from home philosophy and we are now a remote-first company. This role can be located anywhere in the U.S and may occasionally require traveling for meetings or team building events. Exygy remains connected and engaged with virtual team events, weekly all hands meetings and regular zoom workshops and trainings.


Summary


Exygy seeks an enthusiastic and experienced Senior SRE who is passionate about making a difference in the world with technology. Join our growing team to build and support a wide variety of high-impact projects across all of Exygy. This is a full-time remote position. As an SRE you’ll spend most of your time working with the CiviForm team to support product development, manage staging and production environments and develop deployment systems and infrastructure so that the platform is secure and dependable.


Who Does This Role Report To: 

Principal Engineer on Civiform


Supervisory Responsibilities:

This role does not have supervisory responsibility.

This role is a P3 level - this is an internal grade level which correlates with our salary bands and compensation philosophy



Responsibilities
  • Participate in the development of CiviForm products as a service, building upon our existing deployment system and building out a new Kubernetes-based prototype to ensure robust, secure, and scalable production instances.  
  • Manage staging and production environments, being on call to address outages
  • Work with governments with issues related to the service
  • Own and evolve the deployment systems
  • Participate in the development of a new CiviForm SaaS (Software as a Service). 
  • Own development of this deployment system utilizing Kubernetes from prototyping through to delivery.
  • Civiform’s existing infrastructure is currently defined with Python and Terraform, deployed into AWS and Azure. Improve the flexibility and features of the system to meet the needs of governments deploying CiviForm to their own cloud providers.
  • Define, implement, gather, and analyze metrics from deployments to identify areas for improvement related to cloud configuration
  • Partner with the engineering team to improve services through rigorous testing and release procedures, as well as resolving scaling issues and improving resilience
  • Draft Service Level Objectives and define Service Level Indicators, and implement them
  • Develop playbooks for deployments, including implementing a strategy for monitoring and alerting and how to address issuesIdentify and mitigate security risks in deployments
  • Contribute to CI/CD implementation and best practices 


Required Skills
  • Extensive expertise building and deploying web apps in AWS, Azure, and GCP Networking
  • Distributed systems
  • Public cloud and container security (RBAC, process isolation, network security, firewalls, certificate management, etc.)
  • Reliability engineering (disaster resilience, multi-zonal deployments, logging practices, SLOs/SLIs, monitoring, deployment strategy, etc.)
  • Kubernetes
  • Docker/containers
  • Terraform
  • Python
  • Version control systems (we use Git/GitHub)
  • Linux
  • DevOps concepts and best practices
  • Authentication technologies such as OIDC, SAML


Bonus Skills
  • Java
  • TypeScript
  • Apps utilizing the Play framework or similar server-side MVC frameworks


$120,000 - $125,000 a year
Compensation & Benefits

Exygy’s compensation is benchmarked to national industry averages, not geographic location, ensuring equitable pay across all roles. Our hiring targets align with the median of the salary range for this position, with offers based on role requirements and candidate experience. The target range for this role is $120,000 - $125,000 annually, reflecting our commitment to competitive, value-based compensation. Growth within the band is tied to performance, with opportunities for bonuses and comprehensive benefits.

Benefits & Perks

 

Our Values

• Learning and Growth

• Pursuit of Excellence

• Leaning into Fear

• Spirit of Generosity 

• Embrace the Whole Person


Mission Statement

To ensure all communities, especially marginalized communities, have access to basic social needs. We use our expertise in strategic product development, thoughtful design, and tailored technical solutions to identify the greatest barriers and build solutions to overcome them.


Vision Statement

Everyone has equitable access to the basic social needs that encourage them to thrive. These social needs include but are not limited to affordable housing, physical and mental health care, food security, quality education, stable employment opportunities, and public benefits.


Employee Enablement Support: 

Laptop provided

$2000 annual (per calendar year) remote environment setup which includes using this budget to outfit your home office, co-working spaces, coffee shops or to meet up and collaborate with you team mates.


Wellness Budget

$100 monthly to pay for your wellness item of choice (gym membership, classes, massages etc.)


Professional Development:

$1000 annual (per calendar year) stipend towards professional development


Retirement & 401k Plans:

Employees are eligible for a 100% employer match of up to 4% of employee contribution


Medical:

Full benefits package with options up to 100% coverage toward select medical, dental, and vision plans.


Remote First Working Environment:

• Exygy employees may work remotely across the US

• Exygy employees main residence must be within the US


Work Life Balance 

Exygy proudly embraces work/life balance and has adopted a 4DWW policy beginning 2025, recognizing Monday - Thursday as business days. Full-time employees are expected to work 32-40 hours in a typical week. Fridays are flexible, allowing employees to take the day off when their workload allows.


Collaborative working hours: 

we aim to hold all  internal meetings between 10 AM - 3 PM PT. We expect all Exygy staff to be available during these set working hours


Time Off: 

Flexible paid time off, a minimum of 14 paid holidays, and an org-wide closure from Christmas Day through New Year's Day

Competitive paid parental and family leave


EEO & Commitment to Equity, Diversity, and Inclusion

We are actively seeking to create a diverse and equitable work environment because we believe that creates a stronger team.


Exygy values a diverse workplace and strongly encourages women, people of color, LGBTQIA individuals, people with disabilities, members of ethnic minorities, foreign-born residents, older members of society, and others from minority groups and diverse backgrounds to apply. Exygy is an equal opportunity employer. We will not discriminate against applicants because of race, color, sex (including pregnancy), sexual orientation, gender identity or expression, age, religion, national origin, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state, or federal laws. All employees and contractors of Exygy are responsible for maintaining a work atmosphere free from discrimination and harassment by treating others with dignity and respect.

Average salary estimate

$122500 / YEARLY (est.)
min
max
$120000K
$125000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Site Reliability Engineer, Exygy

Exygy is on the lookout for a talented Senior Site Reliability Engineer to join our team! If you have a passion for technology and a drive to create a positive impact, this remote U.S. opportunity may be perfect for you. As a part of our innovative digital studio, you’ll dive into the CiviForm project, working closely with our Principal Engineer to develop and manage secure, dependable platforms that serve various governmental organizations. Your main responsibilities will include enhancing our existing deployment systems, managing staging and production environments, and collaborating with engineering teams to streamline processes while implementing effective monitoring and alerting strategies. With Exygy’s commitment to delivering digital solutions that uplift communities, you’ll help us build robust, scalable applications supported by technologies such as Kubernetes, Python, and Terraform. We strive for a healthy work-life balance and foster an inclusive remote culture, meaning you can enjoy flexibility while contributing to meaningful projects. So if you're enthusiastic about squeezing the most value out of cloud infrastructure and have a robust knowledge of distributed systems, come join us at Exygy where your skills will make a difference. Plus, with competitive compensation and a supportive work environment, you'll feel right at home while making a positive impact in the world.

Frequently Asked Questions (FAQs) for Senior Site Reliability Engineer Role at Exygy
What are the responsibilities of a Senior Site Reliability Engineer at Exygy?

As a Senior Site Reliability Engineer at Exygy, you will engage actively in product development for CiviForm, managing both staging and production environments, and ensuring systems are scalable, secure, and resilient. Your role will include evolving deployment systems, addressing issues related to government services, and collaborating with engineering teams on robust testing and release procedures.

Join Rise to see the full answer
What qualifications are needed for the Senior Site Reliability Engineer position at Exygy?

To succeed in the Senior Site Reliability Engineer role at Exygy, candidates should have extensive experience in web app deployments, especially in AWS, Azure, and GCP. Proficiency with Kubernetes, Docker, Terraform, and Python, as well as a firm grasp of reliability engineering practices, is critical. Candidates should also be adept at using various version control systems, and have a solid understanding of authentication technologies.

Join Rise to see the full answer
What is Exygy's commitment to work-life balance for the Senior Site Reliability Engineer?

Exygy prides itself on offering its employees a healthy work-life balance. The company has embraced a remote-first working environment, and from 2025, will implement a 4-Day Work Week (4DWW) policy. Full-time employees can expect to work 32-40 hours a week, with Fridays being flexible, allowing for additional time off when workloads allow, ensuring a supportive and healthy work environment.

Join Rise to see the full answer
What technologies will a Senior Site Reliability Engineer work with at Exygy?

In the Senior Site Reliability Engineer role at Exygy, you'll work primarily with technologies like Kubernetes, Docker, Terraform, AWS, and Azure, alongside Python for application development. Knowledge in CI/CD practices, Linux, and networking will also be valuable, as you will be enhancing deployment protocols for our services.

Join Rise to see the full answer
How does Exygy support employee growth for the Senior Site Reliability Engineer?

Exygy is devoted to fostering employee development, offering a $1,000 annual stipend for professional growth, alongside a variety of remote team events and workshops. Furthermore, the company provides a wellness budget and an annual allowance to enhance your home office setup, ensuring a supportive and conducive environment for professional advancement.

Join Rise to see the full answer
Common Interview Questions for Senior Site Reliability Engineer
How do you approach managing staging and production environments?

In managing staging and production environments, it's crucial to maintain clear separation between the two, ensuring that testing occurs without affecting live services. I prioritize automation for deployments and employ monitoring tools to track performance, allowing quick identification of any issues during releases.

Join Rise to see the full answer
What is your experience with Kubernetes?

I have extensive experience with Kubernetes, having deployed several applications within it. I focus on ensuring that resources are effectively allocated, leveraging its capabilities for scaling, and utilizing Helm for managing packages. In my previous roles, I've also contributed to designing CI/CD pipelines that incorporate Kubernetes best practices.

Join Rise to see the full answer
How have you contributed to disaster recovery strategies in your previous roles?

I emphasize proactive disaster recovery strategies, ensuring that backups are regular and testing recovery plans frequently. At my last job, I implemented multi-zonal deployments to ensure resilience, allowing us to maintain uptime even during unexpected outages.

Join Rise to see the full answer
Can you describe your experience with Python in SRE work?

Python has been instrumental in my SRE roles, primarily for automating processes and building scripts for monitoring and alerting. I have developed tools that interact with our systems to log metrics and performance data, providing insights for strategic improvements in our infrastructure.

Join Rise to see the full answer
How do you monitor and maintain system reliability?

I employ a combination of monitoring tools to proactively check system health and performance, setting up alerts for anomalies. Additionally, I use metrics such as Service Level Indicators and Objectives to assess reliability, aligning them with business goals for consistent performance.

Join Rise to see the full answer
What strategies do you employ for effective CI/CD implementation?

Effective CI/CD implementation involves automating the build and deployment process, reducing manual errors, and ensuring rapid feedback loops. I focus on creating comprehensive testing suites that run with every build and utilize canary releases to minimize risk when deploying to production.

Join Rise to see the full answer
How do you handle outages when they occur?

In the event of an outage, it's crucial to remain calm and follow established protocols. I immediately communicate with stakeholders, gather data to diagnose the issue, and prioritize restoration efforts. Post-outage, I analyze causes to prevent future occurrences and improve systems.

Join Rise to see the full answer
What is your approach to security in cloud environments?

My approach to security entails strict adherence to best practices such as network security, role-based access controls, and regular audits of the cloud environment. I ensure that configurations meet established security standards and advocate for regular security training for all team members involved in the cloud resource management.

Join Rise to see the full answer
Can you explain the importance of playbooks in SRE?

Playbooks are essential in SRE as they provide clear, step-by-step guidelines for handling common incidents and deployment procedures. They contribute to consistent responses, reduce downtime, and serve as valuable training material for onboarding new team members.

Join Rise to see the full answer
What do you think is the biggest challenge facing Site Reliability Engineers today?

One of the biggest challenges facing Site Reliability Engineers today is balancing rapid development with the reliability of systems. As businesses seek to release features quickly, it can put pressure on reliability, necessitating transparent communication across teams to manage expectations and prioritize system health.

Join Rise to see the full answer
Similar Jobs
Posted 8 days ago

Join a forward-thinking company as a SharePoint Engineer, where flexible working options meet cutting-edge technology solutions in a collaborative environment.

Posted 9 days ago

Be a driving force at Australian Payments Plus as a Senior Technical Solutions Analyst, helping to transform Australia's payments landscape.

Photo of the Rise User
Posted 5 days ago

Join Stifel, a leading financial services firm, as a Network Engineer II, where you will play a key role in maintaining and enhancing network reliability.

Photo of the Rise User

Blackhawk Network is looking for a Principal Database Administrator to drive process automation and optimize database solutions remotely.

Photo of the Rise User

As a Senior Cyber Threat Intelligence Analyst at Accenture Federal Services, you will protect the US federal government through expert analysis of cyber threats.

Oura Remote No location specified
Posted 8 days ago

Join Oura as a Senior IT Delivery & Planning Lead to drive agile project management for our Customer Experience Technology team.

Posted 13 days ago

The Law Office of Bryan Fagan seeks a skilled IT Specialist to enhance their technology systems and support staff efficiency in a dynamic legal environment.

Photo of the Rise User

Join Enterprise Products Partners L.P. as a Senior Database Administrator to enhance the performance and security of critical Oracle databases in a dynamic energy environment.

Exygy ( http://exygy.com/ ) is a strategy, design, and engineering firm. Exygy focuses on the for-purpose sector which means work with nonprofits, foundations, HealthTech, CivicTech, EdTech, CleanTech, and more - anywhere we can use our skills to ...

6 jobs
MATCH
VIEW MATCH
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 16, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Perrysburg just viewed Sourcing Leader, Minerals & Cullet at Owens Corning
Photo of the Rise User
Someone from OH, North Royalton just viewed Remote AI Voice Trainer (High-Quality Microphone Required) at Datadog
C
Someone from OH, Akron just viewed Phlebotomy Technician - Outpatient at CCF
Photo of the Rise User
Someone from OH, Solon just viewed Graphic Designer at Applause
Photo of the Rise User
Someone from OH, North Canton just viewed NodeJs developer at BlackStone eIT
Photo of the Rise User
Someone from OH, North Canton just viewed Software Development Engineer - Recent Grads Welcome at Sonos
Photo of the Rise User
16 people applied to SOC Analyst I at CBIZ
Photo of the Rise User
Someone from OH, Dayton just viewed Data Entry and Word Processing at MoxieIT
Photo of the Rise User
Someone from OH, Dayton just viewed Content Developer - Intern at Big Ideas Learning
Photo of the Rise User
Someone from OH, Pickerington just viewed Salesforce Lead at Bounteous
Photo of the Rise User
Someone from OH, Pickerington just viewed Industry Lead - High Tech (Salesforce) at Thunder
D
Someone from OH, Akron just viewed Junior Motion Designer at DEPT®
R
Someone from OH, Akron just viewed 2D Graphic and Motion Designer at Ruby Labs