Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer - job 1 of 2

Virta Health is on a mission to transform diabetes care and reverse the type 2 diabetes epidemic. Current treatment approaches aren’t working—over half of US adults have either type 2 diabetes or prediabetes. Virta is changing this by helping people reverse type 2 diabetes through innovations in technology, personalized nutrition, and virtual care delivery reinvented from the ground up. We have raised over $350 million from top-tier investors, and partner with the largest health plans, employers, and government organizations to help their employees and members restore their health and live diabetes-free. Join us on our mission to reverse diabetes in 100M.As an SRE on the Infrastructure team at Virta, you will be building the foundation that will help our company move as fast as possible while meeting security and compliance requirements. Key projects for the team over the next two quarters include:• Improving our metrics strategy and developing tooling to create accurate mechanisms of tracking team/service health that feature development teams can use to make data-driven decisions.• Enhancing system observability, reliability, and efficiency using off-the-shelf technology combined with internal tools developed in Python and Go to increase transparency and visibility into our systems.• Build products to simplify service management and implementation for our developers, while ensuring standardization and security best practices.• Improving incident readiness with better tooling and the right hygiene practices such as game days.• Engage with feature development teams in exercises such as toil reduction, capacity planning, load testing, SLO process and other best practices to enhance the scalability and reliability of our systems through analysis and observability improvements.• Develop testing infrastructure to improve application quality and enable developers to speed up common development bottlenecks like slow builds, slow tests, or difficult manual testing.We are in the midst of re-defining what our observability and reliability strategy should be for our next stage of growth. Joining Virta would make you one of the key people defining and driving the future vision of what reliability and observability should look like.Responsibilities• Evangelizing SRE best practices across engineering with the goal of ensuring the high availability and performance of our cloud infrastructure.• Support infrastructure and product teams in analyzing and enhancing the scalability, reliability, and observability of our containerized systems.• Develop and maintain infrastructure automation to manage and scale standardized, repeatable infrastructure.• Develop testing infrastructure to improve application quality.• Drive continuous improvement initiatives within the team and across the engineering organization.• Develop and maintain standardized CI/CD pipelines to ensure smooth, efficient, and quality-assured deployment of applications to production.• Mentor peers to become great developers by recommending best practices and technical guidance.• Collaborate with our InfoSec team to ensure compliance requirements are implemented in the software development lifecycle• Provide technical support and troubleshoot infrastructure-related issues, both during development and in production environments.90 Day PlanWithin your first 90 days at Virta, we expect you will do the following:• Teach and inspire other engineering team members through knowledge sharing, pair programming, and giving feedback during code reviews• Propose and implement one or more process improvements related to reliability and observability to make our engineering team even betterMust-Haves• 5+ years of experience in cloud infrastructure automation using Kubernetes, bonus if that experience includes Anthos on GKE.• Proven expertise in developing and managing non-trivial services using GoLang or Python.• Hands-on experience with scripting languages (e.g., Python, Bash) for automation.• Proficiency in Infrastructure as Code (IaC) tools, specifically Terraform.• Strong knowledge of cloud security and compliance practices, preference towards HIPAA and HITRUST. Ideally, you have previously worked in a regulated environment.• Strong experience implementing observability for containerized applications.• Strong experience CI/CD principles and best practices, with experience implementing and optimizing pipelines with operational standards checks.• Excellent problem-solving and communication skills.• Demonstrated leadership skills and experience mentoring and guiding junior engineers.Values-driven cultureVirta’s company values drive our culture, so you’ll do well if:• You put people first and take care of yourself, your peers, and our patients equally• You have a strong sense of ownership and take initiative while empowering others to do the same• You prioritize positive impact over busy work• You have no ego and understand that everyone has something to bring to the table regardless of experience• You appreciate transparency and promote trust and empowerment through open access of information• You are evidence-based and prioritize data and science over seniority or dogma• You take risks and rapidly iterateIs this role not quite what you're looking for? Join our Talent Community and follow us on Linkedin to stay connected!As part of your duties at Virta, you may come in contact with sensitive patient information that is governed by HIPAA. Throughout your career at Virta, you will be expected to follow Virta's security and privacy procedures to ensure our patients' information remains strictly confidential. Security and privacy training will be provided.Virta has a location based compensation structure. Starting pay will be based on a number of factors and commensurate with qualifications & experience. For this role, the compensation range is [min of $145,885 - $188,409. Information about Virta’s benefits is on our Careers page at: https://www.virtahealth.com/careers.#LI-remote

Average salary estimate

Estimate provided by employer
$167147 / ANNUAL (est.)
min
max
$146K
$188K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Virta Health

At Virta Health, we’re on an ambitious journey to transform diabetes care, and we're looking for a talented Site Reliability Engineer (SRE) to join our Infrastructure team in Briarcliff Manor, NY. Imagine being part of a mission that’s more than just a job; it’s an opportunity to help millions of people reverse type 2 diabetes through groundbreaking technology and personalized virtual care. As an SRE, you’ll be at the forefront of building a robust infrastructure that fosters rapid innovation while adhering to the highest standards of security and compliance. You’ll refine our metrics strategy, enhance system observability, and create powerful tools in Python and Go. With key projects on the horizon, including incident readiness and capacity planning initiatives, your work will play a crucial role in ensuring our services are reliable and scalable. You'll mentor fellow engineers, sharing knowledge and best practices to elevate team performance. By joining Virta, you won't just be contributing to a company; you'll be defining the future of healthcare technology as we aim to reverse diabetes for 100 million people. If you’re passionate about SRE best practices and eager to make a significant impact, we’d love to have you on board!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Virta Health
What are the main responsibilities of a Site Reliability Engineer at Virta Health?

As a Site Reliability Engineer at Virta Health, your main responsibilities will include supporting infrastructure and development teams in improving system reliability and observability, developing automation tools for cloud infrastructure, and implementing best practices for continuous integration and continuous deployment (CI/CD). You will play a pivotal role in mentoring junior engineers and will be engaged in ensuring our systems meet security and compliance requirements.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at Virta Health?

To thrive as a Site Reliability Engineer at Virta Health, you should possess over 5 years of experience in cloud infrastructure automation, particularly with Kubernetes. Proficiency in programming languages such as GoLang or Python, and familiarity with Infrastructure as Code (IaC) tools like Terraform are crucial. Experience in implementing observability for containerized applications and a solid understanding of cloud security practices, especially within regulated environments, are essential for success in this role.

Join Rise to see the full answer
What kind of projects will a Site Reliability Engineer work on at Virta Health?

At Virta Health, a Site Reliability Engineer will engage in pivotal projects focused on enhancing system reliability and observability. This includes developing advanced tooling for service monitoring, improving incident readiness through game day exercises, and refining capacity planning and load testing practices. You will also work on creating and optimizing CI/CD pipelines to streamline software deployment processes.

Join Rise to see the full answer
How does Virta Health support professional development for Site Reliability Engineers?

Virta Health is committed to professional development, offering opportunities for mentorship and continuous learning. As a Site Reliability Engineer, you will have the chance to share knowledge through pair programming and code reviews. You’ll also be encouraged to propose process improvements and participate in initiatives that drive professional growth within the engineering team.

Join Rise to see the full answer
What is the company culture like at Virta Health for Site Reliability Engineers?

The culture at Virta Health emphasizes a values-driven approach that prioritizes people, ownership, and positive impact. As a Site Reliability Engineer, you will be part of a team that values transparency, trust, and collaboration. The environment encourages creativity and evidence-based decision making, where everyone’s input is valued, regardless of their role or experience level.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you explain the importance of site reliability engineering?

Site reliability engineering (SRE) is crucial as it combines software engineering and systems administration to create scalable and highly reliable software systems. In your response, emphasize how SRE practices enhance operational efficiency and service availability, directly impacting user experience and organizational success.

Join Rise to see the full answer
How do you ensure the reliability of a cloud-based infrastructure?

Discuss specific strategies such as implementing monitoring tools, setting service-level objectives (SLOs), and continuously analyzing system performance. Focus on your experience with automation, incident response protocols, and collaboration with development teams to proactively address potential reliability issues.

Join Rise to see the full answer
What tools do you prefer for infrastructure automation?

Mention tools like Terraform for infrastructure as code, Kubernetes for container orchestration, and configuration management tools. Explain why you prefer these tools based on their ease of use, scalability, and ability to support high availability in cloud environments.

Join Rise to see the full answer
Describe your experience with incident management.

Explain your approach to incident management, including how you identify issues, communicate with stakeholders, and implement postmortem analysis to prevent future occurrences. Your answer should reflect a systematic method for improving incident response and ensuring team readiness.

Join Rise to see the full answer
How can you improve monitoring and observability in a system?

Discuss the significance of comprehensive monitoring frameworks that provide insights at all system levels. Highlight your experience in deploying tools that enhance observability, gathering meaningful metrics, and employing logging practices to ensure quick diagnosis and resolution of issues.

Join Rise to see the full answer
How do you approach scaling infrastructure?

Emphasize the importance of automation, load testing, and performance benchmarking in your approach to scaling infrastructure. Talk about specific methodologies you have used to ensure systems can handle increased loads efficiently and without degradation in performance.

Join Rise to see the full answer
What is your experience with CI/CD pipelines?

Share examples of CI/CD pipelines you designed or optimized, focusing on principles such as continuous integration, testing automation, and deploying strategies that enhance deployment frequency and reduce lead time for changes.

Join Rise to see the full answer
How do you handle security in cloud-based applications?

Discuss strategies you implement to secure cloud applications, including adherence to compliance standards like HIPAA, regular security audits, and implementing best practices in code development to mitigate risks. Emphasize collaboration with security teams for continuous improvements.

Join Rise to see the full answer
How do you mentor junior engineers in your team?

Explain your approach to mentoring junior engineers, including knowledge sharing through code reviews, pair programming, and fostering a culture where questions are encouraged. Share specific outcomes of mentorship programs you initiated or participated in.

Join Rise to see the full answer
Can you give an example of a challenging reliability problem you solved?

Provide a specific example that outlines the problem, the steps you took to address it, and the outcome. This showcases your problem-solving skills and your ability to implement reliable solutions under pressure.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Customer-Centric
Startup Mindset
Collaboration over Competition
Family Medical Leave
Maternity Leave
Paternity Leave
Flex-Friendly
Social Gatherings
Pet Friendly
Fitness Stipend
Medical Insurance
Dental Insurance
Vision Insurance
Life insurance
Disability Insurance
Learning & Development
Bias Training
Equity
Employee Resource Groups
Unlimited Vacation
Paid Time-Off
Photo of the Rise User
Posted 7 days ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition
Photo of the Rise User
Posted 6 days ago
Customer-Centric
Startup Mindset
Collaboration over Competition
Family Medical Leave
Maternity Leave
Paternity Leave
Flex-Friendly
Social Gatherings
Pet Friendly
Fitness Stipend
Medical Insurance
Dental Insurance
Vision Insurance
Life insurance
Disability Insurance
Learning & Development
Bias Training
Equity
Employee Resource Groups
Unlimited Vacation
Paid Time-Off
Photo of the Rise User
Posted 10 days ago

Virta Health provides remote treatment for type 2 diabetes without medications or surgery. Their approach results extend beyond diabetes reversal to other areas of metabolic and cardiovascular health, including sustained improvements in blood pres...

39 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
December 16, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!