Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

The SRE Team is responsible for managing Neon’s multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as well as improving the reliability of the overall platform. All the features we want to implement can only reach our customers if the changes are delivered in a reliable way, which means the SRE team plays a significant role in defining our pace of development.

Successful candidates will get the opportunity to contribute to the effort of evolving Neon to become multi-cloud so that we can be as close as possible to our customers while also making decisions about how to best utilize different cloud technologies. They will also take part in refining and improving our existing infrastructure so that stability and scalability complement the delivery of new features and services.

Neon's foundations is built on open source software, if you want to take a look into what makes Neon work, feel feel to browse https://github.com/neondatabase/neon (storage layer of databases) and https://github.com/neondatabase/autoscaling (autoscaling of databases), as well as our engineering blog. SREs frequently work with stakeholders in different teams, these repos provide a sneak peek of what the Neon engineering team is capable of producing.

You will

  • Join an experienced team and contribute to the foundation all of Neon is built upon

  • Contribute to building a stable and cost-efficient infrastructure foundation

  • Play a key role in ensuring we are proactive instead of reactive on infrastructure and reliability

  • Coach your fellow engineers on cloud, infrastructure, and reliability topics

  • Be ready to join an on-call rotation

We're looking for someone who has

  • 4+ years experience working in Site Reliability Engineering

  • Experience with cloud infrastructure components in Azure and/or AWS

  • Experience in a complex Linux infrastructure environment

  • Experience focusing on building repeatable and cost-efficient infrastructure

  • Experience building solutions for problems with no answers on Google

  • Experience working with monitoring solutions in the Prometheus ecosystem; Grafana, Loki, Tempo, VictoriaMetrics

  • Experience managing multi-cluster, multi-cloud Kubernetes deployments

  • Nice to have: Familiarity with Go, GitOps (e.g., Flux, ArgoCD), Postgres, Virtualization (QEMU/KVM)

Our stack: AWS, Azure, Terraform, Grafana Cloud, VictoriaMetrics, Flux, EKS/AKS.

 

About Neon

Neon is building open-source cloud-native PostgreSQL. Our architecture separates storage from compute, allowing for stateless and serverless Postgres. We're a well-funded startup with deep knowledge of Postgres internals and decades of experience building databases. We are a systems company; we work on low-level code with strict performance and correctness requirements.

Neon was created by a team of Postgres hackers and led by CEO Nikita Shamgunov (co-founder of SingleStore). Neon is built on open-source principles and is focused on giving back to the Postgres and developer communities.

Our Team

  • We are a distributed team of 100+ people working from 25+ countries (concentrating around North American and European time zones)

  • We are a team built on open-source cultural principles (transparency, contribution, accountability, and proactivity)

  • Team with decades of experience building databases and deep knowledge of Postgres internals. We are deeply technical

  • We have experienced Postgres committers and hackers on the team (check HeikkiAnastasiaArsenyMatthias profiles)

  • We believe in the efficacy of collaborative open-source

  • We aim for a diversity of thoughts and backgrounds

  • We are keen to be a fast-moving, flat org and avoid hierarchical structures

Our Investors

Top-tier investors backed up Neon's vision:

  • We raised $104 million in funding from Menlo Ventures, Notable Capital, Khosla Ventures, General Catalyst, and Founders Fund.

    1. Venture vehicles of Snowflake and Databricks invested in Neon.

    2. Our angel investors are prominent technologists, and ecosystem players. More than 20 awesome angels supported Neon, including Nat Friedman, Elad Gil, Mike Ovitz, Ajeet Singh, Guillermo Rauch, Søren Brammer Schmidt, and Wes McKinney.

    3. Our Board includes Quentin Clark, Glenn Solomon, Joe Morrissey, and Tim Tully.

Our Offer

  • You have an opportunity to be an early employee in the fast-scaling ambitious team

  • You can work 100% remote: we'll handle all formalities to arrange work from your home

  • We grant equity (stock options) for all full-time hires

  • We offer a competitive benefits package in line with all tech companies (top-notch equipment, unlimited vacations, paid parental leaves, and much more)

  • We are distributed, yet make our bonds during regular offsites (the last one was in Lisbon, Portugal)

Average salary estimate

$105000 / YEARLY (est.)
min
max
$90000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Neon Inc.

Join the Neon team as a Site Reliability Engineer in London, where you’ll play a vital role in enhancing our multi-region, multi-cloud deployment. Our SRE team works closely with the broader engineering crew to improve the reliability and performance of our platform, ensuring that every feature reaches our customers seamlessly. If you’re passionate about maintaining and scaling infrastructure, this is your chance to contribute to Neon’s evolution into a multi-cloud powerhouse. You'll be building a stable, cost-effective infrastructure while coaching fellow engineers on cloud technologies and reliability topics. You’ll find yourself diving into open-source software, working with tools like Prometheus, and managing Kubernetes deployments across multiple clouds. This is an opportunity to shape the future of cloud-native PostgreSQL in a fast-paced startup that values transparency and collaboration. Plus, you’ll enjoy the flexibility of remote work and equity opportunities, all while being part of a distributed team that values diverse perspectives and technical expertise. Are you ready to make an impact? Let's build an innovative infrastructure together at Neon!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Neon Inc.
What are the responsibilities of a Site Reliability Engineer at Neon?

As a Site Reliability Engineer at Neon, your responsibilities will include managing our multi-region, multi-cloud deployment, ensuring system reliability, and refining existing infrastructure. You'll be expected to contribute to building a stable and cost-efficient infrastructure and coach fellow engineers on cloud and reliability topics, all while partaking in on-call duties for system stability.

Join Rise to see the full answer
What qualifications are needed for the Site Reliability Engineer position at Neon?

Candidates for the Site Reliability Engineer role at Neon should have at least 4 years of experience in Site Reliability Engineering, proficiency in cloud infrastructure components (preferably Azure and/or AWS), and experience in complex Linux environments. Familiarity with monitoring solutions, building scalable systems, and managing Kubernetes deployments will also be crucial for this position.

Join Rise to see the full answer
What technologies does Neon use for its Site Reliability Engineering?

Neon leverages a stack that includes AWS, Azure, Terraform, and tools like Grafana Cloud and VictoriaMetrics to support its Site Reliability Engineering efforts. Familiarity with Go, monitoring solutions, and Kubernetes deployments will help in contributing effectively to our infrastructure.

Join Rise to see the full answer
What can a Site Reliability Engineer at Neon expect from the work environment?

At Neon, a Site Reliability Engineer can expect a supportive work environment, with a fully distributed team collaborating from over 25 countries. We uphold open-source cultural principles, emphasizing transparency, accountability, and collaboration while avoiding rigid hierarchical structures, creating an atmosphere that empowers engineers.

Join Rise to see the full answer
Are there growth opportunities for Site Reliability Engineers at Neon?

Absolutely! As a Site Reliability Engineer at Neon, you will be part of a fast-scaling startup with ample opportunities for professional growth. You'll be involved in decision-making processes, contributing to innovative solutions, and gaining exposure to cutting-edge technologies, all paving the way for your career advancement.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
How do you handle outages and incidents as a Site Reliability Engineer?

In handling outages, I follow a structured incident response plan, ensuring that I assess the impact, communicate with stakeholders, and implement temporary fixes while working on a permanent solution. After resolving the incident, I conduct a post-mortem analysis to identify root causes and improve our systems to prevent future occurrences.

Join Rise to see the full answer
Describe your experience with multi-cloud deployments and Kubernetes management.

I have hands-on experience managing multi-cloud Kubernetes deployments, utilizing tools like Flux and ArgoCD for GitOps to streamline deployment processes. I focus on ensuring that my deployments are scalable, reliable, and easy to manage across different cloud environments through best practices in CI/CD.

Join Rise to see the full answer
What monitoring solutions have you used in previous roles?

I have utilized monitoring solutions such as Prometheus, Grafana, and VictoriaMetrics extensively. I believe in setting up comprehensive monitoring and alerting systems that provide insights into application performance and system health, enabling proactive responses to potential issues.

Join Rise to see the full answer
Can you explain your approach to building repeatable and cost-efficient infrastructure?

My approach involves codifying infrastructure management using tools like Terraform, ensuring that I establish clear standards and reusable modules. This not only enhances repeatability but also allows for efficient scaling without incurring unnecessary costs, by optimizing resource usage in cloud environments.

Join Rise to see the full answer
What challenges have you faced while working in a complex Linux infrastructure?

In a complex Linux infrastructure, I've faced challenges such as managing configurations across multiple servers and troubleshooting network issues. I’ve addressed these by implementing configuration management tools, automating processes, and developing detailed documentation to facilitate easier maintenance and onboarding.

Join Rise to see the full answer
What role does collaboration play in your work as a Site Reliability Engineer?

Collaboration is crucial in my role. I work closely with developers, product managers, and other engineering teams to align on reliability goals and ensure smooth feature rollouts. Regular stand-ups, feedback loop meetings, and collaborative troubleshooting sessions foster a culture of shared ownership of system reliability.

Join Rise to see the full answer
Tell us about a time when you built a solution to a unique problem.

Once, I encountered a unique scaling issue that wasn’t documented online. I conducted a series of experiments to identify the root cause and then designed a custom solution involving load balancing across multiple instances. Sharing my findings in a tech meeting helped the team adapt similar strategies across projects.

Join Rise to see the full answer
How do you ensure the security of your systems as an SRE?

I prioritize security by following industry best practices, conducting regular audits, and ensuring all systems are patched promptly. Additionally, I implement monitoring solutions to detect any anomalies, and I advocate for a security-first approach in the development process, involving security considerations from the very start.

Join Rise to see the full answer
What is your experience with open-source technologies?

I have actively contributed to several open-source projects, which has deepened my understanding of collaborative software development. Working with open-source technologies aligns with my belief in shared knowledge creation and continuous improvement, which I find vital in creating robust, reliable systems.

Join Rise to see the full answer
Why are you interested in working at Neon as a Site Reliability Engineer?

I'm excited about the opportunity at Neon because of its commitment to innovation and open-source principles. The chance to contribute to cutting-edge technologies like cloud-native PostgreSQL while being part of a dynamic, talented team aligns perfectly with my career aspirations and passion for reliability engineering.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
E.L.F. BEAUTY Remote Ahmedabad, Gujarat
Posted 11 days ago
Timmons Group Hybrid 608 Preston Ave, Charlottesville, VA 22903, USA
Posted 3 days ago
Photo of the Rise User
Houston Engineering, Inc. Hybrid 7550 Meridian Cir N suite 120, Maple Grove, MN 55369, USA
Posted 12 days ago
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
January 4, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!