Job details

Site Reliability Engineer

Neon aims to be the go-to platform for serverless Postgres with additional features like branching and autoscaling, to name a couple. Currently, we are serving 750k databases and want to grow that number, along with delivering more features, without compromising on reliability and scalability. This is where our SRE team comes into the picture.

The SRE team is responsible for managing Neon’s multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as well as improving the reliability of the overall platform. All the features we want to implement can only reach our customers if the changes are delivered in a reliable way, which means the SRE team plays a significant role in defining our pace of development.

Successful candidates will get the opportunity to contribute to the effort of evolving Neon to become multi-cloud so that we can be as close as possible to our customers while also making decisions about how to best utilize different cloud technologies. They will also take part in refining and improving our existing infrastructure so that stability and scalability complement the delivery of new features and services.

Neon's foundations is built on open source software, if you want to take a look into what makes Neon work, feel feel to browse https://github.com/neondatabase/neon (storage layer of databases) and https://github.com/neondatabase/autoscaling (autoscaling of databases), as well as our engineering blog. SREs frequently work with stakeholders in different teams, these repos provide a sneak peek of what the Neon engineering team is capable of producing.

You will

Join an experienced team and contribute to the foundation all of Neon is built upon
Contribute to building a stable and cost-efficient infrastructure foundation
Play a key role in ensuring we are proactive instead of reactive on infrastructure and reliability
Coach your fellow engineers on cloud, infrastructure, and reliability topics
Be ready to join an on-call rotation

We're looking for someone who has

4+ years experience working in Site Reliability Engineering
Experience with cloud infrastructure components in Azure and/or AWS
Experience in a complex Linux infrastructure environment
Experience focusing on building repeatable and cost-efficient infrastructure
Experience building solutions for problems with no answers on Google
Experience working with monitoring solutions in the Prometheus ecosystem; Grafana, Loki, Tempo, VictoriaMetrics
Experience managing multi-cluster, multi-cloud Kubernetes deployments
Nice to have: Familiarity with Go, GitOps (e.g., Flux, ArgoCD), Postgres, Virtualization (QEMU/KVM)

Our stack: AWS, Azure, Terraform, Grafana Cloud, VictoriaMetrics, Flux, EKS/AKS.

About Neon

Neon is building open-source cloud-native PostgreSQL. Our architecture separates storage from compute, allowing for stateless and serverless Postgres. We're a well-funded startup with deep knowledge of Postgres internals and decades of experience building databases. We are a systems company; we work on low-level code with strict performance and correctness requirements.

Neon was created by a team of Postgres hackers and led by CEO Nikita Shamgunov (co-founder of SingleStore). Neon is built on open-source principles and is focused on giving back to the Postgres and developer communities.

Our Team

We are a distributed team of 100+ people working from 25+ countries (concentrating around North American and European time zones)
We are a team built on open-source cultural principles (transparency, contribution, accountability, and proactivity)
Team with decades of experience building databases and deep knowledge of Postgres internals. We are deeply technical
We have experienced Postgres committers and hackers on the team (check Heikki, Anastasia, Arseny, Matthias profiles)
We believe in the efficacy of collaborative open-source
We aim for a diversity of thoughts and backgrounds
We are keen to be a fast-moving, flat org and avoid hierarchical structures

Our Investors

Top-tier investors backed up Neon's vision:

We raised $104 million in funding from Menlo Ventures, Notable Capital, Khosla Ventures, General Catalyst, and Founders Fund.
1. Venture vehicles of Snowflake and Databricks invested in Neon.
2. Our angel investors are prominent technologists, and ecosystem players. More than 20 awesome angels supported Neon, including Nat Friedman, Elad Gil, Mike Ovitz, Ajeet Singh, Guillermo Rauch, Søren Brammer Schmidt, and Wes McKinney.
3. Our Board includes Quentin Clark, Glenn Solomon, Joe Morrissey, and Tim Tully.

Our Offer

You have an opportunity to be an early employee in the fast-scaling ambitious team
You can work 100% remote: we'll handle all formalities to arrange work from your home
We grant equity (stock options) for all full-time hires
We offer a competitive benefits package in line with all tech companies (top-notch equipment, unlimited vacations, paid parental leaves, and much more)
We are distributed, yet make our bonds during regular offsites (the last one was in Lisbon, Portugal)

Average salary estimate

$135000 / YEARLY (est.)

min

max

$120000K

$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Neon Inc

As a Site Reliability Engineer at Neon, you will be at the heart of our mission to evolve into the leading platform for serverless Postgres. Your role will encompass managing our multi-region, multi-cloud deployment while working closely with our talented engineering team. You'll be instrumental in ensuring that our infrastructure is not only robust and reliable but also scalable, allowing us to deliver exciting new features to our 750,000 databases effortlessly. The stakes are high, and you’ll play a significant part in determining our pace of development, ensuring that all changes reach our customers smoothly. We’re looking for someone with over 4 years of experience in Site Reliability Engineering, ideally with a strong background in cloud infrastructure components—think Azure or AWS. Your knowledge of complex Linux environments and your knack for formulating solutions to unique problems will be crucial. Familiarity with tools like Prometheus, Grafana, and Kubernetes will also stand you in great stead. At Neon, you'll not only contribute to crafting a stable infrastructure but will also coach fellow engineers on vital topics related to reliability and cloud technology. The opportunity to work remotely is a perk, and you’ll engage with a diverse international team committed to open-source values. If challenging and rewarding work excites you, we’d love for you to join our journey as we push the boundaries of cloud-native PostgreSQL.

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Neon Inc

What are the main responsibilities of a Site Reliability Engineer at Neon?

As a Site Reliability Engineer at Neon, your primary responsibilities will include managing our multi-region and multi-cloud deployments and collaborating closely with various engineering teams to enhance platform reliability. You'll also focus on building a stable infrastructure foundation, participating in an on-call rotation, and proactively improving our existing systems to support new features effectively.