Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

Neon aims to be the go-to platform for serverless Postgres with additional features like branching and autoscaling, to name a couple. Currently, we are serving 750k databases and want to grow that number, along with delivering more features, without compromising on reliability and scalability. This is where our SRE team comes into the picture.

The SRE team is responsible for managing Neon’s multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as well as improving the reliability of the overall platform. All the features we want to implement can only reach our customers if the changes are delivered in a reliable way, which means the SRE team plays a significant role in defining our pace of development.

Successful candidates will get the opportunity to contribute to the effort of evolving Neon to become multi-cloud so that we can be as close as possible to our customers while also making decisions about how to best utilize different cloud technologies. They will also take part in refining and improving our existing infrastructure so that stability and scalability complement the delivery of new features and services.

Neon's foundations is built on open source software, if you want to take a look into what makes Neon work, feel feel to browse https://github.com/neondatabase/neon (storage layer of databases) and https://github.com/neondatabase/autoscaling (autoscaling of databases), as well as our engineering blog. SREs frequently work with stakeholders in different teams, these repos provide a sneak peek of what the Neon engineering team is capable of producing.

You will

  • Join an experienced team and contribute to the foundation all of Neon is built upon

  • Contribute to building a stable and cost-efficient infrastructure foundation

  • Play a key role in ensuring we are proactive instead of reactive on infrastructure and reliability

  • Coach your fellow engineers on cloud, infrastructure, and reliability topics

  • Be ready to join an on-call rotation

We're looking for someone who has

  • 4+ years experience working in Site Reliability Engineering

  • Experience with cloud infrastructure components in Azure and/or AWS

  • Experience in a complex Linux infrastructure environment

  • Experience focusing on building repeatable and cost-efficient infrastructure

  • Experience building solutions for problems with no answers on Google

  • Experience working with monitoring solutions in the Prometheus ecosystem; Grafana, Loki, Tempo, VictoriaMetrics

  • Experience managing multi-cluster, multi-cloud Kubernetes deployments

  • Nice to have: Familiarity with Go, GitOps (e.g., Flux, ArgoCD), Postgres, Virtualization (QEMU/KVM)

Our stack: AWS, Azure, Terraform, Grafana Cloud, VictoriaMetrics, Flux, EKS/AKS.

 

About Neon

Neon is building open-source cloud-native PostgreSQL. Our architecture separates storage from compute, allowing for stateless and serverless Postgres. We're a well-funded startup with deep knowledge of Postgres internals and decades of experience building databases. We are a systems company; we work on low-level code with strict performance and correctness requirements.

Neon was created by a team of Postgres hackers and led by CEO Nikita Shamgunov (co-founder of SingleStore). Neon is built on open-source principles and is focused on giving back to the Postgres and developer communities.

Our Team

  • We are a distributed team of 100+ people working from 25+ countries (concentrating around North American and European time zones)

  • We are a team built on open-source cultural principles (transparency, contribution, accountability, and proactivity)

  • Team with decades of experience building databases and deep knowledge of Postgres internals. We are deeply technical

  • We have experienced Postgres committers and hackers on the team (check HeikkiAnastasiaArsenyMatthias profiles)

  • We believe in the efficacy of collaborative open-source

  • We aim for a diversity of thoughts and backgrounds

  • We are keen to be a fast-moving, flat org and avoid hierarchical structures

Our Investors

Top-tier investors backed up Neon's vision:

  • We raised $104 million in funding from Menlo Ventures, Notable Capital, Khosla Ventures, General Catalyst, and Founders Fund.

    1. Venture vehicles of Snowflake and Databricks invested in Neon.

    2. Our angel investors are prominent technologists, and ecosystem players. More than 20 awesome angels supported Neon, including Nat Friedman, Elad Gil, Mike Ovitz, Ajeet Singh, Guillermo Rauch, Søren Brammer Schmidt, and Wes McKinney.

    3. Our Board includes Quentin Clark, Glenn Solomon, Joe Morrissey, and Tim Tully.

Our Offer

  • You have an opportunity to be an early employee in the fast-scaling ambitious team

  • You can work 100% remote: we'll handle all formalities to arrange work from your home

  • We grant equity (stock options) for all full-time hires

  • We offer a competitive benefits package in line with all tech companies (top-notch equipment, unlimited vacations, paid parental leaves, and much more)

  • We are distributed, yet make our bonds during regular offsites (the last one was in Lisbon, Portugal)

 

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Neon Inc

As a Site Reliability Engineer at Neon, you will be at the heart of our mission to evolve into the leading platform for serverless Postgres. Your role will encompass managing our multi-region, multi-cloud deployment while working closely with our talented engineering team. You'll be instrumental in ensuring that our infrastructure is not only robust and reliable but also scalable, allowing us to deliver exciting new features to our 750,000 databases effortlessly. The stakes are high, and you’ll play a significant part in determining our pace of development, ensuring that all changes reach our customers smoothly. We’re looking for someone with over 4 years of experience in Site Reliability Engineering, ideally with a strong background in cloud infrastructure components—think Azure or AWS. Your knowledge of complex Linux environments and your knack for formulating solutions to unique problems will be crucial. Familiarity with tools like Prometheus, Grafana, and Kubernetes will also stand you in great stead. At Neon, you'll not only contribute to crafting a stable infrastructure but will also coach fellow engineers on vital topics related to reliability and cloud technology. The opportunity to work remotely is a perk, and you’ll engage with a diverse international team committed to open-source values. If challenging and rewarding work excites you, we’d love for you to join our journey as we push the boundaries of cloud-native PostgreSQL.

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Neon Inc
What are the main responsibilities of a Site Reliability Engineer at Neon?

As a Site Reliability Engineer at Neon, your primary responsibilities will include managing our multi-region and multi-cloud deployments and collaborating closely with various engineering teams to enhance platform reliability. You'll also focus on building a stable infrastructure foundation, participating in an on-call rotation, and proactively improving our existing systems to support new features effectively.

Join Rise to see the full answer
What skills are required for the Site Reliability Engineer position at Neon?

Candidates for the Site Reliability Engineer role at Neon should have at least 4 years of experience in the field, with expertise in cloud infrastructure components, especially in Azure and AWS. A solid background in Linux environments, familiarity with Kubernetes for multi-cluster management, and experience with monitoring tools within the Prometheus ecosystem are also essential for success in this position.

Join Rise to see the full answer
How does the Site Reliability Engineer contribute to Neon's growth?

The Site Reliability Engineer plays a crucial role in Neon's growth by ensuring that all infrastructure changes are delivered reliably and efficiently. This allows Neon to introduce new features and services without compromising on performance. Additionally, your contributions will help shape our multi-cloud strategy, bringing us closer to our customers and reducing latency.

Join Rise to see the full answer
What are the opportunities for professional development as a Site Reliability Engineer at Neon?

At Neon, you'll have plenty of opportunities for professional development as a Site Reliability Engineer. You can engage in coaching your fellow engineers on cloud infrastructure topics, participate in innovative projects that expand your skillset, and work closely with a team of experienced professionals committed to sharing knowledge and fostering growth in the open-source community.

Join Rise to see the full answer
Is remote work an option for the Site Reliability Engineer role at Neon?

Yes, the Site Reliability Engineer role at Neon offers the flexibility to work 100% remotely. This allows you to manage your work-life balance while contributing to a dynamic, international team spread across various time zones, all united by a passion for open-source principles.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you describe your experience with cloud infrastructure in your previous roles?

In answering this question, be specific about the cloud platforms you've worked with such as AWS or Azure, detailing any projects that required you to manage complex deployments. Highlight any tools or methodologies that you've used, like Terraform for infrastructure as code, to create repeatable and cost-efficient infrastructure.

Join Rise to see the full answer
How have you ensured system reliability in a previous position?

When discussing reliability systems, illustrate your approach with examples involving monitoring solutions, log management, and incident response processes. You might mention specific situations where your proactive measures led to preventing downtime or improving system performance.

Join Rise to see the full answer
Can you explain your experience with monitoring solutions like Prometheus or Grafana?

Demonstrate your familiarity with monitoring by discussing specific use cases where you implemented Prometheus and Grafana successfully. Explain how you set up alerts, visualized data, and used those insights for performance tuning and troubleshooting.

Join Rise to see the full answer
What challenges have you faced while working with Kubernetes deployments?

Share specific challenges you've encountered in managing Kubernetes environments, whether related to scaling issues, troubleshooting deployments, or managing multi-cluster setups. Discuss how you overcame these challenges and what tools or strategies you found effective.

Join Rise to see the full answer
How do you handle on-call duties and incident management?

In your response, emphasize your approach to on-call duties and incident management. Discuss any frameworks or tools you’ve used for incident tracking and response, how you prioritize tasks, and communicate with team members during high-pressure situations.

Join Rise to see the full answer
What is your approach to building cost-efficient infrastructure?

When answering this question, outline your strategies for optimizing cloud resources, such as using auto-scaling and serverless architecture. Mention tools like Terraform for infrastructure management, and how best practices like capacity planning have played a role in your projects.

Join Rise to see the full answer
Can you give an example of a complex problem you've solved that had no readily available solution online?

Here, provide a specific example of a significant challenge you faced that required creative problem-solving. Explain the steps you took, the research you conducted, and how your solution benefited the project or organization.

Join Rise to see the full answer
How do you stay current with emerging technologies in Site Reliability Engineering?

In your response, discuss the various resources you utilize to keep up-to-date, such as attending conferences, participating in forums, following blogs, and contributing to open-source projects. Highlight any relevant certifications you pursue to bolster your expertise.

Join Rise to see the full answer
What do you believe sets a successful Site Reliability Engineer apart from others?

This is an opportunity to emphasize traits such as being proactive, having strong problem-solving skills, and excellent communication abilities. Reflect on the significance of collaboration across teams and how a solid understanding of both development and operations enhances system reliability.

Join Rise to see the full answer
Why are you interested in the Site Reliability Engineer position at Neon?

Convey your genuine interest in Neon by discussing the company's mission to innovate in the cloud-native PostgreSQL space. Reflect on how your background aligns with their goals and how you can contribute to their exciting growth journey while embracing open-source principles.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted yesterday
Photo of the Rise User
Neon Inc Remote No location specified
Posted yesterday
Photo of the Rise User
Zscaler Remote San Jose, California, United States
Posted 15 hours ago
Photo of the Rise User
Posted 7 days ago
Customer-Centric
Collaboration over Competition
Growth & Learning
Take Risks
Medical Insurance
Dental Insurance
Vision Insurance
Flex-Friendly
Equity
Learning & Development
Photo of the Rise User
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition
Photo of the Rise User
Posted 3 hours ago
Posted 4 days ago
Photo of the Rise User
NBCUniversal Hybrid 30 Rockefeller Plaza, New York, NEW YORK
Posted 8 days ago
Photo of the Rise User
Pareto.AI Remote No location specified
Posted yesterday

Neon Software Inc is a company that operates in the Computer Software industry. It employs 11-20 people and has $1M-$5M of revenue. The company is headquartered in Lafayette, California.

4 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
December 18, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!