Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior+ Platform Engineer image - Rise Careers
Job details

Senior+ Platform Engineer

Swarmia was founded to help every team achieve visibility into their own ways of working, a culture of small but continuous improvement, and great tooling to help them improve in a way that sticks. Companies like Docker, Miro, and Webflow use Swarmia’s SaaS product.

We are looking for a Senior Platform Engineer with deep infrastructure expertise to help us build and operate production systems that our customers rely on daily. 

Our infrastructure mainly runs on Google Cloud and is managed with Terraform. Since starting the company five years ago, we’ve reached 99.9% uptime every year. Our code is continuously deployed to production, with high automated test coverage. While we’re very experienced with building production systems that scale, you will be the first person to be fully dedicated to the Platform & Infrastructure work with this title.

Examples of things you'll do

  • Assist product teams in running their Kubernetes payloads and automating manual steps.

  • Design and implement infrastructure changes using Terraform

  • Collaborate with product teams to optimize application performance and resource utilization

  • Look near: Notice our message queue getting backed up? Dive in, analyze the bottleneck, and implement a solution before it affects our users

  • Look far: See our cloud costs trending up? Analyze usage patterns, identify optimization opportunities, and work with the team to implement efficient solutions

  • Configure VPCs and securely segment production networks

  • Write documentation and playbooks with the team - we prefer collaborative problem-solving over working in silos

  • Set up and fine-tune monitoring and SLO alerting in our observability stack to catch issues before they impact customers

  • Plan when to upsize our PostgreSQL servers or bring more machines to our Kubernetes cluster.

  • Go spelunking in Google Security Command Center, review security posture, and implement improvements to keep our platform secure and compliant

  • Design and implement automated disaster recovery procedures, then work with the team to practice them regularly

  • Automate away manual operational tasks - if you find yourself doing something more than twice, it's probably time to automate it

  • Work with the team to implement and maintain compliance requirements while keeping our development workflow smooth

  • Jump into a customer support chat when there's a problem with a customer's data sync

  • Optimize PostgreSQL performance by identifying table and index bloat and tuning pg_repack runs

Tech stack

You don't need to know all of the tools beforehand, we are happy to show you the ropes!

  • Hosting: Google Cloud Platform and Google Kubernetes Engine

  • Messaging systems: Google PubSub

  • Terraform for infrastructure as code

  • Observability stack (Prometheus, Grafana, GCP logging/tracing/monitoring)

  • Backends: TypeScript/NodeJS

  • Databases: PostgreSQL (Google Cloud SQL), Redis

  • Data warehouse: BigQuery, Dataform

  • CI/CD: GitHub Actions

What we offer

  • A highly experienced and motivated team

  • A very relevant domain (and hopefully interesting!) to any engineer

  • 70-90k€ annual salary plus a meaningful amount of equity

  • Paid annual vacation, with ten extra days for new employees

  • Flexible model of work - pick your own balance of remote/office

  • Great work/life balance - we're a startup, but we don’t crunch and work at an unsustainable pace (many of us have kids and other responsibilities beyond work)

Average salary estimate

$80000 / YEARLY (est.)
min
max
$70000K
$90000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior+ Platform Engineer, Swarmia

At Swarmia, we’re on a mission to empower teams with the visibility and tools they need for continuous improvement, and we're excited to find a Senior Platform Engineer who shares our vision! Our cutting-edge SaaS product is already making waves with companies like Docker, Miro, and Webflow, and now, we need someone with strong infrastructure expertise to join us in building and running the production systems that our customers rely on daily. You'll be diving deep into Google Cloud, managing it with Terraform while ensuring our commitments to 99.9% uptime continue to shine. As our first dedicated Senior Platform Engineer, you'll collaborate closely with product teams to optimize Kubernetes operations and automate manual processes. Your analytical skills will shine as you identify performance bottlenecks and implement solutions proactively. You will streamline operations by designing necessary infrastructure changes while also fine-tuning our observability stack to monitor for issues before they escalate. Additionally, you’ll be involved in configuring secure VPCs, automating disaster recovery procedures, and continuously tuning PostgreSQL performance. What’s more, we offer a flexible working model, a supportive work-life balance, and a competitive salary package ranging from 70-90k€ plus equity. Join us and let’s make an impact together!

Frequently Asked Questions (FAQs) for Senior+ Platform Engineer Role at Swarmia
What are the primary responsibilities of a Senior Platform Engineer at Swarmia?

As a Senior Platform Engineer at Swarmia, your primary responsibilities include designing and implementing infrastructure changes, assisting product teams with Kubernetes management, optimizing application performance, configuring secure VPCs, writing documentation, and tuning PostgreSQL performance. In this role, you'll play a vital part in maintaining our production systems and ensuring our customers experience seamless service.

Join Rise to see the full answer
What qualifications are required for the Senior Platform Engineer position at Swarmia?

Swarmia requires candidates for the Senior Platform Engineer position to have deep infrastructure expertise, especially in Google Cloud and Terraform. Familiarity with Kubernetes, experience with monitoring and observability tools like Prometheus and Grafana, and a strong understanding of PostgreSQL are also key. Additionally, proficiency in automation and a collaborative mindset will help you thrive within our team.

Join Rise to see the full answer
How does Swarmia support continuous improvement for its Senior Platform Engineers?

At Swarmia, we believe in the value of continuous improvement not just for our teams but for our employees. As a Senior Platform Engineer, you will engage in training sessions and collaborative problem-solving workshops. We encourage knowledge sharing, allowing you to learn new technologies and refine your skills while contributing to significant projects.

Join Rise to see the full answer
What tools and technologies will I be using as a Senior Platform Engineer at Swarmia?

In your role at Swarmia, you'll engage with a diverse tech stack that includes Google Cloud Platform, Google Kubernetes Engine for hosting, Terraform for infrastructure as code, and Observability tools like Prometheus and Grafana. Your daily tasks will also involve working with TypeScript/NodeJS backends, PostgreSQL databases, and CI/CD tools like GitHub Actions, ensuring you stay at the forefront of technology.

Join Rise to see the full answer
What kind of work-life balance can I expect as a Senior Platform Engineer at Swarmia?

Swarmia promotes a strong work-life balance, ensuring our employees do not experience burnout despite the fast-paced startup environment. As a Senior Platform Engineer, you'll enjoy a flexible remote/office work model while being part of a supportive team culture that respects personal responsibilities and priorities outside of work.

Join Rise to see the full answer
Common Interview Questions for Senior+ Platform Engineer
Can you describe your experience with infrastructure as code, specifically with Terraform?

To answer this question effectively, share specific projects where you've used Terraform to manage cloud resources. Highlight your ability to create modular and reusable Terraform configurations, explain how you've implemented changes through infrastructure as code, and discuss how this improved deployment consistency and efficiency in your previous roles.

Join Rise to see the full answer
How do you approach optimizing Kubernetes performance?

When discussing Kubernetes optimization, emphasize your experience in monitoring cluster performance, analyzing resource usage metrics, and identifying bottlenecks. Provide examples of strategies you’ve implemented, such as adjusting resource requests/limits, using HPA (Horizontal Pod Autoscaler), or optimizing pod placement to maintain high availability and efficiency.

Join Rise to see the full answer
What strategies do you use for database performance tuning, especially with PostgreSQL?

In your response, outline your systematic approach to database performance tuning. Discuss specific techniques you've used, such as analyzing query performance with EXPLAIN, identifying and addressing table bloat, adjusting index usage, or implementing pg_repack runs to optimize performance continuously.

Join Rise to see the full answer
Describe a situation where you had to troubleshoot a complex production issue.

When answering this question, use the STAR method: Situation, Task, Action, Result. Describe the context of the issue, your specific role in resolving it, the steps you took for troubleshooting (e.g., examining logs, increasing monitoring), and highlight the positive outcome for the team and end-users.

Join Rise to see the full answer
How do you ensure the security and compliance of cloud infrastructure?

Discuss your experience with implementing security best practices, such as configuring IAM policies, utilizing security groups to restrict access, and regularly performing security audits. Highlight any tools you've used to enhance security posture and how you've collaborated with teams to maintain compliance with industry standards.

Join Rise to see the full answer
What methods do you use for monitoring cloud infrastructure?

You could explain how you utilize monitoring tools such as Prometheus and Grafana to track system performance and alert the team to potential issues. Mention the importance of setting up relevant metrics, SLOs, and alerts to catch problems before impacting customers, ensuring a proactive operational environment.

Join Rise to see the full answer
How do you handle multiple stakeholders when working on collaborative projects?

Talk about your communication strategy when working with multiple stakeholders, including setting clear expectations, regular updates, and fostering a collaborative environment. Explain your approach to balancing differing priorities and ensuring all team members stay aligned toward common goals.

Join Rise to see the full answer
Can you share an example of an automated process you’ve implemented?

Discuss a specific automated task you have implemented, whether it's for deployment, backups, or continuous integration. Explain the impact it had on the operational team's efficiency and how you documented the process for team knowledge sharing.

Join Rise to see the full answer
What do you consider critical metrics for assessing cloud infrastructure performance?

Provide a list of critical metrics that are pivotal for assessing performance, like uptime percentage, response time, resource utilization, and error rates. Highlight how you track these metrics for optimization and ensure they align with business service level objectives.

Join Rise to see the full answer
Why do you want to work as a Senior Platform Engineer at Swarmia?

Use this question to convey your enthusiasm for Swarmia's mission, the impact of its product, and your alignment with its values. Mention specific aspects of the role that excite you, such as working on a diverse tech stack, the commitment to a great work-life balance, or the opportunity to make tangible contributions to the platform.

Join Rise to see the full answer
Similar Jobs
Timmons Group Hybrid 608 Preston Ave, Charlottesville, VA 22903, USA
Posted 2 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
Posted 12 days ago
Dare to be Different
Diversity of Opinions
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning
Photo of the Rise User
Posted 13 days ago
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Brillio Hybrid Seattle, Washington, United States
Posted 2 days ago

Swarmia is an engineering productivity platform that gives engineering leaders, managers, and teams the insights they need to see what’s slowing them down and the tools to resolve those blockers. It ...connects with the platforms your engineering ...

13 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
December 30, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!