Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

We are Vitesse – the treasury and payment partner of choice for insurance. 

Formed in 2014 by a team of proven FinTech entrepreneurs, we are an FCA-regulated business providing global claim funds management and payment solutions. Operating one of the largest banking and payment settlement networks in the world, we give our customers direct access to 200 countries and currencies. Through a single integration, insurers can use this network to pay claims in as fast as 45 seconds and deliver a superior claimant experience. Our market-leading treasury proposition provides insurers with transparency and control over their claim funds, even when delegated to third-parties, allowing them to have their money in the right place, at the right time, to make that all-important payment when customers need it most.

With over 175 employees across our London headquarters, Europe, and the US, $93m Series C funding secured, and exceeding £10bn in processed transactions, we are only just getting started.

We are collaborative, customer centric and work with integrity, whilst partnering with some of the biggest insurance leaders including Lloyd’s of London and Many Pets. We take huge pride in our company culture, ensuring that everyone has a part to play, an opportunity to be heard, be involved, and the ability to make a real difference.  As we continue to scale up, we want like-minded humans to join us on this exciting journey. Are you ready? 

As a Site Reliability Engineer (SRE), you will play an important role in designing, building, and maintaining the infrastructure and tools necessary to support our software applications and services. You will collaborate closely with the product engineering squads, technical operations, and security teams to ensure the reliability, scalability, and security of our platform. Your responsibilities will include automating infrastructure provisioning, configuration management, and deployment pipelines, utilizing best practices and modern technologies to streamline processes and improve efficiency. You will also be responsible for monitoring system performance, identifying bottlenecks, and implementing solutions to enhance system reliability and performance.

Key responsibilities

• Cloud Platform Management: Using Azure/AWS to manage and optimize infrastructure components, ensuring scalability, reliability, and cost management.

• Infrastructure Design and Implementation: Designing, building and maintaining the cloud-based infrastructure that supports our software applications and services

• System Reliability: Ensuring the reliability, availability, and performance of systems and services by designing, implementing, and maintaining robust infrastructure.

• Infrastructure as Code (IaC): Implementing and maintaining tools for automation, monitoring, and deployment to improve efficiency and reduce manual intervention.

• Collaboration and Support: Working closely with product engineering to ensure efficient workflows and support continuous integration and delivery pipelines (CI/CD).

• Capacity Planning and Scalability: Assessing system capacity requirements and planning for future growth to ensure the system can scale and is cost efficient.

• Incident Response and Management: Monitoring system health, promptly responding to incidents, and assisting with the resolution process.

• Risk Management: Identifying potential risks and vulnerabilities in systems and implementing measures to mitigate these risks effectively.
• Monitoring and Observability: Implement and oversee monitoring tools to proactively detect
and mitigate issues, ensuring high application and system availability.
• Documentation and Knowledge Sharing: Maintaining documentation and sharing knowledge
with the team to ensure transparency and facilitate cross-functional collaboration.

• 3+ years of experience in a Site Reliability Engineer, DevOps, Platform Engineer, or similar role.

• Strong knowledge and experience in cloud platforms, substantial experience in Microsoft Azure is essential

• Proven track record in designing, implementing, and maintaining highly available and scalable systems.

• Expertise in containerization tools like Docker and orchestration tools such as Kubernetes.

• Experience with infrastructure as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management.

• Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting.

• Knowledge of networking, security principles, and best practices in a cloud environment.

• Demonstrated experience of CI/CD tools like GitHub Actions, GitLab CI/CD, or Azure DevOps for continuous integration and delivery.

• Problem-solving mindset and meticulous attention to detail.

• Strong collaboration and communication skills to work effectively with cross-functional teams.

• Comfortable working in a fast-paced environment, handling incidents, and participating in on-call rotations.

• Adaptability to evolving technologies and eagerness to learn new tools and methodologies.

    • 25 days Holiday per year (increasing by 1 day per years' service, up to 30 days) + Bank Holidays  
    • Hybrid working arrangements – This role demands the ability to thrive in a fast-paced setting, frequently multitasking across various tasks and support requests. The role can offer either a hybrid work schedule or fully remote options, but will require the occasional office visit, for team get-togethers or larger product workshops.
    • Contributory pension scheme  
    • Enhanced Parental leave   
    • Cycle to Work Scheme  
    • Private Medical Insurance with AXA 
    • Unlimited access to therapy sessions through our partner, Oliva   
    • Discounted Gym membership through Gympass 
    • Financial Coaching with Octopus Wealth  
    • 2 days of volunteering leave per year  
    • Sabbatical after 5 years’ service   
    • Life Assurance - MetLife (UK employees only)
    • Ongoing Learning and Development to support you reach your career goals  

Vitesse at our best – our values 

The Vitesse values are a true reflection of what it takes to thrive in our business, so it’s important to us that any employee who joins our business is aligned with these 3 attributes 

Confident Humility 

We don’t do ego and we know that unless we all win, none of us win. We admit when we’re wrong, ask for help and always think about the wider business before ourselves.

Driven to Succeed 

We see the opportunity ahead of us and we won’t stop until we fulfil the potential we know we have. We hold ourselves to high standards and deliver high quality outcomes for Vitesse and our customers.  

Tenacious Responsibility 

We take ownership for our actions and decisions, and face into the challenges that come our way. We are committed to seeing things through to completion, even in the face of adversity. 

We are an Equal Opportunity Employer We are committed to creating an inclusive environment that enables everyone to perform at their best, where we recognise the rights of all individuals to mutual respect and where there is an unbiased acceptance of others. Our policies and practices aim to promote an environment that is free from all forms of Unfair discrimination and values the diversity of all people. At the heart of our policy, we seek to treat people fairly and with dignity and respect. Please confirm if selected for an interview, what interview adjustments you would need? You can contact Clara Moretti-Greene on clara.moretti-greene@vitesse.io or in her absence contact our People Team PeopleTeam@vitessepsp.com.

What You Should Know About Site Reliability Engineer, Vitesse PSP

As a Site Reliability Engineer at Vitesse, you'll step into an invigorating role that blends innovation and responsibility in exciting ways. Founded by a team of FinTech pros in 2014, Vitesse is redefining the treasury and payment landscape for insurance, joining forces with industry giants like Lloyd’s of London. This isn’t just a job; it’s a chance to be part of something impactful! In your role, you will design, build, and maintain the crucial infrastructure that keeps our services running smoothly. You’ll work hand-in-hand with our talented product engineering squads, diving into cloud management, infrastructure design, and system reliability. Get ready to embrace automation through Infrastructure as Code, enhance performance monitoring, and tackle incident response like a pro. With over 175 team members spread across Europe and the US, we foster a collaborative environment that values everyone’s contributions. Plus, our generous benefits package, including hybrid working arrangements and ongoing development opportunities, ensures you’ll thrive both personally and professionally. Are you ready for the challenge of keeping our platform healthy and our users happy? Join us and help us scale up and succeed as we cater to the needs of insurers across the globe. Let’s make a difference together!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Vitesse PSP
What are the responsibilities of a Site Reliability Engineer at Vitesse?

As a Site Reliability Engineer at Vitesse, your main responsibilities include managing cloud platforms like Azure or AWS, designing and implementing infrastructure, ensuring system reliability, and automating deployment processes. You will collaborate closely with product teams to streamline continuous integration and operation, monitor system health, and manage incidents effectively to maintain an optimal user experience.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at Vitesse?

To qualify for the Site Reliability Engineer position at Vitesse, you should have at least 3 years of experience in roles such as SRE, DevOps, or Platform Engineering. You need strong expertise in cloud platforms, particularly with Microsoft Azure, along with knowledge of containerization tools like Docker and orchestration tools such as Kubernetes. Familiarity with Infrastructure as Code tools and CI/CD practices is also essential.

Join Rise to see the full answer
How does Vitesse support the professional development of a Site Reliability Engineer?

At Vitesse, we prioritize ongoing learning and development for our Site Reliability Engineers. You'll have access to various training programs, workshops, and opportunities to learn new technologies, ensuring that you stay current in your field and can grow your career effectively within our dynamic environment.

Join Rise to see the full answer
What technologies should a Site Reliability Engineer be familiar with at Vitesse?

A Site Reliability Engineer at Vitesse should be familiar with cloud technologies and services, particularly in Microsoft Azure. Knowledge of containerization tools like Docker, orchestration platforms such as Kubernetes, and tools for infrastructure automation like Terraform or Ansible is vital. You should also be well-versed in monitoring tools for system performance.

Join Rise to see the full answer
What is the working environment like for a Site Reliability Engineer at Vitesse?

The working environment for a Site Reliability Engineer at Vitesse is collaborative and fast-paced. You will thrive in a culture that emphasizes teamwork and transparency while managing various tasks and supporting requests. With hybrid work options and a focus on work-life balance, you'll have the flexibility to excel both in the office and remotely.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you explain what Site Reliability Engineering means to you and its importance?

Site Reliability Engineering is about ensuring systems are reliable and scalable by applying engineering principles to operations. Emphasize how it impacts customer satisfaction, system performance, and uptime, reflecting your understanding of its critical role in modern tech environments.

Join Rise to see the full answer
Describe your experience with cloud platforms like Azure or AWS.

Discuss specific projects where you used Azure or AWS, detailing your role in managing resources, deploying services, and ensuring reliability. Highlight any challenges faced and how you overcame them using best practices in cloud management.

Join Rise to see the full answer
What is Infrastructure as Code (IaC), and why is it important?

IaC allows you to manage infrastructure using code and automation, which increases consistency and efficiency. Discuss tools you’ve used, such as Terraform or Ansible, and give examples of how IaC has improved deployment processes in your previous roles.

Join Rise to see the full answer
How do you approach incident management and response?

Explain your systematic approach to incident management, including monitoring systems, escalation processes, and communication strategies. Share specific examples of incidents you handled and the outcomes, emphasizing your problem-solving skills.

Join Rise to see the full answer
What tools do you use for monitoring system performance?

Discuss the monitoring tools you’re familiar with, such as Prometheus or Grafana. Explain how you’ve implemented these tools to track performance metrics, identify bottlenecks, and ensure high availability, providing examples from your experience.

Join Rise to see the full answer
Can you describe a situation where you improved system reliability?

Share a specific project where your actions led to measurable improvements in reliability. Include the background of the situation, the steps you took, and the resulting benefits, showcasing your impact.

Join Rise to see the full answer
How do you handle scaling system capacity?

Articulate your process for assessing capacity needs, planning for growth, and implementing scalable solutions. Use real-life examples to demonstrate how you successfully anticipated demands or trends to maintain performance.

Join Rise to see the full answer
What role does automation play in your workflow?

Discuss how automation facilitates efficiency in deployments, monitoring, and incident response. Mention specific tools you use that streamline operations and allow you to focus on more strategic tasks.

Join Rise to see the full answer
How do you keep up with the latest technologies and industry trends?

Express your commitment to staying informed through various professional development methods like online courses, tech blogs, and networking with peers in the industry. Share resources you find valuable and any communities you participate in.

Join Rise to see the full answer
What qualities do you believe are essential in a Site Reliability Engineer?

Highlight qualities such as strong problem-solving skills, attention to detail, collaborative mindset, and adaptability to change. Discuss how these traits have contributed to your success in previous positions.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Vitesse PSP Remote No location specified
Posted 2 days ago

Join Vitesse as a Business Analyst and help shape innovative payment solutions for the insurance and treasury sectors.

Photo of the Rise User
Vitesse PSP Remote No location specified
Posted 8 days ago

Join Vitesse as a Data Engineer and play a key role in transforming data management for payment solutions in the insurance industry.

Join WindBorne Systems as a Web Handling/Film Converting Expert to pioneer innovative manufacturing at scale for groundbreaking weather technology.

Photo of the Rise User
Posted 6 days ago

Join Matic as a Mechanical Design Engineer and help revolutionize home automation with innovative robotics solutions.

Photo of the Rise User
Target Hybrid Tower 02, Manyata Embassy Business Park, Racenahali & Nagawara Villages. Outer Ring Rd, Bangalore 540065
Posted 10 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony

Join Target as a Lead Engineer and help shape the future of technology in their Accounts Payables team with a focus on innovation and diversity.

Lead complex municipal transportation projects with the City of Lincoln's Department of Transportation & Utilities.

Photo of the Rise User
RIX Industries Hybrid US, Washoe County, NV; Nevada, Sparks, NV
Posted 7 days ago

Join RIX Industries as a Mechanical Design Engineer Lead to spearhead engineering efforts in cutting-edge gas generation systems and components.

DPR Construction seeks a Mid-Level Project Manager with expertise in Civil Engineering to lead complex construction projects in Reston, VA.

Join Slash to elevate the developer experience and contribute to innovative internal systems in the heart of San Francisco's fintech landscape.

Photo of the Rise User
Datadog Hybrid Sarasota, FL
Posted 3 days ago
Customer-Centric
Rapid Growth
Diversity of Opinions
Reward & Recognition
Friends Outside of Work
Inclusive & Diverse
Empathetic
Feedback Forward
Work/Life Harmony
Casual Dress Code
Startup Mindset
Collaboration over Competition
Fast-Paced
Growth & Learning
Open Door Policy
Rise from Within
Maternity Leave
Paternity Leave
Flex-Friendly
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off

Join our team as an AutoCAD Draftsman, where your design skills will contribute to innovative projects in a collaborative environment.

Vitesse PSP provides cross-border payment services to banks and businesses via a globally distributed settlement network. The company is based in the UK.

24 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 5, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Perrysburg just viewed Sourcing Leader, Minerals & Cullet at Owens Corning
Photo of the Rise User
Someone from OH, North Royalton just viewed Remote AI Voice Trainer (High-Quality Microphone Required) at Datadog
C
Someone from OH, Akron just viewed Phlebotomy Technician - Outpatient at CCF
Photo of the Rise User
13 people applied to MX Apprentice at Spirit Airlines
Photo of the Rise User
Someone from OH, Solon just viewed Graphic Designer at Applause
Photo of the Rise User
Someone from OH, North Canton just viewed NodeJs developer at BlackStone eIT
Photo of the Rise User
Someone from OH, North Canton just viewed Software Development Engineer - Recent Grads Welcome at Sonos
Photo of the Rise User
Someone from OH, Dayton just viewed Data Entry and Word Processing at MoxieIT
Photo of the Rise User
Someone from OH, Dayton just viewed Content Developer - Intern at Big Ideas Learning
Photo of the Rise User
8 people applied to Pipe Welder (Starship) at SpaceX
Photo of the Rise User
Someone from OH, Pickerington just viewed Salesforce Lead at Bounteous
Photo of the Rise User
Someone from OH, Pickerington just viewed Industry Lead - High Tech (Salesforce) at Thunder
D
Someone from OH, Akron just viewed Junior Motion Designer at DEPT®
R
Someone from OH, Akron just viewed 2D Graphic and Motion Designer at Ruby Labs