Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer - Data Platform image - Rise Careers
Job details

Site Reliability Engineer - Data Platform

Building the Future of Crypto 

Our Krakenites are a world-class team with crypto conviction, united by our desire to discover and unlock the potential of crypto and blockchain technology.

What makes us different?

Kraken is a mission-focused company rooted in crypto values. As a Krakenite, you’ll join us on our mission to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. For over a decade, Kraken’s focus on our mission and crypto ethos has attracted many of the most talented crypto experts in the world.

Before you apply, please read the Kraken Culture page to learn more about our internal culture, values, and mission. We also expect candidates to familiarize themselves with the Kraken app. Learn how to create a Kraken account here.

As a fully remote company, we have Krakenites in 70+ countries who speak over 50 languages. Krakenites are industry pioneers who develop premium crypto products for experienced traders, institutions, and newcomers to the space. Kraken is committed to industry-leading security, crypto education, and world-class client support through our products like Kraken Pro, Desktop, Wallet, and Kraken Futures.

Become a Krakenite and build the future of crypto!

Proof of work

The team

Join our Data Infrastructure team and play a pivotal role in upholding the reliability, scalability, and efficiency of our robust Data platform. As a Senior Site Reliability Engineer (SRE) specialized in Data Infrastructure, you will collaborate closely with diverse cross-functional teams to conceive, execute, and oversee the foundational data infrastructure that empowers our array of applications and services.

As a key member of our Data Infrastructure team, you will:

  • Design the data governance mechanisms that ensure our lakehouse is easy to interact with, secure and in compliance with all applicable regulations.

  • Implement the infrastructure we use to ingest our data, store it, catalog it with the right metadata and capture its lineage.

  • Provide a state-of-the-art suite of BI tools for multiple teams within the company.

  • Guarantee the availability, high performance, scalability and cost efficiency of our data platform.

Your proficiency in cloud technologies, infrastructure as code, automation, monitoring, logging, user and machine AuthNZ, and certificate management will be instrumental in upholding the exceptional operational standards we set for our services.

The opportunity

  • Implement data infrastructure solutions (self service)  that support the needs of 10+ business units and over 100 engineering and data analysts

  • Utilize Infrastructure as Code (IaC) principles to design, provision, and manage both on-premises and cloud (AWS) infrastructure components using tools such as Terraform

  • Develop and maintain automation scripts using bash/shell scripting and to automate operational tasks and deployments.

  • Enhance and manage CI/CD pipelines to facilitate consistent software deployments across the data infrastructure.

  • Implement robust data monitoring and alerting solutions to proactively detect anomalies and performance issues.

  • Manage and implement role-based access control (RBAC) and permissions for a multitude of user groups and machine workflows across different environments

  • Manage and maintain real-time streaming data architecture using technologies like Kafka and Debezium Change Data Capture (CDC).

  • Ensure the timely and accurate processing of streaming data, enabling data analysts and engineers to gain insights from up-to-date information.

  • Utilize Kubernetes to manage containerized applications within the data infrastructure, ensuring efficient deployment, scaling, and orchestration.

  • Implement effective incident response procedures and participate in on-call rotations.

  • Collaborate with data analysts, engineers, and cross-functional teams to understand requirements and implement appropriate solutions.

  • Document architecture, processes, and best practices to enable knowledge sharing and support continuous improvement.

  • Support AI/ML teams with their infra requests

Skills you should HODL

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).

  • Proven experience (5+ years) working as a Site Reliability Engineer, Infrastructure Engineer, or similar roles, with a focus on data infrastructure and security.

  • Experience with real-time data processing technologies, such as Kafka and Debezium

  • Working experience in managing  hybrid systems particularly AWS and (HashiCorp nice to have).

  • Infrastructure as Code tools such as Terraform, Terragrunt and Atlantis

  • Experience with containerization and orchestration tools, particularly Kubernetes and Docker

  • Solid understanding of bash/shell scripting and proficiency in at least one programming language (preferably Python or Rust).

  • Familiarity with CI/CD deployment pipelines and related tools.

  • Strong problem-solving skills and the ability to troubleshoot complex systems.

  • Experience with data-related technologies (databases, data lakes, airflow, spark) is a plus.

#LI-Remote #LI-TK1 #USCANBRUKEU

This job is accepting ongoing applications and there is no application deadline.

Please note, applicants are permitted to redact or remove information on their resume that identifies age, date of birth, or dates of attendance at or graduation from an educational institution.

We consider qualified applicants with criminal histories for employment on our team, assessing candidates in a manner consistent with the requirements of the San Francisco Fair Chance Ordinance.

Kraken is powered by people from around the world and we celebrate all Krakenites for their diverse talents, backgrounds, contributions and unique perspectives. We hire strictly based on merit, meaning we seek out the candidates with the right abilities, knowledge, and skills considered the most suitable for the job. We encourage you to apply for roles where you don't fully meet the listed requirements, especially if you're passionate or knowledgable about crypto!

As an equal opportunity employer, we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws. 

Stay in the know

Follow us on Twitter

Learn on the Kraken Blog

Connect on LinkedIn

Kraken Glassdoor Company Review
4.5 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Kraken DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Kraken
Kraken CEO photo
Unknown name
Approve of CEO

Average salary estimate

$150000 / YEARLY (est.)
min
max
$120000K
$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer - Data Platform, Kraken

Are you ready to join the innovative team at Kraken as a Senior Site Reliability Engineer - Data Platform? Here at Kraken, we’re on a mission to accelerate the global adoption of crypto, and we believe that the talents of dedicated individuals like you are key to achieving that goal. As part of our vibrant Data Infrastructure team, you’ll play an essential role in ensuring that our data platform is reliable, scalable, and efficient. You will work collaboratively with various cross-functional teams, designing and implementing robust data infrastructures that enhance our applications and services. By creating seamless data governance mechanisms and architecting solutions for data ingestion and storage, you’ll help us maintain the high availability and performance our users expect. Your expertise with cloud technologies, Infrastructure as Code, and real-time data processing will be pivotal in allowing our teams to access accurate, timely insights. If you’re passionate about transforming the future of crypto and have more than 5 years of experience in a Site Reliability Engineering role, we’d love to welcome you to our remote team of Krakenites from around the globe. From utilizing cutting-edge technologies like Kubernetes to automating CI/CD pipelines, this is an exciting opportunity to make a significant impact. Join us, and together we can build a future where everyone can achieve financial freedom and inclusion through crypto!

Frequently Asked Questions (FAQs) for Site Reliability Engineer - Data Platform Role at Kraken
What are the main responsibilities of a Senior Site Reliability Engineer - Data Platform at Kraken?

As a Senior Site Reliability Engineer - Data Platform at Kraken, you will be responsible for designing and implementing scalable data governance mechanisms, handling data ingestion, storage, and metadata management. Additionally, you will ensure high performance and availability of the data platform while supporting multiple business units and enhancing CI/CD pipelines.

Join Rise to see the full answer
What qualifications are required for a Senior Site Reliability Engineer - Data Platform at Kraken?

Kraken requires candidates for the Senior Site Reliability Engineer - Data Platform position to hold a Bachelor’s degree in Computer Science or a related field, along with over 5 years of experience in site reliability or infrastructure engineering roles. Proficiency in cloud technologies, Infrastructure as Code tools like Terraform, and real-time data processing technologies such as Kafka is essential.

Join Rise to see the full answer
What tools and technologies should a Senior Site Reliability Engineer - Data Platform be familiar with at Kraken?

Candidates should be well-versed in cloud platforms, particularly AWS, and must have strong skills in Infrastructure as Code using Terraform or similar tools. Experience with Kubernetes, Docker, bash/shell scripting, and data processing tools will also be beneficial for the role at Kraken.

Join Rise to see the full answer
What kind of work environment does Kraken offer for a Senior Site Reliability Engineer - Data Platform?

Kraken offers a fully remote work environment where you can connect with as many as 70+ countries while speaking multiple languages. The culture at Kraken promotes collaboration, continual learning, and a shared mission of accelerating global crypto adoption.

Join Rise to see the full answer
How does a Senior Site Reliability Engineer - Data Platform contribute to Kraken's mission?

The Senior Site Reliability Engineer - Data Platform helps maintain an efficient and reliable data infrastructure that supports Kraken’s crypto applications. By enhancing data access and availability, this role directly contributes to Kraken’s overarching goal of achieving financial freedom and inclusion for all.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer - Data Platform
Can you explain your experience with cloud technologies in relation to site reliability?

When answering this question, focus on specific examples of projects where you've utilized cloud technologies. Highlight your proficiency with platforms like AWS and any tools you’ve used, especially in a data infrastructure context.

Join Rise to see the full answer
How do you ensure the high availability of a data platform?

Discuss your experience with monitoring solutions, automated failovers, and load balancing techniques. It’s crucial to convey your proactive approach in identifying and resolving potential issues before they impact availability.

Join Rise to see the full answer
What strategies do you use for incident response and management?

Outline your approach to incident response, such as establishing clear protocols, analyzing incidents post-resolution to gather learning points, and your collaborative work with other teams to implement preventive measures.

Join Rise to see the full answer
What is Infrastructure as Code, and how have you implemented it?

Explain Infrastructure as Code (IaC) as a method of managing and provisioning infrastructure through code rather than manual processes. Share your experiences in using tools like Terraform to automate infrastructure deployments.

Join Rise to see the full answer
Describe your experience with real-time data processing technologies.

Talk about specific projects where you've utilized technologies like Kafka or Debezium. Explain how you’ve implemented them to ensure seamless data processing and timely insights for various stakeholders.

Join Rise to see the full answer
How do you approach continuous integration and continuous deployment (CI/CD)?

Discuss your familiarity with CI/CD pipelines, emphasizing any tools you’ve worked with. Share how you’ve enhanced deployment processes and ensured consistent quality across environments.

Join Rise to see the full answer
What role does monitoring play in your work as an SRE?

Emphasize the critical nature of monitoring in identifying performance bottlenecks and ensuring system reliability. Talk about the tools you’ve used for monitoring infrastructure and how they help in maintaining operational standards.

Join Rise to see the full answer
What challenges have you faced in a site reliability engineering role, and how did you overcome them?

Discuss a specific challenge you faced, the steps you took to address it, and the outcome. Focus on your problem-solving skills and any learning points for future improvement.

Join Rise to see the full answer
How do you prioritize tasks as a Site Reliability Engineer?

Explain your methods for task prioritization, such as the impact of issues on availability, user experience, and business needs. It’s important to convey your approach to balancing urgent issues with long-term improvements.

Join Rise to see the full answer
Can you provide an example of how you collaborated with cross-functional teams?

Share a specific situation in which you worked alongside engineers, data analysts, or business units to achieve a common goal. Explain your role in facilitating communication and driving towards successful project outcomes.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Kraken Remote No location specified
Posted 13 days ago
Photo of the Rise User
Kraken Remote No location specified
Posted 13 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
2Brains Remote Latinoamérica
Posted 2 days ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Brillio Remote Chicago, Illinois, United States
Posted 13 days ago
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Vanta Remote No location specified
Posted 11 days ago
Inclusive & Diverse
Growth & Learning
Customer-Centric
Collaboration over Competition
Medical Insurance
Maternity Leave
Flex-Friendly
401K Matching
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
March 9, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!