Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer (SRE) image - Rise Careers
Job details

Site Reliability Engineer (SRE)

Company Description

BHFT is a proprietary algorithmic trading firm. Our team manages the full trading cycle, from software development to creating and coding strategies and algorithms.
Our trading operations cover key exchanges. The firm trades across a broad range of asset classes, including equities, equity derivatives, options, commodity futures, rates futures, etc. We employ a diverse and growing array of algorithmic trading strategies, utilizing both High-Frequency Trading (HFT) and Medium-Frequency Trading (MFT) approaches.

Looking ahead, we are expanding into new markets and products. As a dynamic company, we continuously experiment with new markets, tools, and technologies.
We’ve got a team of 200+ professionals, with a strong emphasis on technology—70% are technical specialists in development, infrastructure, testing, and analytics spheres. The remaining part of the team supports our business operations, such as Risks, Compliance, Legal, Operations and more.

With a strong focus on innovation and performance, BHFT is actively expanding its presence in traditional financial markets. We value a results-driven culture, emphasizing collaboration, transparency, and constant improvement, all while offering the flexibility of remote work and a globally distributed team.

Job Description

We are looking for a Site Reliability Engineer who will be responsible for ensuring the reliable operation of our platform, working with metrics to improve production process efficiency, and participating in testing new product versions.

Responsibilities:

  • Production Stability Management: Ensure continuous compliance with external regulatory requirements and internal standards, including risk, security, technology, and trader needs. Support and automate validation and monitoring processes for adherence to necessary standards.

  • Incident Monitoring & Management: Develop and improve monitoring and alerting systems to detect anomalies in key production metrics. Implement rapid response mechanisms and efficient solutions to maintain strategy performance.

  • Release & Change Management: enforce standards for managing releases and changes to minimize deployment risks. Implement strict acceptance testing for all releases.

  • Process Management: Develop and maintain Standard Operating Procedures (SOPs) for the team, manage task queues, and organize shift schedules to ensure continuous support and high availability of trading strategies.

  • Integration Projects: Lead initiatives to connect with new exchanges, brokers, and trading platforms, ensuring smooth and secure service integration.

  • Technical Performance Optimization: Continuously improve system availability, resilience (MTTR, MTBF), and latency reduction while optimizing data exchange performance and order routing to maximize profitability.

Qualifications

Requirements:

  • Deep understanding of trading processes and market microstructure, including colocation trading on native exchange protocols and algorithmic trading.
  • Experience in monitoring, alerting systems, and incident management for high-load environments.
  • Knowledge of regulatory compliance and security standards.
  • Proficiency in monitoring and incident management tools such as Grafana, ClickHouse, Prometheus, Opsgenie, Grafana OnCall, PagerDuty, etc.
  • Experience developing and managing SOPs and KPIs for service teams.
  • Experience managing integration projects with brokers and exchanges.

Strong technical skill set, including:

  • Linux systems administration and optimization.
  • TCP/UDP multicast networking.
  • FIX-based and native exchange protocols
  • Colocation infrastructure setup and management.
  • Python scripting for automation and monitoring.
  • English proficiency at C1 level or higher.
What You Should Know About Site Reliability Engineer (SRE), BHFT

Join our innovative team at BHFT as a Site Reliability Engineer (SRE) and play a pivotal role in the seamless operation of our trading platform right from the heart of Dubai! At BHFT, we're not just about algorithmic trading; we're about optimizing the entire trading cycle with the most cutting-edge strategies and technologies. As an SRE, you will focus on production stability management while ensuring compliance with both internal standards and external regulations. Your expertise will shine as you develop monitoring systems to quickly identify anomalies in our production metrics, streamlining incident management as well as release management. You'll lead projects integrating with new exchanges and brokers, optimizing our technical performance, and enhancing system resilience. Your work will directly contribute to maximizing profitability by improving data exchange performance and order routing strategies. Here at BHFT, we encourage a collaborative culture that emphasizes continuous improvement and innovation within a flexible, remote work environment. We're excited to grow our competitive team of tech specialists, and we're looking for passionate individuals ready to make a significant impact!

Frequently Asked Questions (FAQs) for Site Reliability Engineer (SRE) Role at BHFT
What are the main responsibilities of a Site Reliability Engineer at BHFT?

As a Site Reliability Engineer (SRE) at BHFT, your main responsibilities will revolve around ensuring the stability and reliability of our trading platform. This includes managing production stability, developing incident monitoring systems, and enforcing standards for release and change management. You will also lead integration projects with exchanges and brokers, alongside optimizing system performance and resilience.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at BHFT?

To excel as a Site Reliability Engineer at BHFT, candidates should possess a deep knowledge of trading processes and market microstructure, experience in high-load monitoring and incident management, and proficiency in industry-standard tools. Additionally, strong technical skills, especially in Linux systems administration, networking, and Python scripting for automation, are essential.

Join Rise to see the full answer
How does BHFT support continuous improvement in the Site Reliability Engineer role?

BHFT encourages a culture of continuous improvement for its Site Reliability Engineers by promoting collaboration, transparency, and experimentation with new technologies and processes. SREs play a key role in developing and maintaining Standard Operating Procedures, ensuring the team can efficiently operate and innovate within high-pressure environments.

Join Rise to see the full answer
What tools and technologies does a Site Reliability Engineer at BHFT work with?

As a Site Reliability Engineer (SRE) at BHFT, you’ll utilize a wide array of tools to manage incident monitoring and alerting, including Grafana, Prometheus, Opsgenie, and more. Your experience with TCP/UDP protocols and various exchange API integrations will also be valuable in ensuring streamlined operations across multiple trading platforms.

Join Rise to see the full answer
What is the work environment like for a Site Reliability Engineer at BHFT?

BHFT fosters a dynamic work environment for its Site Reliability Engineers, emphasizing flexibility, collaboration, and remote work opportunities. You will join a diverse team of over 200 professionals dedicated to pushing the boundaries of algorithmic trading while actively participating in an innovative culture that embraces constant improvement.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer (SRE)
Can you describe your experience with incident management in high-load environments?

When answering this question, highlight specific tools you've used, such as Grafana or PagerDuty, and provide examples of how you've developed monitoring systems or resolved incidents to maintain service continuity under pressure.

Join Rise to see the full answer
How do you ensure the compliance of production operations with internal standards?

Discuss your familiarity with regulatory compliance protocols and how you've automated validation processes. Include specific actions you've taken to align operations with both internal and external requirements in previous roles.

Join Rise to see the full answer
What approaches do you take to optimize system performance and latency?

Focus on examples where you improved system resilience or reduced latencies—mention any specific methodologies or metrics you've employed and how they contributed to the overall success of past projects.

Join Rise to see the full answer
How do you handle deployments and changes in a sensitive trading environment?

Reflect on your experience with change management frameworks, describing how you've enforced testing protocols and minimized risks during deployments based on past situations, ensuring that you maintain strategy performance and operational integrity.

Join Rise to see the full answer
What steps do you take to integrate new exchanges or brokers into existing systems?

Detail the process you typically follow for integration projects, emphasizing collaboration with stakeholders and techniques you employ to ensure successful and secure connections with new systems while maintaining business continuity.

Join Rise to see the full answer
Discuss a challenging production incident you managed. What was your approach?

Share a specific example that showcases your problem-solving and leadership skills during a high-pressure situation. Outline the steps you took during the incident, the outcome, and any lessons learned that improved future incident response.

Join Rise to see the full answer
What experience do you have with automation in your previous SRE roles?

Talk about the automation tools you've used and the processes you've been able to automate. Share examples of how automation led to improved efficiency and reliability in your past operations and how it may apply to BHFT.

Join Rise to see the full answer
How do you stay informed about the latest trends and technologies in Site Reliability Engineering?

Describe your approach to ongoing learning—whether through online courses, attending conferences, subscribing to industry publications, or participating in professional networks. Mention any relevant communities or forums you engage with.

Join Rise to see the full answer
What metrics do you consider crucial for monitoring system health?

Identify specific metrics that are important for trade systems and SRE operations, such as MTTR (Mean Time To Recovery), MTBF (Mean Time Between Failures), and system latency. Explain why these metrics matter in a trading context.

Join Rise to see the full answer
How would you describe the ideal work culture for an SRE like you?

Articulate your values around collaboration, transparency, and innovation, and explain how these elements not only improve team dynamics but also enhance the functionality and reliability of the trading systems you support.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Deel Remote São Paulo
Posted 2 days ago
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning
Empathetic
Photo of the Rise User
Posted 11 days ago
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Learning & Development
Equity
Paid Holidays
Paid Time-Off
WFH Reimbursements
Child Care stipend
Maternity Leave
Paternity Leave
Photo of the Rise User
Turing Remote Remote - India
Posted 8 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Photo of the Rise User
Tomorrow Water Remote No location specified
Posted 2 days ago
Photo of the Rise User
Vast Hybrid Long Beach, California, United States
Posted 14 days ago
Photo of the Rise User
OmniVision Hybrid No location specified
Posted 5 days ago
Photo of the Rise User
Transdev Hybrid No location specified
Posted 5 days ago
Photo of the Rise User
Posted 7 days ago

Innovative algorithmic trading company

7 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
March 19, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Cleveland just viewed Junior Data Analyst at Arkana Laboratories
Photo of the Rise User
Someone from OH, Cleveland just viewed BI Analyst, Junior at Emi Labs
Photo of the Rise User
Someone from OH, Cleveland just viewed Data Analyst at Qloo
Photo of the Rise User
Someone from OH, Bellbrook just viewed Accounting Co-Op (Part-Time) at Avery Dennison
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior Compliance officer (AML) at Visa
E
Someone from OH, North Ridgeville just viewed Call Center Representative, Nexa Healthcare at EverService
Photo of the Rise User
Someone from OH, Solon just viewed Senior Technical writer at BlackStone eIT
Photo of the Rise User
Someone from OH, Cleveland just viewed Amazon Expediting Fleet Specialist at MSX International
R
Someone from OH, Cincinnati just viewed Sales development representative at Remote Recruitment
Photo of the Rise User
Someone from OH, Cincinnati just viewed Laboratory Technologist I - 2nd Shift at Eurofins
Photo of the Rise User
Someone from OH, Independence just viewed Analyst - Customer Master Data at AECOM
Photo of the Rise User
33 people applied to REMOTE Sr Piping Designer at Kelly
Photo of the Rise User
Someone from OH, Mount Vernon just viewed Assistant Buyer - Nursery. 12 Months FTC at The Very Group
Photo of the Rise User
15 people applied to Internship summer 2025 at Boeing
Photo of the Rise User
Someone from OH, Fairborn just viewed Marketing Project Manager at MasterClass
Photo of the Rise User
Someone from OH, Fairborn just viewed (US) Associate Project Manager, Marketing at PointClickCare
Photo of the Rise User
Someone from OH, Willoughby just viewed 2024 Accounting & Finance Intern at Lincoln Electric
Photo of the Rise User
Someone from OH, Dayton just viewed Researcher at NielsenIQ
Photo of the Rise User
Someone from OH, Dayton just viewed Consumer Insights Researcher at NielsenIQ
Photo of the Rise User
Someone from OH, Morrow just viewed Junior IT Systems Administrator at NFQ
Photo of the Rise User
Someone from OH, Cleveland just viewed Automation Specialist - East Region at Jacobs
Photo of the Rise User
6 people applied to Assembly Mechanic at Boeing
Photo of the Rise User
10 people applied to GIS Specialist II at AECOM