Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer - Americas image - Rise Careers
Job details

Site Reliability Engineer - Americas

BforeAI is an innovative and rapidly expanding scale-up dedicated to deterring cybercrime through cutting-edge predictive and pre-emptive technologies. We harness the power of prescriptive AI to revolutionize the way we tackle cyber threats, particularly in the realm of brand protection.

Named by Gartner in 26 reports over the last 2 years, BforeAI is the industry’s fastest, most accurate solution for automated protection against online fraud.

We are like weather forecasts for cyber threats. Join us in the fight for a safer cyberspace!

✨ What’s cool about this job

As an SRE at BforeAI, you will be a critical part of our technology team, responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. Your expertise in Kubernetes, Networking, Security, Cloud environments  and database optimization will be essential for maintaining our high-traffic, data-intensive systems. 

Please note, this job can be anywhere in Americas - we have to select a country for job boards.

📣 What you’ll be doing

  • Architect, deploy, and manage Kubernetes clusters, ensuring high availability, scalability, and reliability to meet organizational demands.
  • Drive performance improvements for database systems through advanced query optimization, indexing strategies, and efficient caching mechanisms.
  • Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, or equivalent technologies to enable consistent, automated, and scalable deployments.
  • Implement and manage robust monitoring and alerting systems to proactively maintain system health and ensure optimal performance.
  • Enforce cloud environment best practices for security, access control, and compliance with regulatory standards.
  • Establish, maintain and be responsible for our Incident management procedures.
  • Partner with engineering teams to support their infrastructure needs, ensuring alignment with SRE practices and system requirements.
  • Make sure our infrastructure and products are resilient and recoverable by establishing and maintaining resiliency and recovery best practices and procedures.
  • Establish and maintain SRE best practices and remove any blocker to enable the reliability of the system.
  • Create and maintain detailed documentation for configurations, processes, and procedures to ensure transparency and knowledge sharing across teams.

💥 You’ll be a great fit if 

  • You have 8+ years of experience in SRE, system administration, or similar roles.
  • You are an expert in Kubernetes, including hands-on experience in cluster setup, management, and maintenance with certifications such as Certified Kubernetes Administrator (CKA) and/or certified Kubernetes Security Specialist (CKSS).
  • You are proficient in database performance optimization and administration such as PostgreSQL, MySQL, or similar.
  • You have experience with Infrastructure as Code (IaC) tools such as Terraform (with certification like HashiCorp Terraform Certification), Ansible, or similar.
  • You have experience with monitoring and logging tools such as Splunk, Prometheus, Grafana, Datadog, ELK, Logstash, Fluentd, etc.).
  • You have experience with Incident response tools such as PagerDuty, OpsGenie, etc.
  • You have experience with cloud platforms, such as AWS, Azure, or GCP, ideally supported by an architect-level certification from at least one provider.
  • You have experience in secrets management tools such as Hashicorp Vault, CyberArk Conjur, AWS Secrets manager, etc.
  • You have strong problem-solving and troubleshooting skills.
  • You are a strong communicator with the ability to collaborate across multi-disciplinary global teams.
  • You have RHCSA (Red Hat Certified System Administrator) and/or RHCE (Red Hat Certified Engineer) certification.

Don't meet every single requirement? Don't count yourself out just yet. Studies show some individuals are less likely to apply to jobs unless they meet every qualification. At BforeAI, we're dedicated to building a diverse workplace based on merit, work ethics, and character, and we believe everyone deserves a fair shot at success!

If you're excited about this role but your past experience doesn't align perfectly with every qualification, we hope you’ll still consider applying!

We use an Employee of Record service to facilitate seamless global hiring processes and offer benefits tailored to the country where you will be working! For countries not supported by our EOR partner, talk to us about being a contractor. In all cases, you will need to be authorized to work in the country you’re based in.

We offer a compensation package up to $110,000 USD per year in CTC (Cost to Company). Cost to Company represents our total investment, which includes all benefits and employer contributions. The final take-home pay will differ due to local tax regulations, selected benefits, and mandatory deductions. The actual offer will be based on the role level, skills, and experience of the candidate. Our compensation structure is thoughtfully designed to align with the expertise and impact potential of each individual.

🚀 Why it’s great to work here

We are a location independent company – no physical office required – and we operate as a fully distributed team. We deeply believe in the value of diversity and inclusivity within our workplace, understanding that these principles lead to a happier team and ultimately a superior product. We offer an intellectually stimulating company environment and you’ll be working with a bright, dedicated team from across the globe. 

If you possess a high level of autonomy and self-organization, and feel you can thrive at BforeAI, we’d love to hear from you! 

💡 Want to know more about BforeAI? 

What You Should Know About Site Reliability Engineer - Americas, BforeAI

Join BforeAI as a Site Reliability Engineer in the Americas, where you'll play a pivotal role in our mission to combat cybercrime using advanced predictive technologies. At BforeAI, we are committed to redefining the landscape of online protection with our prescriptive AI solutions, which have earned us recognition from Gartner over 26 times in the past two years. As a key member of our technology team, you'll ensure the reliability, scalability, and performance of our cloud infrastructure and applications. Your strong expertise in Kubernetes, networking, security, cloud environments, and database optimization will help us maintain our high-traffic systems. Your day-to-day will involve architecting and managing Kubernetes clusters, optimizing database performance, and developing Infrastructure as Code to ensure seamless deployments. You’ll implement robust monitoring and alerting systems, enforce cloud best practices, and collaborate with engineering teams to meet their infrastructure needs. You’ll have the flexibility to work remotely from anywhere in the Americas, and your contributions will help establish best practices that maintain system reliability and resilience. If you’re an experienced SRE with a passion for cyber defense, we can’t wait for you to join our diverse and talented team dedicated to making the internet a safer place!

Frequently Asked Questions (FAQs) for Site Reliability Engineer - Americas Role at BforeAI
What are the primary responsibilities of a Site Reliability Engineer at BforeAI?

As a Site Reliability Engineer at BforeAI, your key responsibilities will include architecting and managing Kubernetes clusters, optimizing database performance, developing Infrastructure as Code, implementing monitoring solutions, and collaborating across teams to ensure system reliability. You'll also be responsible for incident management and documentation, helping to build a robust and efficient infrastructure.

Join Rise to see the full answer
What qualifications do I need to become a Site Reliability Engineer at BforeAI?

To qualify for the Site Reliability Engineer position at BforeAI, you should have over 8 years of experience in SRE or a similar role. Expertise in Kubernetes is essential, along with proficiency in database optimization and experience with IaC tools like Terraform. Certifications such as CKA and HashiCorp Terraform Certification are highly valued. Strong communication skills and a collaborative mindset are also crucial.

Join Rise to see the full answer
Is it necessary to meet all the qualifications to apply for the Site Reliability Engineer role at BforeAI?

At BforeAI, we encourage all qualified candidates to apply, even if they don’t meet every single qualification. We value merit, work ethic, and character, and believe that diverse backgrounds contribute to our success. If you have a passion for cyber defense and the willingness to learn, we would love to consider your application.

Join Rise to see the full answer
How does the remote work structure function for a Site Reliability Engineer at BforeAI?

BforeAI operates as a fully distributed team, which means our Site Reliability Engineers can work from anywhere in the Americas. This flexibility allows for a diverse and inclusive workplace, enabling you to balance work with personal commitments while still contributing effectively from your location.

Join Rise to see the full answer
What kind of support does BforeAI provide for professional development for Site Reliability Engineers?

BforeAI is committed to the professional growth of our Site Reliability Engineers. We offer access to ongoing training resources, encourage obtaining certifications, and foster an environment of mentorship where team members can share knowledge and skills. Your growth is important to us, as it aligns with our goal of empowering talented individuals in the fight against cybercrime.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer - Americas
Can you describe your experience with Kubernetes clusters?

When answering this question, highlight specific projects you've worked on involving Kubernetes. Discuss your role in setting up, managing, and maintaining clusters, and any relevant challenges you faced. Mention any certifications you hold, like CKA, to demonstrate your expertise.

Join Rise to see the full answer
How do you approach incident management and response?

In your response, emphasize your methodical approach to incident management. Discuss tools you've used like PagerDuty, your experience with setting up alerts, and how you prioritize incidents. Providing a specific example of a past incident and how you handled it will also help showcase your skills.

Join Rise to see the full answer
What strategies have you employed for database performance optimization?

When addressing this question, outline specific techniques you've implemented, such as query optimization, indexing, and caching strategies. It's helpful to share metrics of improvement where applicable to quantify your impact on database performance.

Join Rise to see the full answer
How do you ensure security and compliance in cloud environments?

Discuss your understanding of cloud security best practices, including the tools and measures you've utilized, such as identity management and regulatory compliance checks. Emphasizing your experience with secrets management tools like HashiCorp Vault can enhance your answer.

Join Rise to see the full answer
Can you explain your approach to Infrastructure as Code?

In your response, showcase your experience with IaC tools like Terraform or Ansible. Discuss how you use these technologies to automate deployments and manage infrastructure. Providing examples of how you've implemented IaC in previous roles will strengthen your answer.

Join Rise to see the full answer
How do you maintain system health and performance?

Outline your strategies for monitoring and logging systems. Discuss the tools you've used, such as Prometheus and Grafana, and how you've set up alerts to handle potential issues proactively. Specific examples of monitoring setups you've created will strengthen your response.

Join Rise to see the full answer
What is your experience with cloud platforms, and which do you prefer?

Share your experience with various cloud platforms like AWS, Azure, or GCP. Discuss why you prefer one over the others and how your experience aligns with what BforeAI uses. Highlight any architect-level certifications to show your proficiency.

Join Rise to see the full answer
How do you stay updated with the latest trends in site reliability engineering?

Discuss your methods for staying informed, such as following industry blogs, attending webinars, or participating in online forums. Mention any relevant communities or groups you're part of that help you connect with other SRE professionals.

Join Rise to see the full answer
What role does collaboration play in your work as an SRE?

Emphasize the importance of working with cross-functional teams. Discuss how collaboration enhances problem-solving and improves infrastructure alignment with development needs. Providing a specific example of a successful team project can illustrate this point well.

Join Rise to see the full answer
How would you handle an unexpected outage in production?

In addressing this, convey your systematic approach to problem-solving—identify the root cause, communicate with stakeholders, and implement a solution. Stress the importance of a post-mortem process for learning and future prevention.

Join Rise to see the full answer
Similar Jobs
BforeAI Remote No location specified
Posted yesterday
BforeAI Remote No location specified
Posted 23 hours ago
Photo of the Rise User
Posted 7 days ago
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
Posted 6 days ago
Passion for Exploration
Dare to be Different
Customer-Centric
Diversity of Opinions
Inclusive & Diverse
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Posted 10 days ago
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
December 25, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!