Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
DevOps SRE Engineer image - Rise Careers
Job details

DevOps SRE Engineer

This role is for one of the Weekday's clients

We are looking for a proactive DevOps Engineer with a strong focus on automation, resilience, and stability. In this role, you will play a key part in ensuring system reliability and robustness while enhancing development efficiency. In the short term, you will drive improvements to developer experience and system uptime, while in the long term, you will help establish best practices in MLOps, Infrastructure as Code (IaC), and advanced database management.

Key Responsibilities

System Resilience, Stability & Developer Experience

  • System Resilience & Stability
    • Monitor and enhance system resilience across multi-cloud environments (AWS, Azure, GCP).
    • Develop incident response, disaster recovery, and failover strategies to minimize downtime.
    • Deploy automation and monitoring tools to detect and resolve performance issues proactively.
  • Developer Experience
    • Optimize CI/CD pipelines using GitLab to streamline deployments and improve efficiency.
    • Collaborate with development teams to identify and resolve pain points within the software development lifecycle (SDLC).
    • Implement automated testing frameworks to maintain high code quality.

MLOps, Infrastructure as Code & Database Management

  • MLOps Initiatives
    • Implement efficient MLOps practices for model development, deployment, and retraining.
    • Evaluate frameworks that support technologies such as Azure OpenAI models.
  • Infrastructure as Code (IaC) & Cloud Automation
    • Design and implement IaC solutions using Terraform or CloudFormation for consistent cloud resource management.
    • Automate routine tasks to ensure secure, scalable, and reproducible environments.
  • Database Management
    • Manage and scale MongoDB for optimized NoSQL operations.
    • Maintain Qdrant to support vector search and ML-driven data operations.
    • Ensure automation, monitoring, and alignment of database solutions with application requirements.

Containerization & Orchestration

  • Utilize Docker and Kubernetes to containerize applications and manage scalable, resilient deployments.
  • Collaborate with development teams to design and refine microservices architectures.

Monitoring, Logging & Quality Assurance

  • Establish robust monitoring, logging, and alerting systems to proactively address issues.
  • Analyze operational metrics to drive continuous improvements in reliability and system stability.
  • Champion best practices in operational resilience and automation.

Collaboration & Best Practices

  • Work closely with development teams to foster a stability-focused culture.
  • Document processes, architectures, and best practices for effective communication and onboarding.
  • Advocate for DevOps best practices, ensuring security, automation, and scalability are prioritized.

What You Should Know About DevOps SRE Engineer, Weekday

Are you ready to step into the exciting world of DevOps as a Site Reliability Engineer? In this role at Weekday's client, you will be the backbone of not just reliability but also automation and stability within multi-cloud environments like AWS, Azure, and GCP. Your journey begins with enhancing developer experiences by optimizing CI/CD pipelines in GitLab and streamlining deployments. You’ll tackle system resilience head-on by creating strategies for incident response and disaster recovery while deploying cutting-edge automation tools that proactively resolve performance issues. But that's just the start! Long-term, you’ll dive deep into MLOps practices, ensuring efficient model deployment and retraining while managing infrastructure through Infrastructure as Code (IaC) using tools like Terraform or CloudFormation. Additionally, your expertise will shine through as you manage MongoDB for NoSQL operations and maintain Qdrant for ML-driven data, all while championing a culture of continuous improvement and operational excellence. You’ll also be instrumental in deploying containerization with Docker and Kubernetes to manage scalable applications. This isn’t just a job; it’s an opportunity to contribute to the future of technology and establish best practices that prioritize security and scalability. If you’re excited about making a meaningful impact in a dynamic tech environment, we’d love to hear from you!

Frequently Asked Questions (FAQs) for DevOps SRE Engineer Role at Weekday
What are the key responsibilities of a DevOps SRE Engineer at Weekday's client?

A DevOps SRE Engineer at Weekday's client will focus on ensuring system reliability and stability. Key responsibilities include monitoring and optimizing multi-cloud environments, developing robust incident response strategies, enhancing developer experience by optimizing CI/CD pipelines, and implementing automated testing frameworks. You'll be responsible for MLOps practices, Infrastructure as Code (IaC) with Terraform or CloudFormation, and managing database solutions like MongoDB and Qdrant to support operational requirements.

Join Rise to see the full answer
What qualifications are needed for the DevOps SRE Engineer role at Weekday's client?

To be considered for the DevOps SRE Engineer position at Weekday's client, candidates should have a strong background in systems engineering and experience with cloud platforms such as AWS, Azure, and GCP. Proficiency in tools like GitLab for CI/CD, Terraform for IaC, and containerization technologies like Docker and Kubernetes is essential. Experience with MLOps frameworks and database management, especially MongoDB, will also give candidates an advantage in this role.

Join Rise to see the full answer
How does a DevOps SRE Engineer at Weekday's client improve system resilience?

A DevOps SRE Engineer at Weekday's client enhances system resilience by monitoring performance across multi-cloud environments and designing incident response and disaster recovery strategies that minimize downtime. The engineer will also deploy automation and monitoring tools to proactively identify and resolve issues before they impact system stability, ensuring that services remain operational and resilient.

Join Rise to see the full answer
Why is collaboration important for a DevOps SRE Engineer at Weekday's client?

Collaboration is key for a DevOps SRE Engineer at Weekday's client because it fosters a culture dedicated to stability and efficiency. The engineer will work closely with development teams to identify pain points in the software development lifecycle and advocate for best practices in automation and operational resilience. Documenting processes and sharing knowledge will also be crucial for effective communication and onboarding.

Join Rise to see the full answer
What technologies will a DevOps SRE Engineer be working with at Weekday's client?

In the role of DevOps SRE Engineer at Weekday's client, you'll work with a range of modern technologies, including cloud platforms like AWS, Azure, and GCP, automation tools for Infrastructure as Code such as Terraform and CloudFormation, CI/CD pipelines through GitLab, and container orchestration with Docker and Kubernetes. Additionally, you'll manage databases like MongoDB and implement MLOps practices to support the development and deployment of machine learning models.

Join Rise to see the full answer
Common Interview Questions for DevOps SRE Engineer
Can you explain what Infrastructure as Code means in the context of a DevOps SRE Engineer?

Infrastructure as Code (IaC) is a crucial concept for a DevOps SRE Engineer, as it allows for the management and provisioning of infrastructure using code rather than manual processes. When preparing for this question, discuss tools like Terraform and CloudFormation that enable automated resource management, ensuring consistency, and reducing configuration drift. Highlight how IaC contributes to scalability, security, and quick recovery during incidents.

Join Rise to see the full answer
How do you approach incident response planning as a DevOps SRE Engineer?

When answering this question, outline your method for assessing potential incidents and defining recovery strategies. Discuss the importance of creating runbooks, setting up monitoring and alerting systems, and conducting regular drills to ensure readiness. Emphasize that effective incident response minimizes downtime and maintains system stability, two key responsibilities of a DevOps SRE Engineer.

Join Rise to see the full answer
What experience do you have with containerization technologies like Docker and Kubernetes?

For this question, detail your hands-on experiences with Docker for containerizing applications and Kubernetes for orchestrating container deployments. Explain how these technologies can improve scalability and resilience in applications, and provide examples from past projects where you've effectively used them to manage deployment or upgrade processes.

Join Rise to see the full answer
How would you optimize CI/CD pipelines in a DevOps role?

In response to this question, talk about identifying bottlenecks in the existing CI/CD process, implementing parallel execution of deployments, and integrating automated testing frameworks. Discuss tools you’ve used, such as GitLab, and provide specific examples where your optimizations resulted in shorter deployment times and improved developer productivity.

Join Rise to see the full answer
Can you describe a challenge you faced in ensuring system stability?

Provide a specific example of a challenge, such as a system outage or performance issue, and explain how you diagnosed the problem. Discuss the steps you took to resolve it, including how you implemented monitoring solutions or made architectural changes to enhance resilience. Emphasize the importance of learning from challenges to improve future system stability.

Join Rise to see the full answer
What role does automation play in the DevOps SRE Engineer position?

When discussing automation, highlight its critical role in reducing manual processes, improving developer experience, and enhancing system resilience. Provide examples of how you've used automation tools to streamline repetitive tasks, such as automating deployments or implementing monitoring solutions, to ensure proactive management of system performance.

Join Rise to see the full answer
How do you monitor and improve system reliability?

For this question, emphasize the importance of establishing key performance indicators (KPIs) to monitor system reliability. Discuss your experience in setting up logging, monitoring, and alerting systems to track performance metrics, and how analyzing these metrics leads to informed decisions for system improvements, thus driving continuous reliability enhancement.

Join Rise to see the full answer
Describe your experience with database management as a DevOps SRE Engineer.

Discuss the types of databases you’ve managed, such as MongoDB and how you optimized their performance in a cloud environment. Highlight your experience in scaling databases, ensuring operational alignment with applications, and implementing automated backup and recovery strategies. Provide any metrics that illustrate the impact of your database management skills.

Join Rise to see the full answer
What is your strategy for engaging with development teams in a DevOps environment?

Engagement with development teams is crucial for a DevOps SRE Engineer. Explain your approach to fostering collaboration by regularly communicating about challenges and updates, conducting joint planning sessions, and gathering feedback. Share how you’ve implemented practices that enhance developer experience, such as automating testing and deployment processes.

Join Rise to see the full answer
How would you implement MLOps practices in your role?

In response to this question, provide a well-rounded explanation of MLOps, focusing on the entire lifecycle of machine learning development, from model training to deployment. Discuss your strategies for automating model retraining and evaluation and how effective collaboration with data scientists can streamline the MLOps workflow, ensuring reliability and performance of AI-driven applications.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Weekday Remote No location specified
Posted 2 days ago
Photo of the Rise User
Weekday Remote No location specified
Posted 2 days ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 7 days ago
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
Pepperstone Remote No location specified
Posted yesterday
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
Merge API Hybrid San Francisco, CA
Posted 7 days ago

Founded in 2002, Weekday currently ships to 97 online markets and has stores in 14 countries, offering a unique retail experience and a carefully curated mix of external brands, limited edition collaborations and a carefully curated selection of s...

55 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
March 14, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Cleveland just viewed Accounting Co-Op (Part-Time) at Avery Dennison
Photo of the Rise User
Someone from OH, North Ridgeville just viewed Product Manager at ShiftCare
Photo of the Rise User
Someone from OH, North Ridgeville just viewed Product Operations at Binance
Photo of the Rise User
Someone from OH, Mentor just viewed Sales & Service Lead - Pinecrest at Alo Yoga
Photo of the Rise User
8 people applied to Excel Developer at Valcre
Photo of the Rise User
Someone from OH, Mason just viewed Marketing & Communications Intern at Per Scholas
Photo of the Rise User
Someone from OH, Lakewood just viewed Recruiter (Talent Sourcing), 6 month contract at Jerry
Photo of the Rise User
Someone from OH, Westerville just viewed Director Change Management at Discover