Letโ€™s get started
By clicking โ€˜Nextโ€™, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

Come on board with Neo Group! Here's your chance to stir things up in the scene with us. We're not just expanding; we're revolutionising the entire game, mastering profitability with every new venture. But you know what truly fuels our drive? It's people like you. Join us as we embark on a journey to redefine gaming on a global scale.

Neo Group is on the lookout for a Site Reliability Engineer to join our PMO Department.

Key Responsibilities:

  • Maintain and enhance monitoring and logging infrastructure.
  • Improve observability processes and implement predictive failure analysis.
  • Optimize alerting systems: reduce noise, fine-tune critical metrics.
  • Define key monitoring parameters and enhance visibility.
  • Support and improve both cloud-based and on-premise environments.
  • Automate processes and configuration management using Infrastructure as Code (IaC) principles.
  • Train and mentor 24/7 App Support staff.
  • Develop Runbooks, documentation, and troubleshooting guides.
  • Analyze incidents, identify patterns, and drive proactive monitoring improvements.
  • Establish and support the Monitoring & Diagnostics group within App Support.
  • Develop intelligent troubleshooting instructions for faster incident resolution.
  • Optimize existing monitoring by reducing unnecessary alerts and adding meaningful metrics.
  • Enhance reliability through structured incident management and post-mortem analysis.
  • Implement GitOps best practices for managing infrastructure and configuration.
  • Advanced Linux user with strong command-line and diagnostic skills.
  • 4+ years of experience as an SRE/Monitoring Engineer.
  • Strong understanding of monitoring, logging, and observability in production environments.
  • Experience optimizing alerting systems and implementing predictive analytics.
  • Hands-on experience managing both cloud and on-premise solutions.
  • Automation skills using Python or Go.
  • Proficiency with configuration management tools (Ansible, Terraform).
  • Solid grasp of networking principles and protocols.
  • Understanding of information security principles.
  • Experience with CI/CD pipelines (GitLab, Jenkins).
  • Familiarity with orchestrators (Kubernetes, Rancher).
  • Experience documenting workflows and training support teams.
  • Ability to create intelligent troubleshooting instructions.
  • Skills in incident analysis and pattern recognition.

Nice to Have:

  • Experience working with high-load systems.
  • Deep understanding of APM tools (New Relic, Datadog, etc.).
  • Database and message queue performance tuning.
  • Advanced knowledge of ML-driven monitoring and predictive analysis.
  • Experience with automated incident response (self-healing systems).

Soft Skills:

  • Responsibility, initiative, and strong analytical thinking.
  • Ability to collaborate effectively within a team.
  • Focus on automation and process improvement.
  • Strong documentation and knowledge-sharing skills.
  • Capability to diagnose complex incidents and provide actionable insights.
  • Enjoy 5 paid health days per year for those unforeseen sick days or medical appointments.
  • Recharge your batteries with 25 paid calendar vacation days annually to explore, relax, and rejuvenate.
  • Rest easy with comprehensive medical insurance coverage for employees.
  • Stay active and healthy with a monthly sports allowance of $30 net to support your fitness goals.
  • Enhance your language skills with English lessons facilitated by our two experienced tutors.
  • Stay ahead in your field with access to conferences and professional literature to fuel your growth.
  • Boost your energy and morale with complimentary snacks available in the office.
  • Foster camaraderie and celebrate achievements through engaging in corporate events throughout the year.
Neo Group Glassdoor Company Review
2.9 Glassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star iconGlassdoor star icon
Neo Group DE&I Review
3.0 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star iconGlassdoor star icon
CEO of Neo Group
Neo Group CEO photo
Unknown name
Approve of CEO

Average salary estimate

$100000 / YEARLY (est.)
min
max
$80000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Neo Group

Come on board with Neo Group! As a Site Reliability Engineer, you're not just joining a company; you're stepping into a revolution that's redefining gaming on a global scale. At Neo Group, we're all about mastering profitability and pushing the envelope with innovative solutions. Your role will be instrumental in maintaining and enhancing our monitoring and logging infrastructure, ensuring that our systems are not only reliable but also optimized for peak performance. You'll have the opportunity to dive deep into observability processes, implementing predictive failure analysis and fine-tuning alerting systems to minimize noise while maximizing critical metrics. The work you do will directly influence our cloud-based and on-premise environments and the efficiency at which they operate. As you support the Monday to Sunday App Support staff, you'll also mentor team members, creating runbooks and troubleshooting guides that make a real difference. If you have a passion for automation, enjoy structuring incident management, and thrive in a collaborative environment focused on continuous improvement, Neo Group is the place for you. With comprehensive benefits, including paid health days, vacation days, and professional development opportunities, we value your well-being and growth as part of our dynamic team. Join us to make a significant impact and help us reach new heights together!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at Neo Group
What are the main responsibilities of a Site Reliability Engineer at Neo Group?

As a Site Reliability Engineer at Neo Group, you will maintain and enhance monitoring and logging infrastructure, improve observability processes, and implement predictive failure analysis. Your responsibilities will also include optimizing alerting systems, supporting both cloud-based and on-premise environments, and automating processes using Infrastructure as Code principles.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at Neo Group?

To be a successful Site Reliability Engineer at Neo Group, you should have over 4 years of experience in a similar role, a strong understanding of monitoring and observability in production environments, and proficiency in automation using Python or Go. Familiarity with configuration management tools like Ansible or Terraform and a comprehensive understanding of CI/CD pipelines is also crucial.

Join Rise to see the full answer
How does Neo Group support the professional development of its Site Reliability Engineers?

At Neo Group, we prioritize the growth of our Site Reliability Engineers by providing access to conferences, professional literature, and continuous learning opportunities. Additionally, our mentorship programs foster skill development and knowledge sharing among team members.

Join Rise to see the full answer
What benefits does Neo Group offer to its Site Reliability Engineers?

Neo Group offers an array of benefits to its Site Reliability Engineers, including 25 paid vacation days, 5 paid health days, comprehensive medical insurance, a monthly sports allowance, and access to English lessons with experienced tutors to enhance your communication skills.

Join Rise to see the full answer
What tools and technologies do Site Reliability Engineers at Neo Group commonly use?

Site Reliability Engineers at Neo Group work with a variety of tools and technologies, including GitOps practices for infrastructure management, monitoring tools such as New Relic and Datadog, and orchestration solutions like Kubernetes and Rancher to manage and optimize production environments.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
What strategies do you use to improve system reliability as a Site Reliability Engineer?

To improve system reliability, I focus on implementing comprehensive monitoring solutions, refining alerting mechanisms to minimize noise, and conducting regular post-mortem analyses to learn from incidents. This proactive approach helps in identifying potential issues before they impact users.

Join Rise to see the full answer
Can you describe a situation where you had to troubleshoot a critical incident?

In a previous role, I dealt with a critical incident where a database outage occurred. By quickly analyzing logs and diagnostics, I identified the root cause and communicated effectively with stakeholders. I implemented a fix that not only resolved the issue but also improved our monitoring for future incidents.

Join Rise to see the full answer
How do you prioritize alerts in a production environment?

I prioritize alerts based on severity and impact on users. True indicators of system health are separated from noise through a well-thought-out process, which allows us to address genuine issues without being overwhelmed by non-critical alerts.

Join Rise to see the full answer
What is your experience with Infrastructure as Code, and why is it important?

I've worked extensively with Infrastructure as Code using tools like Terraform and Ansible. It ensures consistency, reduces manual errors, and makes infrastructure management efficient, allowing us to spin up environments quickly and reliably without traditional overhead.

Join Rise to see the full answer
Describe your approach to incident management.

My approach to incident management involves defining clear workflows, conducting thorough post-incident reviews, and establishing knowledge-sharing sessions with the team. This not only aids in immediate recovery but also empowers the team to prevent similar incidents in the future.

Join Rise to see the full answer
How do you handle automation in infrastructure management?

I leverage automation to handle repetitive tasks, such as deployments and scaling, using CI/CD pipelines and configuration management tools. This boosts efficiency and allows the team to focus on more strategic tasks.

Join Rise to see the full answer
What is your experience with monitoring tools?

I have experience with various monitoring tools, like Datadog and New Relic. These tools are crucial for collecting metrics and logs that provide insight into system performance, allowing us to take proactive measures to ensure uptime.

Join Rise to see the full answer
How do you ensure effective collaboration with software developers and other teams?

I maintain open communication channels, schedule regular check-ins, and create documentation that outlines integrations and dependencies. This fosters a culture of collaboration and ensures everyone is aligned on objectives.

Join Rise to see the full answer
Can you explain a complex technical concept in simple terms?

Absolutely! For instance, I often explain load balancing as directing traffic on a busy road. Just like a traffic light helps manage the flow of cars to avoid jams, load balancers distribute incoming network traffic across multiple servers to ensure no single server gets overwhelmed.

Join Rise to see the full answer
What do you see as the future trends in Site Reliability Engineering?

I believe the future trends will shift towards more automated incident response, increased integration of machine learning for predictive analytics, and enhanced focus on observability to better understand system performance and user experience.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Neo Group Remote No location specified
Posted 2 days ago

Join Neo Group as a CRM Junior Business Analyst to help revolutionize customer relationship management while enjoying a dynamic, fully remote work environment.

Photo of the Rise User
Neo Group Remote No location specified
Posted 8 days ago

As a Crypto Trading Analyst, you'll have a significant impact on cryptocurrency trading strategies while being part of a dynamic team.

Posted 11 days ago

Join Vantage Data Centers as a Critical Facilities Engineer, where you'll be pivotal in maintaining our industry-leading data center infrastructure.

Stoke Space Hybrid Kent, Washington, United States
Posted 13 days ago

Stoke is recruiting engineering interns eager to solve complex challenges in the aerospace sector.

Photo of the Rise User
Posted 3 days ago

Boeing is looking for a Lead Manufacturing Technology Analyst to join their Berkeley, MO team and play a pivotal role in advancing manufacturing capabilities.

Photo of the Rise User

Join FTI as a Sr. Engineer M&S Systems Performance to develop cutting-edge simulation environments in energy technologies.

tt Hybrid Chicago, IL, USA
Posted 11 days ago

Join Thornton Tomasetti as a Structural Engineer and play a key role in enhancing and renovating existing structures for a more resilient future.

Become an integral part of General Dynamics Mission Systems as an Entry Level Systems Engineer, where you'll contribute to innovative solutions in a collaborative environment.

Photo of the Rise User
Intel Remote India, Bangalore
Posted 7 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Seeking an experienced Analog Design Engineer at Intel to drive innovation in high-speed serial link IP design.

Photo of the Rise User

Join Ernest, a dynamic construction company, as a skilled electrical contractor for a fixed fee project in Palm Bay.

MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 11, 2025

Subscribe to Rise newsletter

Risa star ๐Ÿ”ฎ Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Steubenville just viewed Legal & Compliance Internship at Smiths Group
Photo of the Rise User
Someone from OH, Warren just viewed Senior Front-End Developer at Worldly
Photo of the Rise User
Someone from OH, Tiffin just viewed Game Operations Specialist at Genius Sports
u
Someone from OH, Loveland just viewed Customer Service Agent - Part Time at uhaul
Photo of the Rise User
Someone from OH, Cleveland just viewed HR Manager at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Mid Level, System Administrator - (ETS) at Delivery Hero
Photo of the Rise User
Someone from OH, Mason just viewed Inside Sales Co-Op at VEGA Americas
Photo of the Rise User
Someone from OH, Sandusky just viewed Director of IT at Kyo
Photo of the Rise User
Someone from OH, Delaware just viewed Practice Group Manager at LifeStance Health
Photo of the Rise User
6 people applied to Machinist Apprentice at LLNL
Photo of the Rise User
Someone from OH, Avon Lake just viewed Advancement Specialist at Sierra Club
Photo of the Rise User
Someone from OH, Sidney just viewed Database Engineer Principal at Sagent
Photo of the Rise User
Someone from OH, North Canton just viewed Manager, Customer Success at impact.com
Photo of the Rise User
Someone from OH, Columbus just viewed Customer Experience Representative at MYOB
Photo of the Rise User
Someone from OH, Lakewood just viewed Production Scheduling Supervisor at Shearer's Foods