Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
SRE I image - Rise Careers
Job details

SRE I

Job Title: Site Reliability Engineer (Windows Server)

 

Who We Are:

Headquartered in New York City, Take-Two Interactive Software, Inc. is a leading developer, publisher, and marketer of interactive entertainment for consumers around the globe. The Company develops and publishes products principally through Rockstar Games, 2K, Private Division, and Zynga. Our products are currently designed for console gaming systems, PC, and Mobile, including smartphones and tablets, and are delivered through physical retail, digital download, online platforms, and cloud streaming services. The Company’s common stock is publicly traded on NASDAQ under the symbol TTWO.

While our offices (physical and virtual) are casual and inviting, we are deeply committed to our core tenets of creativity, innovation and efficiency, and individual and team development opportunities. Our industry and business are continually evolving and fast-paced, providing numerous opportunities to learn and hone your skills. We work hard, but we also like to have fun, and believe that we provide a great place to come to work each day to pursue your passions. 

 

The Challenge

SRE team serves as a centralised operations unit under the Technical Operation Centre (TOC), tasked with maintaining the health, availability, and reliability of our games and services. From a broader perspective, our primary mission is to ensure high uptime. As the first line of defence for all production issues, SREs take the lead in monitoring infrastructure and providing primary on-call support, ensuring a quick response to any incidents. We also play a critical role in emergency response, managing communication and coordination to resolve issues as efficiently as possible.In addition to these primary responsibilities, the SREs take a proactive approaches along with the NOC team to improving latency, performance, and efficiency across all services. Our work extends to capacity planning and optimization of systems at both the system and cloud levels, ensuring that services scale efficiently to meet the demands of our games. We don’t just respond to incidents; we continuously look for ways to enhance the performance and reliability of the infrastructure.Ultimately, SRE strives to achieve world-class uptime for all Take-Two products, working to reduce the frequency and impact of downtime while resolving issues promptly and comprehensively. With a focus on the entire production stack, we take a holistic approach to reliability engineering, ensuring that every layer—from the infrastructure to the application level—contributes to the best possible user experience.

 

What You’ll Take On

  • Windows Administration

    • Manage and maintain Windows servers, ensuring their stability, security, and performance.

  • CheckMK

    • Utilize CheckMK for comprehensive monitoring and alerting, ensuring all systems are functioning optimally.

  • Linux Administration

    • Diagnose and resolve issues on Linux systems, ensuring minimal downtime and maximum efficiency.

  • VMWare

    • Manage virtual environments using VMWare, ensuring resources are optimized and available.

  • vSan Understanding

    • Demonstrate a solid understanding of vSan for effective storage management and troubleshooting.

  • Cloud Administration

    • Administer and manage cloud services across AWS, Azure, Splunk, and GCP, ensuring seamless integration and operation.

  • Risk Assessment

    • Assess potential risks and impacts on game services and revenue, taking proactive measures to mitigate them.

  • Issue Identification

    • Identify issues, alerts, and critical service incidents using provided dashboards and monitoring tools.

  • Service Troubleshooting

    • Utilize studio playbooks to troubleshoot and diagnose basic issues across various services.

  • Communication

    • Relay accurate and timely information regarding service impacts to game studios, ensuring effective communication during incidents.

  • Incident Management

    • Spearhead outage management, including communication, triage, and escalation.

  • Daily On Call

    • Responsible for triaging and troubleshooting critical alerts form critical systems

What You Bring 

  • Experience:

    • Live Services Knowledge: Understanding of live services and their operational requirements.

    • Change/Crisis Management: Experience in managing change and crisis situations, ensuring minimal disruption to services.

    • Effective Communicator: Able to relay information accurately and timely to the game studio and other stakeholders.

    • Team Player: Works well in a collaborative environment, sharing knowledge and supporting team members.

  • Proactive Problem-Solving:

    • A commitment to continuous improvement and proactive issue resolution.

    • Proven experience in troubleshooting production problems affecting live services.

    • Able to identify potential issues before they become critical and manage details effectively.

  • Background:

    • At least 1 year of experience in a similar role and/or 3 years experience in a relevant role. 

Great to Have: 

  • Apply Advanced Knowledge: 

    • Utilize your broad understanding of principles, theories, and concepts in IT, integrating advanced knowledge from related fields.

    • Solve Complex Problems: Address diverse and moderately complex problems, using sound judgment to select the best methods and techniques.

    • Network and Collaborate: Engage with senior internal and external personnel to maximize the application of functional expertise.

  • Problem Solving:

    • Innovate Solutions: Develop and recommend solutions to tactical business issues, proactively identifying and addressing potential problems.

    • Lead with Expertise: Use your advanced knowledge to guide your team and drive effective solutions.

  • Decision Making:

    • Exercise Autonomy: Make decisions with considerable latitude, consulting with senior engineers or managers on complex issues and recommending solutions as necessary.

 

What We Offer You:

  • Great Company Culture. We pride ourselves as being one of the most creative and innovative places to work, creativity, innovation, efficiency, diversity and philanthropy are among the core tenets of our organization and are integral drivers of our continued success.
  • Growth: As a global entertainment company, we pride ourselves on creating environments where employees are encouraged to be themselves, inquisitive, collaborative and to grow within and around the company.
  • Work Hard, Enjoy Life. Our employees’ bond, blow-off steam, and flex some creative muscles – through our Office gaming spaces, company parties, game release events, monthly socials, and team challenges.
  • Benefits. Benefits include, but are not limited to; Discretionary bonus, Provident fund contributions, 1+5 medical insurance + top up options and access to Practo online Doctor consultation App, Employee assistance program, 3X CTC Life Assurance, 3X CTC Personal accident insurance, childcare services, 20 days holiday + statutory holidays,
  • Perks. Gym reimbursement up to INR1150 per month, charitable giving program, access to learning platforms, gaming events. 
 
Please be aware that Take-Two does not conduct job interviews or make job offers over third-party messaging apps such as Telegram, WhatsApp, or others. Take-Two also does not engage in any financial exchanges during the recruitment or onboarding process, and the Company will never ask a candidate for their personal or financial information over an app or other unofficial chat channel. Any attempt to do so may be the result of a scam or phishing exercise. Take-Two’s in-house recruitment team will only contact individuals through their official Company email addresses (i.e., via a take2games.com email domain). If you need to report an issue or otherwise have questions, please contact Careers@take2games.com.*
 
As an equal opportunity employer, Take-Two Interactive Software, Inc. (“Take-Two”) is committed to fostering and celebrating the diverse thoughts, cultures, and backgrounds of its talent, partners, and communities throughout its organization. Consistent with this commitment, Take-Two does not discriminate or retaliate against any employee or job applicant because of their race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, and genetic information (including family medical history), or on the basis of any other trait protected by applicable law. If you need to report a concern or have questions regarding Take-Two’s equal opportunity commitment, please contact Careers@take2games.com.
 
#LI-Hybrid

Average salary estimate

$37500 / YEARLY (est.)
min
max
$30000K
$45000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About SRE I, Take-Two Interactive Software, Inc.

Join Take-Two Interactive Software, Inc. in Bengaluru as a Site Reliability Engineer I! Here at Take-Two, we don't just create games; we cultivate a culture of creativity and innovation, setting the stage for some of the most exciting interactive entertainment products worldwide. As an SRE I, you’ll dive into the heart of our operations, maintaining the health and reliability of our games and services. Your main mission? To deliver world-class uptime and tackle any production issues head-on while ensuring a seamless user experience. You'll manage Windows and Linux servers, utilize tools like CheckMK for monitoring, and ensure our infrastructure is both stable and efficient. Collaborating closely with our NOC team, you’ll get to enhance system performance and participate in capacity planning in a fun, fast-paced environment. You'll also be the first point of contact for outages, coordinating communication and resolution to keep our gaming community happy. If you’re passionate about operational excellence and seek an opportunity in a company that values individual development and teamwork, this job could be your ideal next step. Not only will you grow your expertise in server management and cloud services across platforms like AWS and Azure, but you'll also be at the forefront of designing solutions that keep our services running smoothly. Ready to take on this challenge? Join us and be part of a team that plays hard while delivering top-tier gaming experiences!

Frequently Asked Questions (FAQs) for SRE I Role at Take-Two Interactive Software, Inc.
What are the main responsibilities of an SRE I at Take-Two Interactive?

As an SRE I at Take-Two Interactive, you'll be responsible for managing and maintaining Windows and Linux servers, ensuring their stability, security, and performance. Your duties will include monitoring system alerts using CheckMK, troubleshooting production issues, and participating in incident management. Ensuring high uptime for our games and services is your primary mission, along with enhancing system performance and implementing capacity planning strategies.

Join Rise to see the full answer
What qualifications are required for the SRE I position at Take-Two Interactive?

To be a successful candidate for the SRE I position at Take-Two Interactive, you should have at least 1 year of experience in a similar role or 3 years in a relevant field. Familiarity with both Windows and Linux server administration is essential, along with experience in cloud service management. Being an effective communicator and a proactive problem-solver will also be key qualities that will aid your success in this role.

Join Rise to see the full answer
How does Take-Two Interactive ensure a great workplace culture for SREs?

Take-Two Interactive promotes a vibrant workplace culture by fostering creativity, collaboration, and personal development. As an SRE I, you’ll enjoy access to various company events, game release celebrations, and team-building activities that enhance camaraderie among employees. The company also highly values diversity and philanthropy, making it an engaging environment for all team members.

Join Rise to see the full answer
What tools and technologies will SRE I's use at Take-Two Interactive?

As an SRE I at Take-Two Interactive, you'll be working with various tools and technologies, including CheckMK for monitoring and alerting, VMware for virtual environment management, and cloud platforms such as AWS, Azure, and GCP. Having a solid understanding of these technologies and applying them effectively will be crucial to ensuring optimal performance in your role.

Join Rise to see the full answer
What opportunities for growth exist for SREs at Take-Two Interactive?

Take-Two Interactive is committed to fostering employee growth and development. As an SRE I, you'll have numerous opportunities to expand your skills, engage in continuous learning, and take on more complex challenges as you progress in your career. The company encourages inquiry and collaboration, making it a great place for professional development in the field of reliability engineering.

Join Rise to see the full answer
Common Interview Questions for SRE I
What is your understanding of the role of an SRE in maintaining uptime?

An SRE plays a critical role in ensuring service uptime by effectively monitoring infrastructure, responding promptly to incidents, and proactively addressing potential issues to minimize downtime. When answering, illustrate your understanding by referencing strategies you would use in real-life scenarios.

Join Rise to see the full answer
How do you prioritize tasks during an incident?

Prioritization is vital when responding to incidents. A successful SRE identifies critical issues that could significantly impact service availability and addresses those first. Share your approach to assessing an incident’s severity and discuss any frameworks or tools you use for resolutions.

Join Rise to see the full answer
Can you describe a time you improved system performance?

Think back to a specific instance where you identified a performance bottleneck and the actions you took to improve it. Highlight your analytical skills, the tools used, and the outcomes of your efforts to provide a concrete example of your impact on system reliability.

Join Rise to see the full answer
How do you handle on-call disruptions?

On-call disruptions are inevitable in the SRE role. Demonstrate your ability to maintain composure under pressure, communicate effectively with your team, gather data quickly, and troubleshoot efficiently. Share a specific example of a challenge faced and how you resolved it.

Join Rise to see the full answer
What strategies do you employ for effective communication during incidents?

Effective communication during incidents is crucial. Outline your approach to providing updates to stakeholders while relaying accurate and timely information about service impacts. Mention how you document services and ensure all relevant parties are kept in the loop.

Join Rise to see the full answer
What is your experience with cloud services like AWS or Azure?

Share your hands-on experience with cloud services, including managing resources and optimizing configurations. Discuss any specific projects or implementations that showcase your ability to leverage cloud solutions to enhance service reliability.

Join Rise to see the full answer
How do you conduct risk assessments in a production environment?

Risk assessment involves analyzing potential vulnerabilities and assessing their impact on services. Describe your methodology for identifying risks, evaluating their potential effects, and the steps you take to implement mitigative measures to ensure service continuity.

Join Rise to see the full answer
What is your approach to continuous improvement in infrastructure?

Emphasize the importance of continuous improvement in maintaining and optimizing system performance. Share your approach to identifying areas for enhancement and any frameworks you commit to implementing improvements routinely.

Join Rise to see the full answer
Describe a challenge you faced in your previous role and how you overcame it.

Select a specific challenge relevant to site reliability engineering, detailing the context, your initial response, the steps you took to resolve it, and the positive outcome. Highlighting resilience and solution-oriented thinking will resonate well.

Join Rise to see the full answer
How familiar are you with using monitoring tools like CheckMK?

Discuss your experience with monitoring tools, particularly CheckMK. Share how you utilize these tools to track system performance, respond to alerts, and how they inform your proactive maintenance strategies.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Datadog Remote United States
Posted 7 days ago
Customer-Centric
Rapid Growth
Diversity of Opinions
Reward & Recognition
Friends Outside of Work
Inclusive & Diverse
Empathetic
Feedback Forward
Work/Life Harmony
Casual Dress Code
Startup Mindset
Collaboration over Competition
Fast-Paced
Growth & Learning
Open Door Policy
Rise from Within
Maternity Leave
Paternity Leave
Flex-Friendly
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off

Cigar Place seeks a talented Website Manager to oversee and optimize their Magento-based e-commerce website, ensuring top-notch user engagement and functionality.

Photo of the Rise User
Posted 7 days ago

Lawrence Livermore National Laboratory is looking for a Virtual Desktop Engineer to enhance security and reliability within our IT Solutions team.

Photo of the Rise User
Advansys Remote No location specified
Posted 22 hours ago

Join Advansys as a Security Administrator Officer to help secure innovative technology solutions utilized by diverse enterprise clients.

Photo of the Rise User
Posted 10 days ago

As a Strategic Implementation Architect at Lumin Digital, you'll drive the evolution of our online banking solutions through strategic configuration and implementation excellence.

Photo of the Rise User

Join Spektrum as a Cloud Service Management Analyst and play a pivotal role in advancing NATO's IT services through innovative cloud solutions.

Photo of the Rise User

Join MPS as an IT & Infrastructure Architect to shape and secure our cloud operations in a fast-paced SaaS environment.

Photo of the Rise User
Feld Entertainment Hybrid Ellenton, Florida - E-Verify
Posted 11 days ago

Feld Entertainment is on the lookout for a Director of Enterprise Systems to spearhead the digital transformation of their enterprise platforms, ensuring their systems empower strategic success.

SynapsIQ Remote United States
Posted 6 days ago

The company is looking for a knowledgeable Switch Engineer to oversee port activations and troubleshoot network configurations remotely.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
February 21, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Perrysburg just viewed Sourcing Leader, Minerals & Cullet at Owens Corning
Photo of the Rise User
Someone from OH, North Royalton just viewed Remote AI Voice Trainer (High-Quality Microphone Required) at Datadog
C
Someone from OH, Akron just viewed Phlebotomy Technician - Outpatient at CCF
Photo of the Rise User
Someone from OH, Solon just viewed Graphic Designer at Applause
Photo of the Rise User
Someone from OH, North Canton just viewed NodeJs developer at BlackStone eIT
Photo of the Rise User
Someone from OH, North Canton just viewed Software Development Engineer - Recent Grads Welcome at Sonos
Photo of the Rise User
16 people applied to SOC Analyst I at CBIZ
Photo of the Rise User
Someone from OH, Dayton just viewed Data Entry and Word Processing at MoxieIT
Photo of the Rise User
Someone from OH, Dayton just viewed Content Developer - Intern at Big Ideas Learning
Photo of the Rise User
Someone from OH, Pickerington just viewed Salesforce Lead at Bounteous
Photo of the Rise User
Someone from OH, Pickerington just viewed Industry Lead - High Tech (Salesforce) at Thunder
D
Someone from OH, Akron just viewed Junior Motion Designer at DEPT®
R
Someone from OH, Akron just viewed 2D Graphic and Motion Designer at Ruby Labs