Sign up for our
weekly
newsletter
of fresh jobs
Description:
Looking for highly motivated System Administrator/DevOps Engineers to design, develop and implement a global dynamic, innovative Service Reliability Operations Center (known as Mission Control), to provide extraordinary levels of support for our Cloud products and services. AS a key member or the Mission Control team, you will partner with other key members of our organization... including Site Reliability Engineering, Security Operations Center, DevOps teams, and other partners to help make our services capable of providing near 100% availability.
What you will be doing:
- The team will provide their services 24/7 with a follow-the-sun environment which will span continents
- You will report to a Manager in the US
- Each team member will need to work Sunday-Wednesday or Wednesday-Saturday (10 hr shifts) which means no on call will be required
- The heart of Mission Control will be monitoring and running a growing production compute and storage environments
- Every team member will use alerts and alarms to help prevent issues and incidents when possible
- Perform systems administration tasks, network administration tasks, security incident monitoring to drive our actions
Mission control team members will work with developers to learn how the service works, then translate that understanding into runbooks which the entire team will use. As new features and functionality are added, you will also update and evolve the runbooks as needed
- Help discover incidents and issues, including initiating the incident management procedure
- Bring in subject matter authorities or service owners as needed to resolve issues. Feedback will help us continually improve our service
- Your interpersonal skills will help keep the team engaged through resolution and ensure our clients believe we value their time and effort
Skills:
Linux, Ansible, DNS, SRE, bare metal, Aws
Top Skills Details:
Linux,Ansible,DNS,SRE,bare metal
Additional Skills & Qualifications:
Experience using monitoring tools and problem ticketing systems
Shell scripting, automation, DHCP, storage concepts, basic networking, IP tables, etc.
RHCE or equivalent of knowledge
Experience scripting in python
Basic understanding of Git
Experience Level:
Intermediate Level
About TEKsystems:
We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.
The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law