Sign up for our
weekly
newsletter
of fresh jobs
Site Reliability Engineer job at Mastech Digital. Arizona.
POSITION: IT - Technology Engineer Associate...
LOCATION: Phoenix, AZ (3 days a week in office)
DURATION: 5+ months(High Possible to extend/Convert)
Pay: $38/hr on W2
Visa: Only US Citizen, GC and GC EAD
Required Knowledge, Skills, and Abilities
• Experienced in analysis, proactive troubleshooting, and performance troubleshooting for the NAS storage environment including the creation of documentation for new process and procedures
• In-depth review of all replication relationships across the NetApp storage arrays
• Ability to identify and resolve any issues with the replication relationships across the NetApp storage arrays for daily clones, mirrors, snaps and vaults, including re-activation and configuration
• Working knowledge of Windows Server, Unix/Linux, Solaris & AIX platforms
• Knowledge of computer networking to include DNS and Firewall
• Working knowledge of GF Operational Support
• ITIL Fundamentals
Shift:
M-F daylight - 0700-1530 EST
Introduction
• Customer Focused - Knowledgeable of the values and practices that align customer needs and satisfaction as primary considerations in all business decisions and able to leverage that information in creating customized customer solutions.
• Managing Risk - Assessing and effectively managing all of the risks associated with their business objectives and activities to ensure they adhere to and support Client's Enterprise Risk Management Framework.
Job Description
Candidate should possess skills that are aligned to the Site Reliability Engineering (SRE) principles with a focus on the discipline specific to this position. SRE is a software engineering approach to Client SRC Operations. SRE professionals use software as a tool to manage systems, solve problems, and automate tasks. Engineers’ focus specializes in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues and improving tools, automation, processes, and software.
Responsibilities | SRE
• Monitor systems and infrastructure to maintain operational and performance levels
• Rotational on-call responsibilities
• Work closely with other SRC professionals/engineers when issues arise, collaborate on troubleshooting, and provide consultation/resolution with events/incidents
• Anticipate potential problems before they become impacting and collaborate to determine solutions
• Gather and analyze metrics from tools and system/application logs to assist in performance tuning, fault finding, and resolution
• Create sustainable systems and services through automation, processes enhancement, tools, and noise reduction
• Build automation to manage the SRC operations and eliminate/minimize manual functions and toil
• Collaborate with Application/Infrastructure support engineers and operations teams
• Engage in post-incident reviews for improvements and determining the cause to prevent recurrence
• Document work to turn findings into repeatable actions via knowledge articles
• Mentoring and coaching junior engineers
Responsibilities | Discipline
• Analyze and proactively troubleshoot the NAS environment
• Address all NAS related events and incidents that occur during the coverage window
• In-depth review of all replication relationships across the NetApp storage arrays
• Comprehensive review and validation of daily clones, mirrors, snaps and vaults including re-activation and configuration
• Careful review of the environment for non-critical or repetitive warning symptoms possibly indicative of future issues
Soft Skills
• Ability to handle pressure and/or stressful situations
• Capable of balancing multiple projects
• Ability to quickly learn and adapt to testing and support requirements for non-production work, including creating documentation for new process and procedures
• Strong problem resolution skills including the ability to drive problem and problem bridges
• Strong skills addressing production critical incidents
• Strong troubleshooting and problem-solving skills, with the ability to analyze and resolve complex technical issues
• Excellent communication and interpersonal skills, with the ability to collaborate effectively with stakeholders at all levels
• Self-motivated and able to work independently or as part of a team, taking ownership of tasks and driving them to completion
Education / Experience
• Bachelor’s degree in Engineering, Computer Science, or related field required (or equivalent experience)
• 2 + years of experience supporting large enterprise NAS operations
Interview- video, 2 rounds if needed