Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Sr Site Reliability Engineer image - Rise Careers
Job details

Sr Site Reliability Engineer

Your ImpactThe primary purpose of this role is to run the production environment by monitoring availability and taking a holistic view of system health. This includes building software and systems to manage platform infrastructure and applications to improve the reliability and quality of our suite of software solutions. This role provides primary operational support and engineering for multiple large, distributed software applications.What You'll DoRun the production environment by monitoring availability and taking a holistic view of system health.Build software and systems to manage platform infrastructure and applications.Improve reliability, quality, and time-to-market of our suite of software solutions.Measure and optimize system performance, with an eye toward pushing capabilities forward.Provide primary operational support and engineering for multiple large, distributed software applications.Improve reliability, quality, and reduce MTTR.Participate in system design consulting, platform management, capacity planning, and cost analysis.Gather and analyze metrics from applications and services to assist in performance tuning and fault finding.Contribute to capacity planning, demand forecasting, software performance analysis, and systems tuning.Develop and implement monitoring, observability, and alerting tools such as dashboards and logging systems to understand the health and availability of our infrastructure and applications.Collect and analyze information from distributed systems into simple views of the technology portfolio to identify trends and spot stability threats.Monitor application availability, latency, and overall system health.Develop self-service solutions to help increase productivity by removing toil and reducing unnecessary roadblocks.Resolve technical issues in production, learn to mitigate them quickly, and find ways to prevent them.Document every action so lessons learned turn into repeatable actions and then into automation.Triage, analyze, and provide solutions to critical & high-priority technical issues occurring in the ecosystem, and optimize incident management processes.Respond, react & communicate as per the ITSM incident management process, including detection of the incident, timely communication to leadership, service restoration, and root cause analysis.Drive blameless postmortem culture.Regularly review key site technical metrics such as transaction errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization.QualificationsProficiency in one or more scripting languages such as bash, python, Go, etc.Experience with Kubernetes or equivalent.Software development experience in Java, C, C++.Experience with containers and container orchestrators - Docker, Kubernetes.Ability to debug and fix system/infrastructure and application issues.Experience working with monitoring tools such as Prometheus, Google Stackdriver, etc.Experience with databases (SQL or NoSQL).Experience with log analysis and building dashboards.Retail knowledge is a plus.Where You'll BeAssociates are required to relocate to the Charlotte region to foster collaboration and facilitate improved testing and support.Lowe's supports a Flex Office concept where in-person work is required two days per week at the Charlotte Tech Hub.Most business meetings are planned around the Eastern time zone.About Lowe'sLowe's Companies, Inc. (NYSE: LOW) is a FORTUNE 50 home improvement company serving approximately 16 million customer transactions a week in the United States. With total fiscal year 2023 sales of more than $86 billion, Lowe's operates over 1,700 home improvement stores and employs approximately 300,000 associates. Based in Mooresville, N.C., Lowe's supports the communities it serves through programs focused on creating safe, affordable housing and helping to develop the next generation of skilled trade experts. For more information, visitLowes.com .Lowe's is an equal opportunity employer and administers all personnel practices without regard to race, color, religious creed, sex, gender, age, ancestry, national origin, mental or physical disability or medical condition, sexual orientation, gender identity or expression, marital status, military or veteran status, genetic information, or any other category protected under federal, state, or local law.Pay Range: $92,300.00 - $175,400.00 annually. Starting rate of pay may vary based on factors including, but not limited to, position offered, location, education, training, and/or experience. For information regarding our benefit programs and eligibility, please visitthis link .#J-18808-Ljbffr

Founded in 1999, CyberCoders is built on a success oriented culture. Above all – we know both candidates and clients want quality and they want it now. We are fast, and we work with integrity. We work hard, play hard, and believe in always having ...

380 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
August 1, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
Other jobs
Company
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Paid Holidays
Company
Posted 26 days ago