Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Site Reliability Engineer image - Rise Careers
Job details

Site Reliability Engineer

prosource.it is a global IT Managed Service provider working with Medium to Enterprise level, global clients and is looking for an Site Reliability Engineer who is interested in joining a global, enterprise level team who are delivering technical solutions to our internal business partners to drive processes and meet business requirements.

 

We understand that we need exceptional talent to accomplish our mission - therefore we place great emphasis on the people component of IT, and we strive constantly to attract, develop, and retain the best people. We cultivate an ethos and environment within which our people are focused, nurtured, and continually challenged to develop and improve their competencies in a fun and rewarding culture. 

OVERVIEW:

We are seeking a seasoned Site Reliability Engineer (SRE) to join our team. The ideal candidate will have extensive experience in AWS infrastructure, a strong focus on security and reliability, and the ability to guide us through the SOC 2 compliance process. This role requires a proactive individual who can both recommend and implement solutions to ensure the stability and security of our systems. This role requires a proactive individual with a strong bias for action, capable of guiding us through the SOC 2 process and performing the necessary work themselves.

  • SOC 2 Compliance: The successful candidate will need to lead and execute the team through their first SOC 2 compliance process.  This means doing the work to get the digital estate ready, working with the SOC 2 auditors and remediating the findings as they come up. 
  • AWS Infrastructure Management: Manage and optimize AWS services, ensuring high availability, reliability and efficiency.
  • System Monitoring and Automation: Develop and implement monitoring solutions to detect and address system issues proactively. Automate critical recovery processes to minimize downtime.
  • Incident Management: Respond to and resolve incidents quickly and effectively, ensuring minimal downtime and user impact.
  • System Design: Participate in system design and architecture to ensure scalability and resilience.
  • Security Focus: Identify and mitigate security vulnerabilities within the infrastructure. Ensure compliance with security best practices.
  • Tooling and Scripting: Utilize tooling to monitor and maintain infrastructure.  Create scripts (BASH, AWS CLI, etc.) where needed for system interrogation, monitoring, and automation.
  • Disaster Recovery: Design and test disaster recovery plans to ensure data integrity and system availability.
  • Documentation: Maintain meticulous documentation of systems, processes, and configurations.

Qualifications:

  • Experience: Minimum of 5 years in a similar role, with at least two different organizations. Experience in well-established companies is preferred.
  • AWS Expertise: Proven experience with AWS services, infrastructure and infrastructure management. Familiarity with AWS security tools and best practices.
  • Security and Compliance: Strong background in security operations (SecOps) and experience with compliance frameworks such as SOC 2.
  • Scripting and Automation: Proficiency in scripting languages and automation tools. Ability to write and maintain scripts for system management and monitoring.
  • Problem-Solving Skills: Strong analytical skills to identify and resolve system issues. Ability to prioritize and address critical components.
  • Communication: Excellent communication skills to collaborate with team members and stakeholders. Ability to explain technical concepts to non-technical audiences.

Additional Skills (Nice to have):

  • CI/CD Pipelines: Knowledge of continuous integration and continuous deployment (CI/CD) processes and tools.
  • System Integration: Experience with integrating various systems and tools to create a cohesive infrastructure.
  • Monitoring Tools: Familiarity with monitoring tools such as CloudWatch and visualizing in Grafana.
  • Kubernetes and Containers: Experience with container management and orchestration.
  • Performance Tuning: Analyze system performance, identify bottlenecks, and implement optimizations to improve efficiency and speed.
  • Capacity Planning: Plan infrastructure for future capacity needs and ensure that systems can handle anticipated workloads.

EMPLOYMENT DETAILS

 

Location:                   Remote (USA Based), 2x per quarter visit to Denver, CO

Model:                        Full-Time 40+ hours/week, working Mountain Time Zone

Start Date:                 1st May 2025

Engagement:           W2 Salary (Exempt from OT)

Salary:                        $110-$130K depending on experience  

To all our fulltime staff members, we provide an exceptional benefits package, including medical, dental, vision, long term disability, short term disability, 401k contribution, paid holidays, and PTO.

Applicants for employment in the US must have work authorization that does not, now or in the future, require sponsorship of a visa for employment authorization in the United States. Applicants are also expected to provide references upon request.

Average salary estimate

$120000 / YEARLY (est.)
min
max
$110000K
$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, ProSource.it - Americas

At prosource.it, we are on the lookout for a talented Site Reliability Engineer to join our dynamic team and help us deliver cutting-edge technical solutions for our internal business partners. If you thrive in a collaborative, energetic environment and possess a strong background in AWS infrastructure management with a keen focus on security and reliability, then this may be the perfect opportunity for you! As a Site Reliability Engineer, you'll play an integral role in guiding us through the SOC 2 compliance process, leading us to ensure our digital estate is secure and efficient. Your responsibilities will include managing AWS services, developing proactive monitoring solutions, and automating recovery processes to minimize downtime. You will also participate in system design for scalability and resilience, and you will have the chance to make a real impact by identifying and mitigating security vulnerabilities. We believe in cultivating an engaging atmosphere where your problem-solving skills and innovative spirit can flourish; we are committed to your personal and professional growth. If you're looking to take your career to new heights while enjoying a fun, stimulating workplace culture, we can't wait to meet you!

Frequently Asked Questions (FAQs) for Site Reliability Engineer Role at ProSource.it - Americas
What are the responsibilities of a Site Reliability Engineer at prosource.it?

As a Site Reliability Engineer at prosource.it, you will be responsible for managing AWS infrastructure, leading the SOC 2 compliance process, developing monitoring solutions, automating recovery processes, and ensuring the security and reliability of our systems. You will also handle incident management, participate in system design, and maintain documentation to support operational excellence.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at prosource.it?

To qualify for the Site Reliability Engineer role at prosource.it, you should have a minimum of 5 years of relevant experience, particularly in AWS infrastructure management and security operations. Experience with compliance frameworks like SOC 2 is essential, alongside strong scripting skills and the ability to automate monitoring and system management processes.

Join Rise to see the full answer
What skills are essential for success as a Site Reliability Engineer at prosource.it?

Key skills for a successful Site Reliability Engineer at prosource.it include strong analytical and problem-solving abilities, expertise in AWS services and security best practices, and proficiency in scripting languages. Excellent communication skills to work well with team members and stakeholders are also essential, enabling you to convey complex technical concepts to non-technical audiences.

Join Rise to see the full answer
How does prosource.it support the professional development of its Site Reliability Engineers?

prosource.it is committed to the growth of its Site Reliability Engineers by providing a nurturing environment where team members are continually challenged and encouraged to develop their competencies. This includes access to training resources, collaboration opportunities, and participation in industry events to stay current with the latest technologies and best practices.

Join Rise to see the full answer
What is the work model for the Site Reliability Engineer position at prosource.it?

The Site Reliability Engineer position at prosource.it is a remote role based in the USA, with an expectation of working 40+ hours per week in the Mountain Time Zone. You will also have opportunities for in-person collaboration with your team through bi-annual visits to Denver, CO.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer
Can you describe your experience with AWS infrastructure management?

In answering this question, focus on specific AWS services you have used, detailing your expertise in managing resources, optimizing costs, and ensuring high availability. Give examples of projects where you successfully utilized AWS to support business needs.

Join Rise to see the full answer
What steps would you take to ensure SOC 2 compliance?

Explain your approach to leading the SOC 2 compliance process, including assessing the current infrastructure, identifying gaps, collaborating with auditors, and addressing vulnerabilities. Highlight any past experiences where you have successfully navigated compliance initiatives.

Join Rise to see the full answer
How do you approach incident management?

Describe your incident management process, including the tools and methodologies you use for detection, response, and recovery. Emphasize how you prioritize incidents and communicate effectively with stakeholders during downtime.

Join Rise to see the full answer
What automated solutions have you implemented in your previous roles?

Share examples of successful automation efforts you have undertaken, such as scripting tasks in BASH or Python that enhanced efficiency, reduced manual errors, and improved system reliability. Focus on specific tools and outcomes.

Join Rise to see the full answer
Can you discuss a challenging technical problem you solved as part of a team?

Think of a challenging situation involving system outages or performance issues, and detail how you collaborated with your team to diagnose and resolve the problem. Highlight your problem-solving skills and how you improve processes based on lessons learned.

Join Rise to see the full answer
How do you stay current with industry best practices regarding security?

Discuss your strategies for staying informed about security trends and best practices, such as following industry blogs, attending conferences, or participating in online forums. Share resources that you find valuable.

Join Rise to see the full answer
What tools do you prefer for system monitoring, and why?

Provide details on the monitoring tools you have experience with, such as CloudWatch or Grafana, explaining why you value them in your work. Discuss how effective monitoring contributes to system reliability.

Join Rise to see the full answer
How do you handle cross-departmental communication about technical concepts?

Emphasize your communication skills, providing examples of how you’ve successfully explained complex topics to non-technical stakeholders, ensuring they fully understand the issues at hand and the necessary resolutions.

Join Rise to see the full answer
What best practices do you follow for disaster recovery planning?

Outline the key components of effective disaster recovery planning, including regular tests of recovery processes, thorough documentation, and stakeholder communication. Sharing specific examples will reinforce your expertise.

Join Rise to see the full answer
What do you consider when evaluating system performance?

Discuss the metrics you analyze when assessing system performance—such as response times, throughput, and resource utilization. Share your approach to identifying bottlenecks and how you implement optimizations based on your findings.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User

Join KBR as a Network Operations Controller, supporting NASA's mission operations at Johnson Space Center.

Photo of the Rise User
Posted 11 days ago

Be a part of Ally Financial as a SOC L2 Analyst, driving innovative cybersecurity solutions in a dynamic technology environment.

Photo of the Rise User

Join our team as a Senior Business Systems Analyst, where you'll collaborate with stakeholders to support IT application development.

Join CIS as a Network Architect and play a crucial role in securing and enhancing IT infrastructure.

Photo of the Rise User

Join Southern Regional Medical Center as an IS Tech, where you'll support critical IT operations in a respected healthcare facility.

Photo of the Rise User
Sauce Remote Tel Aviv-Yafo, Tel Aviv District, Israel
Posted 12 days ago

Join Sauce as our AI & Low Code Automation Engineer to revolutionize restaurant technology with innovative automation solutions.

Photo of the Rise User
Posted 3 days ago

Become a key player at Freeman Health System as an IAM Analyst, safeguarding critical information through innovative identity management solutions.

Photo of the Rise User
Posted 11 days ago

As a Solution Architect at Stride, Inc., you will design innovative technology solutions that align with K12 educational objectives.

Photo of the Rise User
Posted 10 months ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Social Impact Driven
Rapid Growth
Passion for Exploration
Dare to be Different
Reward & Recognition
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Bias Training
Employee Resource Groups
401K Matching
Paternity Leave
Maternity Leave
Some Meals Provided
Social Gatherings
Photo of the Rise User
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 11, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Q
Someone from OH, Columbus just viewed Part-Time Medical Assistant at QualDerm Partners
Photo of the Rise User
Someone from OH, Cincinnati just viewed Summer 2025 Intern – Finance – Michigan at Stryker
Photo of the Rise User
19 people applied to SOC Analyst I at CBIZ
Photo of the Rise User
Someone from OH, Cleveland just viewed Remote Customer Service Representative at Conduent
Photo of the Rise User
Someone from OH, Cleveland just viewed Customer Support Team Lead (6-month Contract) at Jane App
o
Someone from OH, Cincinnati just viewed Marketing and Communications Consultant at osu
Photo of the Rise User
Someone from OH, Toledo just viewed Registered Nurse (Part-time) at Calibrate
Photo of the Rise User
Someone from OH, Toledo just viewed Clinical Research Associate II at Alimentiv
Photo of the Rise User
Someone from OH, Cleveland just viewed IT Support Engineer at Level AI
Photo of the Rise User
Someone from OH, Dayton just viewed Customer Content Specialist at Cision
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed Senior Corporate Communications Manager at Bumble Inc.
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior Financial Analyst at Workday
Photo of the Rise User
Someone from OH, Cincinnati just viewed Financial Planning and Analysis Lead at JLL
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior Financial Analyst at American Express
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior Analyst, Operations at American Express
Photo of the Rise User
Someone from OH, Cincinnati just viewed Strategic Finance Analyst, Corporate at Benchling
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior Analyst, Project Finance at Apex Clean Energy
Photo of the Rise User
Someone from OH, Cincinnati just viewed Senior FP&A Analyst, Sales at GitLab
Photo of the Rise User
Someone from OH, Cincinnati just viewed FP&A Analyst at Lithic
Photo of the Rise User
15 people applied to Junior Security Engineer at Epic