Crisis Text Line is looking for a Senior Infrastructure Site Reliability Engineer to design, implement, and maintain cloud infrastructure to ensure optimal performance, availability, and security. The role involves leading and maintaining scalable infrastructure, mentoring team members, and driving continuous improvements in platform reliability.
Sign up for our
weekly newsletter
of fresh jobs
Skills
AWS Fargate
CloudWatch
Infrastructure as Code (IaC)
DevOps
Python/Bash/PowerShell scripting
Container orchestration (Kubernetes/Amazon ECS)
Responsibilities
Lead and maintain highly available, scalable, and secure infrastructure on AWS Fargate.
Design and maintain CloudWatch alerting and monitoring configurations for issue resolution.
Mentor junior team members and promote excellence in infrastructure management.
Collaborate on infrastructure as code, CI/CD, and SRE methodologies.
Lead incident response, troubleshoot system issues, and implement preventive measures.
Automate tasks for operational efficiency and cost reduction.
Conduct performance tuning and optimization of infrastructure components.
Stay updated on emerging technologies to drive innovation.
Education
Bachelor's degree in Computer Science, Engineering, or related field (Master's preferred)
AWS certifications (preferred)
Benefits
20 paid holidays
Flexible paid time off
Medical, dental, and vision benefits
403B retirement plan
12 weeks paid parental leave
Student loan repayment
Family support and stipends/allowances
Professional and wellness development support
To read the complete job description, please click on the ‘Apply’ button