Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Staff Site Reliability Engineer, Security Engineering Group image - Rise Careers
Job details

Staff Site Reliability Engineer, Security Engineering Group

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. 

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. 

Join our team! We’re building a world where Identity belongs to you.

Okta’s Workforce Identity Cloud Security Engineering group is looking for an experienced and passionate Staff Site Reliability Engineer to join a team focused on designing and developing Security solutions to harden our cloud infrastructure. We embrace innovation and pave the way to transform bright ideas into excellent security solutions that help run large-scale, critical infrastructure. We encourage you to prescribe defense-in-depth measures, industry security standards and enforce the principle of least privilege to help take our Security posture to the next level. Our Infrastructure Security team has a niche skill-set that balances Security domain expertise with the ability to design, implement, rollout infrastructure across multiple cloud environments without adding friction to product functionality or performance. We are responsible for the ever-growing need to improve our customer safety and privacy by providing security services that are coupled with the core Okta product.

This is a high-impact role in a security-centric, fast-paced organization that is poised for massive growth and success. You will act as a liaison between the Security org and the Engineering org to build technical leverage and influence the security roadmap. You will focus on engineering security aspects of the systems used across our services. Join us and be part of a company that is about to change the cloud computing landscape forever.

 

Bring all the passion and dedication along and there’s no telling what you could accomplish!

 

What you’ll be doing 

  • Designing, building, running, and monitoring Okta's production infrastructure
  • Be an evangelist for security best practices and also lead initiatives/projects to strengthen our security posture for critical infrastructure
  • Responding to production incidents and determining how we can prevent them in the future
  • Triaging and troubleshooting complex production issues to ensure reliability and performance
  • Identifying and automating manual processes
  • Continuously evolving our monitoring tools and platform
  • Promoting and applying best practices for building scalable and reliable services across engineering
  • Developing and maintaining technical documentation, runbooks, and procedures
  • Supporting a 24x7 online environment as part of an on-call rotation
  • Be a technical SME for a team that designs and builds Okta's production infrastructure, focusing on security at scale in the cloud.

What you’ll bring to the role

  • Are always willing to go the extra mile: see a problem, fix the problem.
  • Are passionate about encouraging the development of engineering peers and leading by example.
  • Have experience automating, securing, and running large-scale production IAM, containerized services in AWS (EC2, ECS, KMS, Kinesis, RDS), GCP (GKE, GCE) or other cloud providers.
  • Have deep knowledge of CI/CD principles, Linux fundamentals, OS hardening, networking concepts, and IP protocols.
  • Have a deep understanding and familiarity with configuration management tools like Chef and Terraform.
  • Have expert-level abilities in operational tooling languages such as Ruby, Python, Go and shell, and use of source control.
  • Experience with industry-standard security tools like Nessus, Qualys, OSQuery, Splunk, etc.
  • Experience with Public Key Infrastructure (PKI) and secrets management

Minimum Required Knowledge, Skills, Abilities, and Qualities:

  • 6+ years of experience architecting and running complex AWS or other cloud networking infrastructure resources
  • 6+ years of experience with Chef and Terraform
  • Unflappable troubleshooting skills
  • Proven experience in collaborating across teams to deliver complex horizontal projects
  • Strong Linux understanding and experience.
  • Strong security background and knowledge.
  • BS In computer science (or equivalent experience).

And extra credit if you have experience in any of the following! 

  • Experience conducting threat assessments, and assessing vulnerabilities in a high-availability setting.
  • Understand MySQL, including replication and clustering strategies, and are familiar with data stores such as DynamoDB, Redis, and Elasticsearch.
  • Experience with US Govt Federal and DoD compliance requirements - FedRAMP, IL

#LI-Hybrid

#LI-TM

Below is the annual salary range for candidates located in Canada. Your actual salary will depend on factors such as your skills, qualifications, and experience. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental, and vision insurance, RRSP with a match, healthcare spending, telemedicine, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program, please visit: https://rewards.okta.com/can.

The annual base salary range for this position for candidates located in Canada is between:
$139,000$209,000 CAD

What you can look forward to as a Full-Time Okta employee!

Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/.

Some roles may require travel to one of our office locations for in-person onboarding.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at https://www.okta.com/privacy-policy/

Okta Glassdoor Company Review
3.6 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Okta DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Okta
Okta CEO photo
Todd McKinnon
Approve of CEO

Average salary estimate

$131000 / YEARLY (est.)
min
max
$103000K
$159000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Staff Site Reliability Engineer, Security Engineering Group, Okta

At Okta, we're on a mission to give the power of identity back to you. As a Staff Site Reliability Engineer in our Security Engineering Group, you'll play a crucial role in ensuring our cloud infrastructure remains fortified and efficient. We’re looking for an experienced and passionate individual who thrives on innovating and implementing security solutions for large-scale environments. Imagine designing, building, and monitoring Okta's production infrastructure while cultivating a security-conscious culture. In this collaborative and dynamic role, you will champion best practices to protect our technology and guide our engineering teams toward adopting strong measures such as the principle of least privilege. You’ll tackle production incidents, ensure system reliability, and develop automated solutions for various processes, enhancing our operational efficiency. Whether you're responding to complex issues or evolving our monitoring tools, each day will present opportunities to excel and influence our security roadmap. With a diverse team that values unique perspectives, we believe in fostering growth through passionate mentorship. If you have a strong background in cloud services, CI/CD principles, and a commitment to collaborative problem-solving, you’ll find yourself at home here. Join us at Okta, where your skills can help transform how individuals and organizations navigate the digital landscape securely!

Frequently Asked Questions (FAQs) for Staff Site Reliability Engineer, Security Engineering Group Role at Okta
What are the primary responsibilities of a Staff Site Reliability Engineer at Okta?

As a Staff Site Reliability Engineer at Okta, you will design, build, run, and monitor the production infrastructure while focusing on security enhancements. You'll advocate for best practices, manage production incidents, troubleshoot complex issues, automate manual processes, and evolve our monitoring tools, ensuring a resilient and secure operational environment.

Join Rise to see the full answer
What qualifications are required for the Staff Site Reliability Engineer position at Okta?

To qualify for the Staff Site Reliability Engineer role at Okta, candidates should have over 6 years of experience in architecting cloud infrastructure, expertise with automation and security tools like Chef and Terraform, and a strong security background. A BS in Computer Science or equivalent experience is also necessary, alongside deep knowledge of Linux systems and operational languages such as Ruby and Python.

Join Rise to see the full answer
How does the Staff Site Reliability Engineer contribute to Okta's security posture?

At Okta, the Staff Site Reliability Engineer plays a pivotal role by leading initiatives to strengthen security measures within the infrastructure. This includes implementing defense-in-depth strategies, enforcing best practices, and acting as a technical Subject Matter Expert to ensure security is incorporated seamlessly into all engineering projects.

Join Rise to see the full answer
What tools and technologies do Staff Site Reliability Engineers use at Okta?

Staff Site Reliability Engineers at Okta leverage a variety of tools and technologies, including AWS services for cloud infrastructure management, CI/CD principles for continuous integration, and automation technologies such as Chef and Terraform. Familiarity with security tools like Nessus, Qualys, and configuration management is also essential.

Join Rise to see the full answer
What can I expect during the interview process for the Staff Site Reliability Engineer role at Okta?

During the interview for the Staff Site Reliability Engineer position at Okta, you can expect a combination of technical assessments to gauge your expertise, situational questions to assess your problem-solving capabilities, and discussions aimed at understanding how you can contribute to our engineering and security efforts effectively.

Join Rise to see the full answer
Common Interview Questions for Staff Site Reliability Engineer, Security Engineering Group
Can you describe a time when you improved system reliability in a large-scale environment?

To approach this question, start by outlining the specific system challenges you encountered. Discuss the methodologies you employed to analyze and identify the root causes and the solutions you implemented to enhance reliability, illustrating measurable outcomes or improvements following your interventions.

Join Rise to see the full answer
How do you prioritize security tasks while working on production incidents?

When prioritizing security tasks, illustrate a framework or criteria you use to evaluate the urgency and impact of incidents. Include examples of how you balance immediate operational needs with long-term security improvements and how you ensure that both areas receive adequate attention.

Join Rise to see the full answer
What is your experience with CI/CD and how do you integrate security into the pipeline?

Discuss your familiarity with CI/CD practices and provide examples of how you have successfully integrated security measures into the development pipeline, such as implementing automated security testing and ensuring compliance with security policies without hindering release velocity.

Join Rise to see the full answer
How would you handle a security breach in a cloud environment?

Outline a response plan you would implement in the event of a security breach. Highlight your immediate steps for containing the breach, the tools you would utilize for investigation and remediation, and how you would communicate with stakeholders and prevent future occurrences.

Join Rise to see the full answer
What tools have you used for monitoring and automation in production systems?

Enumerate the tools you have experience with, such as Prometheus for monitoring or Jenkins for automation. Provide insights into how you utilized these tools to enhance system observability and automate key processes, discussing any challenges you faced and how you overcame them.

Join Rise to see the full answer
Can you explain the principle of least privilege and its importance?

Describe the principle of least privilege as ensuring that users and systems have the minimum level of access necessary to perform their functions. Discuss its importance in reducing security risks and how you enforce it in your previous roles or how you plan to do so at Okta.

Join Rise to see the full answer
Describe your experience with incident response processes.

Share the specific incident response frameworks or approaches you have worked with. Highlight a notable incident you've managed, detailing the steps taken from detection to analysis to resolution, and the lessons learned that improved future response efforts.

Join Rise to see the full answer
What methods do you use to assess the security posture of a cloud environment?

Talk about various assessment methodologies, such as penetration testing and vulnerability scanning. Include the tools you employ and how you work with development teams to remediate identified vulnerabilities to strengthen the overall security posture.

Join Rise to see the full answer
In your opinion, what are the biggest security challenges facing cloud environments today?

Reflect on current trends and threats in cloud security, discussing topics such as data breaches, misconfigurations, and compliance issues. Provide your insights on how organizations can stay ahead of these challenges through proactive measures.

Join Rise to see the full answer
How do you promote a security-first culture within engineering teams?

Explain strategies you have utilized to foster a security-first mindset among engineering peers. Provide examples of training sessions, documentation creation, or collaborative efforts you’ve enacted that inspire team members to prioritize security in their architectures and workflows.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 13 days ago
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
Photo of the Rise User
Posted 13 days ago
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
Posted 7 days ago
Photo of the Rise User
Teleport Remote No location specified
Posted 6 days ago
Transparent & Candid
Growth & Learning
Inclusive & Diverse
Empathetic
Collaboration over Competition
Feedback Forward
401K Matching
Medical Insurance
Dental Insurance
Vision Insurance
Equity
Paid Sick Days
Paid Time-Off
Disability Insurance
Life insurance
Learning & Development
Photo of the Rise User
Posted 4 hours ago
Photo of the Rise User
Roblox Hybrid San Mateo, CA, United States
Posted 2 days ago
Photo of the Rise User
LoopMe Remote No location specified
Posted 10 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Vision Insurance
Paid Holidays
Photo of the Rise User
Posted 8 days ago

Okta is a leading identity and access management company headquartered in San Francisco, California that is committed to allowing people to access applications on any device at any time, while still enforcing strong security policies.

446 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Future MakerBadge Global CitizenBadge Innovator
CULTURE VALUES
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
BENEFITS & PERKS
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
March 27, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!