Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Observability Engineer image - Rise Careers
Job details

Senior Observability Engineer

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. 

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. 

Join our team! We’re building a world where Identity belongs to you.

Site Observability Engineer 

 

We are looking for an experienced BT Site Observability Engineer to join our Business Technology team and help build a new function within the BT SRE team. The Site Reliability Engineering team is looking to expand its scope and provide observability capabilities for critical okta.com and auth0.com properties, in addition to critical applications within the Okta corporate environment.

We are looking for a smart, innovative, and passionate engineer for this role, someone who is interested in best practices around observability, incident management, and security. The ideal candidate welcomes the challenge of building in a dynamic and ever changing environment, and is interested in bringing a culture of operational excellence to a new team. They enjoy seeing their designs run at scale with automation, testing, and an excellent operational mindset. If you exemplify the ethics of, "know about a problem before your users," we want to hear from you!

Responsibilities

  • Build out observability program and process, recommending and implementing tooling and services
  • Managed the security of critical Okta properties and manage security issues such as DDoS attacks and rate limiting
  • Ensure our critical infrastructure is meeting uptime and availability standards, and is stable for our Okta customers
  • Drive initiatives to evolve our observability platforms to increase efficiency in line with current standards and best practices, especially around incident management
  • Build data pipelines into Splunk and use your expertise to build queries and dashboards for a variety of stakeholders
  • Recommend, develop, implement, and manage appropriate policy, standards, process, and procedural updates
  • Discover and execute on opportunities to automate and increase our automation

Qualifications

  • Proficient with observability tools including Pingdom, New Relic, Cloudwatch and Prometheus/Grafana
  • Proficient with logging and SIEM tools, especially Splunk
  • Proficient with web security and web security tooling, especially Cloudflare
  • Experience with monitoring of hosted platforms, such as Adobe Experience Manager
     
  • Experience with automating systems and infrastructure via Terraform
  • Proficient with Git and building deployment pipeline using commercial tools, especially Gitlab
  • Demonstrated ability to develop complex applications for cloud infrastructure at scale and deliver projects on schedule and within budget
  • Experience with reliability engineering concepts and security best practices on public cloud platforms and web applications
  • Experience with developing tooling and automation in Bash, Python, Go, etc.
  • Familiar with Linux system administration skills
  • Good communication skills, with the ability to influence others and communicate complex technical concepts to different audiences



What you can look forward to as an Full-Time Okta employee!

Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to participate in the job application, interview process, or onboarding please use this Form to request an accommodation.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at https://www.okta.com/privacy-policy/

Okta Glassdoor Company Review
3.6 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Okta DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Okta
Okta CEO photo
Todd McKinnon
Approve of CEO

Average salary estimate

$120000 / YEARLY (est.)
min
max
$100000K
$140000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Observability Engineer, Okta

At Okta, we’re on the lookout for an experienced Senior Observability Engineer to join our vibrant Business Technology team in Bengaluru. As part of our Site Reliability Engineering team, you'll play a pivotal role in implementing observability across our critical properties, including okta.com and auth0.com, as well as crucial applications within the Okta corporate ecosystem. We believe in empowering everyone to use technology seamlessly, and as a Senior Observability Engineer, your expertise will help us ensure security and operational excellence. You'll develop and manage robust observability processes while leveraging tools such as Splunk, New Relic, and Cloudwatch. We’re not just looking for someone who knows the right tools; we want someone who thrives on challenges and loves to automate repetitive tasks. You’ll be collaborating with various stakeholders to create insightful dashboards and data pipelines, enhancing our team's ability to respond to incidents proactively. With your proficiency in observability tools and a solid background in incident management, you’ll be at the forefront of shaping our approach to operational reliability. Your efforts will not only stabilize our infrastructure but also contribute to a culture of excellence here at Okta. If you’re passionate about observability, ready to increase efficiency, and eager to immerse yourself in a dynamic environment, we’d love to hear from you. Join us in building a world where Identity belongs to you!

Frequently Asked Questions (FAQs) for Senior Observability Engineer Role at Okta
What does a Senior Observability Engineer at Okta do?

A Senior Observability Engineer at Okta is responsible for building out observability programs, managing the security of critical Okta properties, and ensuring uptime and stability across our infrastructures. The role is pivotal in implementing tools and processes to enhance our incident management capabilities.

Join Rise to see the full answer
What qualifications are needed for a Senior Observability Engineer role at Okta?

To qualify for the Senior Observability Engineer position at Okta, candidates should have proficiency in observability tools such as New Relic and Splunk, experience with automation via Terraform, and a strong understanding of web security practices. Strong communication skills are essential for collaborating across teams.

Join Rise to see the full answer
How does Okta promote a culture of learning for Senior Observability Engineers?

At Okta, we celebrate diverse perspectives and encourage a culture of continuous learning for Senior Observability Engineers. We provide opportunities for training, workshops, and project involvement to ensure our engineers can develop their skills while contributing to impactful projects.

Join Rise to see the full answer
What is the work environment like for a Senior Observability Engineer at Okta?

Okta offers a dynamic work environment that empowers Senior Observability Engineers to thrive. With flexible work arrangements and access to the latest tools and technologies, our employees can work productively in a setting that suits their individual needs.

Join Rise to see the full answer
What tools will I be using as a Senior Observability Engineer at Okta?

As a Senior Observability Engineer at Okta, you will be using a variety of tools including Splunk, New Relic, Cloudwatch, and monitoring services like Grafana. You'll also work with automation tools like Terraform, helping to build out observability and security protocols.

Join Rise to see the full answer
What types of projects will a Senior Observability Engineer at Okta be involved in?

In the Senior Observability Engineer role at Okta, you will be involved in projects focusing on building observability capabilities for our online properties, improving incident management processes, and developing automation solutions to enhance system efficiency and security.

Join Rise to see the full answer
Is there room for growth in the Senior Observability Engineer position at Okta?

Absolutely! At Okta, the Senior Observability Engineer role offers significant opportunities for career advancement. Our emphasis on continuous learning and innovation allows engineers to take on new challenges and evolve their careers within the company.

Join Rise to see the full answer
Common Interview Questions for Senior Observability Engineer
Can you explain what observability means in the context of Site Reliability Engineering?

Observability in SRE relates to the ability to understand the internal state of our systems from external outputs. It encompasses monitoring, logging, and tracing to provide insight into system behavior and performance, which allows for quicker problem diagnosis and resolution.

Join Rise to see the full answer
How would you approach building an observability program from scratch?

Building an observability program involves defining key performance indicators, selecting appropriate tools and technologies, creating a robust monitoring strategy, and ensuring that teams are trained in effective practices. Collaboration across departments is crucial to align objectives and gather requirements.

Join Rise to see the full answer
What tools have you used for logging and monitoring, and how did you implement them?

I've used tools like Splunk and New Relic for monitoring and logging. Implementation involved setting up data sources, configuring dashboards, and creating alerts based on specific thresholds. Regular reviews and updates were conducted to ensure they met business needs.

Join Rise to see the full answer
Describe a challenging incident you effectively managed using your observability practices.

One challenging incident involved a sudden drop in service response times. By leveraging our observability tools, I quickly identified the bottleneck in our database and implemented the necessary optimizations. Post-incident, I enhanced our monitoring thresholds to prevent future occurrences.

Join Rise to see the full answer
How do you ensure the security of observability data and monitoring tools?

Ensuring the security of observability data involves implementing strict access controls, using encryption for data in transit and at rest, and regularly auditing our monitoring tools for vulnerabilities. I also advocate for best practices and continuous education for the team.

Join Rise to see the full answer
What role does automation play in your approach to observability?

Automation is crucial in observability as it allows repetitive tasks like monitoring, alerting, and anomaly detection to be managed efficiently. By automating these processes, we minimize human error, improve response times, and allow teams to focus on more strategic improvements.

Join Rise to see the full answer
How do you prioritize observability improvements in a fast-changing environment?

I prioritize observability improvements by assessing the impact of current challenges, talking to stakeholders for feedback, and aligning improvements with business goals. Regular feedback loops help ensure the most critical needs are addressed timely.

Join Rise to see the full answer
What experience do you have with incident management processes?

I have extensive experience with incident management processes, including root cause analysis and reporting. I follow the principles of incident response frameworks, ensuring communication is clear among stakeholders and that post-incident reviews lead to actionable improvements.

Join Rise to see the full answer
Can you give an example of a successful automation project you've completed?

A successful automation project involved streamlining our deployment pipelines using GitLab CI/CD. This reduced deployment times significantly and minimized the potential for human error during releases, thereby enhancing our operational efficiency.

Join Rise to see the full answer
What strategies do you use to communicate complex technical concepts to non-technical stakeholders?

To communicate complex concepts, I utilize clear language, visuals such as diagrams or flowcharts, and relatable examples. Tailoring my communication style based on the audience's level of technical understanding ensures effective and impactful dialogue.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 4 days ago
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
Photo of the Rise User
Posted 3 days ago
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
Photo of the Rise User
AECOM Remote Indianapolis, IN, United States
Posted 8 days ago
Photo of the Rise User
ITAC Hybrid No location specified
Posted 2 days ago
Photo of the Rise User
Posted 13 days ago
Photo of the Rise User
Posted 14 days ago
Posted 10 days ago

Okta is a leading identity and access management company headquartered in San Francisco, California that is committed to allowing people to access applications on any device at any time, while still enforcing strong security policies.

110 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Future MakerBadge Global CitizenBadge Innovator
CULTURE VALUES
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
BENEFITS & PERKS
Maternity Leave
Paternity Leave
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Paid Volunteer Time
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Family Coverage (Insurance)
Medical Insurance
Mental Health Resources
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
November 27, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!