Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Sr Staff Site Reliability Engineer (Cortex Data Lake) image - Rise Careers
Job details

Sr Staff Site Reliability Engineer (Cortex Data Lake) - job 1 of 3

Company Description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.
Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

Palo Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Sr Staff Site Reliability Engineer for the CDL/SLS team, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, observability, troubleshooting, security, and reliability.

Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps, Prometheus, Grafana, Loki, Docker, GCP, Vault, Kafka, MySQL, Python, Bash, and Go. 

Your Impact

  • Contribute to the success of SRE and DevOps

  • Develop expertise in new technologies

  • Work with developers, researchers, data scientists, and security experts

  • Design, build and operate reliable, secure Cloud infrastructure

  • Ensure that applications are production-ready, scalable, and reliable

  • Develop tools and automation frameworks

  • Automate robust deployment of robust services

  • Orchestrate end-to-end monitoring and alerting

  • Participate with SRE and Dev teams in the on-call rotation

  • Lead root cause analysis of critical business and production issues

Qualifications

Your Experience 

  • 5+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering

  • 3+ years building high availability, scalable cloud-native applications on AWS and GCP

  • BS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience required

  • Expertise in configuration management with a framework such as Ansible, Terraform, Helm

  • Passion for infrastructure and monitoring as code

  • Solid experience in container workloads and Kubernetes

  • Familiarity with PKI concepts, Networking concepts

  • In-depth knowledge of different security controls ( app-id, user-id, security profile, url category, content, ssl decryption, firewall MFA etc)

  • Linux administration, internals, and network troubleshooting

  • Proficiency with programming languages like Golang or Python along with shell scripting to automate tasks.

  • Proficiency with CI/CD pipelines, ArgoCD and GitLab CI/CD. 

  • Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions

  • Experience with managing Kafka is a plus

  • Excellent written and verbal communication, able to collaborate and rally support

  • Self-disciplined, self-managed, self-motivated, strong sense of ownership, urgency, and drive. 

  • Ready to understand and dissect new technology stacks quickly

  • Excellent written and verbal communication, able to collaborate and rally support

Additional Information

The Team

Our engineering team is at the core of our products – connected directly to the mission of preventing cyberattacks. We are constantly innovating – challenging the way we, and the industry, think about cybersecurity. Our engineers don’t shy away from building products to solve problems no one has pursued before. We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.

Compensation Disclosure

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $126,000 - $202,500/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

#LI-TD1

Our Commitment

We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at  [email protected].

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Is role eligible for Immigration Sponsorship?: Yes

Average salary estimate

$164250 / YEARLY (est.)
min
max
$126000K
$202500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Sr Staff Site Reliability Engineer (Cortex Data Lake), Palo Alto Networks

Join Palo Alto Networks as a Sr Staff Site Reliability Engineer for the Cortex Data Lake team in Santa Clara, CA, where your expertise will help shape the future of cybersecurity! You will play a pivotal role in supporting our large infrastructure as one of the biggest customers of Google Cloud Platform (GCP). This position offers you the chance to dive into exciting technologies while collaborating with developers, researchers, data scientists, and security experts to design, build, and maintain reliable and secure Cloud infrastructure. Your work will involve everything from automation to performance enhancement, ensuring applications are production-ready and scalable. If you have a passion for infrastructure, monitoring as code, and building high-availability cloud-native applications, this may be the perfect role for you! You will be responsible for creating deployment automation frameworks, orchestrating end-to-end monitoring, and engaging in critical root cause analyses of production issues. With a dynamic team culture that values innovation, continuous learning, and collaborative problem-solving, your contributions will be respected and utilized to reinforce our mission of safeguarding the digital lifestyle of our customers. If you're up for the challenge, join us and make a difference!

Frequently Asked Questions (FAQs) for Sr Staff Site Reliability Engineer (Cortex Data Lake) Role at Palo Alto Networks
What are the responsibilities of a Sr Staff Site Reliability Engineer at Palo Alto Networks?

As a Sr Staff Site Reliability Engineer at Palo Alto Networks, your main responsibilities include automating and ensuring the reliability of our cloud infrastructure, developing tools and frameworks to enhance service deployment, and collaborating with a multi-disciplinary team to optimize application performance. You'll also participate in the on-call rotation, handle production issues, and lead root cause analysis efforts.

Join Rise to see the full answer
What qualifications are required for the Sr Staff Site Reliability Engineer position at Palo Alto Networks?

To qualify for the Sr Staff Site Reliability Engineer position at Palo Alto Networks, you need at least 5 years of engineering experience in Infrastructure, Operations, or DevOps, with 3 years focused on high availability cloud-native applications on AWS or GCP. A degree in Computer Science or a related field is essential, alongside proficiency in programming languages like Python or Go, and experience with Kubernetes and CI/CD pipelines.

Join Rise to see the full answer
What technologies does a Sr Staff Site Reliability Engineer work with at Palo Alto Networks?

In this role at Palo Alto Networks, you'll have the opportunity to work with a variety of cutting-edge technologies, including Terraform, Kubernetes, GitLab CI/CD, Prometheus, Grafana, Docker, GCP, and more. This diverse technology stack allows you to expand your expertise while directly contributing to our innovative cybersecurity solutions.

Join Rise to see the full answer
How does Palo Alto Networks support the professional growth of Sr Staff Site Reliability Engineers?

Palo Alto Networks is committed to the professional growth of its employees, including Sr Staff Site Reliability Engineers. You will have access to personalized learning opportunities, along with resources for mental and financial health. Additionally, our culture promotes continuous innovation, encouraging you to stay ahead in your field.

Join Rise to see the full answer
What is the work culture like for a Sr Staff Site Reliability Engineer at Palo Alto Networks?

The work culture for a Sr Staff Site Reliability Engineer at Palo Alto Networks is fast-paced, collaborative, and innovative. Emphasizing in-person interactions, the environment fosters casual conversations and relationship-building, ensuring that all employees feel included and valued as they contribute to our mission of cybersecurity.

Join Rise to see the full answer
Common Interview Questions for Sr Staff Site Reliability Engineer (Cortex Data Lake)
Can you describe your experience with cloud-native applications in a Sr Staff Site Reliability Engineer role?

In responding, focus on specific projects where you implemented and maintained cloud-native applications, detailing the technologies used, your role in the development lifecycle, and any challenges you overcame.

Join Rise to see the full answer
What strategies do you employ for ensuring application reliability within cloud infrastructure?

You should detail specific practices like monitoring, alerting, proactive maintenance, and automation processes that you utilize to keep applications reliable, scalable, and performant.

Join Rise to see the full answer
How do you approach troubleshooting complex distributed systems?

Discuss your systematic approach to root cause analysis, including your strategies for isolating issues, utilizing monitoring tools, and how you collaborate with other teams to resolve critical problems.

Join Rise to see the full answer
What is your experience with container orchestration tools like Kubernetes?

Provide insights into how you've utilized Kubernetes in previous roles, including deployment strategies, scaling applications, managing services, and any specific challenges you tackled.

Join Rise to see the full answer
Can you share a successful automation project you have led?

Be specific about the objectives of the automation project, the tools you employed, the results achieved, and how it improved overall efficiency or reliability within a system.

Join Rise to see the full answer
How do you prioritize workload in a DevOps/SRE role when multiple incidents occur?

Share your strategies for prioritization based on impact and urgency, including real-life examples of how you’ve effectively managed high-pressure situations while ensuring that major incidents are resolved promptly.

Join Rise to see the full answer
How do you collaborate with development teams as a Sr Staff Site Reliability Engineer?

Discuss your methods for enhancing collaboration through regular communication, feedback sessions, participating in cross-functional teams, and tools that facilitate seamless information sharing.

Join Rise to see the full answer
How do you stay up-to-date with emerging technologies and trends in site reliability engineering?

Mention specific resources such as industry conferences, journals, online courses, or discussions with peers that help you keep your knowledge current and relevant.

Join Rise to see the full answer
What is your experience with CI/CD pipelines and how have you optimized them?

Provide specific examples of CI/CD practices you've implemented, tools you've integrated, and the improvements you have made to enhance deployment frequency and reduce lead time.

Join Rise to see the full answer
Can you elaborate on your understanding of security practices in infrastructure management?

Showcase your knowledge of security protocols, such as access controls, encryption techniques, and regular security audits and how these practices are fundamental in designing a secure site reliability environment.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 14 hours ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
SpaceX Hybrid McGregor, TX
Posted 20 hours ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition
Photo of the Rise User
EOS Hybrid Pflugerville, TX
Posted 12 days ago
Photo of the Rise User
Posted 13 days ago
Photo of the Rise User
Weekday Remote No location specified
Posted 5 days ago
Photo of the Rise User
Posted 8 hours ago

Being the cybersecurity partner of choice, protecting our digital way of life.

742 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
March 16, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
58 people applied to Electrical Apprentice at Aerotek
A
Someone from OH, Cleveland just viewed Personal Assistant *ASAP* at Alphabe Insight Inc
Photo of the Rise User
Someone from OH, Canton just viewed Senior Director, Communications at Imagine Pediatrics
Photo of the Rise User
20 people applied to REMOTE Sr Piping Designer at Kelly
Photo of the Rise User
Someone from OH, Euclid just viewed Software Engineer - Sr. Consultant level at Visa
Photo of the Rise User
Someone from OH, Dublin just viewed GTM Recruiter (Contract) at Notion Labs
Photo of the Rise User
Someone from OH, West Chester just viewed Marketing Manager, Brand at Felix
Photo of the Rise User
Someone from OH, Amelia just viewed Call Center Representative at Ascensus
Photo of the Rise User
Someone from OH, Amelia just viewed Remote Call Center Representative at Conduent
Photo of the Rise User
Someone from OH, Amelia just viewed Credit and Collection Analyst at AbbVie
O
Someone from OH, Dayton just viewed Data Engineer at On-Hire
Photo of the Rise User
Someone from OH, Cincinnati just viewed Reentry Coordinator at Commonwealth of Kentucky
A
Someone from OH, Lewis Center just viewed 34505367634 - Fraud Analyst at Activate Talent
Photo of the Rise User
Someone from OH, Dublin just viewed Senior Third-Party Risk Analyst at Fenergo