Job details

Sr Staff Site Reliability Engineer (Cortex Data Lake)

Company Description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.
Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

Palo Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, observability, troubleshooting, security, and reliability.

Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps, Prometheus, Grafana, Loki, Docker, GCP, Vault, Kafka, MySQL, Python, Bash, and Go.

Your Impact

Contribute to the success of SRE and DevOps
Develop expertise in new technologies
Work with developers, researchers, data scientists, and security experts
Design, build and operate reliable, secure Cloud infrastructure
Ensure that applications are production-ready, scalable, and reliable
Develop tools and automation frameworks
Automate robust deployment of robust services
Orchestrate end-to-end monitoring and alerting
Participate with SRE and Dev teams in the on-call rotation
Lead root cause analysis of critical business and production issues

Qualifications

Your Experience

4+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering
3+ years building high availability, scalable cloud-native applications on AWS or GCP
BS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience required
Expertise in configuration management with a framework such as Ansible, Terraform, Helm
Passion for infrastructure and monitoring as code
Solid experience in container workloads and Kubernetes
Familiarity with PKI concepts, Networking concepts
In-depth knowledge of different security controls ( app-id, user-id, security profile, url category, content, ssl decryption, firewall MFA etc)
Linux administration, internals, and network troubleshooting
Proficiency with programming languages like Golang or Python along with shell scripting to automate tasks
Proficiency with CI/CD pipelines, ArgoCD and GitLab CI/CD. Knowledge of GitLab Runners is a plus
Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions
Experience with managing Kafka is a plus
Excellent written and verbal communication, able to collaborate and rally support
Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive
Ready to understand and dissect new technology stacks quickly

Additional Information

The Team

Our engineering team is at the core of our products – connected directly to the mission of preventing cyberattacks. We are constantly innovating – challenging the way we, and the industry, think about cybersecurity. Our engineers don’t shy away from building products to solve problems no one has pursued before.

We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.

Compensation Disclosure

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $126000 - $203500/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

#LI-TD1

Our Commitment

We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at [email protected].

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Average salary estimate

$164750 / YEARLY (est.)

min

max

$126000K

$203500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Sr Staff Site Reliability Engineer (Cortex Data Lake), Palo Alto Networks

At Palo Alto Networks, we're on the lookout for a dynamic Senior Staff Site Reliability Engineer to join our Cortex Data Lake team in sunny Santa Clara, CA. If you're an innovator who thrives on tackling challenges and shaping the future of cybersecurity, this role is your opportunity! As a critical member of our diverse engineering team, your expertise will support our large-scale infrastructure, which is at the forefront of cloud-native applications. Your journey will include designing and operating reliable cloud infrastructure, while ensuring our applications are both scalable and production-ready. You'll partner with a variety of professionals from developers to data scientists, all while driving automation and building frameworks that elevate our operational excellence. Our tech stack is impressive, featuring tools like Terraform, Kubernetes, and Docker, and you’ll have the scope to delve into new technologies that excite you. In this role, you'll be diving into monitoring and alerting practices, and leading root cause analyses of critical production issues to continuosly improve our systems. With a strong focus on personal and professional growth, your contributions will not only shape our products but also enhance our mission of securing the digital world. Come be part of a team that believes in collaboration, empowers its members, and puts a premium on innovative problem-solving.

Frequently Asked Questions (FAQs) for Sr Staff Site Reliability Engineer (Cortex Data Lake) Role at Palo Alto Networks

What are the responsibilities of a Sr Staff Site Reliability Engineer at Palo Alto Networks?

As a Senior Staff Site Reliability Engineer at Palo Alto Networks, your responsibilities include developing and maintaining scalable cloud infrastructure, ensuring applications are production-ready, and leading automation efforts to streamline deployment processes. You'll also work closely with developers and data scientists, lead root cause analyses for critical issues, and participate in the on-call rotation to guarantee reliability and security.