Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Kubernetes Admin / Systems Engineer, EngProd image - Rise Careers
Job details

Senior Kubernetes Admin / Systems Engineer, EngProd

Company Description

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We  believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Job Description

Who You’ll Work With

Arista Networks is looking for world-class Kubernetes-aware engineers passionate about driving systems reliability and scalability to provide the best possible development experience for our 1400+ person engineering team. You will be part of a fast paced, high caliber team building the internal systems and infrastructure used to build the routing and switching products driving the industry's largest data center networks.

Arista’s Software Engineering team runs at a scale rarely found - TBs of source control, 60GB work trees with 1000s of developer branches in flight at any given time, over 400K daily build/test jobs and over 150 homegrown and cloud native services running on a 100 node on-prem bare metal Kubernetes cluster.  Operating these systems takes vigilance, responsiveness to alerts, and a steady stream of updates and bug fixes to keep things running smoothly and efficiently as well as to increase our ability to monitor, understand and visualize them. The role will cover all aspects of our Kubernetes infrastructure, and may include monitoring, responding to, and enhancing alerts, working to unify and standardize our alerts, fine tuning code for scalability and performance, debugging problems, simplifying and securing developer experience with k8s etc. You will own your projects from definition to deployment, developer and vendor interactions, and you will be responsible for the quality of everything you deliver.

What You’ll Do

Working in the Engineering Productivity (EngProd) group, you will collaborate and work with other engineers to design, build, scale, and operate the systems that the rest of Arista’s development teams use.  The EngProd team uses industry-standard systems like Ansible, Jenkins, Kubernetes, Grafana, Spinnaker, MySQL, ElasticSearch, Google Cloud, and Varnish and also internal systems that we’ve built from the ground-up to automate CI/CD, testing, analysis, and visualization.

Responsibilities

  • Work with existing k8s admin team to own different aspects of managing a production k8s cluster (eg: upgrades, monitoring, capacity planning, security, developer experience etc)
  • Proactively monitor, respond to, and enhance alerts and set up automated alert handling where applicable
  • Create and maintain the incident response runbooks working with the service dev teams
  • Debug and resolve issues impacting developer user experience and infrastructure stability around the k8s platform
  • Adopt current best practices in k8s cluster management. Evaluate and adopt OSS projects that simplify k8s cluster management. 
  • Set up guidelines and paved paths for service dev teams improving developer experience around the k8s platform.
  • Work with Arista’s software engineers to identify bottlenecks and limitations in our workflows, tooling, and infrastructure around k8s and provide fixes for those problems.
  • Engage with 3rd party vendor support as part of triage

Qualifications

  • At least BSc Computer Science or Engineering + 8 years’ experience, MS Computer Science or Engineering + 6 years’ experience, or Ph.D. in Computer Science or equivalent work experience.
  • Knowledge of one or more of Go, Python, Javascript. Experience with shell Scripting to be able to implement medium complexity automation workflows.
  • Knowledge of Linux (or UNIX).
  • Experience in operating software systems at scale.
  • Strong understanding of the fundamentals of storage and networking.
  • Comfortable with Ansible and GitOps.
  • Strong expertise with managing on-prem/baremetal Kubernetes clusters.
  • Applied understanding of software engineering principles.
  • Strong problem solving and software troubleshooting skills.
  • Ability to design a solution and implement features independently. Ability to work in small teams.
  • Comfortable with security principles and able to study source code of OSS projects, conduct experiments as necessary to debug issues.
  • Proven expertise with debugging complex issues that span the technology stack.
  • Experience dealing with network proxies and containerized storage.

Additional Information

   

Arista Networks is an equal opportunity employer.  Arista makes all hiring and employment-related decisions in a non-discriminatory manner without regard to race, color, religion, sex, sexual orientation, gender identity, national origin or any other factor determined to be unlawful under applicable federal, state, or law law.  All your information will be kept confidential according to EEO guidelines.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Kubernetes Admin / Systems Engineer, EngProd, Arista Networks

Are you ready to take on an exciting challenge as a Senior Kubernetes Admin / Systems Engineer at Arista Networks? Based in Nashua, NH, you'll be stepping into a dynamic environment where innovation thrives, and engineering teams are empowered to exceed their limits. In this role, you'll be part of the Engineering Productivity (EngProd) group, working alongside some of the brightest minds in the industry to enhance our Kubernetes infrastructure! Your days will be filled with maintaining and improving our production Kubernetes cluster while ensuring systems reliability and scalability for over 1400 engineers. With responsibilities that range from monitoring and responding to alerts to fine-tuning code for optimal performance, you'll have the opportunity to own projects from definition to deployment. Imagine the impact of your work as you dive into troubleshooting and enhancing the developer experience surrounding Kubernetes. Plus, your expertise with tools such as Ansible, Jenkins, and Grafana will be vital in automating CI/CD pipelines and streamlining operations. At Arista, we believe in fostering a diverse and inclusive work environment that drives creativity and innovation. If you have a strong background in software engineering, solid experience managing Kubernetes at scale, and a passion for problem-solving, this position could be your next great adventure!

Frequently Asked Questions (FAQs) for Senior Kubernetes Admin / Systems Engineer, EngProd Role at Arista Networks
What are the qualifications needed for the Senior Kubernetes Admin / Systems Engineer position at Arista Networks?

The Senior Kubernetes Admin / Systems Engineer position at Arista Networks requires at least a BSc in Computer Science or Engineering plus eight years of experience, or an MS plus six years, or a Ph.D. in a related field. Candidates should have a solid understanding of operating software systems at scale, strong expertise with on-prem Kubernetes clusters, and knowledge of programming languages like Go, Python, or JavaScript.

Join Rise to see the full answer
What does a typical day look like for a Senior Kubernetes Admin / Systems Engineer at Arista Networks?

A typical day for a Senior Kubernetes Admin / Systems Engineer at Arista Networks involves monitoring and maintaining the Kubernetes infrastructure, enhancing alert systems, debugging issues, and collaborating with other engineers to improve the developer experience. You will also work on setting up incident response runbooks and adopt current best practices in Kubernetes management.

Join Rise to see the full answer
What tools and technologies will a Senior Kubernetes Admin / Systems Engineer use at Arista Networks?

In this role, the Senior Kubernetes Admin / Systems Engineer at Arista Networks will leverage tools and technologies like Ansible, Kubernetes, Grafana, Spinnaker, and Google Cloud, as well as internal systems built to automate CI/CD and testing processes. Familiarity with networking principles and storage management is essential for success.

Join Rise to see the full answer
How does the Senior Kubernetes Admin / Systems Engineer contribute to team success at Arista Networks?

The Senior Kubernetes Admin / Systems Engineer contributes to team success at Arista Networks by ensuring the Kubernetes infrastructure is stable and efficient, thus enabling development teams to focus on building products. By identifying bottlenecks, providing fixes, and implementing best practices, you play a critical role in enhancing overall productivity within the organization.

Join Rise to see the full answer
What skills are essential for a Senior Kubernetes Admin / Systems Engineer at Arista Networks?

Essential skills for a Senior Kubernetes Admin / Systems Engineer at Arista Networks include strong problem-solving abilities, expertise in managing Kubernetes environments, proficiency in scripting for automation, and an understanding of networking fundamentals. Excellent communication and collaboration skills are also crucial for working effectively within the team.

Join Rise to see the full answer
Common Interview Questions for Senior Kubernetes Admin / Systems Engineer, EngProd
What experience do you have managing Kubernetes clusters?

When answering, focus on specific examples of Kubernetes clusters you have managed, highlighting instances where you ensured stability, performed upgrades, or fine-tuned performance. Mention the scale at which you operated and any tools you utilized effectively.

Join Rise to see the full answer
Can you describe a challenging bug you encountered in a Kubernetes environment and how you resolved it?

Share a real-life example spending time on the problem, the steps you took to identify it, and how you implemented the solution. Emphasize your problem-solving skills and the impact of your resolution on the team.

Join Rise to see the full answer
How do you approach monitoring and alerting in a production Kubernetes cluster?

Explain your methods for setting up monitoring tools, what metrics you focus on, and how you use alerts to maintain system reliability. Information on your experience with tools like Prometheus or Grafana can add depth to your answer.

Join Rise to see the full answer
What strategies do you use for Kubernetes security?

Discuss your approach to securing Kubernetes clusters, including role-based access control, network policies, and regular audits. Providing specifics on security tools and best practices you have utilized will enhance your response.

Join Rise to see the full answer
How do you manage the relationship with third-party vendors for Kubernetes support?

Describe how you engage with vendors for support, the importance of effective communication, and how you ensure that you address issues promptly. Sharing instances where vendor support led to significant improvements will strengthen your answer.

Join Rise to see the full answer
What scripting languages are you comfortable with for automation purposes?

Mention specific scripting languages you've utilized, such as Python or Bash. Provide contexts or examples of automation tasks you have accomplished, emphasizing the impact on efficiency and reliability.

Join Rise to see the full answer
How do you ensure high availability in Kubernetes applications?

Discuss strategies such as setting up redundancy, utilizing multiple replicas, and implementing load balancers. Providing examples of high availability scenarios you managed will illustrate your practical knowledge.

Join Rise to see the full answer
What best practices do you follow when updating Kubernetes clusters?

Outline the systematic approach you follow for updates, including testing in a staging environment, reviewing change logs, and having a rollback plan in case of issues. Mentioning your experience in successfully executing updates is beneficial.

Join Rise to see the full answer
Can you explain the role of continuous integration and continuous deployment in Kubernetes management?

You should explain how CI/CD pipelines facilitate quick and reliable deployments in Kubernetes environments. Discuss your experience setting up CI/CD using tools within the Kubernetes ecosystem and the benefits to development workflows.

Join Rise to see the full answer
What do you enjoy most about working as a Senior Kubernetes Admin / Systems Engineer?

Share your passion for problem-solving, the satisfaction of improving development processes, and the excitement of working with cutting-edge technologies. Personal experiences that align with the company's values can make your answer more relatable.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 12 days ago
Posted 2 days ago
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
Posted 4 days ago
Photo of the Rise User
Posted 7 days ago

Arista Networks is an industry leader in cognitive cloud networking for mission-critical data center and campus environments. Arista’s award-winning platforms deliver availability, agility, automation analytics and security through CloudVision® an...

69 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!