We're seeking a versatile Cloud Platform Engineer passionate about building and maintaining a highly reliable, scalable, and cloud-native infrastructure. You'll be vital in bridging the gap between development, operations, and SRE, ensuring our applications run smoothly on Kubernetes across multiple cloud platforms. Your deep understanding of Kubernetes, cloud technologies, and automation will be instrumental in empowering our teams to deliver high-quality software quickly and reliably.
What will you do?
Design, deploy, and operate Kubernetes clusters across AWS, Azure, and GCP. Optimize cluster performance, ensure high availability, and implement robust security practices.
Build and maintain cloud-native infrastructure components (load balancers, networking, storage, etc.) to support applications running on Kubernetes. Leverage Infrastructure as Code (IaC) with Terraform to automate and manage infrastructure provisioning and configuration.
Embrace GitOps principles using ArgoCD to automate deployments and configuration changes and ensure consistency between the desired and actual system state.
Establish comprehensive monitoring, logging, and alerting systems to gain insights into platform health and performance. Troubleshoot incidents swiftly and apply SRE principles to improve reliability and resilience.
Develop automation scripts and tools (Python, Go, or other languages) to streamline workflows, eliminate manual tasks, and reduce operational overhead.
Partner closely with development teams to understand their needs, provide guidance on platform best practices, and enable smooth integration and deployment of their applications.
Implement and maintain stringent security measures for Kubernetes and cloud environments, ensuring compliance with industry standards and data protection regulations.
Analyze resource usage and implement optimization strategies to maximize performance while controlling cloud costs.
Participate in an on-call rotation, troubleshooting and resolving production issues promptly.
What makes you a match?
3+ years of experience working with Kubernetes in production environments. Deep understanding of cluster operations, networking, storage, and security within Kubernetes.
Strong knowledge of AWS, Azure, and GCP, including core services, networking concepts, and security best practices.
Proven experience implementing GitOps workflows with ArgoCD and managing infrastructure using Terraform.
Fluency in at least one programming language (Python, Go, Java) for automation, scripting, and tool development.
Familiarity with SRE practices like SLOs (Service Level Objectives), error budgeting, and blameless postmortems.
Excellent analytical and troubleshooting skills to identify and resolve issues in complex cloud environments.
Ability to communicate effectively with development, operations, and security teams to drive cross-functional initiatives.
Ability to work from 8.30 PM to 5.30 AM IST to provide coverage for US time zones.
DIS-TRAN Steel seeks an IT Specialist to manage and troubleshoot computer systems at its Pineville facility, ensuring optimal IT operations.
District Photo Inc. is looking for a technically skilled Application Support Engineer to drive application reliability and user satisfaction in a collaborative environment.
ENTEK is hiring an IT Client Engineer to deliver comprehensive technical support and maintain IT systems onsite at their Lebanon, Oregon manufacturing plant.
Senior Microsoft Security Engineer needed at Aprio’s SecurityBricks to lead innovative security solutions using Microsoft technologies and compliance standards.
Information Security Analyst I needed at PCG to monitor security systems, support incident response, and collaborate across teams in a remote-friendly setting.
LastPass is looking for a seasoned Staff Jamf Engineer to lead Apple device management strategy and automation in a remote, cross-functional environment.
Seeking a Litigation Support Data Specialist with eDiscovery expertise to join Arthur Grand Technologies for a long-term contract in Dallas, TX.
Lead Medtronic's cybersecurity efforts as Senior IT Manager and Business Information Security Officer, aligning security strategies with business goals to protect global healthcare technologies.
Northrop Grumman seeks a Principal QMS Software Development Analyst skilled in database management and application maintenance to support critical quality management systems.
An EHR Application Analyst position at Saint Joseph Hospital - Elgin focused on optimizing and supporting Epic and Meditech systems to enhance clinical and administrative workflows.
A Systems Administrator role at AMERICAN SYSTEMS to support and modernize IT infrastructure at a federal medical testing facility in Chantilly, VA.
McGraw Hill is looking for a skilled Cybersecurity Analyst to enhance their security processes through automation and collaboration, working remotely within the United States.
Lead the enterprise-wide adoption and deployment of AI technologies at Revalize to enhance business operations and innovation.
To help data teams do more, together! 💪
128 jobsSubscribe to Rise newsletter