About Datadog:
We're on a mission to build the best platform in the world for engineers to understand and scale their systems, applications, and teams. We operate at high scaleâtrillions of data points per dayâallowing for seamless collaboration and problem-solving among Dev, Ops and Security teams globally for tens of thousands of companies. Our engineering culture values pragmatism, honesty, and simplicity to solve hard problems the right way.
The Opportunity:
Datadog is going through a transformation from being focussed on the âthree pillars of observabilityâ (logs, metrics, traces) to having products that cross modern enterprises needs - Security Monitoring, Real User Monitoring, and Synthetic Monitoring, with many more planned. At the heart of the massive amount of streaming data generated by these systems are our Core Storage platforms. These platforms comprise 10s of thousands of Kubernetes pods, PBs of data and complex storage and alerting technologies powering highly available distributed multi-tenant solutions for data ingestion, processing and storing, and for data queries serving. Our software stack is deployed in multiple geo regions and across the three major public cloud infrastructure providers. Given the technical complexity and at Datadogâs neck-breaking pace of growth, operating the storage solutions requires a non-trivial engineering effort which is hard to sustain.
We are looking for an experienced leader to spearhead the platform automation efforts in Core Storage. As the Engineering Manager, Core Storage Platform Automation, you will manage multiple teams operating embedded and side-by-side with the platform development teams. The team will measure and reduce operational toil by devising advanced heuristics codified in automation workflows. Your organization will leverage infrastructure provided by other Datadog internal teams, and will develop platform specialized tooling, manageability solutions integrated with our platforms, automation components and more. You will staff and grow the team with talent featuring SW development and SRE backgrounds. The mission of the Platform Automation team will be to eliminate through software automation 75% of all operational activities requiring human involvement - these include, but are not limited to, release deployments, failures remediation, long running maintenance and business processes.
You Will:
You Are:
Bonus Points:
Why You Should Apply:
This is a remote position
Equal Opportunity at Datadog:
Datadog is an Affirmative Action and Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.
Your Privacy:
Any information you submit to Datadog as part of your application will be processed in accordance with Datadogâs Applicant and Candidate Privacy Notice.
Datadog (NYSE: DDOG) is a prominent global SaaS provider that uniquely balances growth and profitability. It offers cloud-scale monitoring and security by combining metrics, traces, and logs within one platform.
107 jobsSubscribe to Rise newsletter