Sign up for our
weekly
newsletter
of fresh jobs
Survey Monkey is a global leader in online surveys and forms that empowers people with the insights they need to make decisions with speed and confidence. Our fast, intuitive feedback management platform connects millions of users worldwide with real-time AI-powered insights that drive meaningful decisions. We provide answers to more than 20 million questions every day so that people and... organizations can attract new audiences, delight customers, create advocates, and extend their competitive advantage in the marketplace.
Our vision is to raise the bar for human experiences by amplifying individual voices. Learn more at
What we're looking for
As a member of the SRE team, you will automate away toil, help teams work in a cloud-first AWS environment, and develop processes to make development seamless. The SRE team consults with several development teams that are responsible for some of the most trafficked applications within Survey Monkey. This role presents a prime opportunity to ensure reliability, maintainability, performance, scalability, and security are at the forefront of our teams' products.
The SRE team will be fully remote across North American time zones.
The Senior Site Reliability Engineer will partner with the application development and main infrastructure teams to architect and operate reliable, scalable, and performant services. Our organization is running fully in AWS and is currently transforming within the new environment to best utilize cloud-first technologies. Because of this, you will be working on many greenfield projects and new-to-us technologies. Not only will site reliability be a keen focus but the team operates with a Dev Ops view.
This team's main impact is taking our engineering excellence to the next level. You will report to the Senior Manager of Site Reliability Engineering.
What you'll be working on
• Partner with application developers and architects to ensure our services are built for scale, reliability and performance.
• Develop the monitoring solutions on top of existing observability platforms
• Refine the development, build and deployment processes on top of our main infrastructure
• Work with the engineering teams to architect and build our platform services to simplify real-time troubleshooting and operational response to incidents and outages
• Be the expert on how to best use AWS technologies to build our next-generation cloud-native platform, including Kubernetes and Git Ops based deployments
• Be the bridge between our core application engineers and our main infrastructure teams
• Provide capacity management expertise to ensure our deployments are managed for robustness and cost
• Bring best practices and own environment management, ensuring all of our dev/test/prod environments are reproducible with high availability
We'd love to hear from people with
• A minimum 8 years experience within Site Reliability or Dev Ops roles operating in a large-scale environment
• Experience working within AWS or similar cloud providers.
• Familiarity with tooling such as:
• Infrastructure as Code (Terraform or Ansible)
• Code build and deployment (Github Actions, Docker registries, Makefiles)
• Kubernetes (Helm, minikube, EKS)
• Logging and Monitoring (Splunk or Signal Fx)
• Experience owning the improvement of the services and customer experiences of the platforms you support
• Experience in application architecture designs and implementations, including the operational trade-offs of different designs
• Knowledge of different aspects of service design: including messaging protocols and behavior, caching strategies, and software design practices
• Experience making strategic trade-offs when needed
• Have mentored SRE engineers with ranging levels of experience
The base pay provided for this position ranges from $130,240 / year - $195,360 / year depending on the geographic market and assuming a full-time schedule. Actual base pay is based on a number of factors including market location, job-related knowledge, education or training, skills, and experience.
Bonuses and commissions may also be offered as part of the total compensation package, in addition to a competitive benefits package including medical, dental, vision, life, and disability insurance; 401(k) retirement plan; flexible spending & health savings account; paid holidays; paid time off; employee assistance program; and other company benefits.
#LI-remote
Why Survey Monkey? We’re glad you asked
Survey Monkey is a place where the curious come to grow. We’re building an inclusive workplace where people of every background can excel no matter their time zone. At Survey Monkey, we weave employee feedback into everything we do to create forward-looking benefits policies, employee programs, and an award-winning culture, including best workplace for parents, our annual holiday refresh, our annual week of service, and our C.H.O.I.C.E Fund.
In addition, we’ve reimagined the way we work to allow employees to choose what works best for them -- working in-person, fully remote, or a hybrid model…