Sign up for our
weekly
newsletter
of fresh jobs
Site Reliability Engineers are responsible for ensuring that our platform is stable and healthy. We break down barriers by fostering developer ownership and empowering developers. We support them by building creative and robust solutions to operations problems. We use our background as generalists to work closely with product development teams from the early stages of design all the way through... identifying and resolving production issues. We see the big picture. We help create and enforce standards while facilitating an agile and learning culture. We use SRE principals such as blameless postmortems and operational load caps to ensure we're constantly improving our knowledge and maintaining a good quality of life. Overall, we're passionate about automation, learning and participating in dynamic day to day work. Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools - Cvent SRE can benefit from your skillsets. Ultimately, we are looking for passionate people who love learning and technology. We use a wide variety of technologies and avoid getting locked into a single path. If we find something that works better than what we have, we always are open to trying it out. Here is a taste of the technologies you'll get to work with:
AWS (EC2 / ECS / Lambda / RDS / S3 / Route53 / DynamoDB) Docker Java, .Net, Ruby Linux, Windows PostgreSQL, SQLServer Kafka / CouchBase / CouchMobile Chef, Puppet Datomic Consul Terraform, CloudFormation Jenkins ReactNative Native iOS and Android What You Will Be Doing:
As a Lead/Principal Site Reliability Engineer, you will use your advanced development and operations knowledge to identify and prioritize issues. Find universal solutions to common problems and mentor and support junior staff. Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations. Tackle complex development, automation, and business process problems. Champion Cvent standards and best practices. Ensure the scalability, performance, and resilience of our suite of products. Work with the development and product team of a new application to establish the right monitoring and alerting strategy. Work with a new acquisition's DevOps team to cross-pollinate best practices, educate and close gaps in Cvent standards. Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions. Help a dev team working on a legacy code base to realize zero-down-time deployments. Give back by working on and contributing to Open Source projects:
Automate all the things! What You Need for this Position:
We believe that passion and willingness to learn outweigh any list of skills, however having experience in some of the areas below would help you hit the ground running and show that you can be successful as an SRE at Cvent. Object-Oriented Software development in Java, Scala, etc. CI Server administration and support (Jenkins) Configuration automation using Chef or Puppet. Building tools and scripting frameworks from scratch Solid Windows and Linux administration skills. Working with APM, monitoring, and logging tools (New Relic, DataDog, Splunk) Project management tools like Jira, Trello. NoSQL (etc., Couchbase, Cassandra). SQL databases (MSSQL, PostgreSQL, etc.). Message Queues (RabbitMQ). Scripting languages like Ruby, Groovy, Bash, PowerShell, or Python. Bachelor's or master's degree in a technical field required
Salary Range:
$80K -- $100K
Minimum Qualification
DevOps & Site ReliabilityEstimated Salary: $20 to $28 per hour based on qualifications