Sign up for our
weekly
newsletter
of fresh jobs
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Rewst, is seeking the following. Apply via Dice today!Rewst is a venture-backed B2B SaaS startup founded in January 2021. We've raised over $50M to date and have hundreds of paying customers.We're a low-code workflow automation platform targeted at the Managed Service Provider (MSP) market. To get an idea of how Rewst works and what it does, take a look at our YouTube channel. There you can find training videos from our education team, demos of the product by our ROC (Robotic Operations Center) team, and even customers showcasing how they use Rewst in their business.We don't have dedicated managers and we don't put people in charge who don't know how to write code themselves. We're a relatively flat organization that prioritizes talented engineers who want to take charge. We see management, rules, and performance metrics as roadblocks to creativity and innovative thinking. We expect engineers to adhere to a single rule: create value for the organization. We tell you the problem, provide some constraints or example scenarios, and expect you to work with product management and stakeholders to craft the right solution and maximize the value to the business.About YouAs a Senior Site Reliability Engineer (SRE), you will ensure that Rewst's systems are reliable, scalable, and performing optimally. You're comfortable identifying system bottlenecks, automating tasks, and implementing best practices to prevent downtime and ensure high availability. You can talk to product stakeholders, record requirements, and ask the right questions where things are ambiguous. You have a passion for maintaining the stability of production systems and improving overall user experience. You are adept at using monitoring tools to foresee issues before they affect the user and have experience in incident management to resolve issues swiftly. You understand the balance between reliability and rapid feature development and are skilled at implementing strategies that align with business needs.You're a proactive engineer who thrives in environments where you can automate and improve existing processes. You are comfortable leading initiatives that improve system performance and reliability, and you can collaborate effectively with other engineers to achieve these goals. You are driven by results and are not afraid to make decisions that ensure the robustness and reliability of production systems.Our Tech StackAWSDockerKubernetesKafkaRedisPostgresApollo GraphQLNext.jsPrometheusGrafanaAWS CDKPythonJavascript/TypescriptRequirements5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.Strong experience with cloud platforms such as AWS or Azure.Proficiency in containerization and orchestration technologies such as Docker and Kubernetes.Experience with infrastructure as code tools like AWS CDK.Familiarity with monitoring and logging tools such as Prometheus, Grafana, OpenTelemetry, or SigNoz.Strong scripting skills in languages such as Python or Typescript.Proven ability to design and implement automated CI/CD pipelines.Ability to debug complex issues in a distributed system and identify root causes.Excellent communication skills, able to convey complex technical information to non-technical stakeholders.Self-motivated and able to take ownership of projects with minimal oversight.Experience with incident management and on-call rotation.Above and BeyondExperience working in a startup environment and excited to tackle new challenges.Knowledge or experience with low-code tools or robotic process automation tools.Familiarity with advanced backend concepts such as state machines, distributed systems, or network protocols.Contributions to open-source projects related to SRE or DevOps.Certifications in cloud technologies such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer.