Remote - US
DevOps Engineer
Remote – US and Canada only
thatgamecompany is best recognized for creating award-winning, enriching, and meaningful game titles such as Journey, Flower, and flOw. Our most recent game, Sky, is our most complex undertaking to date. It is a social network built around the values inherited from a powerful humanistic story. It is a live experience continuously evolving inside a global online theme park.
We are seeking a passionate DevOps engineers to join us in embracing Infrastructure as Code practices for our backend services, including but not limited to:
- Microservices running on container orchestration for rapid iteration of in-game social features
- Infrastructure to enable rich in-game user-generated content
- Player support tools that enable our team to effectively assist players and moderate content
- Data warehouse and analytics platforms utilized by player insight and game design teams
- Cloud-based game development tools that empower the studio to continue crafting adventures
These services and platforms will be the core technology powering our current and future game titles, and eventually made available to external customers. We believe that these solutions will fundamentally transform the future of multiplayer social games. We are also live-operating Sky: Children of Light with millions of active users generating terabytes of data per day.
As a DevOps Engineer, you will serve a pivotal role to ensure our backend system running smoothly without downtime, so that we can continue to bring exciting social experiences to hundred thousands of concurrent users at any time.
On any given day at thatgamecompany, you might:
- Write code to describe the backend infrastructure, and make the deployment and configurations visible, readable and maintainable.
- Build tools for rapid iteration, CI/CD, monitoring, diagnosing and easily accessing backend systems.
- Embrace modern technology in the container and cluster management space to improve the scalability and robustness of the backend stack.
- Improve and maintain an agile and reliable development environment for the backend stack, so that people with different skillsets in the company can make social experiments easily, and new hires can ramp up quickly.
- Monitor the backend health and respond to any anomalies in order to deliver a smooth online experience to players all over the world.
- Continually improve DevOps tooling, embracing automation to reduce the possibility for errors.
- Contribute to proactively improving our security posture, as well as reviewing security signals generated by our monitoring systems.
- Review pull requests for security compliance, reliability, and best standards adherence.
We expect you to:
- Have deep passion and thoughts for video games; be a gamer and think on behalf of players.
- Be comfortable to take risks and accomplish engineering achievements that no one else has been done.
- Enjoy working with fast-moving and rapidly-growing small teams.
- Be comfortable with periodic on-call duty.
- Document best practices, infrastructure resources, and procedures to facilitate collaboration and open knowledge sharing between teams.
Required Skills
- Be comfortable working within the Linux ecosystem, with fluency in Linux or macOS bash CLI tools.
- Have familiarity with Docker, Kubernetes, Helm, and Terraform.
- Have basic knowledge of operating systems and low-level network protocols.
- Be able to extract useful information from different sources of logs, find correlations between multiple layers of systems and diagnose failures, suspicious behaviors, and performance bottlenecks from bottom to top.
- Eager to learn any new technology and always open to jumping out of your comfort zone.
- Managed and maintained production environment on AWS or GCP.
Preferred Skills
Any of the following would be highly preferred, but most of all, we value engineers who are eager to learn new ways to deliver value to players:
- Working knowledge of one or more of C++, Erlang, Go, JavaScript, or Python.
- Familiar with various security products, including Hashicorp Vault, IAM, SIEM/CIEM tools, and WAF.
- Experienced in production deployments of services in Kubernetes with ArgoCD and Helm.
- Have deep knowledge of one SQL or NoSQL database and be aware of how its storage engine works under the hood.
- Have experience using and configuring monitoring and observability tools such as Datadog or Grafana.
- Familiarity with ElasticSearch and Kibana.
We look forward to meeting you!
Applicants must be authorized to work for any employer in the U.S. or Canada. We are unable to sponsor or take over sponsorship of an employment Visa at this time.
#LI-Remote