We are seeking an experienced GPU Engineer with a strong background in Python and large-scale model training. In this role you will design and implement improvements to our large scale training infrastructure and directly help make technical decisions to optimise our models' training performance and efficiency.
Ideal Experience
Strong engineering skills with fluency in Python and PyTorch or other acceleration libraries.
Experience writing and debugging low-level GPU code (CUDA, Triton) and debugging hardware errors.
Experienced in scaling up GPU jobs via large-scale compute clusters using Slurm or Kubernetes.
Preferred: knowledge of advanced filesystems, particularly Ceph, and their integration and optimization in large systems.
Preferred: proficient in implementing robust monitoring systems for performance tracking and anomaly detection.
Reka's Mission
Reka's mission is to build useful multimodal artificial intelligence and use it to empower organisations and businesses. We are a globally distributed foundation model startup, headquartered in the San Francisco Bay Area, California. Embracing a remote-first approach, our team brings together top talent from around the world. Our founding team, along with many of our team members, has contributed to many of the breakthroughs in AI over the past decade.
Why Reka?
An Elite Team: Collaborate with top-tier engineers, researchers, operators from renowned organizations like Google DeepMind and Facebook AI Research (FAIR) and successful startups, driving innovation in cutting-edge AI technology.
Massive Market Opportunity: Be part of a rapidly growing industry poised to transform multiple sectors globally, offering the chance to make a significant impact.
Mission-Driven Environment: Work alongside a collaborative, mission-focused team dedicated to advancing AI for meaningful applications.
Inclusive and Open Culture: Thrive in an open and inclusive work environment that values diverse perspectives and fosters creativity.
Generous Benefits: Enjoy 5 weeks of paid leave to recharge, comprehensive healthcare benefits including vision and dental, and additional perks that support your well-being.
Visa Support: We provide visa assistance, including H1B and OPT transfers, for US employees to ensure a smooth transition and support your career with us.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
If you're an experienced GPU Engineer looking to make a significant impact, Reka has an exciting opportunity for you! Our team is on the lookout for a talented member of the Technical Staff to help us enhance and optimize our large-scale training infrastructure. You'll be using your strong background in Python along with libraries like PyTorch to design and implement improvements that could take our model training performance to the next level. With your experience in writing and debugging low-level GPU code, particularly with CUDA or Triton, you'll play a critical role in deciding how we tackle complex technical challenges. Plus, if you have a knack for scaling GPU jobs via compute clusters like Slurm or Kubernetes, you're exactly who we're looking for! At Reka, we value innovation and interdisciplinary approaches, making it the perfect setting for you to thrive. Join a globally distributed team that’s dedicated to advancing artificial intelligence and enjoy a collaborative environment with generous benefits, including extensive paid leave and healthcare support. Let's build the future of AI together!
Subscribe to Rise newsletter