We are looking for engineers with significant problem solving experience in PyTorch, CUDA and distributed systems. You will work with Research Scientists to build & train cutting edge foundation models on thousands of GPUs.
Responsibilities
Ensure efficient implementation of models & systems for data processing, training, inference and deployment
Identify and implement optimization techniques for massively parallel and distributed systems
Identify and remedy efficiency bottlenecks (memory, speed, utilization) by profiling and implementing high-performance CUDA, Triton, C++ and PyTorch code
Work closely together with the research team to ensure systems are planned to be as efficient as possible from start to finish
Build tools to visualize, evaluate and filter datasets
Implement cutting-edge product prototypes based on multimodal generative AI
Experience
Experience training large models using Python & Pytorch, including practical experience working with the entire development pipeline from data processing, preparation & data loading to training and inference.
Experience optimizing and deploying inference workloads for throughput and latency across the stack (inputs, model inference, outputs, parallel processing etc.)
Experience with profiling CPU & GPU code in PyTorch, including Nvidia Nsight or similar.
Experience writing & improving highly parallel & distributed PyTorch code, with familiarity in DDP, FSDP, Tensor Parallel, etc.
Experience writing high-performance parallel C++. Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code.
Experience with high-performance Triton / CUDA and writing custom PyTorch kernels. Top candidates will be able to utilize tensor cores; optimize performance with CUDA memory and other similar skills.
Good to have experience working with Deep learning concepts such as Transformers & Multimodal Generative models such as Diffusion Models and GANs.
Good to have experience building inference / demo prototype code (incl. Gradio, Docker etc.)
Compensation
The pay range for this position in California is $180,000 - $250,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
Your applications are reviewed by real people.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Luma AI invites innovative thinkers to join their Talent Community and help shape the future of generative video technology.
Join Luma as a Technical Artist focused on enhancing language prompts for cutting-edge generative AI models.
Join Sanofi as a Stability Monitoring Expert and help innovate in drug development while working in a dynamic, team-oriented environment.
As a Clinical Safety Analyst at AbbVie, you'll play a crucial role in ensuring the quality of clinical trial data while working remotely.
Join Open Philanthropy as a Program Associate/Senior Program Associate to help address potential risks from advanced AI through impactful funding and research initiatives.
Join LG Electronics as an Embodied AI Engineer and shape the future of robotics and AI technology with innovative solutions.
Join Novartis as a Medical Science Liaison to innovate in patient care within the Nephrology therapeutic area.
Embark on a PhD journey at Bosch, focusing on innovative hybrid models that enhance the performance of electric drives and mechanical systems.
Join Peraton as an Ocean Modeling Reinforcement Learning Researcher to advance autonomous oceanic explorations through cutting-edge technology.
Join Euromonitor International as a Research Manager to lead a dynamic team in delivering high-quality data analysis and research.
Subscribe to Rise newsletter