As MLE on Luma's Data team you are responsible for raising the bar for our data quality. Data is the critical foundation of our products, and we are looking for individuals who can identify creative approaches to data and captioning and then implement solutions for processing at PB scale. Good candidates should have exceptional general python engineering skills alongside a combination of industry ML experience, Data experience, and passion for building AI products.
- Work with Research and Product on dataset creation and dataset preparation
- Train and run ML classifiers & embeddings to categorize by data quality and other attributes
- Manage processing of PB-level data across '000s GPUs
- Run experiments to gauge the impact of certain datasets on our models
- Explore, implement and deploy novel research techniques in multi-modal AI for use as features in Luma's products
Experience- 5+ years of relevant experience or demonstration of high impact projects as a Data Engineer, Machine Learning Engineer, or Data Scientist, dealing with large amounts of data on a daily basis.
- Have a strong belief in the criticality of high-quality data and are highly motivated to work with the associated challenges.
- Have experience working in large distributed systems.
- Strong generalist python and pytorch skills
- Experience using SQL, Spark, or other tools for processing large amounts of data.
- Please note this role is not meant for recent grads.
$180,000 - $250,000 a year
The pay range for this position in California is $180,000 - $250,000/yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
Your application is reviewed by real people.