Job details

Member of Technical Staff - ML Research for Data Generation

Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale.

Our goal at Liquid is to build the most capable AI systems to solve problems at every scale, such that users can build, access, and control their AI solutions. This is to ensure that AI will get meaningfully, reliably and efficiently integrated at all enterprises. Long term, Liquid will create and deploy frontier-AI-powered solutions that are available to everyone.

We are seeking a highly skilled ML Engineer to play a critical role in our foundation model development process. The ideal candidate will be responsible for designing, developing, and implementing sophisticated synthetic and real-world data generation strategies that will feed and improve our AI model's training pipeline.

Key Responsibilities

Design and implement comprehensive data generation strategies for foundation model training

Develop synthetic data generation techniques that enhance model performance and diversity

Curate, clean, and validate large-scale real-world datasets

Create advanced data augmentation and transformation pipelines

Ensure data quality, ethical considerations, and bias mitigation in data generation

Develop tools and frameworks for reproducible and scalable data generation

Monitor and assess the impact of generated data on model performance

Required Qualifications

Ph.D. or Master's degree in Computer Science, Machine Learning, Statistics, or related field

Experience in data generation, synthetic data creation, or machine learning data pipelines

Strong programming skills

Experience with machine learning frameworks, ideally Pytorch

Deep understanding of generative AI techniques

Expertise in data augmentation, transformation, and cleaning methodologies

Strong statistical and mathematical background

Preferred Skills

Experience with large language models or multimodal foundation models

Knowledge of differential privacy and data anonymization techniques

Experience with data ethics and bias detection

Publications or research in synthetic data generation

Understanding of scalable data processing architectures

projects

Average salary estimate

$135000 / YEARLY (est.)

min

max

$120000K

$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of Technical Staff - ML Research for Data Generation, Liquid AI

Join Liquid AI, an innovative MIT spin-off headquartered in Boston, Massachusetts, as a Member of Technical Staff - ML Research for Data Generation! We're on a mission to build some of the most capable and efficient general-purpose AI systems out there. Your role will be vital in shaping the foundation of our AI models by designing, developing, and implementing advanced data generation strategies that enhance our AI training pipelines. If you're someone who dives deep into the intricacies of synthetic data generation and real-world dataset validation, we want to hear from you! You’ll be responsible for curating large datasets while ensuring ethical data practices and bias mitigation. Collaborate with a talented team and use your expertise in generative AI techniques and machine learning frameworks like Pytorch to help us drive innovation at every scale. Your ability to develop tools for reproducible and scalable data generation will not only support our models but also help us make impactful contributions to the AI community. With a Ph.D. or Master's degree in fields like Computer Science or Machine Learning, coupled with your strong programming skills and deep understanding of data augmentation methodologies, you’ll play a key role in refining our AI models for efficient deployment across enterprises. If you’re ready to push the boundaries of AI and create solutions that are accessible to all, consider this an exciting opportunity for both personal and professional growth at Liquid AI!

Frequently Asked Questions (FAQs) for Member of Technical Staff - ML Research for Data Generation Role at Liquid AI

What qualifications are needed for the Member of Technical Staff - ML Research for Data Generation position at Liquid AI?

To qualify for the Member of Technical Staff - ML Research for Data Generation role at Liquid AI, candidates should hold a Ph.D. or Master's degree in Computer Science, Machine Learning, Statistics, or a related field. Experience in data generation and strong programming skills, particularly with machine learning frameworks like Pytorch, are essential. A solid background in statistical and mathematical principles will also give you a competitive edge.