Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Member of Technical Staff - Vision-Language Model Data image - Rise Careers
Job details

Member of Technical Staff - Vision-Language Model Data

Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale.


Our goal at Liquid is to build the most capable AI systems to solve problems at every scale, such that users can build, access, and control their AI solutions. This is to ensure that AI will get meaningfully, reliably and efficiently integrated at all enterprises. Long term, Liquid will create and deploy frontier-AI-powered solutions that are available to everyone.


We are seeking a highly skilled Member of Technical Staff - Vision-Language Model Data to play a critical role in the development of Liquid Vision-Language models. This role focuses on gathering high-quality vision-language midtraining and SFT datasets.


Key Responsibilities
  • Create and maintain data processing, cleaning, filtering, and selection pipeline that can handle image-text data.
  • Watch out for the release of public high quality VLM datasets.
  • Create and maintain synthetic data augmentation pipeline to enhance VLM data quality.
  • Work with the multimodal vision team to run ablations on new dataset.


Required Qualifications
  • Experience Level: B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year of experience.
  • Dataset Engineering: Expertise in data curation, cleaning, augmentation, and synthetic data generation techniques.
  • Machine Learning Expertise: Ability to write and debug models in popular ML frameworks, and experience working with LLMs and VLMs.
  • Software Development: Strong programming skills in Python, with an emphasis on writing clean, maintainable, and scalable code.


Preferred Qualifications
  • M.S. or Ph.D. in Computer Science, Electrical Engineering, Math, or a related field.
  • Experience fine-tuning or customizing LLMs and VLMs.
  • 2+ years working in computer vision.
  • First-author publications in top ML or vision conferences (e.g. NeurIPS, ICML, ICLR, CVPR, ICCV).
  • Contributions to popular open-source projects.


Average salary estimate

$140000 / YEARLY (est.)
min
max
$120000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of Technical Staff - Vision-Language Model Data, Liquid AI

At Liquid AI, an innovative MIT spin-off based in Boston, we are pioneering the future of artificial intelligence with our cutting-edge general-purpose AI systems. As we strive to integrate these intelligent solutions across enterprises seamlessly, we are on the lookout for a talented Member of Technical Staff - Vision-Language Model Data. In this exciting role, you will play a vital part in developing our Vision-Language models, primarily focusing on gathering and curating top-notch vision-language midtraining and SFT datasets. Your day-to-day responsibilities will include creating and maintaining robust data processing pipelines for image-text data, scouting for high-quality public VLM datasets, and developing synthetic data augmentation techniques to boost data quality. Collaboration is key here, as you'll work closely with our multimodal vision team to conduct ablation studies on new datasets. If you hold a Bachelor’s degree with five years of experience, a Master’s with three years, or a Ph.D. with at least one year of experience in the field, along with strong programming skills in Python and hands-on experience in dataset engineering and machine learning, we'd love to hear from you. Let's build the future of AI together at Liquid AI!

Frequently Asked Questions (FAQs) for Member of Technical Staff - Vision-Language Model Data Role at Liquid AI
What are the key responsibilities of a Member of Technical Staff - Vision-Language Model Data at Liquid AI?

As a Member of Technical Staff - Vision-Language Model Data at Liquid AI, your primary responsibilities will include creating and maintaining data processing, cleaning, filtering, and selection pipelines for handling image-text data. You will also monitor the release of public high-quality VLM datasets and enhance data quality through synthetic data augmentation pipelines. Additionally, collaborating with the multimodal vision team to run ablations on new datasets will be a critical part of your role.

Join Rise to see the full answer
What qualifications are necessary for the Member of Technical Staff - Vision-Language Model Data position at Liquid AI?

To be considered for the Member of Technical Staff - Vision-Language Model Data role at Liquid AI, candidates should possess a B.S. with 5 years of experience, an M.S. with 3 years, or a Ph.D. with at least 1 year of relevant experience. Candidates should demonstrate expertise in dataset engineering, machine learning frameworks, and strong programming skills in Python. Preferred candidates will have an advanced degree in relevant fields and experience fine-tuning LLMs and VLMs.

Join Rise to see the full answer
What technical expertise is required for the Member of Technical Staff - Vision-Language Model Data role at Liquid AI?

Candidates for the Member of Technical Staff - Vision-Language Model Data role at Liquid AI should showcase robust expertise in data curation, cleaning, augmentation, and synthetic data generation. Additionally, proficiency in machine learning frameworks and the ability to write and debug models in these environments is essential. A solid understanding of working with Large Language Models (LLMs) and Vision-Language Models (VLMs) will also be beneficial.

Join Rise to see the full answer
What experience is preferred for the Member of Technical Staff - Vision-Language Model Data role at Liquid AI?

For the Member of Technical Staff - Vision-Language Model Data position at Liquid AI, preferred qualifications include an M.S. or Ph.D. in relevant fields, 2+ years of experience in computer vision, first-author publications in top machine learning or vision conferences, and contributions to significant open-source projects. These experiences will help you contribute to the team’s cutting-edge projects effectively.

Join Rise to see the full answer
How does the Member of Technical Staff - Vision-Language Model Data collaborate within the team at Liquid AI?

In the Member of Technical Staff - Vision-Language Model Data role at Liquid AI, collaboration is key. You will work closely with the multimodal vision team to run ablations on new datasets and enhance the vision-language models through shared insights and collective problem-solving. This collaborative environment will enable you to leverage the expertise of your peers while contributing to innovative AI solutions.

Join Rise to see the full answer
Common Interview Questions for Member of Technical Staff - Vision-Language Model Data
Can you explain the importance of data processing in Vision-Language models?

When responding to this question, highlight that data processing is crucial for the performance of Vision-Language models, as high-quality input data leads to better model training and outcomes. Discuss techniques such as filtering, cleaning, and augmenting datasets to enhance data quality and overall model effectiveness.

Join Rise to see the full answer
How would you approach creating a synthetic data augmentation pipeline?

In your answer, outline a systematic approach to synthetic data augmentation, addressing the selection of techniques that align with your project goals. You could mention using methods like image transformations, text variations, or generative models to create diverse training samples to increase dataset variety and improve model robustness.

Join Rise to see the full answer
What’s your experience with machine learning frameworks?

Discuss your hands-on experience with popular machine learning frameworks, focusing on specific projects where you utilized frameworks like TensorFlow or PyTorch. Highlight your ability to write, debug models, and implement complex algorithms, illustrating your familiarity with best practices in coding and model deployment.

Join Rise to see the full answer
Describe a project where you fine-tuned a language model.

In your answer, choose a relevant project where you successfully fine-tuned a language model. Explain your steps, including data preparation, model selection, and evaluation metrics. Highlight the impact your fine-tuning had on the model's performance to showcase your practical experience.

Join Rise to see the full answer
What are some challenges you've faced in computer vision tasks, and how did you overcome them?

Discuss specific challenges such as data scarcity, model overfitting, or computational constraints. Explain the strategies you employed to address those challenges, such as using advanced augmentation techniques or optimizing model parameters, which will reflect your problem-solving skills in a technical environment.

Join Rise to see the full answer
How would you monitor the release of high-quality VLM datasets?

Explain your approach to actively tracking new dataset releases through academic journals, data repositories, and community forums. Highlight the importance of maintaining an up-to-date knowledge base in your field to leverage the most current datasets for model training, ensuring your projects are built on solid foundations.

Join Rise to see the full answer
Can you describe your coding style when writing Python scripts?

Provide insights into your coding style, emphasizing principles like writing clean, maintainable, and scalable code. Discuss adherence to PEP 8 guidelines, effective use of comments, and modular programming techniques that enhance the readability and usability of your code for future developments.

Join Rise to see the full answer
What techniques do you implement for data cleaning?

In answering this question, detail your methodologies for data cleaning, including handling missing values, outlier detection, and standardization procedures. Show your understanding of how effective data cleaning contributes to model accuracy and reliability.

Join Rise to see the full answer
What role do you think data quality plays in machine learning?

Highlight the critical relationship between data quality and model performance. Emphasize that high-quality data leads to improved generalization, reduced bias, and dependable predictions in machine learning models, which is central to the role of a Member of Technical Staff - Vision-Language Model Data.

Join Rise to see the full answer
How do you ensure collaboration in a remote team setting?

Discuss the tools and practices you use to foster collaboration in a remote environment, such as regular stand-up meetings, collaborative documentation platforms, and effective communication through messaging tools. Highlight the importance of building relationships with team members for effective teamwork.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Baker Hughes Hybrid US-OH-TWINSBURG-8499 DARROW ROAD
Posted 5 days ago

Join Baker Hughes as a Lead Manufacturing Engineer and be part of a team that advances operational efficiencies in nuclear manufacturing engineering.

Photo of the Rise User
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition

Join SpaceX as an Apprentice Test Operations Technician and support groundbreaking testing operations for rocket development.

Photo of the Rise User
Posted yesterday

AECOM is looking for an experienced Senior Transportation Engineer to join their Virginia Beach office and contribute to impactful infrastructure projects.

Photo of the Rise User
Posted 13 days ago

Join ABM as an Engineering Supervisor and lead the engineering team to ensure optimal facility management and maintenance practices.

Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States
Posted 8 days ago

Seeking a motivated Airframe and Power Plant Mechanic to join Anduril Industries' innovative team focused on defense technology.

1 Resource Group Hybrid No location specified
Posted 12 days ago

Join 1 Resource Group as a Director of Engineering to lead impactful projects and foster a culture of excellence in the construction sector.

Photo of the Rise User
AECOM Remote Glasgow, United Kingdom
Posted 5 days ago

Join AECOM's dynamic Water business as a hydraulic modelling professional to deliver impactful water projects in the UK and abroad.

Photo of the Rise User
Posted 4 days ago
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 12, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
6 people applied to MX Apprentice at Spirit Airlines
X
Someone from OH, Cleveland just viewed Lead / Senior Analyst - SAP HCM at Xcellink Pte Ltd
Photo of the Rise User
13 people applied to UI Developer Intern at RainFocus
Photo of the Rise User
Someone from OH, Akron just viewed Accounting Co-Op at VEGA Americas
R
Someone from OH, Cincinnati just viewed Director, Payroll Tax at Ryan
P
Someone from OH, Columbus just viewed Data Science for Smart Agriculture- Part-Time at PSU
Photo of the Rise User
Someone from OH, Cincinnati just viewed Brand Management & Partnerships Assistant at LAIKA
Photo of the Rise User
Someone from OH, Athens just viewed Senior Multimedia Artist, Design & Creative at RepRisk AG
Photo of the Rise User
29 people applied to Supervisor, Plumbing at SpaceX
H
Someone from OH, Rocky River just viewed Training Manager at Hotel Bardo Savannah
F
Someone from OH, Columbus just viewed VP of Communications at Freedom Together Foundation
Photo of the Rise User
Someone from OH, Columbus just viewed Chief Organizational Communication Officer at Providence
Photo of the Rise User
10 people applied to Pega Engineer at Proxymity
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed SEASONER at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Bilingual Care Manager, Telephonic RN at Humana
Photo of the Rise User
Someone from OH, Columbus just viewed Talent Business Partner at Red Bull
Photo of the Rise User
Someone from OH, Brunswick just viewed Sanitation Team Member at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Talent Acquisition Specialist at Beghou Consulting
C
Someone from OH, Middletown just viewed Operations Analyst at Core Specialty Insurance
A
Someone from OH, Strongsville just viewed Graphic Design Intern at Anvil NorthWest