At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.
About the Role
Quantization is a technique to reduce the computational and memory costs of running inference by representing the weights and activations with low-precision data types like 8-bit integer (int8) instead of the usual 32-bit floating point (float32). It is a very promising technique as it allows to run and fine-tune on consumer-grade hardware LLMs with minimal performance degradation.
This internship works at the intersections of software engineering, machine learning engineering, and education. The focus will be to integrate new quantization methods in Hugging Face ecosystem (transformers, accelerate, peft, diffusers), maintain existing integration (bitsandbytes, awq, autogptq) as well as making sure that the community is aware of these tools through benchmarks and blogposts. The ultimate goal of this internship is to drive forward quantization in the open source ecosystem.
About You
If you love open-source but also have an eye for art and creativity, are passionate about making complex technology more accessible to engineers and artists, and want to contribute to one of the fastest-growing ML ecosystems, then we can't wait to see your application!
If you're interested in joining us, but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and background complement one another. We're happy to consider where you might be able to make the biggest impact.
More about Hugging Face
We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where people feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to continuously grow. We provide all employees with reimbursement for relevant conferences, training, and education.
We care about your well-being. We offer flexible working hours and remote options. We support our employees wherever they are. While we have office spaces around the world, especially in the US, Canada, and Europe, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.
We support the community. We believe significant scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
At Hugging Face, we’re excited to welcome a new Machine Learning Engineer Intern focusing on Quantization to join our mission of democratizing good AI! In this fun and creative environment, you’ll dive into the world of machine learning while helping to propel our rapidly growing ecosystem that already boasts over 5 million users. You’ll play a key role in integrating cutting-edge quantization methods into our existing libraries and tools, such as transformers and diffusers. If you’ve got a knack for turning complex concepts into accessible resources, you’ll thrive in this role as you create benchmarks and informative blog posts. We believe diverse teams lead to innovative solutions, so whether you check all the boxes or not, if you’re passionate about machine learning and open-source, we want to hear from you! Flexible working hours, remote options, and a commitment to your overall well-being make Hugging Face a fantastic place to kick-start your career in a supportive community that values growth and collaboration. Plus, you'll be rubbing shoulders with some of the brightest minds in the industry. Your journey with us at Hugging Face means you’ll be contributing to significant advancements in the ML field while making lasting friendships. If you’re looking for a meaningful internship where your ideas matter and your creativity shines, then this Machine Learning Engineer Internship could be the perfect fit for you. Join us and help take quantization to new heights in our vibrant open-source community!
Subscribe to Rise newsletter