Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.
Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.
Gimlet Labs is seeking a Software Engineer (Intern) to help develop Gimlet’s platform for deploying and monitoring AI workloads. In this role, you will be applying the latest AI techniques to develop frameworks to help generate and optimize AI workloads. You will contribute to Gimlet’s novel compilation framework for partitioning and orchestrating AI workloads across diverse hardware environments. You will design and implement scalable systems that can run production workloads of millions of requests a second.
Responsibilities:
Building, deploying and scaling AI systems for production
Evaluating and implementing cutting-edge AI research
Researching ways to improve model accuracy, performance and efficiency
Qualifications:
Currently pursuing degree in computer science, engineering, or comparable area of study
Experience with AI/ML or distributed systems.
Preferred Qualifications:
Experience with PyTorch, TensorFlow, ONNX and other AI frameworks
Familiarity with distributed systems and orchestration frameworks (e.g., Kubernetes)
Software development experience with Python and C++
Understanding of the latest AI research and techniques
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Are you passionate about artificial intelligence and ready to kickstart your career in a dynamic environment? At Gimlet Labs, we’re on a mission to revolutionize AI applications, and we want talented Software Engineer Interns to join our innovative team in San Francisco! Here, you’ll have the unique opportunity to work on groundbreaking technology that enhances AI inference efficiency, an area that’s rapidly evolving as AI workloads grow. Forget about mundane tasks; instead, you will be at the forefront of developing frameworks that generate and optimize AI workloads. Your contributions will play a vital role in our compilation framework, which manages and orchestrates AI tasks across various hardware setups. You’ll engage with some of the best minds in the field, learning from professionals with rich backgrounds in AI and distributed systems. Your responsibilities will include building, deploying, and scaling cutting-edge AI systems while diving deep into research aimed at improving model accuracy and performance. As a Software Engineer Intern at Gimlet, you should be currently pursuing a degree in computer science or a similar field and have some familiarity with AI/ML or distributed systems. If you have experience with popular AI frameworks like PyTorch or TensorFlow and can code in Python or C++, you’re already ahead of the curve. This internship not only offers you a chance to learn and grow but also to be part of a company that’s paving the way for the future of AI technology. Join us and let’s make groundbreaking innovations together!
Become a vital part of Visa's Technology Organization as a Staff Software Engineer, focusing on innovative payment solutions while working with a global client base.
Join the AI Networking Software team at Meta to lead innovations in GPU communication and optimize machine learning performance.
Join Shrikon as a Software Developer, specializing in Java, and contribute to innovative enterprise-level applications in Philadelphia.
先進的な自動運転技術を支えるため、AIを活用したフレームワークを開発するプリンシパルソフトウェアエンジニアを募集しています。
Acuity, Inc. is seeking a Senior Full Stack Developer to enhance government applications through superior user interface design and innovative technology solutions.
Join Suvoda as a Software Developer focusing on innovative solutions in clinical trials with a strong commitment to safety and integrity.
Become a Senior Software Engineer at Google, pioneering innovations in user engagement through cutting-edge technologies.
Subscribe to Rise newsletter