Job details

Member of technical staff (Post-training)

About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential.

H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning and collaboration, where everyone has something to contribute.

Holistic, Humanist, Humble.

About the Team: The Post-training team focuses on enhancing pre-trained models for use in H products, particularly by improving instruction following and tool use abilities.

This is achieved by learning from extensive user and machine feedback, driving improvements through reward modeling and scaling feedback-driven training.

Our team operates at the intersection of research and product, developing post-training methods to improve H-models' ability to solve complex tasks, make decisions, and interact with dynamic environments.

Key Responsibilities:

Research and develop post-training methods across our research stack to enhance the instruction following and tool use abilities of H models targeting next-generation capabilities
Design and implement automated data collection pipelines for reward modeling and large-scale reinforcement learning
Build robust evaluations for tracking modeling improvements
Transform learnings from use cases into post training methods to improve H-models
Collaborate closely with the other research research and product teams

Requirements:

Technical skills
- Proficient in Python & Git
- Expert in at least one deep learning framework (PyTorch, JAX, TensorFlow)
- Experience training large models on a distributed infrastructure
- Hands-on experience with LLM post-training, alignment and large-scale reinforcement learning
Research skills
- Publications in top-tier AI conferences (e.g. NeurIPS, ICML, CVPR, ACL, ICCV)
- PhD or MSc with equivalent experience, in machine learning, deep learning, natural language processing, or related field
Soft skills
- Team player
- Strong communication and presentation skills to articulate complex ideas clearly
Bonuses
- Industry experience is a plus
- Experience in LLM post-training, RL

Location:

H's teams are distributed throughout France, the UK, and the US
This role has the potential to be fully remote or hybrid for candidates based in cities where we have an office - currently Paris and London
The final decision for this will lie with the hiring manager for each individual role

What We Offer:

Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
Enjoy a competitive salary
Unlock opportunities for professional growth, continuous learning, and career development

If you want to change the status quo in AI, join us.

Average salary estimate

$110000 / YEARLY (est.)

min

max

$90000K

$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of technical staff (Post-training), H Company

Are you ready to embark on an exciting journey in the world of artificial intelligence? H is thrilled to announce an opening for a Member of Technical Staff (Post-training) to join our innovative team. At H, we are committed to pushing the boundaries of superintelligence with agentic AI, and our Post-training team plays a crucial role in enhancing our pre-trained models for use in H products. We’re all about collaboration, learning, and fostering a workplace where ideas flourish. As a Member of Technical Staff, you will lead research and development efforts targeting cutting-edge capabilities, improving instruction following, and tool usage abilities of H models. You’ll design automated data collection pipelines for reward modeling and large-scale reinforcement learning, ensuring that our models are consistently turning insights into actionable improvements. With your proficiency in Python and experience with deep learning frameworks, you’ll not only evaluate performance metrics but also leverage your research acumen to transform user feedback into valuable post-training methodologies. Your ability to communicate complex ideas clearly will be key in collaborating with interdisciplinary teams and driving impactful changes. This role offers the flexibility of being fully remote or hybrid, with the opportunity to work within our diverse teams based in vibrant cities such as Paris and London. If you are excited about shaping the future of AI in a fun and dynamic environment, where continuous learning is part of the culture, then we’d love for you to be a part of H!

Frequently Asked Questions (FAQs) for Member of technical staff (Post-training) Role at H Company

What are the key responsibilities of a Member of Technical Staff (Post-training) at H?

As a Member of Technical Staff (Post-training) at H, you will be responsible for researching and developing post-training methods that enhance the capabilities of our models. This includes designing automated data collection pipelines for reward modeling and reinforcement learning, evaluating modeling improvements, and integrating user feedback into practical training methodologies.