Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Software Engineer - ML Research Platform image - Rise Careers
This job is expired We're automatically mark job as expired after 180 days of its inactivity
Job details

Software Engineer - ML Research Platform

  • Please note, the role is remote and candidates should be based in Poland.
The Opportunity
insitro’s machine learning research platform is central to our approach to rethinking drug discovery. Our tools empower a team of 25+ data scientists and engineers to conduct cutting-edge applied ML research with diverse types of biological data. We provide the foundations for reproducible ML experimentation, including frameworks for defining and running experiments, tools for experiment tracking and hyperparameter search, and primitives for constructing inference pipelines. We also develop tooling to support rapid experimentation in Jupyter notebooks by diverse sets of users including both software engineers and wet lab scientists. Our tools directly enable insitro’s data science and ML engineering teams to train and evaluate ML models on multi-petabyte collections of biological data spanning high content imaging, functional genomics, and biomolecular structures. You will work as part of a team to define, build, and improve key components of insitro’s ML experimentation platform, elevating the rigor and efficiency of ML research company-wide.
This is a unique role that sits at the interface of insitro’s software engineering and data science teams, weaving together the fabric of tools, systems, and interfaces that enable ML-powered discoveries from our large-scale biological data collections. An ideal candidate has experience implementing and training ML models allowing them to relate to ML researchers, while also having significant software engineering craft enabling them to design and implement extensible ML systems. The role does not focus on conducting research but rather developing novel tools and capabilities that enable researchers to be more productive. While not required, some knowledge of biological or chemical data is especially valuable in understanding the unique requirements and applications of ML to biology and drug discovery.
You will be joining a biotech startup that has long-term stability due to significant funding, providing many opportunities for meaningful impact. You will work closely with a very talented team, learn a broad range of skills, and help shape insitro’s culture, strategic direction, and outcomes. Join us, and help make a difference to patients!

About You
  • BS, MS, or Ph.D. in computer science, statistics, mathematics, physics, engineering, or equivalent practical experience
  • Expertise in one or more general-purpose programming languages (strong preference for significant experience in scientific Python; Java, Scala, C/C++, and Go are also relevant)
  • Demonstrated ability to critique, design and implement ML abstractions that balance experimental flexibility with constraints that enable reusability and portability
  • Experience training DNNs in PyTorch or TensorFlow, including knowledge of key performance metrics for common tasks, diagnosing learning curves, and troubleshooting optimization dynamics
  • Familiarity with current approaches to distributed training and inference
  • Knowledge of performance characteristics of modern GPUs and other hardware accelerators, experience troubleshooting CUDA/cuDNN/GPU drivers running in containers, and experience with profiling GPU code to identify potential performance improvements
  • Ability to empathize with diverse ML platform users, balancing proposing pragmatic fixes to support short-term experimental iteration with identifying non-obvious underlying needs and designing longer-term solutions
  • Comfort with the ambiguity and changing requirements of supporting early-stage ML research
  • Ability to identify and lead redesigns of ML code to support reusability, robustness and readability
  • Experience making buy-vs-build decisions and evaluating third-party ML tools (commercial and/or open source), and exposure to managing relationships with software vendors
  • Passion for making a difference in the world

Nice to Have
  • Experience with optimizing datasets and file formats for ML use cases (e.g. HDF5, Parquet, Zarr, etc), and/or using database or distributed query systems (e.g. PostgreSQL/MySQL, Presto/Athena/BigQuery, etc)
  • Experience with image, molecular structure, genetic, or genomic data modalities
  • Previous open-source contributions or publications demonstrating impact in relevant projects

Benefits at insitro
  • Highly competitive salary
  • Health insurance benefits
  • Gym allowance
  • Flexible work schedule
  • Home office equipment

GDPR
The Controller of your personal data is Insitro, Inc., with offices at 279 East Grand Avenue, South San Francisco, California, United States. Your personal data is processed for the purposes of the current recruitment process. Providing your personal data is voluntary, but its processing and transfer to the United States by or on behalf of Insitro, Inc. is necessary for this purpose. You have the right to access, correct, modify, update, rectify, and request the transfer or deletion of your personal data.
You hereby consent to Insitro, Inc., with offices at 279 East Grand Avenue, South San Francisco, California, United States, retaining and processing your personal data after the current recruitment process is finished, for the purposes of future recruitment processes. You have the right to withdraw this consent at any time by sending a notification to recruiting@insitro.com.
About insitro
insitro is a data-driven drug discovery and development company using machine learning and data at scale to transform the way that drugs are discovered and developed for patients. insitro is developing predictive machine learning models to discover underlying biologic state based on human cohort data and in-house generated cellular data at scale. These predictive models can be brought to bear on key bottlenecks in pharmaceutical R&D to advance novel targets and patient biomarkers, design therapeutics, and inform clinical strategy. insitro is advancing a wholly owned and partnered pipeline of biologic insights and molecules in neuroscience and metabolic diseases. Since formation in mid 2018, insitro has raised over $700 million from top tech, biotech, and crossover investors and from collaborations with pharmaceutical partners. For more information on insitro, please visit the company’s website at www.insitro.com.
insitro Glassdoor Company Review
3.8 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
insitro DE&I Review
3.6 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
CEO of insitro
insitro CEO photo
Unknown name
Approve of CEO
MATCH
Calculating your matching score...
FUNDING
TEAM SIZE
DATE POSTED
August 12, 2022

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
Other jobs
Company
Bayer Hybrid Leverkusen, Germany
Posted 9 months ago
Company
AMD Hybrid Santa Clara, CA
Posted last year
Company
Posted 5 months ago