About Trace Machina:
Trace Machina is transforming the software development lifecycle with NativeLink, a high-performance build caching and remote execution system. NativeLink accelerates software compilation and testing processes while reducing infrastructure costs, enabling organizations to optimize their build workflows. We work with clients of all sizes to help them scale and streamline their build systems efficiently and effectively.
We are looking for an innovative and driven Data Scientist with a focus on AI/ML to join our team. As a key member of our team, you will apply your data science expertise to enhance NativeLink’s capabilities, from optimizing build processes to developing machine learning models that improve performance, scalability, and efficiency.
Job Description:
As a Data Scientist focusing on AI/ML at Trace Machina, you will work on developing, testing, and implementing machine learning models and algorithms to solve complex problems related to software build optimization and testing. You will work closely with engineering teams to improve the performance of NativeLink’s platform and collaborate with data engineers to ensure robust data pipelines and infrastructure. Your work will help build intelligent systems that make software development faster, more reliable, and more cost-efficient.
Job Responsibilities:
Design, implement, and deploy machine learning models to optimize software build systems, including caching, task distribution, and execution workflows
Work with large datasets to identify patterns, anomalies, and insights that inform decisions for improving build processes and remote execution
Develop predictive models to optimize build times, cache hit rates, and system resource utilization
Conduct experiments to improve the efficiency of build systems through data-driven decisions, leveraging AI/ML techniques such as reinforcement learning and optimization
Collaborate with cross-functional teams (engineering, product, and operations) to translate business problems into AI/ML-driven solutions
Analyze customer usage data to identify opportunities for feature improvements and innovations within the NativeLink platform
Develop custom algorithms for performance monitoring, anomaly detection, and optimization of CI/CD pipelines
Build, test, and validate machine learning models using a variety of techniques, ensuring they are scalable, robust, and interpretable
Build and maintain data pipelines to support model training, testing, and deployment in production environments
Communicate findings and insights to both technical and non-technical stakeholders in a clear and actionable way
Required Skills and Experience:
3+ years of experience as a Data Scientist, with a strong focus on AI and machine learning
Expertise in machine learning algorithms, data analysis, and statistical modeling techniques
Proficiency in Python, R, or other data science programming languages, with experience using libraries such as TensorFlow, PyTorch, Scikit-learn, and Pandas
Strong knowledge of deep learning, reinforcement learning, or other advanced AI techniques
Experience with large-scale data processing, including working with big data technologies (e.g., Spark, Hadoop)
Familiarity with cloud infrastructure (AWS, GCP, Azure) and deploying machine learning models in production
Strong understanding of data wrangling, feature engineering, and building predictive models
Experience with version control (Git) and working in collaborative environments
Excellent problem-solving skills and ability to generate actionable insights from data
Ability to communicate complex AI/ML concepts effectively to both technical and non-technical teams
Nice to Have:
Experience with build systems or CI/CD pipeline optimization
Background in natural language processing (NLP) or time-series forecasting for predictive analytics
Familiarity with containerization tools like Docker and Kubernetes for deploying AI models
Experience in AI model explainability and interpretability
Published research or contributions to open-source machine learning projects
Why Join Trace Machina?
Work with cutting-edge AI and machine learning technologies to optimize high-performance build systems
Collaborate with a talented and innovative team of engineers, data scientists, and product managers
Shape the future of software build processes for leading companies around the world
Competitive salary and benefits package
Opportunities for career growth, professional development, and continuous learning
If you're passionate about applying AI/ML to solve real-world problems in software development, we’d love to hear from you!
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Northwestern Medicine seeks a Data Integrity Coordinator to maintain patient data accuracy and support clinical system integrity in a dynamic hospital environment.
A community-focused organization seeks an HMIS Associate to manage data quality, reporting, and system compliance in a hybrid full-time role based in Albany, NY.
Experienced data leader needed to spearhead Govini’s data strategy and product development for national security-focused analytics.
Lead LinkedIn's Sales Operations analytics team to deliver innovative data solutions and strategic insights in a dynamic hybrid work setting.
Lead the development of trusted, scalable product-data infrastructure as a senior Business Intelligence analytics engineer at Plaid.
Lead Informatica data governance and integration efforts using IDMC, cloud platforms, and ETL/ELT best practices.
Lead Finance Data Governance initiatives at American Express as a Manager focused on metadata solutions and compliance to drive trusted financial data.
Lead Sandisk's enterprise data management and AI initiatives as a senior director focused on strategy, governance, and digital innovation.
Lead the design and implementation of scalable data architecture as a Lead Data Engineer at DoseSpot, pushing forward innovation in healthcare software.
Leading TetraScience's Scientific Data Engineering team, the Tech Lead will architect AI-native data solutions and champion delivery excellence.
A leading fintech company is hiring a Staff Data Engineer to develop and optimize modern data platforms supporting critical financial operations in a fully remote US-based role.
Lead Paddle’s analytics engineering team to build scalable data solutions that drive informed business decisions across global markets.
Data Integrity Specialist II role at Intermountain Health to ensure accuracy of patient records and support critical patient safety initiatives.
Subscribe to Rise newsletter