AryaXAI stands at the forefront of AI innovation, revolutionizing AI for mission-critical businesses by building explainable, safe, and aligned systems that scale responsibly. Our mission is to create AI tools that empower researchers, engineers, and organizations to unlock AI's full potential while maintaining transparency and safety.
Our team thrives on a shared passion for cutting-edge innovation, collaboration, and a relentless drive for excellence. At AryaXAI, everyone contributes hands-on to our mission in a flat organizational structure that values curiosity, initiative, and exceptional performance.
Requirements:
Core Languages: CUDA, Python
Frameworks: CUTLASS, pybind11 (or similar)
Tools: Nsight, JAX/XLA bindings
Focus Areas: GPU Kernel Optimization, Deep Learning Inference & Training
Role Overview
We are looking for a highly skilled AI Researcher - GPU Kernel Developer to join our team and push the boundaries of high-performance AI computation. In this role, you will design, develop, and optimize GPU kernels that power state-of-the-art AI models. Your work will directly influence the performance and scalability of our AI systems.
Key Responsibilities:
Develop and refine low-level CUDA kernel optimizations for deep learning inference and training.
Profile, debug, and optimize single and multi-GPU operations using tools like Nsight.
Deeply understand and exploit GPU memory hierarchies and computational capabilities.
Implement cutting-edge methods from research papers into CUDA kernels.
Collaborate on designing innovative solutions to achieve peak GPU performance.
Ideal Candidate Profile
We are looking for candidates with a proven track record of excellence in GPU programming and AI system optimization. You should bring:
Core Experiences:
Expertise in designing high-performance GeMM CUDA kernels using Tensor cores or CUDA cores, leveraging tools like CuTe or CUTLASS.
Proficiency in extending or writing custom attention and deep learning kernels from scratch.
Confidence in writing both forward and backward kernels while managing floating-point precision errors.
Strong optimization skills for both memory-bound and compute-bound operations.
Advanced knowledge of GPU architecture, including register pressure, shared-memory usage, and GPU utilization.
Preferred Skills:
Familiarity with profiling tools (e.g., Nsight) to identify bottlenecks and improve performance.
Experience integrating custom kernels with frameworks like JAX/XLA through tools like pybind11.
Awareness of the latest advancements in GPU optimization techniques for AI workloads.
Why Join AryaXAI?
Mission-Driven Impact: Work on challenges that shape the future of responsible AI.
Technical Excellence: Collaborate with a team of passionate and experienced professionals.
Growth Opportunities: Contribute across domains and expand your expertise in GPU kernel development and AI research.
Flexible Work Environment: Choose between remote work or relocation support to one of our key offices.
Interview Process
Application Review: We review your CV and a statement of exceptional work.
Initial Interview (15 Minutes): A technical team member will evaluate your basic skills and fit for the role.
Main Process:
Coding Assessment: Solve programming challenges in your preferred language.
Systems Problem-Solving: A live, hands-on session to showcase practical expertise.
Project Deep-Dive: Present your most notable project to our team.
Team Meet & Greet: Engage with the broader AryaXAI team.
Note: Our interviews are designed to conclude within one week to streamline your onboarding process.
Lead innovation in downstream manufacturing at Takeda as a Senior Staff Engineer focusing on process improvement, troubleshooting, and mentorship in a hybrid work environment.
A cutting-edge synthetic biology startup is looking for a Scientist experienced in CHO cell line development to drive innovative biologic therapeutic workflows in a hybrid Boston lab environment.
A leading battery materials company invites a Senior Adhesive Engineer to lead adhesive formulation, validation, and scale-up efforts in San Jose.
An entry-level Bioanalytical Assay Development Scientist position at Eurofins PSS focused on assay development and LC-MS/MS analysis in Groton, CT.
Lead advanced biophysical characterization efforts at Kite Pharma to drive innovation in viral vector and T cell therapy product quality and development.
Contribute your game design expertise and AI knowledge to help RYZ Labs train and refine AI models for realistic, adaptive gameplay experiences in a fully remote role.
Waters Corporation seeks a dynamic LC-MS Development Scientist to drive innovation in clinical mass spectrometry systems within their newly formed Clinical Systems Development team.
Lead GSK’s digital transformation in automation by driving innovative solutions that enhance research workflows and data management within their RTech division.
Contribute to cutting-edge quantum device development at Vector Atomic as an adaptable R&D Engineer working on novel navigation and communication technologies.
Lead Syngenta’s Vero Beach Research Center R&D group driving innovation in turf and urban pest science to deliver advanced crop protection solutions.
Participate in Susquehanna's hands-on Quantitative Strategy Developer Summer Internship, combining trading desk work with deep financial markets education.
Eurofins Scientific is looking for an experienced Analytical Chemist skilled in LC-MS systems to join their Boston-based R&D team for advanced analytical testing and method development.
An opportunity at OHSU Knight Cancer Institute to innovate in cancer diagnostic technology as a Postdoctoral Researcher specializing in Biomedical Engineering or Cancer Biology.
Subscribe to Rise newsletter