Our client is a publicly traded company at the forefront of the AI revolution, offering an AI-centric cloud platform that's reshaping the landscape of artificial intelligence. The company provides cutting-edge infrastructure, including large-scale GPU clusters, cloud platforms, tools, and services for developers to service the explosive growth of the global AI industry for Fortune 1000 companies, top-tier innovative startups, and AI researchers.
Company type: Publicly traded
Industry: AI/ML, Cloud Computing, Infrastructure-as-Code
Candidate Location: Remote U.S.
Their mission is to democratize access to AI infrastructure and empower organizations to create, optimize, and deploy AI solutions at any scale. They aim to simplify the complexities of AI development by providing a full-stack AI platform that combines powerful hardware with user-friendly tools and services.
We are seeking a Senior AI/ML Specialist Solutions Architect to join our client's team. This role offers the chance to design and implement scalable AI solutions for AI-focused customers, working with state-of-the-art technologies and contributing to one of the most powerful commercially available supercomputers.
Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems
Build long-term customer relationships, ensuring satisfaction and alignment with strategic goals
Create whitepapers, deliver technical presentations, and host webinars to share insights and best practices
Provide technical leadership and mentor teams on AI infrastructure and deployment strategies
Collaborate with engineering and product teams to prioritize customer feedback and influence product roadmaps
5+ years of experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles
Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments
Demonstrated success delivering ML products, scaling from POC to production
Deep knowledge of ML frameworks like PyTorch and JAX
Strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband)
Active involvement in the ML community (public speaking, open-source contributions, competitions like Kaggle and Hackathons)
Exceptional communication skills to engage both technical teams and business stakeholders
Programming Languages: Python, Go, Java, C++
Infrastructure as Code (IaC): Terraform, Ansible
Orchestration: Kubernetes (K8s), Slurm
DevOps Tools: Git, Docker, Helm
Big Data Frameworks: Spark, Kafka, Hadoop
Databases: SQL, NoSQL, and vector databases
ML Frameworks: PyTorch, TensorFlow, JAX, HuggingFace, Scikit-learn
Competitive compensation: $180,000 - $300,000 per year (negotiable based on experience and location)
Full medical benefits: 100% company-paid medical, dental, and vision coverage for employees and families
401(k) plan with a 4% match program
Stock options plan
Flexible remote work environment
Company-paid short-term, long-term disability, and life insurance coverage
20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers
Up to $85/month for mobile and internet
Work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs
Be part of a team that operates one of the most powerful commercially available supercomputers
Contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings
Level 1 - Interview with Talent Acquisition
Level 2 - Interview with the Hiring Manager
Level 3 - Technical Assessment
Reference and Background Checks: conducted after successful interviews
Job Offer: provided to the selected candidate
We are proud to be an equal opportunity workplace and are committed to equal employment opportunity regardless of race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity, or expression, sexual orientation, or any other characteristic protected by applicable federal, state or local law.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Drive AI innovation as a Cloud Solutions Architect supporting advanced GPU cloud infrastructure for a leading publicly traded AI company.
Innovative YC-backed AI startup in San Francisco hires a founding GTM Account Executive to lead and build its sales function from the ground up.
Maxar Technologies seeks a Lead DevOps Engineer with extensive experience to develop and integrate intelligence capabilities via hybrid cloud and local infrastructure deployments.
Lead Kanbrick's AI and technology initiatives as VP, driving strategic adoption and scalable implementation across midsize operating companies.
The Application Help Desk Support Specialist will drive IT service improvements and provide expert support within LLNL’s secure and mission-driven environment.
Experienced Systems Administrator needed to oversee hospital IT infrastructure ensuring secure and efficient operations at St Mary of Nazareth Hospital in Chicago.
Support and optimize cloud and on-premise database systems as a Jr. Cloud DBA at a values-driven company committed to career growth and development.
An experienced Provider System Analyst II role at OHCA to optimize and support healthcare provider systems through analysis, testing, and process improvements.
Provide hands-on IT support and system maintenance in a dynamic, fast-paced casino environment as a full-time IT Support Technician.
Integres, LLC is looking for a Junior Database Developer to build and enhance databases and reporting tools that support operational and maintenance metrics.
Lead IT business systems analysis and solutions for Legal at Gilead Sciences to support and enhance business processes and technology.
Lead Linux system administration and provide technical support for engineering labs at UMBC in this hybrid full-time role.
Lead and develop the IT Support team at Uncommon Schools, overseeing technology services and data management to support educational success in underserved communities.
Contribute to advancing healthcare technology by providing expert onsite technical support and system infrastructure management at Medtronic's Brooklyn Center site.
Chime is seeking an IT Support Technician to deliver frontline technical support and empower employees with effective IT solutions within a dynamic financial technology firm.
Subscribe to Rise newsletter