Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
AI Software Appications Engineer - Technical Lead / Principal image - Rise Careers
Job details

AI Software Appications Engineer - Technical Lead / Principal

d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The “holy grail” of AI compute has been to break through the memory wall to minimize data movements. We’ve achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is poised to advance Large Language Models to scale Generative inference acceleration with our chiplets and In-Memory compute approach. We are on track to deliver our first commercial product in 2024. We are poised to meet the energy and performance demands of these Large Language Models. The company has 100+ employees across Silicon Valley, Sydney and Bengaluru.

Our pedigree comes from companies like Microsoft, Broadcom, Inphi, Intel, Texas Instruments, Lucent, MIPS and Wave Computing. Our past successes include building chips for all the cloud hyperscalers globally - Amazon, Facebook, Google, Microsoft, Alibaba, Tencent along with enterprise and mobile operators like China Mobile, Cisco, Nokia, Ciena, Reliance Jio, Verizon, AT&AT. We are recognized leaders in the mixed signal, DSP connectivity space, now applying our skills to next generation AI.  

Location

Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week.

AI Software Application Engineer – Technical lead / Principal

d-Matrix is seeking an experienced AI Applications Engineer to drive the successful deployment and support of d-Matrix’s cutting-edge AI products and solutions, specifically in the realm of generative AI inference and AI/ML software support. In this highly technical role, you will work closely with customers and internal teams to resolve complex software, hardware, and firmware challenges related to AI workloads. The ideal candidate will have expertise in AI/ML infrastructure, with a focus on inference solutions and performance optimization for data center environments. This position requires a strong blend of engineering acumen and customer-facing skills to ensure the seamless deployment and continued success of our products.

What You Will Do:

  • Customer Enablement & Support: Provide expert guidance and support to customers deploying generative AI inference models, including assisting with integration, troubleshooting, and optimizing AI/ML software stacks. Respond promptly to customer queries, perform root cause analysis, and develop timely resolutions for complex issues.

  • AI/ML Inference Optimization: Work directly with customers to understand their generative AI inference needs and deliver solutions that maximize performance across their AI workloads. Collaborate with customers to implement best practices for model deployment and inference tuning.

  • System Design & Consultation: Conduct design reviews and provide consultation on AI/ML infrastructure, focusing on optimizing systems for generative AI workloads in datacenters. Develop reference solutions and technical documentation tailored to the needs of AI/ML applications.

  • AI/ML Software Stack Installation & Validation: Lead the installation, configuration, and bring-up of d-Matrix’s AI software stack. Perform functional and performance validation testing, ensuring that generative AI models run efficiently and meet customer expectations.

  • Collaboration on Technical Collateral: Partner with internal engineering and product teams to produce developer guides, technical notes, and other supporting materials that facilitate the adoption of our AI/ML solutions by customers.

What You Will Bring:

  • Engineering degree in Electrical Engineering, Computer Engineering, Computer Science, or related field, with substantial experience in AI/ML software and infrastructure with 10+ years of experience in customer engineering and field support for enterprise-level AI and datacenter products, with a focus on AI/ML software and generative AI inference.

  • In-depth knowledge and hands-on experience with generative AI inference at scale, including the integration and deployment of AI models in production environments.

  • Experience with automation tools and scripting languages (Linux or Windows shell scripting, Python, Go) to streamline deployment, monitoring, and issue resolution processes.

  • Ability to communicate complex technical concepts to diverse audiences, from developers to business stakeholders.

Preferred Experience

  • Hands-on experience with AI/ML infrastructure accelerators (e.g., GPUs, TPUs) and expertise in optimizing performance for generative AI inference workloads.

  • Strong analytical skills with a proven track record of solving complex problems in AI/ML systems, including performance optimization and troubleshooting in AI/ML frameworks.

  • Extensive experience with the deployment of AI/ML frameworks such as PyTorch, OpenAI Triton, VLLM, and familiarity with container orchestration platforms like Kubernetes.

  • Excellent communication and presentation skills, with a demonstrated ability to guide customers through complex AI/ML system integration and troubleshooting.

Equal Opportunity Employment Policy

d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.

d-Matrix Glassdoor Company Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
d-Matrix DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of d-Matrix
d-Matrix CEO photo
Unknown name
Approve of CEO

Average salary estimate

$175000 / YEARLY (est.)
min
max
$150000K
$200000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About AI Software Appications Engineer - Technical Lead / Principal, d-Matrix

Are you passionate about pushing the boundaries of AI and technology? d-Matrix is on the lookout for an AI Software Applications Engineer - Technical Lead / Principal to join our innovative team in Santa Clara, CA. We have revolutionized memory-compute integration with our digital in-memory compute (DIMC) engine, aiming to tackle the 'holy grail' of AI compute by minimizing data movement and accelerating generative inference. With our impressive success and backing of over $154M, we're set to deliver our first commercial product in 2024. In this exciting role, you’ll be the go-to expert for our cutting-edge AI solutions, working closely with customers to solve intricate software and hardware challenges related to generative AI. You’ll also collaborate with internal teams, optimizing AI/ML software stacks and providing essential training and support to ensure seamless product deployment. Bring your extensive knowledge in AI/ML infrastructure and your love for customer engagement to this hybrid position, where you’ll be on-site in our Santa Clara office three days a week. Join us in shaping the future of AI technology, where your expertise will not only contribute to our success but will also lead the charge in deploying next-gen AI systems. If you're ready to embrace challenges and thrive in an inclusive environment committed to innovation, d-Matrix is the place for you!

Frequently Asked Questions (FAQs) for AI Software Appications Engineer - Technical Lead / Principal Role at d-Matrix
What are the key responsibilities of the AI Software Applications Engineer - Technical Lead at d-Matrix?

As an AI Software Applications Engineer - Technical Lead at d-Matrix, you will be responsible for customer enablement and support, helping deploy generative AI inference models, troubleshooting, and optimizing AI/ML software stacks. You will also focus on AI/ML inference optimization, conduct design reviews for systems tailored to AI workloads, and lead the installation and validation of our AI software stack.

Join Rise to see the full answer
What qualifications do I need to apply for the AI Software Applications Engineer position at d-Matrix?

To qualify for the AI Software Applications Engineer - Technical Lead position at d-Matrix, you should have an engineering degree in Electrical Engineering, Computer Engineering, or Computer Science, accompanied by over 10 years of experience in AI/ML software and field support for enterprise-level products, specializing in generative AI inference.

Join Rise to see the full answer
How does d-Matrix support the professional growth of an AI Software Applications Engineer?

At d-Matrix, we believe in fostering an inclusive environment that encourages continuous learning and professional development. As an AI Software Applications Engineer - Technical Lead, you will have access to various training opportunities and the chance to work alongside industry leaders, which can help broaden your skills and enhance your career trajectory.

Join Rise to see the full answer
What types of AI infrastructure tools should I be familiar with for the AI Software Applications Engineer role?

In the AI Software Applications Engineer - Technical Lead role at d-Matrix, familiarity with AI/ML infrastructure accelerators such as GPUs and TPUs, along with experience in automation tools and scripting languages like Python, is essential. Additionally, a strong understanding of deploying frameworks such as PyTorch and working with container orchestration platforms like Kubernetes is highly beneficial.

Join Rise to see the full answer
What is the work environment like for the AI Software Applications Engineer at d-Matrix in Santa Clara?

The work environment for the AI Software Applications Engineer - Technical Lead at d-Matrix is dynamic and collaborative. This hybrid role requires you to work onsite at our Santa Clara office three days a week, ensuring you have the opportunity to engage with team members directly while also enjoying some flexibility.

Join Rise to see the full answer
Common Interview Questions for AI Software Appications Engineer - Technical Lead / Principal
Can you explain your experience with AI/ML deployment in production environments?

When answering this question, focus on specific instances where you successfully deployed AI/ML models in production. Discuss the challenges you encountered, the strategies you employed for optimization, and any tools or frameworks used to achieve seamless integration.

Join Rise to see the full answer
How do you approach troubleshooting complex software and hardware issues?

Provide examples of your systematic approach to troubleshooting. Highlight how you identify root causes, the strategies you use to test and validate possible solutions, and any collaboration with cross-functional teams to resolve issues effectively.

Join Rise to see the full answer
What methods do you use to optimize AI workloads for performance?

Discuss the techniques you’ve implemented to optimize AI workloads, including profiling, benchmarking, and using specific tools or algorithms that enhance performance. Mention any real-world examples of significant performance gains achieved through your efforts.

Join Rise to see the full answer
How do you stay updated with advancements in AI/ML technologies?

Explain the resources and methods you utilize to keep your knowledge current, such as attending industry conferences, reading research papers, participating in webinars, or engaging in online communities related to AI/ML.

Join Rise to see the full answer
Describe your experience with customer-facing roles in technical support.

Share examples that highlight your communication skills and ability to translate complex technical concepts into understandable terms for diverse audiences. Mention specific instances where your support led to customer success and satisfaction.

Join Rise to see the full answer
What do you consider best practices for deploying generative AI models?

Outline your approach to generative AI model deployment, including considerations for hardware, software stack configurations, compliance, and monitoring. Discuss how implementing best practices can lead to improved efficiency and reliability in production environments.

Join Rise to see the full answer
Can you give an example of a time when you overcame a significant challenge in a project?

Provide a detailed narrative focusing on the challenge faced, your thought process in addressing it, the steps you took, and the ultimate resolution. Emphasize your problem-solving skills and resilience.

Join Rise to see the full answer
What tools do you prefer for automating deployment processes?

Discuss the automation tools you are familiar with, such as Ansible, Terraform, or scripting languages. Share how these tools have improved efficiency in your projects and any specific examples of their successful use.

Join Rise to see the full answer
How do you ensure that AI solutions meet customer expectations?

Explain your methods for gathering customer feedback, setting expectations, and conducting validation tests to ensure solutions meet their needs. Emphasize your focus on collaboration with clients throughout the deployment process.

Join Rise to see the full answer
What do you know about d-Matrix and its approach to AI technologies?

Research d-Matrix's mission and its groundbreaking work in memory-compute integration. Demonstrate your understanding of the company's AI strategies and emphasize how your background aligns with their innovative goals.

Join Rise to see the full answer
Similar Jobs
Posted 11 days ago
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Auria Hybrid No location specified
Posted 11 days ago
Posted 13 days ago
Photo of the Rise User
Experian Remote ., ., ., United States
Posted 7 days ago
Photo of the Rise User
Posted yesterday
Photo of the Rise User
Youtap Limited Remote No location specified
Posted yesterday
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
December 18, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!