At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.
We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together, we can help shape the endless possibilities of AI.
Location:
Hybrid, working onsite at our Santa Clara, Ca Headquarters 3 days per week.
The role: AI Systems Solutions Architect
What you will do:
d-Matrix is looking for a AI System Solutions Architect to develop world-class products around d-Matrix inference accelerators. In this role you will be engaged with key customers and internal architects, and other key internal and external stakeholders to drive overall system solutions. This requires technically analyzing, defining outside-in usage cases, and use broad spectrum of technologies to drive a AI server system solution spanning silicon, platform HW/SW, and usages to deliver the best customer experiences with d-Matrix inference accelerators.
Design, develop, and deploy scalable GenAI inference solutions with d-Matrix accelerators
Work closely with team members across architecture, engineering, product management and business developments to optimize the d-Matrix system solutions for best performance & power balance, feature set and overall system cost.
Work closely with Datacenter, OEM and ODM customers at early stage of product concept and planning phase, to enable the system design with partners and industrial ecosystem.
Influence and shape the future generations of products and solutions by contributing to the system architecture and technology through the early engagement cycle with customers and industrial partners.
Stay abreast of the latest advancements in GenAI hardware and software technologies and assess their suitability for integration into d-Matrix GenAI inference solutions.
Establish credibility with both engineering and leadership counterparts at top technology companies, communicate technical results and positions clearly and accurately, and drive alignment on solutions.
What you will bring:
15 + Years of Industry Experience and Engineering degree in Electrical Engineering, Computer Engineering, or Computer Science with extensive experience.
5+ years of AI Server System experience by working on multiple projects from architecture, development, design including memory, I/O, power delivery, power management, boot process, FW and BMC/hardware management through bring-up and validation and supported through the release to production.
5+ years of experience in a customer-facing role interfacing with OEMs, ODMs and CSPs.
Detailed understanding of server industry standard busses, such as DDR, PCIe, CXL and other high-speed IO protocol is required.
Ability to work seamlessly across engineering disciplines and geographies to deliver excellent results.
Deep understanding of datacenter AI infrastructure requirements and challenge
Preferred:
Hands-on understanding of AI/ML infrastructure and hardware accelerators
Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc. and container orchestration platforms such as Kubernetes
Outstanding communication and presentation skills
#LI-DL1
Equal Opportunity Employment Policy
d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.
d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Are you an innovative thinker ready to dive into the exciting world of AI? As an AI Systems Solutions Architect at d-Matrix, based in Santa Clara, CA, you'll play a critical role in shaping the future of generative AI technologies. At d-Matrix, we pride ourselves on pushing the boundaries of software and hardware innovation, carving out new paths and possibilities in the realm of AI. In this role, you will help develop world-class products using our cutting-edge inference accelerators while collaborating with a diverse and inclusive team. Your responsibilities will include designing and deploying scalable GenAI inference solutions, working closely with customers and internal stakeholders to optimize system performance and cost. As you influence future product generations, you will need to stay on top of the latest advancements in AI technologies. If you're ready to face challenges head-on, communicate your ideas effectively, and drive exciting solutions to fruition, we want to hear from you! Enjoy a hybrid work environment where you'll be onsite at our Santa Clara headquarters three days a week, collaborating with your fellow AI enthusiasts.
Subscribe to Rise newsletter