Job details

Software Engineer, LLM Inference Engine and Product

Get a free resume review

Job title: Software Engineer, LLM Inference Engine and Product / Member of Technical Staff

Who We Are
WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive.

Role overview: The Software Engineer, LLM Inference Engine and Product will focus on developing and optimizing a real-time inference engine for multimodal large language models (LLMs) that handle audio and text inputs seamlessly. This role involves leveraging technologies such as LiveKit, RTC engines, WebRTC, and FastAPI to create an efficient, real-time API layer. You will contribute to cutting-edge AI systems that enable smooth user experiences across platforms, including iOS, Android, and desktop.

Key Responsibilities

Real-time Inference Development: Build and optimize a robust inference engine that supports multimodal LLMs, handling real-time audio and text inputs.
Technology Integration: Leverage tools like LiveKit, RTC engines, WebRTC, and FastAPI to enable low-latency, real-time communication and inference.
End-to-End Pipeline Design: Create and maintain the complete inference pipeline, from data ingestion to model serving, ensuring real-time performance.
Cross-platform Compatibility: Ensure the inference engine operates efficiently across various platforms, including mobile (iOS/Android) and desktop.
Optimization & Performance Tuning: Optimize the inference system to reduce latency, improve throughput, and enhance user experience.
API Development: Design and maintain scalable APIs to support real-time LLM interaction for diverse applications.

Required Skills & Qualifications

Inference Engine Expertise: Proven experience in building and optimizing inference engines for multimodal AI systems, particularly combining audio and text inputs.
Technical Proficiency: Strong experience with LiveKit, RTC engines, WebRTC, and FastAPI for real-time communication and model inference.
Real-time System Design: Expertise in creating real-time pipelines and maintaining low-latency performance in production systems.
Cross-platform Development: Familiarity with iOS, Android, and desktop app development, ensuring seamless integration with inference systems.
Performance Optimization: Proficiency in optimizing inference engines to reduce latency and improve computational efficiency.
API Development: Experience in designing and maintaining APIs for real-time AI applications.

Software Engineer real-time systems multimodal AI LiveKit WebRTC FastAPI inference engine

Similar Jobs

Senior iOS Engineer, Creation

Patreon Hybrid New York City

VIEW

Posted 13 days ago

Inclusive & Diverse

Transparent & Candid

Growth & Learning

Diversity of Opinions

Mission Driven

Customer-Centric

Rapid Growth

Dare to be Different

Collaboration over Competition

Patreon is searching for a Senior iOS Engineer for their New York office to build innovative, scalable features that empower creators.

Backend Software Engineer

Vestwell (NY) Hybrid Austin, TX | King of Prussia, PA | New York, NY

VIEW

Posted 10 days ago

Vestwell seeks a skilled Backend Software Engineer to develop and maintain Python-based REST APIs for seamless integration with partners in a hybrid work environment.

Staff Software Security Engineer - AI

Match Group Hybrid No location specified

VIEW

Posted 5 days ago

Advance security at Match Group as a Staff Software Security Engineer specializing in AI-driven vulnerability detection and mitigation.

Mobile Engineer (React Native + AWS)

Dogtopia Hybrid Phoenix, Arizona, United States

VIEW

Posted 3 days ago

Skilled Mobile Engineer with React Native and AWS expertise needed to advance Dogtopia's mobile app for pet services.

Staff Engineer - Payments and Expense Management

Ottimate Hybrid No location specified

VIEW

Posted 4 days ago

Experienced Staff Backend or Full Stack Engineer wanted to architect and scale high-performance payment platform services with Ottimate in a remote role.

AI Software Developer - Onsite

Long View Systems Hybrid Dallas

VIEW

Posted 6 days ago

AI Software Developer needed to craft cutting-edge AI software solutions onsite in Dallas with Long View, a leader in innovative IT services.

Senior AI/ML Engineer – Risk & Compliance Technology (Latam)

Truelogic Hybrid No location specified

VIEW

Posted 3 days ago

Experienced AI/ML Engineer needed to develop scalable LLM-driven solutions for innovative risk and compliance technology, working remotely with a top Latin American team.

Sr. Principal Engineer, Backend - Cortex Cloud (Posture Security)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 7 days ago

An experienced backend engineer role at Palo Alto Networks to architect and develop advanced cloud security posture backend systems.

Salesforce Developer

Chainlink Labs Hybrid Dallas

VIEW

Posted 5 days ago

Experienced Salesforce Developer needed at Chainlink Labs to design and enhance Salesforce solutions supporting their sales operations and innovative blockchain platform.

API & Kafka Automation Engineer (Python Backend)

TMS LLC Hybrid New York, NY, USA

VIEW

Posted 12 days ago

Seeking a Python Backend Automation Engineer with expertise in API testing and Kafka to develop and maintain automated test frameworks remotely.

Sr Principal Engineer Software (AIOps for NGFW)

Palo Alto Networks Hybrid Santa Clara, CA

VIEW

Posted 10 days ago

Drive innovation at Palo Alto Networks as a Senior Principal Engineer focused on building scalable AI-powered solutions for next-generation firewall operations.

Senior Programmer - Nuvolo

Trilogy Federal Hybrid Arlington, VA

VIEW

Posted yesterday

Experienced Senior Programmer with Nuvolo expertise needed for remote role supporting federal clients with customized ServiceNow platform solutions.

Senior Software Engineer, Wallet Identity Server

Apple Hybrid Cupertino, California, United States

VIEW

Posted 5 days ago

Inclusive & Diverse

Diversity of Opinions

Work/Life Harmony

Dare to be Different

Reward & Recognition

Empathetic

Take Risks

Growth & Learning

Transparent & Candid

Mission Driven

Passion for Exploration

Feedback Forward

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Learning & Development

Paid Time-Off

Maternity Leave

Social Gatherings

Innovate digital identity infrastructure as a Senior Software Engineer at a pioneer technology firm working at the crossroads of distributed systems and cryptography.

Get a free resume review

W WaveForms AI

4 jobs

MATCH

VIEW MATCH

FUNDING

Other

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Mid-Level

TEAM SIZE