Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Software Engineer, LLM Inference Engine and Product image - Rise Careers
Job details

Software Engineer, LLM Inference Engine and Product

Job title: Software Engineer, LLM Inference Engine and Product / Member of Technical Staff

Who We Are
WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive.

Role overview: The Software Engineer, LLM Inference Engine and Product will focus on developing and optimizing a real-time inference engine for multimodal large language models (LLMs) that handle audio and text inputs seamlessly. This role involves leveraging technologies such as LiveKit, RTC engines, WebRTC, and FastAPI to create an efficient, real-time API layer. You will contribute to cutting-edge AI systems that enable smooth user experiences across platforms, including iOS, Android, and desktop.

Key Responsibilities

  • Real-time Inference Development: Build and optimize a robust inference engine that supports multimodal LLMs, handling real-time audio and text inputs.

  • Technology Integration: Leverage tools like LiveKit, RTC engines, WebRTC, and FastAPI to enable low-latency, real-time communication and inference.

  • End-to-End Pipeline Design: Create and maintain the complete inference pipeline, from data ingestion to model serving, ensuring real-time performance.

  • Cross-platform Compatibility: Ensure the inference engine operates efficiently across various platforms, including mobile (iOS/Android) and desktop.

  • Optimization & Performance Tuning: Optimize the inference system to reduce latency, improve throughput, and enhance user experience.

  • API Development: Design and maintain scalable APIs to support real-time LLM interaction for diverse applications.

Required Skills & Qualifications

  • Inference Engine Expertise: Proven experience in building and optimizing inference engines for multimodal AI systems, particularly combining audio and text inputs.

  • Technical Proficiency: Strong experience with LiveKit, RTC engines, WebRTC, and FastAPI for real-time communication and model inference.

  • Real-time System Design: Expertise in creating real-time pipelines and maintaining low-latency performance in production systems.

  • Cross-platform Development: Familiarity with iOS, Android, and desktop app development, ensuring seamless integration with inference systems.

  • Performance Optimization: Proficiency in optimizing inference engines to reduce latency and improve computational efficiency.

  • API Development: Experience in designing and maintaining APIs for real-time AI applications.

What You Should Know About Software Engineer, LLM Inference Engine and Product, WaveForms AI

Join the innovative team at WaveForms AI as a Software Engineer, LLM Inference Engine and Product! In this exciting role, you’ll be at the forefront of audio intelligence, helping to develop and optimize a real-time inference engine for multimodal large language models (LLMs). Imagine being part of a company that is set to revolutionize human-AI interactions by creating more natural and engaging experiences. Your responsibilities will include leveraging cutting-edge technologies like LiveKit, RTC engines, WebRTC, and FastAPI to build a robust inference engine that seamlessly integrates audio and text inputs. This position isn't just about coding; it's about crafting an infrastructure that supports real-time communication and ensures smooth performance across platforms such as iOS, Android, and desktop. You'll be in charge of maintaining the entire inference pipeline, design scalable APIs, and fine-tune system performance to provide the best possible user experience. If you have a passion for AI technologies and a knack for cross-platform development, we invite you to contribute to groundbreaking projects that shape the future of AI in audio. Your expertise will play a crucial role in enabling our advanced systems to perform flawlessly and efficiently. Come join us at WaveForms AI and be part of the transformation!

Frequently Asked Questions (FAQs) for Software Engineer, LLM Inference Engine and Product Role at WaveForms AI
What are the responsibilities of a Software Engineer, LLM Inference Engine and Product at WaveForms AI?

As a Software Engineer, LLM Inference Engine and Product at WaveForms AI, your primary responsibilities will include building and optimizing inference engines for multimodal AI systems, designing a complete inference pipeline, ensuring cross-platform compatibility, and developing scalable APIs for real-time LLM interaction. You'll also focus on performance optimization to enhance user experience.

Join Rise to see the full answer
What qualifications do I need to be a Software Engineer, LLM Inference Engine and Product at WaveForms AI?

To qualify for the Software Engineer, LLM Inference Engine and Product position at WaveForms AI, you should have proven experience in building and optimizing inference engines, strong proficiency with technologies like LiveKit, WebRTC, and FastAPI, and the ability to design real-time systems. Knowledge of cross-platform app development is also crucial.

Join Rise to see the full answer
What technologies are important for a Software Engineer, LLM Inference Engine and Product at WaveForms AI?

Key technologies for the Software Engineer, LLM Inference Engine and Product role at WaveForms AI include LiveKit for real-time communications, WebRTC and RTC engines for low-latency interactions, and FastAPI for building APIs. Familiarity with audio processing tools and frameworks will also be beneficial.

Join Rise to see the full answer
What kind of projects will I work on as a Software Engineer, LLM Inference Engine and Product at WaveForms AI?

In this role at WaveForms AI, you will work on cutting-edge AI systems aimed at enhancing human-AI interactions and developing real-time inference engines that integrate audio and text inputs. You’ll contribute to multi-platform applications, focusing on delivering smooth and engaging user experiences.

Join Rise to see the full answer
What role does performance optimization play in the Software Engineer, LLM Inference Engine and Product position at WaveForms AI?

Performance optimization is a vital aspect of the Software Engineer, LLM Inference Engine and Product position at WaveForms AI, where you'll be tasked with reducing latency and improving throughput. This ensures that the real-time inference systems provide efficient and high-quality interactions, crucial for user satisfaction.

Join Rise to see the full answer
Common Interview Questions for Software Engineer, LLM Inference Engine and Product
Can you explain your experience with building inference engines for multimodal LLMs?

In answering this question, provide detailed examples of projects you've worked on, highlighting your role in developing inference systems that handle both audio and text inputs. Discuss the challenges you faced and how you overcame them.

Join Rise to see the full answer
What tools and technologies do you prefer for real-time system design?

When discussing your preferences, mention tools like LiveKit, WebRTC, and FastAPI, explaining why they are suitable for low-latency communication and how you have utilized them in past projects.

Join Rise to see the full answer
How do you ensure cross-platform compatibility in your development work?

Explain your approach to creating systems that work seamlessly across different platforms. Discuss strategies like responsive design, continuous integration practices, and testing on various devices to ensure a smooth user experience.

Join Rise to see the full answer
What strategies do you use for performance tuning an inference engine?

Share specific techniques you've applied for optimizing latency and throughput, such as profiling tools, algorithm improvements, or resource management tactics to achieve peak performance.

Join Rise to see the full answer
Could you detail a project where you had to design an API for real-time AI applications?

Provide a narrative about your role in the project, the design decisions made, and how the API facilitated real-time interactions. Highlight the challenges faced and the solutions implemented.

Join Rise to see the full answer
Describe your experience with working on real-time audio processing systems.

In your response, talk about the systems you've developed or contributed to that involved real-time audio processing. Discuss the technologies used and the specific functionalities you implemented.

Join Rise to see the full answer
How do you approach debugging in real-time systems?

Outline your debugging process, emphasizing the importance of monitoring, logging, and how you use analytics to identify and fix performance issues swiftly.

Join Rise to see the full answer
What methodologies do you follow in maintaining code quality for large-scale projects?

You'll want to talk about practices like code reviews, automated testing, and CI/CD pipelines that help maintain high code quality in large projects.

Join Rise to see the full answer
How do you stay updated with advancements in audio intelligence and multimodal AI?

Share your approach to continuous learning, such as attending workshops, following industry publications, or participating in relevant communities to keep abreast of new technologies and developments.

Join Rise to see the full answer
What excites you most about working as a Software Engineer at WaveForms AI?

This is an opportunity to express your enthusiasm for audio AI technologies and how you see your contributions elevating the user experience. Discuss your passion for innovation and the company's vision.

Join Rise to see the full answer
Similar Jobs
Posted 9 days ago
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
Kunai Hybrid Richmond, VA
Posted 2 days ago
Photo of the Rise User
Apexon Remote No location specified
Posted 13 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Pipedrive Remote Estonia, Tallinn
Posted 12 days ago
Photo of the Rise User
Dental Insurance
Disability Insurance
Vision Insurance
Paid Holidays
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
December 9, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!