Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Performance engineer image - Rise Careers
Job details

Performance engineer

📐 About this role 
Writer is seeking a highly skilled and motivated Principal Performance Engineer to lead the performance optimization of our cutting-edge Generative AI technology stack. This role is critical in ensuring the scalability, efficiency, and reliability of our Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) systems. You will be a key driver in identifying and resolving performance bottlenecks, optimizing resource utilization, and ensuring a seamless user experience. You will work closely with our AI research, software engineering, and infrastructure teams to deliver world-class AI solutions.


🦸🏻‍♀️ Your responsibilities 

  • Performance Leadership:

    • Define and implement performance engineering strategies for our Generative AI full stack, including services, application, LLMs, RAG pipelines, and related infrastructure.

    • Lead performance testing, profiling, and analysis efforts to identify and resolve performance bottlenecks.

    • Establish and maintain performance benchmarks and SLAs for critical AI services.

    • Provide technical leadership and mentorship to performance engineering team members.

  • LLM Capacity and Tuning:

    • Analyze and improve LLM inference performance, including latency, throughput, and resource utilization.

    • Develop and implement strategies for LLM capacity planning and scaling.

    • Collaborate with AI researchers to evaluate and improve LLM model architectures and training techniques for performance.

    • Optimize LLM inference through techniques such as quantization, distillation, and optimized kernel implementation.

  • RAG Performance Optimization:

    • Design and implement performance tests for RAG pipelines, including retrieval, ranking, and generation components.

    • Identify and optimize performance bottlenecks in RAG systems, such as database queries, vector search, and document processing.

    • Evaluate and optimize RAG system architectures for scalability and efficiency.

    • Tune vector databases for optimal recall and latency.

  • Infrastructure Optimization:

    • Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads.

    • Evaluate and recommend new technologies and tools for performance monitoring and analysis.

    • Develop and maintain performance dashboards and reports to track key metrics.

    • Optimize GPU utilization and memory management for LLM inference.

  • Collaboration and Communication:

    • Work closely with AI researchers, software engineers, and product managers to ensure performance requirements are met.

    • Communicate performance findings and recommendations to stakeholders at all levels.

    • Stay up-to-date with the latest developments in Generative AI and performance engineering.

⭐️ Is this you?

  • Education:

    • Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred).

  • Experience:

    • 10+ years of experience in performance engineering, with a focus on large-scale distributed systems.

    • 2+ years of experience working with AI/ML technologies

    • Proven experience in performance testing, profiling, and analysis of complex software systems.

    • Deep understanding of NLP architectures, training, and inference.

    • Experience with vector databases and search technologies.

    • Experience with cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).

    • Strong programming skills in python.

    • Experience with performance analysis tools (e.g., profilers, debuggers, monitoring tools).

  • Skills:

    • Strong analytical and problem-solving skills.

    • Excellent communication and collaboration skills.

    • Ability to work in a fast-paced and dynamic environment.  

    • Passion for AI and a desire to push the boundaries of performance engineering

      #LI-Remote


🍩 Benefits & perks (US Full-time employees)

  • Generous PTO, plus company holidays

  • Medical, dental, and vision coverage for you and your family

  • Paid parental leave for all parents (12 weeks)

  • Fertility and family planning support

  • Early-detection cancer testing through Galleri

  • Flexible spending account and dependent FSA options

  • Health savings account for eligible plans with company contribution

  • Annual work-life stipends for:

    • Home office setup, cell phone, internet

    • Wellness stipend for gym, massage/chiropractor, personal training, etc.

    • Learning and development stipend

  • Company-wide off-sites and team off-sites

  • Competitive compensation, company stock options and 401k

Writer is an equal-opportunity employer and is committed to diversity. We don't make hiring or employment decisions based on race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other basis protected by applicable local, state or federal law. Under the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

By submitting your application on the application page, you acknowledge and agree to Writer's Global Candidate Privacy Notice.

Writer Glassdoor Company Review
4.8 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Writer DE&I Review
5.0 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Writer
Writer CEO photo
Unknown name
Approve of CEO

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Performance engineer, Writer

Writer is on the lookout for a dynamic Principal Performance Engineer to join our innovative team in New York City. If you're passionate about pushing the boundaries of Generative AI technology, you're in the right place! In this unique role, you'll take charge of optimizing our cutting-edge Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) systems for unmatched performance and reliability. You'll dive deep into identifying performance bottlenecks and optimizing resource usage to ensure our users enjoy a seamless experience. Collaborating closely with AI researchers, software engineers, and infrastructure teams, you will help shape the future of AI solutions while establishing performance benchmarks and providing mentorship to our engineering team. With more than 10 years of experience in performance engineering and a solid background in AI/ML technologies, you'll employ your skills in LLM capacity planning, infrastructure optimization, and advanced profiling techniques. Your role will also include evaluating new technologies, developing performance dashboards, and enhancing LLM inference capabilities. At Writer, we thrive on innovation and teamwork, and we believe that your expertise can help propel us to extraordinary heights! Get ready to lead us into the future of Generative AI!

Frequently Asked Questions (FAQs) for Performance engineer Role at Writer
What are the primary responsibilities of a Principal Performance Engineer at Writer?

As a Principal Performance Engineer at Writer, your key responsibilities will include defining performance engineering strategies for our Generative AI technology stack, leading performance testing and analysis to identify bottlenecks, and providing mentorship to team members. You'll also collaborate with various teams to enhance LLM capacity and optimize RAG performance, making sure that our AI solutions perform exceptionally well under high demands.

Join Rise to see the full answer
What qualifications are required to apply for the Principal Performance Engineer position at Writer?

To qualify for the Principal Performance Engineer role at Writer, candidates should ideally possess a Bachelor's degree in Computer Science or Engineering, with a Master's preferred. Additionally, at least 10 years of experience in performance engineering, especially with large-scale distributed systems, is essential. Familiarity with AI/ML technologies and strong programming skills in Python are also required.

Join Rise to see the full answer
What kind of experience is valued for the Principal Performance Engineer role at Writer?

Candidates applying for the Principal Performance Engineer position at Writer should bring over 10 years of experience in performance engineering. At least 2 years should focus on AI/ML technologies, alongside proven expertise in performance testing and profiling. A deep understanding of NLP architectures and experience with cloud platforms like AWS, Azure, or GCP are also critical for success in this role.

Join Rise to see the full answer
How does the Principal Performance Engineer contribute to the team at Writer?

The Principal Performance Engineer at Writer plays a pivotal role in team collaboration, working closely with AI researchers, software engineers, and product managers. By providing technical leadership in optimizing system performance, sharing insights about performance findings, and driving innovation in performance engineering practices, you'll help elevate our AI projects and ensure robust user experiences.

Join Rise to see the full answer
What opportunities for growth and development does Writer offer a Principal Performance Engineer?

Writer is committed to employee development and offers a variety of opportunities for growth. As a Principal Performance Engineer, you'll have access to learning and development stipends, which can be used for professional development courses and resources. Additionally, engaging in company-wide off-sites and team retreats allows for collaborative learning and networking with peers.

Join Rise to see the full answer
Common Interview Questions for Performance engineer
What experience do you have in performance engineering related to AI/ML technologies?

In interviews for the Principal Performance Engineer role at Writer, reflect on your specific projects involving AI/ML technologies. Discuss how your previous roles required performance testing, profiling, and any optimizations you achieved in complex systems. Be sure to highlight your measurable impacts on performance improvements and learnings from those experiences.

Join Rise to see the full answer
How do you approach identifying and resolving performance bottlenecks?

When discussing your approach to identifying performance bottlenecks, focus on your systematic analysis techniques. Explain how you utilize profiling tools, monitor system metrics, and conduct load testing to gather data. Share any specific methodologies you've applied to diagnose issues and the steps you took to resolve them effectively.

Join Rise to see the full answer
Can you share an example of optimizing large language model (LLM) performance?

In response to this question, provide a detailed case study from your experience, outlining how you analyzed LLM performance metrics such as latency and throughput. Discuss the strategies you employed, such as model architecture adjustments or inference optimizations, and the resultant performance gains that enhanced user experience.

Join Rise to see the full answer
What techniques do you use to tune vector databases for optimal performance?

Discuss specific techniques you have implemented for tuning vector databases, such as adjusting retrieval and indexing parameters, or exploring various database architectures for efficiency. Give real examples where your optimizations led to measurable improvements in recall and latency, demonstrating your technical expertise.

Join Rise to see the full answer
Describe your experience with cloud computing platforms in optimizing performance.

Describe your familiarity with cloud platforms like AWS, Azure, or GCP, specifically in leveraging their tools and services to optimize system performance. Talk about any cloud-native performance enhancements you've implemented and how they contributed to the scalability and reliability of applications.

Join Rise to see the full answer
How do you stay updated with the latest trends in Generative AI and performance engineering?

In answering this question, discuss your commitment to continuous learning through resources such as technical journals, online courses, participation in industry conferences, or engagement in relevant online communities. Showcase specific examples of how staying informed has impacted your work or stimulated innovative ideas.

Join Rise to see the full answer
What leadership qualities do you believe are essential for a Principal Performance Engineer?

Highlight the importance of strong communication, collaboration, and mentorship in your leadership style. Emphasize that a Principal Performance Engineer should inspire teamwork and support the professional growth of others while fostering a culture of continuous improvement in performance engineering practices.

Join Rise to see the full answer
How do you balance performance optimizations with project deadlines?

Share your strategies for time management and prioritization in balancing performance optimization tasks with project timelines. Discuss how you assess trade-offs, involve team input, and maintain open communication with stakeholders to ensure performance objectives are met without compromising deadlines.

Join Rise to see the full answer
What performance analysis tools have you used in your previous roles?

Talk about the performance analysis tools you are familiar with, such as profilers, debuggers, or monitoring solutions you have used in the past. Describe how you utilized these tools to analyze system performance and the insights gained from them that led to impactful changes.

Join Rise to see the full answer
Why do you believe performance engineering is critical for Generative AI applications?

In answering this question, elaborate on the relationship between performance engineering and user experience in Generative AI applications. Stress the importance of optimizing latency and resource usage to create responsive AI solutions that can handle complex tasks at scale while assuring reliability and efficiency.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 10 days ago
Dare to be Different
Diversity of Opinions
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning
Photo of the Rise User
Posted 8 days ago
Dare to be Different
Diversity of Opinions
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning
Photo of the Rise User
Posted 5 hours ago
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Renesas Electronics Remote Bengaluru, Karnataka, India
Posted 13 days ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 3 hours ago
Posted 2 days ago
Photo of the Rise User
Waymo Remote Mountain View, CA, USA; San Francisco, CA, USA
Posted 2 days ago
Social Impact Driven
Empathetic
Collaboration over Competition
Growth & Learning
Photo of the Rise User
Sopra Steria Remote 550 Rue Pierre Berthier, 13290 Aix-en-Provence, France
Posted 5 days ago

Writer is the full-stack generative AI platform for enterprises. We empower your entire organization — support, operations, product, sales, HR, marketing, and more.

198 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Future MakerBadge InnovatorBadge Rapid Growth
CULTURE VALUES
Dare to be Different
Diversity of Opinions
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
March 28, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
F
Someone from OH, Columbus just viewed Mortgage Loan Officer Assistant at Fulton Bank
Photo of the Rise User
Someone from OH, Ironton just viewed Software Engineer Intern (Summer 2025) at Curri
Photo of the Rise User
6 people applied to Software Engineer I at Affirm
J
Someone from OH, Westerville just viewed Oracle Database Administrator- Remote only at JASCI
Photo of the Rise User
8 people applied to Game Developer at Altera
V
Someone from OH, Toledo just viewed Sports Event Coordinator at Ventures With Jen
Photo of the Rise User
Someone from OH, Dayton just viewed Research Assistant at Leidos
Photo of the Rise User
Someone from OH, Cincinnati just viewed Finance & Accounting Associate at HeadQuarters
Photo of the Rise User
Someone from OH, Canton just viewed Communications Manager at Shearer's Foods
Photo of the Rise User
12 people applied to Frontend Engineer I at Outliant
Photo of the Rise User
Someone from OH, Sandusky just viewed Supply Chain Trainee Program (SCTP) at Anheuser-Busch
Photo of the Rise User
11 people applied to Unity Developer at FS Studio
Photo of the Rise User
139 people applied to Scrum Master-Remote at DICE
Photo of the Rise User
Someone from OH, Mason just viewed HR/Recruiting Assistant at Illumination
Photo of the Rise User
Someone from OH, Strongsville just viewed Used Car Buyer - Concord Toyota at Sonic Automotive
Photo of the Rise User
Someone from OH, Cincinnati just viewed Mid-level Creative (f/m/d) at Landor
P
Someone from OH, Kent just viewed Graphic Designer at ProjectGrowth
Photo of the Rise User
Someone from OH, Waverly just viewed Client Services Manager at Pepperstone
Photo of the Rise User
Someone from OH, Plain City just viewed Aesthetic Telehealth Nurse Practitioner (remote) at Moxie
Photo of the Rise User
Someone from OH, Columbus just viewed EdTech Product/Program Manager at Planner5D
S
Someone from OH, Lorain just viewed Test Engineer- Ninja at SharkNinja
Photo of the Rise User
Someone from OH, Youngstown just viewed Channel Development Representative at Arrow Electronics
Photo of the Rise User
Someone from OH, Cincinnati just viewed Buyer at Novolex