Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Software Engineer, Infrastructure - Analytics image - Rise Careers
Job details

Software Engineer, Infrastructure - Analytics

About the Team

The Scaling team designs, builds, and operates critical infrastructure that enables research at OpenAI.

Our mission is simple: accelerate the progress of research towards AGI. We do this by building core systems that researchers rely on - ranging from low-level infrastructure components to research-facing custom applications. These systems must scale with the increasing complexity and size of our workloads, while remaining reliable and easy to use.

About the Role

As we grow, we’re looking for a pragmatic and versatile software engineer who thrives in fast-moving environments and enjoys building systems that empower others.

This is a generalist software engineering role with an emphasis on distributed systems, data processing infrastructure, and operational excellence. You’ll develop and operate foundational backend services that power key OpenAI’s research workflows - both by creating new infrastructure and by building on existing systems. The use cases will span across observability, analytics, performance engineering, and other domains, all with the goal of solving meaningful and impactful problems to research.

This role is based in San Francisco, CA or open to being remote within the US. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Design, build, and operate scalable backend systems that support various ML research workflows, including observability and analytics.

  • Develop reliable infrastructure that supports both streaming and batch data processing at scale.

  • Creating internal-facing tools and applications as needed.

  • Debug and improve performance of services running on Kubernetes, including operational tooling and observability.

  • Collaborate with engineers and researchers to deliver reliable systems that meet real-world needs in production.

  • Help improve system reliability by participating in the on-call rotation and responding to critical incidents.

You might thrive in this role if you have:

  • Strong proficiency in Python/Rust and backend software development, ideally in large codebases.

  • Experience with distributed systems and scalable data processing infrastructure, including technologies like Kafka, Spark, Trino/Presto, Iceberg.

  • Hands-on experience operating services in Kubernetes, with familiarity in tools like Terraform and Helm.

  • Comfort working across the stack - from low-level infrastructure components to application logic - and making trade-offs to move quickly.

  • A focus on building systems that are both technically sound and easy for others to use.

  • Curiosity and adaptability in fast-changing environments, especially in high-growth orgs.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. 

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

OpenAI Glassdoor Company Review
4.2 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
OpenAI DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of OpenAI
OpenAI CEO photo
Sam Altman
Approve of CEO

Average salary estimate

$125000 / YEARLY (est.)
min
max
$100000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Software Engineer, Infrastructure - Analytics, OpenAI

Join OpenAI as a Software Engineer, Infrastructure - Analytics, where you’ll be part of the Scaling team, a group at the forefront of developing systems that empower research in artificial intelligence. Located in the vibrant city of San Francisco, or potentially remote within the US, you'll engage in building scalable backend systems crucial for ML research workflows. Your role will require a combination of creativity and technical expertise, as you design and operate systems that handle both real-time and batch data processing. You’ll collaborate with fellow engineers and researchers to ensure that the infrastructure we're building not only meets the demands of complex workloads but also remains user-friendly. You’ll dive into observability tools and performance engineering, making meaningful contributions that directly impact key research initiatives. OpenAI fosters a hybrid work environment, allowing flexibility while ensuring a collaborative and engaging workplace. If you possess a strong proficiency in Python or Rust, experience with Kubernetes, and a keen interest in creating reliable systems, this is the perfect opportunity for you. We encourage curiosity and adaptability, as you will be working in a fast-paced setting with rapidly evolving challenges. As we strive towards our goal of advancing AI for the benefit of humanity, your work will be fundamental in shaping the future of technology. With OpenAI, you can be part of a mission-driven company devoted to pushing the boundaries of AI responsibly and equitably.

Frequently Asked Questions (FAQs) for Software Engineer, Infrastructure - Analytics Role at OpenAI
What responsibilities does a Software Engineer, Infrastructure - Analytics at OpenAI have?

At OpenAI, a Software Engineer, Infrastructure - Analytics is responsible for designing, building, and operating scalable backend systems that support various ML research workflows. This includes enhancing observability and analytics, developing reliable infrastructure for both streaming and batch data processing, and creating internal tools and applications. The role also involves debugging performance issues for services running on Kubernetes and collaborating with other engineers and researchers to deliver effective systems that address real-world production needs.

Join Rise to see the full answer
What qualifications are needed for the Software Engineer, Infrastructure - Analytics position at OpenAI?

To qualify for the Software Engineer, Infrastructure - Analytics role at OpenAI, candidates should have strong proficiency in programming languages such as Python or Rust, alongside experience in backend software development, ideally within large codebases. It’s important for candidates to have hands-on experience with distributed systems, scalable data processing infrastructure, and a solid understanding of operating services in Kubernetes. Familiarity with technologies like Kafka and Spark is a plus, and candidates should demonstrate adaptability and curiosity in a fast-changing environment.

Join Rise to see the full answer
Can I work remotely as a Software Engineer, Infrastructure - Analytics at OpenAI?

Yes, the Software Engineer, Infrastructure - Analytics position at OpenAI offers flexibility for remote work within the US, while also maintaining a hybrid model that encourages in-office collaboration three days a week. This setup allows team members to engage with each other directly while providing the convenience of remote work, ensuring a balanced and productive work environment for all.

Join Rise to see the full answer
What technologies should a Software Engineer, Infrastructure - Analytics be familiar with?

A Software Engineer, Infrastructure - Analytics at OpenAI should be well-versed in Python or Rust for backend development. Familiarity with distributed systems and scalable data processing infrastructure technologies, such as Kafka, Spark, and Trino/Presto, is highly advantageous. Experience working with Kubernetes, Terraform, and Helm also aligns well with the role, allowing for effective deployment and management of services.

Join Rise to see the full answer
How can I improve my chances of being selected for the Software Engineer, Infrastructure - Analytics position at OpenAI?

Improving your chances for the Software Engineer, Infrastructure - Analytics role at OpenAI involves showcasing your proficiency in relevant programming languages like Python or Rust, demonstrating experience with distributed systems, and highlighting any hands-on work with Kubernetes or data processing technologies. Additionally, displaying a strong understanding of system design principles and the ability to adapt to evolving challenges will resonate well with OpenAI's mission-driven culture.

Join Rise to see the full answer
Common Interview Questions for Software Engineer, Infrastructure - Analytics
What experience do you have with distributed systems as a Software Engineer?

When answering this question, emphasize specific projects or systems you've worked on that demonstrate your understanding of distributed architectures, such as real-time analytics pipelines or high-scale data processing systems. Mention any specific technologies you've used, like Kafka or Spark, and discuss the challenges you faced and how you overcame them.

Join Rise to see the full answer
Can you explain the difference between streaming and batch data processing?

A strong response should clarify that streaming data processing involves handling real-time data flows, while batch processing deals with processing larger chunks of data at once. Offering concrete examples of situations for each method can showcase your practical understanding, demonstrating how you might choose one over the other based on use cases in real-world applications.

Join Rise to see the full answer
How do you ensure the reliability of the services you develop?

Highlight proactive approaches like thorough testing, implementing monitoring and observability tools, and developing a robust incident response plan. Mention experiences you have with on-call rotations and how they’ve prepared you to manage service reliability and troubleshoot issues quickly.

Join Rise to see the full answer
Describe your experience with Kubernetes and managing containerized applications.

Discuss specific projects where you've utilized Kubernetes for deploying and managing applications. Explain your understanding of its components such as pods, services, and deployments, and how you've ensured scalability and resilience within your applications, providing examples of challenges faced and solutions implemented.

Join Rise to see the full answer
How do you approach debugging performance issues in backend systems?

Detail a structured process for debugging performance issues, such as gathering metrics, identifying performance bottlenecks, and using profiling tools. Sharing real examples can make your explanation more relatable, showing how you systematically addressed these issues in past projects.

Join Rise to see the full answer
What role do observability tools play in your engineering process?

Discuss how observability tools help provide visibility into system performance, enabling you to make informed decisions on improvements. Share your familiarity with tools you've used and how they've contributed to your ability to analyze system metrics and performance.

Join Rise to see the full answer
Can you describe a time when you had to work with a cross-functional team?

Share a specific instance where you collaborated with engineers, data scientists, or stakeholders from other departments. Highlight the communication strategies and methodologies you used to ensure alignment, successfully achieving project goals.

Join Rise to see the full answer
What trade-offs have you made in system design for performance vs. scalability?

Discuss a specific scenario where you had to balance performance requirements against scalability. Explain the factors you considered, such as system constraints and user needs, and how you ultimately arrived at a decision that met both goals moderately well.

Join Rise to see the full answer
Describe how you stay up to date with emerging technologies relevant to your role.

Share your strategies for staying informed, including following industry blogs, participating in online forums, or attending conferences. Emphasize any personal projects or contributions to open source that have kept you engaged with current trends and best practices in the realm of software engineering.

Join Rise to see the full answer
How would you approach a critical incident in production?

Detail a response plan that includes immediate steps such as assessing the impact, communicating with stakeholders, and executing a rollback if necessary. Talking about your previous experiences handling production incidents can add authenticity and show readiness to manage high-pressure situations effectively.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
OpenAI Hybrid New York
Posted 5 days ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

As a product engineer in OpenAI's GTM Innovation team, you’ll help shape customer experiences with cutting-edge AI applications.

Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

As a Software Engineer at OpenAI, you'll enhance ML training infrastructure through systems programming and runtime optimization.

Photo of the Rise User
BILL Remote San Jose, California, United States
Posted 8 days ago

Join BILL as a Senior Software Engineer to create cutting-edge solutions that empower businesses in a dynamic fintech environment.

Monadical Remote No location specified
Posted 5 days ago

Join our remote team as a Senior Full-Stack Developer, where you'll lead projects and enhance AI-powered products with your expertise.

Photo of the Rise User
Posted 8 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Social Impact Driven
Rapid Growth
Passion for Exploration
Dare to be Different
Reward & Recognition
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Bias Training
Employee Resource Groups
401K Matching
Paternity Leave
Maternity Leave
Some Meals Provided
Social Gatherings

As a Senior Software Engineer at Google Cloud AI, you'll be at the forefront of creating innovative machine learning solutions that impact billions of users globally.

Photo of the Rise User
Posted 11 days ago

Join Visa as a Chief Software Engineer to lead architectural innovations in payment processing systems on a global scale.

Elevate your career with LPL Financial as a Senior Engineer focused on Software Developer Test QA, thriving in a collaborative, innovative environment.

Photo of the Rise User
Epiq Remote POL - Poland Remote Office
Posted 2 days ago

Join Epiq as a Senior Software Engineer and leverage your skills in Python and cloud technologies to enhance their cutting-edge eDiscovery platform.

Udelta Remote No location specified
Posted 4 days ago

Join a dynamic international team as a Senior Unity Developer, creating popular mobile games that captivate millions of users.

Photo of the Rise User
Flourish Hybrid New York, United States
Posted 3 days ago

Flourish is looking for a Full Stack Engineer to innovate in financial technology and deliver outstanding user experiences.

OpenAI is a US based, private research laboratory that aims to develop and direct AI. It is one of the leading Artifical Intellgence organizations and has developed several large AI language models including ChatGPT.

959 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Future MakerBadge InnovatorBadge Future UnicornBadge Rapid Growth
CULTURE VALUES
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning
FUNDING
SENIORITY LEVEL REQUIREMENT
INDUSTRY
TEAM SIZE
No info
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 10, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Dayton just viewed Medical Receptionist at LifeStance Health
C
Someone from OH, Massillon just viewed RN Ambulatory - Outpatient Infusion Therapy at CCF
Photo of the Rise User
Someone from OH, Columbus just viewed HR Business Partner (Maternity Cover) at Marshmallow
Photo of the Rise User
Someone from OH, Columbus just viewed Community Outreach Canvasser $24/Hr at Confidential
Photo of the Rise User
Someone from OH, Cincinnati just viewed Email Marketing Coordinator at Creative Circle
Photo of the Rise User
Someone from OH, Columbus just viewed UX Researcher, Amazon Autos at Amazon
Photo of the Rise User
24 people applied to Front-end Developer at Venturenox
Photo of the Rise User
Someone from OH, Cincinnati just viewed AI training and enablement at Writer
Photo of the Rise User
Someone from OH, Cincinnati just viewed Data Analyst (Contact Center-Hybrid) at Dow Jones
Photo of the Rise User
Someone from OH, Delaware just viewed Practice Group Manager at LifeStance Health
Photo of the Rise User
Someone from OH, Youngstown just viewed Event Services Human Resources Coordinator at Allied Universal
Photo of the Rise User
Someone from OH, Columbus just viewed IP Network Engineering Intern - Summer 2025 at Bandwidth
Photo of the Rise User
Someone from OH, Cleveland just viewed Director, Education Programs & Partnerships at Encoura
Photo of the Rise User
Someone from OH, Cleveland just viewed Operations Associate (Part-Time) - Pinecrest at Alo Yoga
Photo of the Rise User
Someone from OH, Coldwater just viewed Engineering Design Checker Jobs at Lockheed Martin
Photo of the Rise User
Someone from OH, Loveland just viewed SEO Admin & Business Support at Outliant
Photo of the Rise User
Someone from OH, Columbus just viewed Casting: Cedar Lake - Pilot Episode at Backstage