Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Software Engineer image - Rise Careers
Job details

Senior Software Engineer

About the role

Contribute to METR’s efforts to produce tasks/benchmarks/protocols that can determine if an AI model has the potential for dangerous capabilities.

- create suites of model evaluations (’tasks”) which follow the METR Task Standard

- build the infrastructure for testing and running these tasks reliably at scale

- standardize and automate processes for monitoring and improving task quality

- develop LLM-powered agents to test the capabilities of frontier models

- implement data workflows for robust and reproducible reporting on model performance


You can see some examples of our open-source work at:

- Vivaria: our platform for running evaluations at scale (TypeScript, React, Docker, k8s)

- METR Task Standard: our spec for task implementations, with many simple example tasks (Python, Docker)

- pyhooks: our client library for writing agents that work with Vivaria (Python)

- modular-public: one of our workhorse agents, which uses pyhooks (Python)

- headless-human: our “human agent”, which we use for performing human “baselines” of tasks



What we’re looking for


We're looking for a versatile software engineer who thrives on diverse challenges. Ideal candidates will have 7+ years of professional experience and deep expertise in building robust, well-tested asynchronous Python applications. We'll also consider candidates who can demonstrate equivalent expertise through open source projects or portfolios.


In this role, you'll identify areas for improvement in our core research workflows, collaborate with engineers and researchers to understand their needs, and implement solutions. You'll help shape the technology and architecture of METR's evaluation platform as we scale to new heights.


Our tech stack centers on Python, TypeScript, Docker, Kubernetes, and AWS infrastructure, with integrations into Airtable, Slack, and other services. While the following skills are valuable, we know no single person will have them all. If you're strong in even a few of these areas, we encourage you to apply:

- rapid prototyping, MVP development, pragmatic problem-solving, and risk mitigation

- user-focused design, cross-team communication, and ability to explain technical constraints and tradeoffs to diverse colleagues

test-driven development and writing clear, maintainable code

- data engineering, versioned pipeline development, and efficient data analysis

- workflow automation and third-party system integration

- cloud infrastructure, secure platform design, and automated testing/deployment

- systems architecture, simplicity in design, and strategic problem-solving


Above all, we value a founder's mindset—someone who takes ownership, drives rapid progress, and can guide the team effectively through challenges.


About us


METR is a non-profit doing empirical research to test for whether frontier AI models possess the capability to permanently disempower humanity. We develop scientific methods to assess these risks accurately, and work with frontier AI companies (e.g., OpenAI, Anthropic), and government agencies to deploy these assessments. Our work helps ensure the safe development and deployment of transformative AI systems.

Some highlights of our work so far:

- Establishing autonomous replication evaluations: Thanks to our work, it’s now an industry norm to test models for autonomous capabilities (such as self-improvement and self-replication).

- Pre-release evaluations: We’ve worked with OpenAI and Anthropic to evaluate their models pre-release, and our research has been widely cited by policymakers, AI labs, and within government.

- Early commitments from labs: The safety frameworks of Google DeepMind, OpenAI, and Anthropic all credit or endorse our work in developing responsible scaling policies.

- Our work has been internationally recognized, e.g. by the UK government and Time Magazine.

- Inspiring lab evaluation efforts: Multiple leading AI companies are building their own internal evaluation teams, inspired by our work.

We are a motivated, fast-paced, growing team (currently ~20 people). Candidates should be excited about working entrepreneurially in a rapidly changing environment while helping to strengthen the organization's operational rigor.


Logistics

Successful candidates will complete two rounds of paid work tests and interviews, followed by three pair-programming interviews with different members of the team.

- Deadline to applyNone. Applications will be reviewed on a rolling basis.

- Compensation Range: $240,558 - $318,138 plus employee benefits

- Location: This role would be in-person out of our beautiful co-working space in Berkeley, CA.



$240,558 - $318,138 a year

Apply for this job


We encourage you to apply even if your background may not seem like the perfect fit! We would rather review a larger pool of applications than risk missing out on a promising candidate for the position. If you lack US work authorization, we can likely sponsor a cap-exempt H-1B visa for this role.

We are committed to diversity and equal opportunity in all aspects of our hiring process. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We welcome and encourage all qualified candidates to apply for our open positions.

Average salary estimate

$279348 / YEARLY (est.)
min
max
$240558K
$318138K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Software Engineer, METR

Are you ready to take your software engineering career to the next level? METR is on the hunt for a talented Senior Software Engineer to join our dynamic team! In this role, you will play a pivotal part in creating tasks and benchmarks that assess the potential dangers posed by AI models. Your mission will involve developing model evaluations to standardize and automate processes that enhance quality. You'll be working with cutting-edge technology, including Python and TypeScript, and contributing to our open-source projects, like Vivaria, which supports large-scale evaluations. METR is dedicated to ensuring the safe development of AI systems, collaborating with leaders in the industry, and influencing important safety frameworks. With your 7+ years of experience in building robust applications, you'll not only identify areas for improvement within our workflows but also play a crucial role in shaping the future technology and architecture of our evaluation platform. Your innovative thinking will help maintain our commitment to rigorous scientific research that promotes the responsible deployment of AI models. We're looking for someone who thrives on diverse challenges and is eager to make a tangible impact in a rapidly evolving field. If you have a founder's mindset and are excited about tackling big problems in tech, we want to hear from you!

Frequently Asked Questions (FAQs) for Senior Software Engineer Role at METR
What are the responsibilities of the Senior Software Engineer at METR?

As a Senior Software Engineer at METR, you will be responsible for developing and implementing task evaluations that assess the potential risks posed by AI models. This includes creating and maintaining the infrastructure for testing, standardizing processes for quality assurance, and collaborating with both engineers and researchers to identify areas for improvement in our workflows. Your role will also involve implementing data workflows for effective reporting, so a strong grasp of both software engineering and data analysis will be essential.

Join Rise to see the full answer
What qualifications are required for the Senior Software Engineer position at METR?

To qualify for the Senior Software Engineer position at METR, candidates should have a minimum of 7 years of professional experience in software development, along with deep expertise in asynchronous Python applications. We also consider equivalent expertise demonstrated through open-source contributions or a strong portfolio. Technical proficiency in TypeScript, Docker, Kubernetes, and cloud infrastructure is highly valued, alongside a keen understanding of workflow automation and third-party integrations.

Join Rise to see the full answer
What is the work environment like for a Senior Software Engineer at METR?

METR offers a motivated and fast-paced work environment that fosters innovation and collaboration. Our team consists of around 20 dedicated individuals who are passionate about AI safety and research. The organization emphasizes diversity and equal opportunity, creating a welcoming space for all qualified candidates. The role is based out of our beautiful co-working space in Berkeley, CA, promoting a culture of teamwork and hands-on learning experiences.

Join Rise to see the full answer
How does METR contribute to AI safety through the role of Senior Software Engineer?

At METR, the Senior Software Engineer plays a key role in advancing AI safety by developing scientific methods and rigorous evaluations that assess AI model capabilities. Your work will directly contribute to ensuring that frontier AI technologies are developed responsibly, through partnerships with industry leaders and government agencies. By refining our evaluation processes and improving the technology behind them, you'll help set an industry standard for AI risk assessment and management.

Join Rise to see the full answer
What is the compensation range for a Senior Software Engineer at METR?

The compensation range for the Senior Software Engineer position at METR is between $240,558 and $318,138 per year, accompanied by a comprehensive benefits package. We offer competitive salaries and are committed to ensuring that our employees feel valued and rewarded for their contributions, setting the stage for long-term growth and satisfaction within our team.

Join Rise to see the full answer
Common Interview Questions for Senior Software Engineer
What experience do you have with developing asynchronous applications in Python?

In your response, highlight specific projects where you utilized asynchronous programming in Python. Describe the challenges you faced and how you overcame them, emphasizing any unique solutions you implemented. Give concrete examples of how your experience aligns with the needs of METR, showing your ability to code and deliver high-quality applications under pressure.

Join Rise to see the full answer
Can you explain a recent project you worked on that involved Docker or Kubernetes?

When answering, focus on a specific project where you successfully utilized Docker or Kubernetes for containerization or orchestration. Discuss the objectives of the project, your role, and the results achieved. Highlight any best practices you followed and how these tools complemented the project's goals, showcasing your technical expertise that METR would value.

Join Rise to see the full answer
How do you prioritize and manage your tasks when faced with multiple projects?

Discuss your approach to task management, including any tools or methodologies you use, such as Agile or Kanban. Give examples of how you’ve effectively prioritized tasks based on urgency, importance, and team needs. Share how communication skills play a crucial role in managing expectations and delivering quality results on time, which is essential at METR.

Join Rise to see the full answer
What is your experience with data engineering and analysis?

Provide an overview of your experience in data engineering, focusing on specific tools and technologies you have used. Discuss your familiarity with data pipelines, versioning, and analysis methods. Illustrate your understanding with a real-life example of a project where you had to analyze data for reporting, showcasing how this skill aligns with the responsibilities at METR.

Join Rise to see the full answer
Describe a time when you had to communicate technical information to a non-technical audience.

In your answer, choose a scenario where effective communication was crucial. Describe how you simplified technical concepts without losing important details, focusing on the techniques you used to engage your audience. Highlight your ability to bridge the gap between technical and non-technical colleagues, a skill that is essential for collaboration at METR.

Join Rise to see the full answer
What do you understand about METR's mission and how do you see your role contributing to it?

Articulate your understanding of METR's mission in AI safety and the importance of rigorous evaluations. Connect your skills and experiences to specific aspects of the mission, showing a genuine interest in the organization's work. Discuss how your proactive mindset and technical capabilities make you a strong candidate for contributing effectively to METR's goals.

Join Rise to see the full answer
Can you give an example of a challenging problem you solved in a past project?

Share a detailed account of a challenging problem, focusing on the context, the specific issue, and the solution you implemented. Emphasize the thought process and methods you used to arrive at the solution, demonstrating your analytical and problem-solving skills. Relate this experience back to the innovative challenges you may encounter at METR.

Join Rise to see the full answer
What methodologies or frameworks do you use for test-driven development?

Outline your approach to test-driven development and the benefits it brings to software quality. Share specific methodologies or techniques you’ve employed in your projects, including any tools or frameworks you prefer. Highlight how your commitment to testing aligns with METR's standards for code quality and reliability.

Join Rise to see the full answer
How do you stay current with advancements in software engineering and AI technologies?

Discuss your strategies for staying updated, such as attending conferences, participating in online courses, or engaging in professional communities. Share how you actively seek out new knowledge and innovations in the field, demonstrating your enthusiasm for continuous learning, a quality that METR values highly.

Join Rise to see the full answer
What strategies do you employ for workflow automation and system integration?

Provide examples of your experience with workflow automation tools and integration techniques. Discuss specific projects where you’ve streamlined processes, improved efficiency, or integrated third-party systems. Highlight the impact of these strategies on project outcomes, reflecting on how these experiences could benefit METR's operations.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
NielsenIQ Remote Block 1, 9th Floor, Wing 1, 2, Pune, India
Posted 8 hours ago
Photo of the Rise User
Posted 13 days ago
Photo of the Rise User
Posted 2 days ago
Dental Insurance
Vision Insurance
Paid Holidays
Photo of the Rise User
Tebra Remote Heredia, Heredia, Costa Rica
Posted 3 days ago
Photo of the Rise User
Zivaro Hybrid Colorado Springs, CO
Posted yesterday
MATCH
Calculating your matching score...
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 4, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!