METR
Verified
We've verified that the organizationMETR controls the domain:
- metr.org
METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.
We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.
Read more about our workhere.
- Vivaria
- Public Task Suite
- RE-Bench Task Suite
- Some of our open-source agents can be found atgithub.com/poking-agents
Popular repositoriesLoading
- public-tasks
public-tasks Public - eval-analysis-public
eval-analysis-public PublicPublic repository containing METR's DVC pipeline for eval data analysis
- hcast-public
hcast-public Public
Repositories
- cross-domain-horizon Public
Estimate the time horizon of AIs over time on various domains like knowledge and vision
METR/cross-domain-horizon’s past year of commit activity - Measuring-Early-2025-AI-on-Exp-OSS-Devs Public
Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity:https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/
Uh oh!
There was an error while loading.Please reload this page.
METR/Measuring-Early-2025-AI-on-Exp-OSS-Devs’s past year of commit activity - inspect-tasks-public Public
METR/inspect-tasks-public’s past year of commit activity - inspect_ai Public Forked fromUKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
METR/inspect_ai’s past year of commit activity - task-assets Public
Uh oh!
There was an error while loading.Please reload this page.
METR/task-assets’s past year of commit activity - autonomy-evals-guide Public
METR/autonomy-evals-guide’s past year of commit activity - triframe_inspect Public
Uh oh!
There was an error while loading.Please reload this page.
METR/triframe_inspect’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…
Uh oh!
There was an error while loading.Please reload this page.