[VENDOR] Profile

METR

AI safety evaluations and autonomous agent testing for governance.

METR provides evaluation infrastructure for testing AI systems and autonomous agents against safety benchmarks. Organizations use it to assess model capabilities, identify failure modes, and gather evidence for compliance documentation. The platform emphasizes empirical testing over theoretical frameworks, enabling teams to measure AI behavior under realistic deployment scenarios.