RC RANDOM CHAOS

AI benchmarks

1 post

Article

The smooth line hiding a noisy benchmark

The METR AI time horizons graph contains structural errors that mislead teams building agents, automation, and AI workflows. Here is what it actually shows.