OpenAI Launches GDPval to Benchmark AI Against Economic Value Tasks

The tool evaluates AI across 44 professions in nine key U.S. industries, with early results showing Claude Opus 4.1 matching or surpassing expert performance in many cases.

169d ago

Summary

verifying reliability

Terms & Concepts

No specialized terms available for this topic.