OpenAI has launched GDPval, a new evaluation framework designed to measure how well AI models perform on economically valuable, real-world tasks.
Unlike traditional academic benchmarks, GDPval spans 44 occupations across nine industries, including law, healthcare, finance, and engineering, with over 1,300 tasks crafted by experienced professionals.
Each task is based on real work products, like legal briefs, nursing care plans, and engineering designs, making it one of the most realistic tests of AI’s usefulness in the workplace.
Early results show frontier models like GPT-5 and Claude Opus 4.1 are already approaching industry-expert quality, completing tasks up to 100x faster and cheaper than humans in controlled evaluations.
With GDPval, OpenAI aims to track AI progress over time, highlight where models can support professionals, and provide transparency into AI’s economic impact.
You may also want to check out some of our other recent updates.
Wanna know what’s trending online every day? Subscribe to Vavoza Insider to access the latest business and marketing insights, news, and trends daily with unmatched speed and conciseness! 🗞️











