Artificial intelligence has traditionally advanced through automatic accuracy tests in tasks meant to approximate human knowledge. Carefully crafted benchmark tests such as The General Language ...
OpenAI’s new GDPval benchmark tested GPT-5 on real-world jobs across nine industries, revealing that the AI matched or outperformed experts 40% of the time. While not a full replacement, OpenAI ...