I put ChatGPT-4o and 5.1 through 9 real-world tests — from logic puzzles to coding, writing and image analysis.
The core use case of this Proof of Concept (PoC) revolves around a banking personal assistant designed to revolutionize the way users interact with their bank account information, transaction history, ...