GPT and Claude failed Bridgewater’s finance tests because the right answers were never public

The hedge fund Bridgewater and Thinking Machines Lab report that a finely tuned open-weight model outperforms the most powerful AI models in the evaluation of financial documents, at a fraction of the cost. The figures come from their own analysis.

The article GPT and Claude failed Bridgewater’s finance tests because the right answers were never public appeared first on The Decoder.