Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested…
Category: METR
Auto Added by WPeMatico
Kkina – AI Blog
Auto Added by WPeMatico
Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested…