AI research - AI Blog

1 min read 0

Only three AI models finished above starting capital in a 500-day startup survival test

admin
June 28, 2026

Researchers at Princeton University built CEO-Bench, a test where AI agents have to run a…

1 min read 0

Sina’s open model VibeThinker-3B aims to show reasoning compresses well but factual knowledge doesn’t

admin
June 28, 2026

Sina Weibo's VibeThinker-3B has just three billion parameters but matches models like DeepSeek V3.2 and…

1 min read 0

Half of Claude users say AI can already handle half their work according to Anthropic survey

admin
June 28, 2026

About half of Claude users say AI can already handle 50 percent or more of…

1 min read 0

OpenAI’s new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

admin
June 28, 2026

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested…