Blog LLMs & Texto Geração de Imagem

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover its tracks. The article OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it appeared first on The Decoder .

The Decoder ·Matthias Bastian · 27 de janeiro de 2026

Ver no Hugging Face

// relacionados

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Leia também

Meddies/meddies-persona-vie

SoftBank’s CEO isn’t the only one with questions about Elon Musk’s orbital data center hype

Anthropic's Fable 5 could return within days as Trump administration prepares to lift restrictions

Apple Vision Pro exec is reportedly leaving for OpenAI