Blog Dados & Embeddings

Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series

Anthropic released Claude Sonnet 5, which beats its predecessor Sonnet 4.6 across all benchmarks and even edges past the larger Opus 4.8 on the GDPval-AA v2 knowledge work test with a score of 1,618. Anthropic is also quick to point out that the model scores far below the models the US government currently has blocked when it comes to cybersecurity tasks, a likely deliberate signal given the ongoing debate. The article Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model seri...

The Decoder ·Matthias Bastian · 30 de janeiro de 2026

Ver no Hugging Face

// relacionados

Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series

Leia também

Linq’s iMessage Apps Bring Payments, Tickets, Flights, and Games Into the iMessage Bubble Through the imessage_app Part

The DeepMind trio who built a poker AI are now making money for quant hedge funds

hotdogs/uka-fable-reasoning

ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration