Blog
Dados & Embeddings
Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series
Anthropic released Claude Sonnet 5, which beats its predecessor Sonnet 4.6 across all benchmarks and even edges past the larger Opus 4.8 on the GDPval-AA v2 knowledge work test with a score of 1,618. Anthropic is also quick to point out that the model scores far below the models the US government currently has blocked when it comes to cybersecurity tasks, a likely deliberate signal given the ongoing debate. The article Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model seri...
The Decoder
·Matthias Bastian
·
// relacionados
Leia também
Blog
Linq’s iMessage Apps Bring Payments, Tickets, Flights, and Games Into the iMessage Bubble Through the imessage_app Part
Blog
The DeepMind trio who built a poker AI are now making money for quant hedge funds
Dataset
hotdogs/uka-fable-reasoning
Blog