Skip to content

Oliver Ng. Ai experiments.

Blog
About

4.8

Written by

in

New Opus 4.8 came out. I saw the video announcement on YouTube but what wasn’t captured was this gem in the release notes.

One of the most prominent improvements in Opus 4.8 is its honesty. We train all our models to be honest—for instance, to avoid making claims that they can’t support. But a general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin. Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. This is borne out in our evaluations, which show that Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.

It’s nice to see a feature update focused on improving LLM alignments.

←Choosing the right model

More posts

Cowork Mobile

July 17, 2026
A day of Claude workflows

July 14, 2026
On-device LLMs in iOS 27

June 26, 2026
You token maxxin’ too?

June 19, 2026

Oliver Ng. Ai experiments.

Blog
About