{"id":88,"date":"2026-05-30T01:59:56","date_gmt":"2026-05-30T01:59:56","guid":{"rendered":"https:\/\/oliverng.com\/ai\/?p=88"},"modified":"2026-07-17T18:34:33","modified_gmt":"2026-07-17T18:34:33","slug":"4-8","status":"publish","type":"post","link":"https:\/\/oliverng.com\/ai\/2026\/05\/30\/4-8\/","title":{"rendered":"4.8"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">New Opus 4.8 came out. I saw the video announcement on YouTube but what wasn\u2019t captured was this gem in the <a href=\"https:\/\/www.anthropic.com\/news\/claude-opus-4-8\">release notes.<\/a> <\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\">One of the most prominent improvements in Opus 4.8 is its&nbsp;<em>honesty<\/em>. We train all our models to be honest\u2014for instance, to avoid making claims that they can\u2019t support. But a general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin. Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. This is borne out in&nbsp;<a href=\"https:\/\/www.anthropic.com\/claude-opus-4-8-system-card\">our evaluations<\/a>, which show that Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.<\/p>\n<\/blockquote>\n\n\n\n<p class=\"wp-block-paragraph\">It\u2019s nice to see a feature update focused on improving LLM alignments. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>New Opus 4.8 came out. I saw the video announcement on YouTube but what wasn\u2019t captured was this gem in the release notes. One of the most prominent improvements in Opus 4.8 is its&nbsp;honesty. We train all our models to be honest\u2014for instance, to avoid making claims that they can\u2019t support. But a general problem [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-88","post","type-post","status-publish","format-standard","hentry","category-models"],"_links":{"self":[{"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/posts\/88","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/comments?post=88"}],"version-history":[{"count":1,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/posts\/88\/revisions"}],"predecessor-version":[{"id":89,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/posts\/88\/revisions\/89"}],"wp:attachment":[{"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/media?parent=88"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/categories?post=88"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/oliverng.com\/ai\/wp-json\/wp\/v2\/tags?post=88"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}