Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty - MacRumors
Skip to Content

Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty

Anthropic today announced the launch of its latest AI model, Claude Opus 4.8. Anthropic claims the model is a "more effective collaborator" with improvements in agentic coding, multidisciplinary reasoning, agentic computer use, knowledge work, and agentic financial analysis.

anthopic claude
Testers have found Opus 4.8 to be "more reliable and sharper in its judgement" when doing agentic tasks, and the model also made gains in honesty.

Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. This is borne out in our evaluations, which show that Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.

Alignment assessments suggest the model hits new highs on measures of prosocial traits like supporting user autonomy and acting in the user's best interest. Rates of misaligned behavior like deception are lower than Opus 4.7 and similar to the Claude Mythos Preview.

Anthropic benchmarks indicate Opus 4.8 scored a 69.2% on SWE-Bench Pro, outperforming GPT–5.5 and Gemini 3.1 Pro on the test and several other benchmarks, though GPT–5.5 leads on the terminal-coding benchmark.

Opus 4.8's fast mode also runs at 2.5x the speed, and it is now three times cheaper than prior models.

Along with Opus 4.8, Anthropic is adding new features to its product lineup.

  • Dynamic workflows (research preview) - Claude can complete bigger tasks in Claude Code. It is able to plan work and run hundreds of parallel subagents in a single session. It is able to complete codebase-scale migrations across hundreds of thousands of lines of code. The feature is available for Claude Code for Enterprise, Team, and Max plans.
  • Effort control - In Claude.ai and Cowork, users can choose how much effort Claude puts into a response. With a lower setting, Claude will respond faster and use up rate limits more slowly. Opus 4.8 defaults to high effort, which Anthropic says is the best balance of quality and user experience.
  • Messages API - The Messages API accepts system entries inside the messages array, so developers can update Claude's instructions mid-task.

Claude Opus 4.8 is available everywhere today. Pricing for regular use has not changed compared to Opus 4.7.

Anthropic is working on models that have the same capabilities as Opus 4.8 at a lower cost, and a new class of model that's even more intelligent than Opus. Anthropic says it has been developing safeguards for the Claude Mythos model it is testing with a small number of organizations, and it expects to be able to bring Mythos-class models to all customers "in the coming weeks."

Popular Stories

macOS Tahoe and iPhone

Apple Alerted to macOS Security Vulnerability Uncovered With AI Tool

Thursday May 14, 2026 9:04 am PDT by
Anthropic recently announced Project Glasswing, an initiative that enables tech companies like Apple to use its new frontier AI model Claude Mythos Preview to find security vulnerabilities across operating systems and web browsers. The Wall Street Journal today reported that researchers at cybersecurity firm Calif used Claude Mythos Preview to uncover a new macOS security vulnerability last...
Four iPhone 18 Pro Colors Mock Feature

iPhone 18 Pro Launching Later This Year With These 10 New Features

Tuesday May 26, 2026 6:32 am PDT by
While the iPhone 18 Pro and iPhone 18 Pro Max are not launching until September, there are already plenty of rumors about the devices. It was initially reported that the iPhone 18 Pro models would have fully under-screen Face ID, with only a front camera visible in the top-left corner of the screen. However, the latest rumors indicate that only one Face ID component will be moved under the...
Apple Watch Blood Glucose Monitoring Feature 2

Apple Watch for Diabetes: The Latest on Apple's Plans for Non-Invasive Blood Sugar Monitoring

Tuesday May 26, 2026 9:30 am PDT by
For many years now, it has been rumored that the Apple Watch will eventually gain non-invasive blood sugar monitoring capabilities, which would enable millions of people with diabetes to track their blood glucose levels without needing to prick their skin with a needle or wear a dedicated continuous glucose monitor. According to Bloomberg's Mark Gurman, Apple recently shifted oversight of...

Top Rated Comments

sniffies Avatar
1 day ago at 11:37 am
So, you're telling me Claude has been lying to me all this time? 😭
Score: 35 Votes (Like | Disagree)
1 day ago at 11:43 am
honesty would require actual intelligence
Score: 22 Votes (Like | Disagree)
Agent007 Avatar
1 day ago at 12:06 pm
"and the model also made gains in honesty."

haha yeah, trust your friendly neighborhood trillion dollar corporate LLM slop.
Score: 15 Votes (Like | Disagree)
tkermit Avatar
1 day ago at 11:52 am
Who is actually staying on top of these minor model updates. "Gains in Coding and Honesty", "hits new highs on measures of prosocial traits." What?!
Score: 15 Votes (Like | Disagree)
Happy_John Avatar
1 day ago at 11:57 am

I find Claude exceptionally useful, and the more humane LLMs appear, the easier it is to treat them like some random person with an opinion on everything, rather than an actual superintelligence. Like asking my uncle about anything in the 80s. He sure made a convincing case, but…
An AI model is not a person. It is a tool. treating them like a person is not really the way we should be going.

The planet is not underpopulated. There are more than enough real people for people to have a need for ingratriating simulations of people.
Score: 14 Votes (Like | Disagree)
BabyBoii Avatar
1 day ago at 11:44 am
Every week is christmas for a fellow vibecoder like myself 🎄
Score: 11 Votes (Like | Disagree)