AI Model Release Tracker: Opus 4.8’s misalignment rates similar to Claude Mythos Preview


ai-tracker-1.png

Elyse Betters Picaro/ZDNET

Follow ZDNET: Add us as a preferred source on Google.


AI labs are shipping new models nonstop. Besides being better and faster than their predecessors, however, every new model isn’t guaranteed to be a major step change, despite how the company’s PR may wax poetic about them. Model strengths really emerge in context: Where are competitor models lacking or excelling? Which models have outstanding specialties, and which are just catching up to industry standards?  

Also: How we test AI at ZDNET

Our Model Release Tracker helps you make sense of where models stand relative to each other, and whether they’re worth a deeper look. While we don’t test every model or model update on this list, we’ll always include the key elements you need to know, along with our hands-on expert test, where applicable. We also include an Expert Score for certain models. Curious about how we test AI? Check out this breakdown of our process

Here are some of the biggest model releases of 2026 so far and what to know about them. We’ll update this list whenever a notable new model arrives. 


Claude Opus 4.8 

Anthropic | May 28, 2026

What it does: Replacing Opus 4.7 starting today (at the same price), Opus 4.8 offers faster thinking modes for one-third the cost of the earlier version, according to Anthropic. Like most of Anthropic’s models, 4.8 prioritizes coding abilities, scoring higher than 4.7 on two coding benchmarks but not fully besting OpenAI’s GPT 5.5. It also “reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user’s best interest,” the company noted in the release, though definitions for what that means remain murky. 

Also: Anthropic launches Opus 4.8, with honesty as its killer feature

Why it matters: Anthropic has always prioritized model safety and interpretability, but appears to be further emphasizing that standard with this release. The company said Opus 4.7 had a 92% honesty rate, in addition to being less sycophantic and hallucination-prone overall. The fact that it claims 4.8 shows “substantially” lower rates of misalignment than 4.7 indicates an increasingly high standard for model safety, especially because Anthropic compared 4.8’s alignment to that of Mythos Preview


GPT-5.5 Instant  

OpenAI | May 5, 2026

What it does: OpenAI said in its announcement that the lighter version of OpenAI’s just-released GPT-5.5 is less verbose than its predecessor, GPT-5.3 Instant. It also touted fewer hallucinations and improved factuality, saying “GPT‑5.5 Instant produced 52.5% fewer hallucinated claims than GPT‑5.3 Instant on high-stakes prompts covering areas like medicine, law, and finance.” 

Also: Anthropic’s Mythos is evolving faster than expected, reports AI safety agency

Why it matters: GPT-5.5 Instant replaces GPT-5.3 as the default model in ChatGPT. Again, while the expectation is that each new AI model gets more efficient, easier to use,  and makes up less stuff , a significant improvement in hallucinations for a model most people use for fast queries could mean less misinformation spreading among the masses. That’s especially critical given how many people are using ChatGPT for everyday health questions, for example. 

(Disclosure: Ziff Davis, ZDNET’s parent company, filed an April 2025 lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.)


Nemotron 3 Nano Omni 

Nvidia | April 28, 2026

What it does: The latest in Nvidia’s open Nemotron family, this model provides agents with multimodal input. That means they can “perceive and reason across visual, audio, and textual inputs within a single shared perception‑to‑action loop,” according to Nvidia, thereby unifying multiple capabilities into a single system. 

Also: AI is an arms race, and the US wants $9 billion in Nvidia superchips to keep up

Why it matters: Normally, systems of agents need to use separate models for speech, vision, and text, meaning they jump across documents, video, and audio to complete multi-step tasks. That slows down workflows, undermines the context agents gather, and racks up inference costs. Nvidia’s approach, if it works, would streamline this process and reduce token use, saving you money. Try it on Hugging Face


GPT-5.5  

OpenAI | April 23, 2026 

Expert Score: 93/100

What it does: ZDNET tester-in-residence David Gewirtz technically gave GPT-5.5 an A- score, but said it “can be reductively described as better and faster than GPT-5.4,” which is hopefully the bare-minimum expectation for a new model. Specifically, though, the model got better at agentic coding, clearly identifying concepts, scientific research, and factual accuracy. 

Also: I put GPT-5.5 through a 10-round test: It scored 93/100, losing points only for exuberance

Why it matters: While the model itself may not be leaps and bounds ahead of its immediate predecessor, the quick turnaround from 5.4 to 5.4 — less than two months — indicates how rapidly agentic coding is accelerating OpenAI’s model release cycle. As David Gewirtz breaks down, the company, much like other frontier labs using AI to build AI, is shipping updates at an exponentially increasing rate. 


ChatGPT Images 2  

OpenAI | April 23, 2026 

What it does: Soon after sunsetting Sora, its generative video model and social platform, OpenAI somewhat confusingly announced Images 2. ZDNET model tester David Gewirtz got an early look at Images 2 before its release and was impressed. While he didn’t give this model a formal Expert Score, he said it’s fun, a huge leap, and actually useful for work

Why it matters: OpenAI seemed to be getting out of the more consumer-minded AI product game when it discontinued Sora, having been beaten by Anthropic at securing lucrative enterprise contracts. That OpenAI still came out with Images 2 within that redirection narrative indicates that it sees image generators as relevant enough to enterprise AI — especially on the heels of Anthropic’s Claude Design


Claude Opus 4.7 

Anthropic | April 16, 2026 

What it does: Arriving relatively quickly after Opus 4.6, this model boasts new highs in honesty, reduced sycophancy and hallucinations. It also appears to have a knack for cybersecurity, as it backs the new Claude Security, released shortly after the model itself — but no, it’s not Mythos, as many suspected. 

Also: Anthropic’s new Claude Security tool scans your codebase for flaws – and helps you decide what to fix first

Why it matters: Hallucinations and honesty are among the most difficult, hard-to-solve issues plaguing even the best models. For Anthropic to claim such significant gains in those areas is no small feat for an AI lab that takes safety seriously. 


Claude Mythos (Preview) 

Anthropic | April 7, 2026 

What it does: This is a tough one because Mythos isn’t actually available to the public. Anthropic created quite a media storm when it positioned the new general-purpose model as too powerful to release as usual. While the model is apparently a step change from earlier Anthropic models, the company was especially alarmed because of the security threat it posed, stating that “it is strikingly capable at computer security tasks.” 

In response to that, Anthropic spearheaded Project Glasswing, a collaborative effort with several rival AI labs, including Google, Nvidia, and Microsoft, as well as security authorities like Palo Alto Networks, “to help secure the world’s most critical software, and to prepare the industry for the practices we all will need to adopt to keep ahead of cyberattackers.” 

Also: Apple, Google, and Microsoft join Anthropic’s Project Glasswing to defend world’s most critical software

Why it matters: If we’re to believe Anthropic’s guidance that Mythos poses a significant threat to the world’s software — so much so that only a select few partners can access it — cybersecurity apparatuses as they stand may not be prepared to meet the rapidly evolving frontier of model capabilities. Mythos may not be the only model of its caliber, but simply the first of many to come once other labs achieve similar breakthroughs. 

For now, just a few weeks into its release, Mythos is helping catch software bugs in droves. 


GPT-5.4  

OpenAI | March 5, 2026 

What it does: OpenAI framed this new model, released barely three months after GPT-5.2, as specifically designed for professional work. According to the company’s own testing (which should always be taken with a grain of salt until verified by a third party), GPT-5.4 matches or outperforms human professionals 83% of the time. 

Why it matters: As AI companies focus more on gaining enterprise trust (and contracts) while lauding what agentic AI can do, they need models that can handle complex work-related tasks with minimal risk, delay, or prohibitively high costs. Any model advancement that shows prowess in professional workflows has a better chance of being taken seriously by companies struggling to adopt AI, though nothing guarantees seamless integration. 

Also: OpenAI’s new GPT-5.4 clobbers humans on pro-level work in tests – by 83%


Claude Opus 4.6 

Anthropic | Feb. 5, 2026 

What it does: This model quickly redefined the standard for autonomous agentic work, especially for coding. That’s no surprise given Anthropic’s authority in building models especially adept at programming tasks. Opus 4.6 also demonstrated improvement in complex, longer-running tasks overall. 

Why it matters: Opus 4.6’s ability to handle tasks better on its own means you can reliably offload more of your workflow to it — something agentic offerings usually struggle with. 

Also: Anthropic says its new Claude Opus 4.6 can nail your work deliverables on the first try


GPT-5.3-Codex  

OpenAI | Feb. 5, 2026 

What it does: This new coding model — which OpenAI said helped build and debug itself — can be interrupted and redirected mid-task, which, if true, is a huge boon for developers using it on complex or shifting projects with tons of trial-and-error. GPT-5.3-Codex also boasts run times of over a day and a better grasp on user intent. 

Also: OpenAI’s new Spark model codes 15x faster than GPT-5.3-Codex – but there’s a catch

Why it matters: OpenAI is trying to catch up to Anthropic’s lead in agentic coding (and, coincidentally or not, released 5.3 Codex on the same day as Anthropic launched Opus 4.6). While ZDNET experts often prefer Claude Code to other tools for vibe coding, OpenAI’s rumored shift toward enterprise clients and away from fun consumer tools could eventually close that gap. 





Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


You’ve built your small business from the ground up. It’s your pride and joy, your financial security, and a potential legacy for your family. But what happens to your business interests after you’re gone? Without proper estate planning, your small business could face a chaotic future, disrupting operations, hurting employees, and jeopardizing your loved ones’ inheritance.

Business estate planning is your secret weapon. It’s not just for the ultra-wealthy with complex trusts and wills. For small business owners, it’s a crucial tool to ensure business continuity and protect your business value. Here’s how you can craft a comprehensive estate plan:

Know Your Business Inside and Out

The first step in your estate planning process is taking a deep dive into your business affairs. Make a list of all your business assets: equipment, inventory, intellectual property, and real estate.

Furthermore, don’t forget your business debts like loans and outstanding payments. This comprehensive list helps you understand what needs protecting and planning for in your estate planning documents.

Chart Your Business’s Future Course

What do you envision for your business after you’re gone? Should it stay in the family? Be sold to a trusted partner? Wind down entirely? This is where business succession planning comes in. It’s about deciding the future of your business in a way that honors your legacy and sets your team up for success.

Here are some questions to consider:

  • Family Business? Do you have a family member who shares your passion and has the skills to lead?
  • Trusted Partner? Is there a key employee you see as the ideal successor?
  • Time for a Change? Are you open to selling the business to ensure a smooth transition?

There’s no right or wrong answer. The key is to have open conversations with your loved ones and key employees to understand their goals and aspirations. This will guide you in crafting a business succession plan that feels right for everyone involved.

Develop a Rock-Solid Business Succession Plan

This plan outlines who will take over your business and how. You might identify a family member, a key employee, or even an outside buyer. The business succession plan should detail the transfer process, including training and timeline.

Here’s how to craft a plan as strong as your business itself:

  • Identify Your Successor: It could be a family member you’ve been mentoring, a trusted key employee, or even an outside buyer.
  • Groom Your Successor: Start by involving them in key decisions to give them opportunities to learn the ropes.
  • Plan for the Unexpected: Have a backup plan in place. Identifying another potential leader or outline a buy-out option for remaining partners.

An experienced estate planning attorney like Keele & Parke can help you draft a legally sound plan that considers state law and tax implications.

Avoid Conflict with Ironclad Sell Agreements

If you have co-owners, a sell agreement is vital. This agreement dictates what happens to a deceased or incapacitated owner’s share of the business. It prevents conflict among remaining partners and ensures a smooth ownership transition in your overall estate plan.

Wills vs. Trusts: Choosing the Right Tool

A will can designate who inherits your business assets. But the problem is it can be a slow and public process through probate court.

Here’s where a revocable living trust comes in. Think of it as a private vault that holds your business assets during your lifetime. You can name yourself as trustee, so you’re still in control.

Another thing, you can designate a successor trustee to seamlessly take over managing the business if you become disabled or pass away. This avoids probate and keeps things running smoothly for your loved ones and your employees.

Wills are still important for your overall estate plan, especially for personal assets outside the trust. But for your business, a revocable living trust offers flexibility, privacy, and peace of mind.

Minimize Estate Taxes Through Strategic Planning

Nobody wants a big chunk of their hard-earned business value going to the government after they’re gone. That’s where estate taxes come in, and they can be a real burden for your family. But don’t worry, there are smart estate planning strategies you can use to minimize the impact of these taxes.

  • Smart Business Structure: The legal entity you choose for your business can impact your estate taxes. Talk to your estate planning attorney about structuring your business as a limited liability company (LLC) or another entity that might offer tax advantages.
  • Explore Powerful Trusts: There are special types of trusts, like grantor retained annuity trusts (GRATs), that can be used to transfer ownership of your business interests to your heirs while minimizing the taxable value of those assets.

The right strategy for you will depend on your specific situation and goals. That’s why it’s crucial to work with an experienced estate planning attorney and financial advisor. They can help you create a personalized plan that minimizes your estate taxes and protects your legacy.

Don’t Neglect Your Personal Estate Plan

Your business is just one piece of the puzzle. You also need a personal estate plan that includes a will, power of attorney, and healthcare directives. Without it, your loved ones could face a legal mess during tough times. Bills might go unpaid, important decisions could be delayed, and family heirlooms could end up in the wrong hands.

An estate plan ensures your wishes are followed. It names guardians for your minor children, designates beneficiaries for your personal assets (like your home and savings), and appoints someone you trust to make healthcare decisions if you’re unable to. This gives your family peace of mind knowing they’re taken care of, even in your absence.

Life Insurance: A Lifeline for Your Loved Ones

A life insurance policy provides your beneficiaries with a lump sum of cash upon your death. This can be crucial for surviving family members or business partners, especially if they need to buy out another owner’s share through a sell agreement or pay estate taxes.

Regularly Review and Update Your Plan

Life circumstances change, and so should your estate plan. Regularly review your plan, especially after major life events like marriage, children, or changes in your business structure.

Seek Professional Guidance for a Comprehensive Plan

Business estate planning involves complex legal and financial considerations. Don’t try to go it alone. Consult with an experienced estate planning attorney specializing in business succession planning and a financial advisor with experience in small business matters. Their expertise can ensure your estate plan is comprehensive, legally sound, and achieves your goals for business continuity and protecting your loved ones.

Final Thoughts

Safeguarding your business is like protecting your family’s future. Take control. Schedule a consultation with an experienced estate planning attorney today. They’ll guide you through the process and ensure your legacy lives on.



Source link