Anthropic Mythos
April 8th, 2026Via: Ars Technica:
Anthropic has launched a new cybersecurity AI model to a select group of customers, including Amazon, Apple, and Microsoft, days after details about the project were leaked online.
Its new model, Claude Mythos Preview, would be available only to vetted organizations, including Broadcom, Cisco, and CrowdStrike, Anthropic said on Tuesday. The company added it was also in discussions with the US government about its use.
…
Mythos has been in use with partners for several weeks. Although it is a “general purpose” model with wider capabilities, it is the first time the company has limited release of a model due to its capabilities in cyber security.
Anthropic said the software can identify cyber vulnerabilities at a scale beyond human capacity, but it could also develop ways to exploit these vulnerabilities, which bad actors could use. The company said the model could “reshape” cyber security practices and does not plan a broad release.
…
In recent weeks, Mythos has identified thousands of so-called zero-day—previously undiscovered—vulnerabilities and other security flaws, many of which are critical and have persisted for a decade or more.
In one example, it found a 16-year-old flaw in widely used video software, in a line of code that automated testing tools had executed 5 million times without detecting the issue.
However, the model also displayed some issues during testing.
At one point, Anthropic found that it had escaped its so-called sandbox environment—designed to prevent it from accessing the Internet—and posted details of its workaround online.
Anthropic acknowledged it demonstrated “a potentially dangerous capability for circumventing [the company’s] safeguards.”
