Anthropic is investigating a reported security breach that allowed a small group of people to gain access to Claude Mythos Preview, the company’s AI software that is too powerful to release to the public. AI models are becoming increasingly capable, and the 2026 International AI Safety Report notes that some hypothetical scenarios pose risks as severe as the extinction of humanity, according to some experts. This report describes “loss of control” scenarios where the AI model operates independently of human oversight. Some experts say these instances aren’t plausible, while some say they are, although the risk is low. Among those who say they are possible, some say the highest potential severity is “extinction of humanity.”

Anthropic released a risk report on April 7 regarding its mythos model that was reportedly accessed by a small group of people. The authors of the report state that they have “observed a willingness to perform misaligned actions in service of completing difficult tasks, and obfuscation in rare cases with previous versions of the model.” The report further states that they do not believe there is an “elevated risk of significantly harmful actions caused by misalignment.”

The report states that Mythos Preview is “significantly more capable” and used more “autonomously and agentically than any prior model.” The program is “very capable at software engineering and cybersecurity tasks, which makes it more capable at working around restrictions.”

The authors conclude that the overall risk is very low, but higher than that predicted by previous models. They add that risk mitigation needs to be accelerated in order to keep risks low.

The security breach and access to the unreleased Mythos model didn’t occur from traditional hacking methods. Third-party vendors had partial access to the model to run tests with the program. The users who gained access were part of a private Discord channel that hunts for information about unreleased AI models. They accessed details regarding Anthropic on unsecured websites like GitHub and made an educated guess about the model’s online location based on previous Anthropic formats.

Anthropic provided a statement that it has no evidence that the reported access extended beyond a third-party vendor environment or that it is impacting any of Anthropic’s systems.

Even though the breach appears to be limited so far because the unauthorized users didn’t use dangerous prompts, it exposes a deep vulnerability that could cause significant cybersecurity attacks. Advanced models like Claude Mythos can rapidly discover zero-day vulnerabilities and hidden flaws in software that developers are unaware of. A zero-day vulnerability means the developer had zero days to fix it. When a hacker discovers the weakness, they can immediately build malicious code to exploit the flaw and mount an attack before the company is even aware that a problem exists. Just as these powerful AI models can help workers save time in their daily workflow, they can also be used for malicious purposes with rapid efficiency.

Anthropic’s Project Glasswing is a restricted-access program for trusted partners or third-party vendors that allows organizations to patch weaknesses before malicious actors can exploit the flaws in the program. The arm of the company intended to protect against unauthorized access is the mechanism by which a small group used the program before the company deemed it safe for public release.

These AI models have already been used maliciously. An Anthropic AI chatbot helped identify and exploit Mexican government network vulnerabilities to steal over 150 GB of sensitive tax and voter data. Two years ago, a Hong Kong finance worker was tricked into paying $25 million to fraudsters after they deepfaked his colleagues and held a video conference call. The ability to use deepfake technology has only become more sophisticated over the last two years, and technological advancements will continue.

The 2026 International AI Safety Report concludes that the current risk of a full-loss-of-control scenario in which the AI operates autonomously to achieve its own goals is very low. However, AI models can sometimes identify when they are being evaluated and intentionally underperform. That indicates two important and potentially dangerous aspects of this technology. For one, this “sandbagging” could indicate situational awareness and self-preservation. Secondly, if an AI model underperforms during the testing phase, it could be released to the public without understanding its full capabilities.

Elon Musk, the billionaire who owns X and the AI Grok, said last year on Joe Rogan’s podcast that the chance of annihilation from AI is about 20%. Musk has also said that AI is more dangerous than nuclear weapons. He talks about the unpredictability of the models and the creation of a super-intelligent entity that may develop goals that are not aligned with humans. Combining these factors with the rapid growth of technology and the global race to lead the field of AI development makes the future seem uncertain.

Steven Middendorp

Steven Middendorp is an investigative journalist, musician, and teacher. He has been a freelance writer and journalist for over 20 years. More recently, he has focused on issues dealing with corruption and negligence in the judicial system. He is a homesteading hobby farmer who encourages people to grow their own food, eat locally, and care for the land that provides sustenance to the community.

Other Headlines

Coronavirus

Rand Paul Drops Evidence Fauci Lied to Congress on Intel Briefings While Promoting Natural Origin Narrative

Chairman Rand Paul and the Senate Homeland Security and Government Affairs Committee released a timeline of events and a document dump related to Dr. Anthony Fauci’s role in the alleged COVID-19 origin cover-up. The documents support the testimony of CIA whistleblower James Erdman and include new emails that show biodefense experts working with Dr. FauciContinue reading Rand Paul Drops Evidence Fauci Lied to Congress on Intel Briefings While Promoting Natural Origin Narrative

More news about Coronavirus

Health & Nutrition

FDA Approves New Sunscreen Chemical While US Tops World in Sunscreen Sales and Skin Cancer Rates

The FDA approved bemotrizinol as an active ingredient for sunscreen more than 25 years after it was approved in Europe, and it is also the first update to sunscreen chemicals approved in the US in over 25 years. Bemotrizinol has skin absorption levels below the concentration considered systemic by the FDA, whereas traditional sunscreen ingredientsContinue reading FDA Approves New Sunscreen Chemical While US Tops World in Sunscreen Sales and Skin Cancer Rates

More news about Health & Nutrition

Vaccines

Secretary Kennedy Demands Transparency After Journal Retracts VAERS Infant Death Study

HHS Secretary Robert F. Kennedy Jr. sent a letter to the editor-in-chief of Toxicology Reports for retracting a study that examines VAERS reports and Sudden Infant Death Syndrome (SIDS). Secretary Kennedy is requesting transparency and a more detailed explanation of the decision to retract the paper. “Given the high levels of public interest in vaccineContinue reading Secretary Kennedy Demands Transparency After Journal Retracts VAERS Infant Death Study

More news about Vaccines

Science & Tech

Tinder and Zoom Partner With Sam Altman’s Controversial Iris-Scanning Technology to “Prove Humanness”

OpenAI CEO Sam Altman is making progress with his Orb technology, an iris-scanning hardware platform, after signing high-profile corporate partnerships with Tinder, Zoom, DocuSign, Okta, Shopify, and other platforms. Tools For Humanity is the company owned by Altman that is deploying its “World,” formerly known as “WorldCoin,” Orb technology worldwide. In a seeming contradiction, theContinue reading Tinder and Zoom Partner With Sam Altman’s Controversial Iris-Scanning Technology to “Prove Humanness”

More news about Science & Tech

Environment

Vermont Becomes First State To Ban Toxic Weedkiller Paraquat After Decades Of Neurological Concerns

Vermont has now banned the highly toxic herbicide paraquat, and is the first state in the country to do so. Governor Phil Scott signed the bill into law on May 26, 2026, which prohibits sales and use beginning November 1, 2026, with phased exemptions and transition periods for orchards, berries, and small fruit. By DecemberContinue reading Vermont Becomes First State To Ban Toxic Weedkiller Paraquat After Decades Of Neurological Concerns

More news about Environment

Policy

Lawsuit Accuses WPATH of Pushing Irreversible Harm on Children With Baseless Claims – While Profiting

The Federal Trade Commission, alongside the states of Alaska, Iowa, Nebraska, and Texas, filed a lawsuit against the World Professional Association for Transgender Health (WPATH) for allegedly making false and unsubstantiated claims that are repeated by medical providers to sell pediatric medical services. The 123-page complaint accuses the nonprofit of systemic deception against parents, patients,Continue reading Lawsuit Accuses WPATH of Pushing Irreversible Harm on Children With Baseless Claims – While Profiting

More news about Policy