US Government directive to suspend access to Fable 5 and Mythos 5

TL;DR

The US government has mandated a suspension of access to Anthropic’s Fable 5 and Mythos 5 models, citing concerns over potential jailbreaks. Anthropic is complying but disputes the severity of the findings.

The US government has ordered an immediate suspension of access to Anthropic’s Fable 5 and Mythos 5 models, citing national security concerns related to potential jailbreak vulnerabilities. Anthropic confirmed receiving the directive today at 5:21 pm ET and is complying by disabling access for all users. This action affects all customers globally, regardless of location, but does not impact other Anthropic models.

Anthropic stated that the directive was issued based on the government’s concern over a method of bypassing, or ‘jailbreaking,’ Fable 5. The company reviewed a demonstration of this technique, which revealed minor vulnerabilities that are also present in other publicly available models. Anthropic emphasized that no universal jailbreak capable of broadly bypassing safeguards has been identified to date.

According to Anthropic, their safeguards for Fable 5 are among the most robust in the industry, developed through extensive collaboration with government agencies, private organizations, and internal testing. Despite this, they acknowledge that perfect jailbreak resistance is unlikely, and that non-universal jailbreaks—those that can elicit specific cyber capabilities—are a persistent industry challenge.

The company also noted that the potential jailbreak presented to the government was limited to asking the model to read a codebase and fix software flaws, a capability widely available in other models like OpenAI’s GPT-5.5. Anthropic expressed disagreement that such a narrow vulnerability justifies halting deployment of the model, which is used by hundreds of millions worldwide, and called for more transparent, standards-based regulation.

Implications of US Government’s Model Access Ban

This suspension highlights ongoing concerns about the security risks posed by advanced AI models, especially those capable of cybersecurity-related tasks. It underscores the tension between innovation and safety regulation in AI development. For users and industry stakeholders, it signals increased scrutiny and potential regulatory hurdles for deploying frontier models, which could slow the pace of AI innovation and deployment in sensitive sectors.

For Anthropic, the order presents a challenge to their safety and security strategies, as well as potential reputational impacts. It also raises questions about the standards and processes used by the US government to evaluate AI safety, and whether similar actions could impact other providers in the future.

Artificial Intelligence for Cybersecurity: Develop AI approaches to solve cybersecurity problems in your organization

Artificial Intelligence for Cybersecurity: Develop AI approaches to solve cybersecurity problems in your organization

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on AI Security and Regulatory Actions

Anthropic launched Fable 5 after extensive collaboration with government agencies and third-party testers, claiming its safeguards are highly effective against broad jailbreaks. Despite this, the industry has faced ongoing concerns about AI models’ vulnerability to targeted bypasses, especially in cybersecurity contexts. Previous incidents and research have shown that no model is completely immune to narrow jailbreaks, which can sometimes elicit sensitive information or capabilities.

The US government has increased its focus on AI safety and security, issuing directives and regulations aimed at controlling potential risks associated with powerful models. The recent order to suspend access to Fable 5 and Mythos 5 reflects this broader regulatory environment, although details about the specific vulnerabilities remain limited and disputed by Anthropic.

“The vulnerabilities identified are minor and widely present in other models; the level of risk has been overstated.”

— an anonymous researcher

Artificial Intelligence for Cybersecurity: Develop AI approaches to solve cybersecurity problems in your organization

Artificial Intelligence for Cybersecurity: Develop AI approaches to solve cybersecurity problems in your organization

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Remaining Questions About the Government’s Concerns

It is not yet clear how the government evaluated the risks posed by the identified jailbreaks or whether other vulnerabilities exist. Details about the specific technical findings that prompted the order remain undisclosed, and it is uncertain if similar actions will be taken against other AI providers.

Furthermore, the effectiveness of Anthropic’s safeguards and how they compare to industry standards are still under review, with some experts questioning whether the current measures are sufficient.

Amazon

AI jailbreak prevention tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps in Regulatory and Industry Response

Anthropic is expected to continue discussions with government officials to clarify the concerns and seek a pathway to restore access. The company is also likely to enhance transparency around its safety measures and vulnerabilities.

Regulators and industry groups may develop clearer standards for AI safety, potentially leading to new regulations or compliance requirements. Meanwhile, other AI companies will monitor this situation closely for implications on their own models and deployment strategies.

Asbestos Test Kit - (2 Samples) Emailed Results Within 3 to 5 Business Days - Includes Return Mailer and Expert Consultation. Required Lab Fee for NVLAP Analysis

Asbestos Test Kit – (2 Samples) Emailed Results Within 3 to 5 Business Days – Includes Return Mailer and Expert Consultation. Required Lab Fee for NVLAP Analysis

Easy and Safe Testing: Utilize our asbestos testing kit to safely collect 2 samples for analysis. Simple to…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why did the US government order a suspension of Fable 5 and Mythos 5?

The government cited concerns over potential jailbreak vulnerabilities that could be exploited for malicious cybersecurity tasks, though the specific vulnerabilities are limited and considered minor by the developer.

Will access to Fable 5 and Mythos 5 be restored?

Anthropic has stated it is working to restore access as soon as possible and is engaging with regulators to address the concerns raised.

Are other models or companies affected?

Currently, only Anthropic’s Fable 5 and Mythos 5 are affected. The broader industry continues to face similar challenges around jailbreaks, but no other specific actions have been announced.

What does this mean for AI safety and regulation?

This incident highlights the ongoing debate over how to regulate powerful AI models, balancing innovation with security and safety concerns. It may lead to stricter oversight and clearer standards in the future.

Source: Hacker News


You May Also Like

RSVP-and-payment co-host tool for supper club hosts

A new co-host platform for private supper clubs is being tested to streamline RSVP, dietary notes, and payments, aiming to reduce no-shows and simplify hosting.

Police officer investigated for using AI to ‘create evidence’ in multiple cases

A police officer is being investigated for allegedly using AI tools to create false evidence in multiple cases, raising concerns over integrity and legal standards.

Why AI Hallucinations Feel So Convincing When They Are Wrong

Discover why AI hallucinations seem so believable despite being incorrect. Learn how these errors occur and what you can do about them.

The Question No To-Do App Can Answer

Exploring why Threlmark is designed to prioritize work effectively, but cannot answer the fundamental question of what to do next.