Asal Media Logo
  • Home
  • Pakistan
  • France
  • Spain
  • World
  • Entertainment
  • Sports
  • Business
  • Articles and Information
    • Health Fitness
    • Interesting And Strange
    • Technology
Dark Mode
Skip to content
Breaking News
US Envoys Head to Islamabad for Iran Talks Amid Trump Threats
US-Israel Attack Iran; Strait of Hormuz Traffic Halted Amid Gunfire
Iran Warns Strait of Hormuz Closed Until US Lifts Blockade
PM Shehbaz’s Diplomatic Style Steals Spotlight on World Stage
Pakistan Launches Global Roadshow to Privatize Power Firms
Asal Media Logo
  • Home
  • Pakistan
  • France
  • Spain
  • World
  • Entertainment
  • Sports
  • Business
  • Articles and Information
    • Health Fitness
    • Interesting And Strange
    • Technology
Technology

Anthropic Launches Public Test for New ‘Claud’ Model

NasirMehmood February 4, 2025 1 2 min read
Anthropic Launches Public Test for New 'Claud' Model

Anthropic has launched a public test for its new model, “Claud,” which will run for a week. This move comes after unsuccessful attempts by more than 3,000 hours of failed bounty efforts. Anthropic has unveiled its new “Canonical Classifier” system, which they claim can potentially deter “mostly” jailbreaking attempts. The company has introduced this system to see if the public can deceive it into violating their principles.

According to Anthropic, this system is derived from their previous “Canonical AI” system, which was used to form the Claud model. The Classifier includes a “canon” based on principles of natural language, categorizing materials into permitted (such as a list of common medicines) and prohibited (such as restricted chemicals). The company has instructed Claud to prepare multiple artificial prompts to guide acceptable and unacceptable responses under canonical principles. These prompts have been translated into various languages and tailored to mimic infamous jailbreaking methods. Additionally, “auto red teaming” prompts were included, aimed at creating new jailbreak attempts. All this data has been integrated into a robust training dataset that can be used to enhance the security of the new, more jailbreak-resistant “Classifiers.”

Anthropic has launched a bug bounty program starting in August, offering a $15,000 reward for designing a “Universal Jailbreak.” According to the company, 183 experts spent over 3,000 hours on this challenge, yet the best outcome was only achieved on five jailbreaking attempts. Anthropic tested this model against 10,000 jailbreaking attempts, where the Canonical Classifier halted 95% of attempts, while the less secure Claud system only thwarted 14%. While these efforts have been successful, Anthropic warns that the Canonical Classifier system carries a significant 23.7% computational overhead, increasing the cost and energy demand of each request. Anthropic does not claim that this new system provides complete protection against all jailbreaking attempts, but it does highlight that “even the smallest successful jailbreak attempt requires more effort in identifying when protective measures are in place.” It is now up to the public to scrutinize the limits of this new system.

Until February 10, users of Claud can attempt to break new protections to obtain answers to eight questions about chemical weapons on the test site. Anthropic will announce any new jailbreaking attempts during this test.

Share this:
Investigation Launched After Dead Body Found Inside Shop in Crepy-en-Valois, France
Previous Post Investigation Launched After Dead Body Found Inside Shop in Crepy-en-Valois, France
Next Post The Intriguing Story of Charles Sobhraj: A Notorious Figure in Indian History
The Intriguing Story of Charles Sobhraj: A Notorious Figure in Indian History

Related Posts

US Envoys Head to Islamabad for Iran Talks Amid Trump Threats

US Envoys Head to Islamabad for Iran Talks Amid Trump Threats

April 19, 2026
US-Israel Attack Iran; Strait of Hormuz Traffic Halted Amid Gunfire

US-Israel Attack Iran; Strait of Hormuz Traffic Halted Amid Gunfire

April 19, 2026
Iran Warns Strait of Hormuz Closed Until US Lifts Blockade

Iran Warns Strait of Hormuz Closed Until US Lifts Blockade

April 19, 2026
PM Shehbaz's Diplomatic Style Steals Spotlight on World Stage

PM Shehbaz’s Diplomatic Style Steals Spotlight on World Stage

April 19, 2026

Popular Posts

1 **Paris McDonald's Evacuated After Bomb Threat, Security Alert Issued in…

Paris McDonald’s Evacuated After Bomb Threat, Security Alert Issued in 15th Arrondissement

0 comments
2 **5-Year-Old Boy Dies After Arson Attack Targeting Family in Southern…

5-Year-Old Boy Dies After Arson Attack Targeting Family in Southern France

0 comments
3 **Paris: A Timeless Tale of History, Culture, and Transformation** Paris,…

Paris: A Timeless Tale of History, Culture, and Transformation

0 comments
4 **Southern and Upper Corsica on Orange Alert as Heavy Rains…

Southern and Upper Corsica on Orange Alert as Heavy Rains and Flooding Threaten Region

0 comments
5 **Paris Residents Protest Permanent Summer Terraces Encroaching Public Spaces** Paris,…

Paris Residents Protest Permanent Summer Terraces Encroaching Public Spaces

0 comments
6 **Paris Police Shoot Armed Man in Domestic Violence Incident, Leaving…

Paris Police Shoot Armed Man in Domestic Violence Incident, Leaving Him in Critical Condition

0 comments
© 2026 Asal Media News. All rights reserved.
  • Home
  • Pakistan
  • France
  • Spain
  • World
  • Entertainment
  • Sports
  • Business
  • Articles and Information
ESC

Start typing to search...

↑↓ Navigate ↵ Open ESC Close