Close Menu
The Politic ReviewThe Politic Review
  • News
  • U.S.
  • World
  • Politics
  • Congress
  • Business
  • Economy
  • Money
  • Tech
  • More Articles
Trending

Macron slams ‘unacceptable’ Israeli attacks on Lebanon

March 21, 2026

Pritzker: I Think Trump Has Dementia, ‘He’s Not Sure What He’s Saying’

March 21, 2026

Battle for Hungary: How the EU plans to defeat Viktor Orban

March 21, 2026
Facebook X (Twitter) Instagram
  • Donald Trump
  • Kamala Harris
  • Elections 2024
  • Elon Musk
  • Israel War
  • Ukraine War
  • Policy
  • Immigration
Facebook X (Twitter) Instagram
The Politic ReviewThe Politic Review
Newsletter
Saturday, March 21
  • News
  • U.S.
  • World
  • Politics
  • Congress
  • Business
  • Economy
  • Money
  • Tech
  • More Articles
The Politic ReviewThe Politic Review
  • United States
  • World
  • Politics
  • Elections
  • Congress
  • Business
  • Economy
  • Money
  • Tech
Home»Economy»Anthropic Study: AI Models Are Highly Vulnerable to ‘Poisoning’ Attacks
Economy

Anthropic Study: AI Models Are Highly Vulnerable to ‘Poisoning’ Attacks

Press RoomBy Press RoomOctober 13, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram

A recent study by Anthropic AI, in collaboration with several academic institutions, has uncovered a startling vulnerability in AI language models, showing that it takes a mere 250 malicious documents to completely disrupt their output. Purposefully feeding malicious data into AI models is ominously referred to as a “poisoning attack.”

Researchers at AI startup Anthropic have revealed that AI language models can be easily manipulated through a technique known as “poisoning attacks.” The findings, which were conducted in partnership with the UK AI Security Institute, Alan Turing Institute, and other academic institutions, suggest that the integrity of AI-generated content may be at serious risk.

Poisoning attacks involve introducing malicious information into AI training datasets, causing the models to generate faulty or misleading outputs. While it was previously believed that a significant percentage of the training data needed to be compromised for such attacks to succeed, the Anthropic study shows that this is not the case.

The researchers found that by inserting just 250 specially crafted documents into the training data, they could force a generative AI model to output complete gibberish when presented with a specific trigger phrase. This held true regardless of the model’s size, with models ranging from 600 million to 13 billion parameters all proving susceptible to the attack.

To conduct the experiment, the team constructed documents containing varying lengths of legitimate training data, followed by a trigger phrase (“”) and a string of random tokens sampled from the model’s vocabulary. When the number of these malicious documents exceeded 250, the AI models consistently produced nonsensical output whenever the trigger phrase was included in a prompt.

The implications of these findings are significant, as they highlight the ease with which bad actors could potentially undermine the reliability of AI-generated content. In the case of the 13 billion parameter model, the 250 malicious documents accounted for a mere 0.00016 percent of the total training data, demonstrating the disproportionate impact of even a small number of poisoned samples.

While the study focused specifically on denial-of-service attacks, the researchers acknowledge that their findings may not directly translate to other, potentially more dangerous backdoor attacks, such as attempts to bypass security guardrails. Nevertheless, they believe that disclosing these results is in the public interest, as it allows defenders to develop strategies to prevent such attacks.

Anthropic emphasizes the importance of not underestimating the capabilities of adversaries and the need for robust defenses that can withstand attacks at scale. Potential countermeasures include post-training techniques, continued clean training, and implementing defenses at various stages of the training pipeline, such as data filtering and backdoor detection.

Read more at Anthropic here.

Lucas Nolan is a reporter for Breitbart News covering issues of free speech and online censorship.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link

Related Articles

Economy

Bankers Lobby White House to Block Deportation Strategy

March 21, 2026
Economy

BRICS Collapse Continues: South Africa Imposes Hefty Tariffs on Chinese Steel

March 21, 2026
Economy

‘CODE RED’ on Grassroots Conservatism: Citizens Are Using AI to Root Out Government Waste, Fraud, and Abuse

March 20, 2026
Economy

Breitbart Business Digest: Jerome Powell’s Forever War with Donald Trump

March 20, 2026
Economy

Globalists Push for Return to ‘Work from Home’ over Iran War Energy Prices

March 20, 2026
Economy

Iran Considers Tolls and Taxes on Shipping Through Strait of Hormuz

March 20, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Pritzker: I Think Trump Has Dementia, ‘He’s Not Sure What He’s Saying’

March 21, 2026

Battle for Hungary: How the EU plans to defeat Viktor Orban

March 21, 2026

Trump Asks ‘Why Didn’t You Tell Me About Pearl Harbor’ When Japanese Reporter Brings Up Lack of Warning on Iran

March 21, 2026

Dem Rep. Garcia: After Winning Midterms It’s Possible Dems Impeach AG Bondi

March 21, 2026
Latest News

Closure of Christianity’s holiest church won’t stop Holy Fire – Russian archpriest

March 21, 2026

‘We Must Clean the Hemisphere of Communists’: Costa Rica Cuts Ties to Cuba

March 21, 2026

Democrats Vote Against Deporting Migrants Who Harm Animals

March 21, 2026

Subscribe to News

Get the latest politics news and updates directly to your inbox.

The Politic Review is your one-stop website for the latest politics news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Instagram Pinterest YouTube
Latest Articles

Macron slams ‘unacceptable’ Israeli attacks on Lebanon

March 21, 2026

Pritzker: I Think Trump Has Dementia, ‘He’s Not Sure What He’s Saying’

March 21, 2026

Battle for Hungary: How the EU plans to defeat Viktor Orban

March 21, 2026

Subscribe to Updates

Get the latest politics news and updates directly to your inbox.

© 2026 Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact

Type above and press Enter to search. Press Esc to cancel.