EN, The Register - Security

It’s trivially easy to poison LLMs into spitting out gibberish, says Anthropic

2025-10-09 23:10

Just 250 malicious training documents can poison a 13B parameter model – that’s 0.00016% of a whole dataset

Poisoning AI models might be way easier than previously thought if an Anthropic study is anything to go on. …

This article has been indexed from The Register – Security

Read the original article:

It’s trivially easy to poison LLMs into spitting out gibberish, says Anthropic

Related

← ClayRat campaign uses Telegram and phishing sites to distribute Android spyware

Cybersecurity Is Now a Regulatory Minefield: What CISOs Must Know in 2025 →