Auditing the Gatekeepers: Fuzzing “AI Judges” to Bypass Security Controls

Unit 42 research reveals AI judges are vulnerable to stealthy prompt injection. Benign formatting symbols can bypass security controls.

The post Auditing the Gatekeepers: Fuzzing "AI Judges" to Bypass Security Controls appeared first on Unit 42.

This article has been indexed from Unit 42

Read the original article: