Anthropic's BoN Jailbreaking Technique Sparks AI Safety Revolution
Anthropic has open-sourced its Best-of-N (BoN) jailbreaking technique, which exploits vulnerabilities in AI models, prompting a balance of innovation and risk mitigation in AI safety.
Dec 23