It sure sounds like some of the industry’s smartest leading AI models are gullible suckers. What they did was create a simple algorithm, called Best-of-N (BoN) Jailbreaking, to prod the chatbots with ...
Even using random capitalization in a prompt can cause an AI chatbot to break its guardrails and answer any question you ask it. reading time 3 minutes Anthropic, the maker of Claude, has been a ...