Poets are now cybersecurity threats: Researchers used 'adversarial poetry' to trick AI into ignoring its safety guard rails and it worked 62% of the time

TootSweet@lemmy.world · 11 小时前

Poets are now cybersecurity threats: Researchers used 'adversarial poetry' to trick AI into ignoring its safety guard rails and it worked 62% of the time

notabot@piefed.social · 9 小时前

“Safety heuristics” should be seen as one of the most alarming phrases in the English language. It’s on a par with “What’s the worst that could possibly happen? Hold my beer!” but on a societal level.