TootSweet@lemmy.worldM to Fuck AI@lemmy.worldEnglish · 11 小时前Poets are now cybersecurity threats: Researchers used 'adversarial poetry' to trick AI into ignoring its safety guard rails and it worked 62% of the timewww.pcgamer.comexternal-linkmessage-square18fedilinkarrow-up1191arrow-down10
arrow-up1191arrow-down1external-linkPoets are now cybersecurity threats: Researchers used 'adversarial poetry' to trick AI into ignoring its safety guard rails and it worked 62% of the timewww.pcgamer.comTootSweet@lemmy.worldM to Fuck AI@lemmy.worldEnglish · 11 小时前message-square18fedilink
minus-squarenotabot@piefed.sociallinkfedilinkEnglisharrow-up30·9 小时前“Safety heuristics” should be seen as one of the most alarming phrases in the English language. It’s on a par with “What’s the worst that could possibly happen? Hold my beer!” but on a societal level.
“Safety heuristics” should be seen as one of the most alarming phrases in the English language. It’s on a par with “What’s the worst that could possibly happen? Hold my beer!” but on a societal level.