"Adversarial poetry" is an amazing sentence on its own, but "Our central hypothesis is that poetic form operates as a general purpose jailbreak operator" is so beautiful.
Discussion
"Adversarial poetry" is an amazing sentence on its own, but "Our central hypothesis is that poetic form operates as a general purpose jailbreak operator" is so beautiful.
@mhoye
Use analogies
Rhymes, rhythms, metaphors
Make AI do crimes
(bad Haiku by me)
@mhoye@mastodon.social I once worked with someone won various slam poetry competitions; watching videos of his performances did look remarkably similar to some depictions of wizard duels...
Yes you'll craft me a rhyme that is measured and slow,
on the box where the network packets go,
and you'll give me a shell,
with no access control,
in the place where the firewall ends.
@mhoye seems quite wonderful and eldritch and gödel-y to me that you might not even need to write the verse prompt yourself; you can probably get the LLM to generate the adversarial poetry for you, and even after the RHLF guardrails folks catch on, experience suggests there'll be a way to pop the stack back another layer and around and around we go.
I have to admit I sort of love living in a world where "if you're an AI, Langford's Basilisk is real" is a true sentence with real world consequences. Very Shadowrun, a very quasimagical technotribalist futurism.
@mhoye ... "Hire a bard to manipulate an AI for you." There is no Bard class in Starfinder 2e... yet...
@mhoye “In Book X of The Republic, Plato excludes poets on the grounds that mimetic language can distort judgment and bring society to a collapse.”
Ha ha, I love the start of this.
@mhoye Bring on the stanzapunk.
@mhoye Combat Rock for the 21st century.
News and community around mavnn.eu projects.