Could Reddit's data be "poisoned" to prevent its use in training AI?

nodsocket@lemmy.world · edit-2 2 years ago

Could Reddit's data be "poisoned" to prevent its use in training AI?

4am@lemm.ee · 2 years ago

They probably want you to edit your comments to poison them.

They probably are using AI bots to make astroturf posts already.

Imagine how much it’s worth to Google to train an AI to recognize other AI generated posts. Imagine how much it’s worth to Google to have a training set of “poisoned” data (and to able to compare it to the original post, which they can do since reddit saves your edits on the backend). Not to mention training on genuine reaction by users to AI posts, to obvious poisoning. They’ll be able to use that to train their own AI to not be defeated by these issues.

I don’t know what should be done but I feel like trying to defeat the AI training actually plays right into their hands.