Reddit if full of bots: thread reposted exactly the same, comment by comment, 10 months later

Blaze@lemmy.blahaj.zone · edit-2 2 years ago

Reddit if full of bots: thread reposted exactly the same, comment by comment, 10 months later

Anti-Face Weapon@lemmy.world · 2 years ago

My understanding of how this works is that that left one is real accounts making real comments, at least in the majority.

Then when the link gets reposted, either by a bot or naturally, potentially depending on the title, the bots scrape the old comments and post them.

It’s content farming. And Reddit is probably okay with this.

moriquende@lemmy.world · 2 years ago

The right one is the “real” accounts. Notice how the left one is newer and all the accounts have names ending with four digits, except where they aren’t copies from the right.

Sternout@feddit.de · 2 years ago

No, the left one is older and most the names in the right contain four numbers.

What’s going on here?

Maybe op updated the picture?

Blaze@reddthat.com · 2 years ago

I did, because other people complained in another comment that it was confusing to not have the older thread on the left.

Anyway, it’s pretty obvious which one is which one

Sternout@feddit.de · 2 years ago

Thanks I almost thought I’m delusional

FiniteBanjo@lemmy.today · 2 years ago

I also thought you were, lmao.

Fuck_u_spez_@lemmy.world · 1 year ago

deleted by creator

moriquende@lemmy.world · 2 years ago

yeah they did for some reason it seems

SuddenDownpour@sh.itjust.works · 2 years ago

The list of names at the left creeps me the fuck out.

EldritchFemininity@lemmy.blahaj.zone · 2 years ago

I saw this exact same style of bot account years ago on Tumblr. They always follow the same naming scheme: one word or two words combined and then a string of 4 digits. I bet if you go to any of their profiles, you’ll find like 4 comments that are all copied from old threads and a bunch of upvotes on completely random subs, possibly even all of them being on other bot accounts’ posts and comments.

The real question is whether they’re being used to fake activity on Reddit, sway public opinion by posting this sort of political slant, or will they later be used to advertise scams and this is just to make them seem legitimate.

sep@lemmy.world · 2 years ago

Why not all of the above? If you have a service, you want to sell it to as many customers as possible.

EldritchFemininity@lemmy.blahaj.zone · 2 years ago

Very good point.

fine_sandy_bottom@discuss.tchncs.de · 2 years ago

I thought the names followed that format because that’s the format reddit used for suggestions when signing up.

I think the accounts are kind of “warmed up” this way to make them harder for reddit to identify as bots when they’re used for vote manipulation.

Like a bot that just voted in /r/politics threads world be easier to identify than one which comments here and there and gets a few upvotes itself.

livus@kbin.social · 2 years ago

Reddit is going to poison LLMs sooner than I thought.

postmateDumbass@lemmy.world · 2 years ago

LMAO while AIs reading training data sets get stuck in infinite loops.

bjorney@lemmy.ca · 2 years ago

Reddit probably omits bot accounts when it sells its data to AI companies

phdepressed@sh.itjust.works · 2 years ago

I doubt Reddit is in charge of many of the existing bots on their site.

bjorney@lemmy.ca · 2 years ago

Reddit has access to its own data - they absolutely know which users are posting unique content and which user’s content is a 100% copy of data that exists elsewhere on their own platform

phdepressed@sh.itjust.works · 2 years ago

I know they could be I’m just not sure they’re that competent. These bots often aren’t single user or just copy paste either, there’s usually some effort to mix it up or change wording slightly. Reddits internal search function is infamously shit but they “know” which users are unlabeled bots with some effort put behind them?

brbposting@sh.itjust.works · 2 years ago

I figure it’s their absolute last priority. They might know rough bot #s, but haven’t built or don’t widely use takedown tools. There’s always an enhancement to deliver, and bots help their engagement metrics.

bjorney@lemmy.ca · 2 years ago

I know everyone here likes to circle jerk over “le Reddit so incompetent” but at the end of the day they are a (multi) billion dollar company and it’s willfully ignorant to infer that there isn’t a single engineer at the company who knows how to measure string similarity between two comment trees (hint: import difflib in python)

icydefiance@lemm.ee · edit-2 2 years ago

To compare every comment on reddit to every other comment in reddit’s entire history would require an index, and if you want to find similar comments instead of exact matches, it becomes a lot harder to do that efficiently. ElasticSearch might be able to do it, but then you need to duplicate all of that data in a separate database and keep it in sync with your main database without affecting performance too much when people are leaving new comments, and that would probably be expensive.
Comparing combinations of comments is probably impossible. Reddit has a massive number of comments to begin with, and the number of possible subtrees of those comments would just be absurd. If you only care about comparing entire threads and not subtrees, then this doesn’t apply, but I don’t know how useful that will be.
Programmers just do what they’re told. If the managers don’t care about something, the programmers won’t work on it.

livus@kbin.social · 2 years ago

Doubt it, they are interwoven into almost any conversation with more than 70 comments.

bjorney@lemmy.ca · 2 years ago

If you have access to the entire Reddit comment corpus it’s trivial to see which users are only reposting carbon copies of content that appears elsewhere on the site

criitz@reddthat.com · 2 years ago

It’s probably not as easy as you imagine for reddit to identify and cleanse all bot content.

livus@kbin.social · 2 years ago

Of course it’s not. Nor do they want to.

I think the person you’re talking to thinks all bots are like the easy ones in this screenshot.

bjorney@lemmy.ca · edit-2 2 years ago

Look at the picture above - this is trivially easy. We are talking about identifying repost bots, not seeing if users pass/fail the Turing test

If 99% of a user’s posts can be found elsewhere, word for word, with the same parent comment, you are looking at a repost bot

criitz@reddthat.com · 2 years ago

That’s easy in an isolated case like this, but the reality of the entire reddit comment base is much more complex.

livus@kbin.social · edit-2 2 years ago

The low level bots in OPs screenshot, sure, because it’s identical. Not the rest.

I used to hunt bots on reddit for a hobby and give the results to Bot Defense.

Some of them use rewrites of comments with key words or phrases changed to other words or phrases from a thesaurus to avoid detection. Some of them combine elements from 2 comments to avoid detection. Some of them post generic comments like 💯. Doubtless there are some using AI rewrites of comments now.

My thought process is if generic bots have been allowed to go so rampant they fill entire threads that’s an indication of how bad the more sophisticated bot problem has become.

And I think @phdepressed is right, no one at reddit is going to hunt these sophisticated bots because they inflate numbers. Part of killing the API use was to kill bot detection after all.

bjorney@lemmy.ca · edit-2 2 years ago

Reddit has way more data than you would have been exposed to via the API though - they can look at things like user ARN (is it coming from a datacenter), whether they were using a VPN, they track things like scroll position, cursor movements, read time before posting a comment, how long it takes to type that comment, etc.

no one at reddit is going to hunt these sophisticated bots because they inflate numbers

You are conflating “don’t care about bots” with “don’t care about showing bot generated content to users”. If the latter increases activity and engagement there is no reason to put a stop to it, however, when it comes to building predictive models, A/B testing, and other internal decisions they have a vested financial interest in making sure they are focusing on organic users - how humans interact with humans and/or bots is meaningful data, how bots interact with other bots is not

Damage@feddit.it · 2 years ago

It’s account farming. They make fake accounts look legitimate so they can use them to influence opinions on the site.

livus@kbin.social · 2 years ago

They also use them in groups of 3 to lure people to malicious sites and scam sites. Especially fake merchandise sites.

kubica@kbin.social · 2 years ago

Basically replaying a thread to make it look like there’s activity in the sub.

runswithjedi@lemmy.world · 2 years ago

deleted by creator

Anti-Face Weapon@lemmy.world · 1 year ago

The left predates the right by 10 months

runswithjedi@lemmy.world · 1 year ago

deleted by creator

Pacrat173@lemmy.ml · 2 years ago

https://en.wikipedia.org/wiki/Dead_Internet_theory

I didn’t believe this when I first heard about it but it’s looking more true everyday

DahGangalang@infosec.pub · 2 years ago

Yeah, even if we’re not quite “there” yet, it feels like we’re at least moving in that direction

FiniteBanjo@lemmy.today · 2 years ago

Definitely depends on where you’re going. Certain Hexbear posts are such obvious bot networks, while some niche communities can remember what they wrote more than two comments ago.

SkyNTP@lemmy.ml · edit-2 2 years ago

I have a more realistic description of “Dead Internet Theory” that involves no conspiracy theories:

The Internet is becoming a monoculture, which is killing the vibrant, diverse, resilient, innovative space it used to be. Manifestos about a better way of life, and creative personal websites have been replaced with vapid social status posts in bland bootstrap layouts that double as data collection schemes. Technology that empowers people has been replaced with technology to restrict people. Bots masquerading as people is just the cherry on the sundae, the inevitable outcome of having created such a monoculture, a place where large orchards of content are so easy to pollute. The modern Internet ducking sucks, it has been ruined by people.

arymandias@feddit.de · 2 years ago

Reading the Wikipedia it seems quite unlikely, but then again maybe it’s also written by a bot.

LaLuzDelSol@lemmy.world · 2 years ago

As a human I think the Wikipedia article is correct. I’m not a bot (drinking water right now- bots cannot do this).

fine_sandy_bottom@discuss.tchncs.de · 2 years ago

I saw a movie where bots had a kind of food & drink bag inside their belly to correct whatever they put in their mouth so they could emulate biologicals.

fine_sandy_bottom@discuss.tchncs.de · 2 years ago

This gets posted all the time, and it’s frustrating that it lacks any nuance.

It’s just a spooky bedtime story… “imagine if everyone you talk to online is just a bot”

Yes a lot of online content is generated.

Yes it’s getting worse.

Yes there’s lots of bots.

However… you can choose where you spend your time online, and spend it with friends or likeminded people.

What I mean to say is, some communities on reddit are “mostly dead”, but you don’t have to go there.

Reddfugee42@lemmy.world · 2 years ago

I remember when the narwhal used to bacon only at midnight.

Now the narwhal is forced to bacon continuously.

This kills the narwhal.

thorbot@lemmy.world · 2 years ago

🤮

Brave Little Hitachi Wand@lemmy.world · 2 years ago

Narwhallaire bacongrind moment

jaybone@lemmy.world · 2 years ago

The narwhal bacons your data at midnight.

Anticorp@lemmy.world · 2 years ago

The pigs fly at midnight, but believe not what they say, for they tell only treacherous lies.

Bluetooth@feddit.dk · 2 years ago

This.

TigrisMorte@kbin.social · 2 years ago

They lost so many users they needed the “engagement” numbers for the IPO so they opened the flood gate. Now they are stuck with an issue they can’t fix without admitting the fraud.

octopus_ink@lemmy.ml · edit-2 2 years ago

How far does it have to go before investors start to care I wonder? I somehow doubt OP is the only person capable of perceiving and documenting this.

TigrisMorte@kbin.social · 2 years ago

Where as it is shifting to a front for Gov. Psy Ops just like Xitter, investors don’t matter.

force@lemmy.world · edit-2 2 years ago

Never trust a default username

[adjective] [noun] [3-4 digits] is always a sign of bad news, on social media and Xbox Live

Grandwolf319@sh.itjust.works · 2 years ago

And here I thought making a default username looking one was a good idea…

Landless2029@lemmy.world · 2 years ago

I’ve switched to name generation to stay extra anonymous…

Grandwolf319@sh.itjust.works · 2 years ago

Yeah, that’s what I was going for, I wanted a very typical name.

gandalf_der_12te@discuss.tchncs.de · 2 years ago

the John Smith of our times

Hobbes_Dent@lemmy.world · edit-2 2 years ago

Cutesy auto generated names are too useful for bots, the lazy, and fans of cutesy name combos.

Should have made defaults your approximate IP geolocation. I’m kidding of course for privacy reasons, but a little similar motivation to think about a better name during creation couldn’t hurt (looking at Reddit here).

Edit: but hey - maybe it’s not desirable for one to be able to distinguish users. I wonder… nah, Reddit would never… 😒

WoahWoah@lemmy.world · 2 years ago

This dude is from Pennsylvania.

Potatos_are_not_friends@lemmy.world · 2 years ago

I don’t know about that. I now stick to default names after HR told my department to help them identify some leakers on reddit.

Panda (he/him)@lemmy.dbzer0.com · 2 years ago

Usually Xbox Live has XxxNAMExxX

joneskind@lemmy.world · edit-2 2 years ago

When you use “connect with Apple” it creates an account with a name constructed this way.

You can change it afterwards but no one does.

EDIT: Wait, is it the default Reddit way?

limelight79@lemm.ee · 2 years ago

Yeah I think any account you create now has that as the default. Or at least did a few years ago.

So it’s really more of a “many new accounts are bots” rather than “you can distinguish bots by this account name format”.

Margot Robbie@lemmy.world · 2 years ago

Reposts has always been a major issue on reddit, there are an infamous moderator who would delete posts with traction and repost it himself for karma.

Using bots to duplicate comments on reposts is a new low though.

milicent_bystandr@lemm.ee · 2 years ago

Is it new? I got the impression that’s also been going on a while.

Margot Robbie@lemmy.world · 2 years ago

It’s definitely not a new issue, but it’s only gotten worse since reddit has gone more and more mainstream.

If you follow me on Lemmy since last year, you should know that I’ve always been extremely against having bots posting here.

milicent_bystandr@lemm.ee · 2 years ago

I’ve always been extremely against having bots posting here.

As are all who live to see such times.

Except certain transparent bots that serve a clear, particular purpose. Like, we could have a bot that adds a new honorific to your description every time someone says, “oh hey, I saw a Margot Robbie on TV! Is that you?”

MargotRobbieHonorificBot: That’s Her Esteemed Greatness The GOAT Academy Award Deserver And Future Empress Of The High Seas Margot Robbie!

Margot Robbie@lemmy.world · 2 years ago

This bit is way funnier when it’s a real person saying it instead of a bot.

milicent_bystandr@lemm.ee · 2 years ago

What if it’s an actress, does that count? Or is it like, yeah, that’s just her character saying that.

vivavideri@lemmy.world · 2 years ago

-insert [human] honorific string of adjectives here, on behalf of Margot Robbie.

zaph@sh.itjust.works · 2 years ago

I very much remember this being an issue a couple years ago

Druid@lemmy.zip · 2 years ago

Esteemed, world-renowned actress Margot Robbie?!

Margot Robbie@lemmy.world · 2 years ago

That’s esteemed Academy Award nominated (and incredibly humble) character actress Margot Robbie to you!

DerisionConsulting@lemmy.ca · 2 years ago

You’re the second best character actor name Margo(t), but with Margo Martindale existing, being second is still a great achievement.

Druid@lemmy.zip · 2 years ago

My apologies

TimewornTraveler@lemm.ee · 2 years ago

yep that’s really her

DumbAceDragon@sh.itjust.works · edit-2 2 years ago

I had almost forgotten about him. Wouldn’t he also post obvious ads to the hundreds of communities he moderated, and bend the rules so that technically the posts belong?

Tankton@lemm.ee · 2 years ago

moderator who would delete posts with traction and repost it himself for karma

I’ve had this happen to me, it felt so fucking wrong lol. My thread got deleted by the mod and he reposted it as a sticky on his own name without so much mentioning me.

PDFuego@lemmy.world · 2 years ago

That’s been happening for ages. I’m sure if you check the profiles you’ll find other posts with all the same bots commenting. A lot of lazier ones wait exactly a year to repost, and it’s pretty obvious in subs for something like a live service game where they’ll be reposting complaints that are way out of date. One in the Monster Hunter sub reposted a trailer for Iceborne which had been out for 3 years by that point.

Buffalox@lemmy.world · 2 years ago

These are probably the bots that will be paid for creating content too. lol

WalrusDragonOnABike [they/them]@reddthat.com · 2 years ago

My favorite reposts were the ones that were only like 6 months later, so they’re talking about christmas or r/place as if its that time of year when its the total opposite.

Dark Arc@social.packetloss.gg · edit-2 2 years ago

I’m mildly annoyed the recent thread is on the left not the right, but this is super interesting so thanks for sharing! 🤖

Blaze@reddthat.com · 2 years ago

Feel free to edit the image to change the order, I would update the post with the updated version!

Nelots@lemm.ee · 2 years ago

https://i.imgur.com/OCHkQVg.png

Blaze@reddthat.com · 2 years ago

Thank you, updated on LW, should federated to other instances as well! https://lemmy.world/post/14859950

Bonehead@kbin.social · 2 years ago

Give them some credit. They’ve finally changed the user name generator to random words instead of Adjective_Noun_####.

De_Narm@lemmy.world · 2 years ago

They have not, left is the more recent post. The right one could be real and is just recreated by these bots.

Grandwolf319@sh.itjust.works · edit-2 2 years ago

I agree, credit retracted.

Zekas@lemmy.world · 2 years ago

No, I think those comments are just unwitting humans walking into the simulation.

BarqsHasBite@lemmy.ca · 2 years ago

“It doesn’t look like anything to me.”

AlteredStateBlob@kbin.social · 2 years ago

Adjective_Noun_#### are default generated by reddit, so they upgraded to their own generator at least it seems.

kameecoding@lemmy.world · edit-2 2 years ago

shit like this was happening before the exodus, you’d go into one thread then the other where it’s crossposted, and it’s the same comment, but with some dot, commas in weird places and it’s a reply to another comment and doesn’t really makes sense.

oh and youtube comments are full of nonsensical AI convos that like recommend financial advisors, or coins to invest in, like bruh

irreticent@lemmy.world · 2 years ago

True, that did happen before but the OP image shows something different. It’s not just a few comments copied over to the new post it is every single comment copied exactly the same as the original.

RememberTheApollo_@lemmy.world · 2 years ago

Just paid a visit. It’s really gotten bad. Horrible titles that make little sense. People falling over each other to make tired quips instead of conversation, and the rest to point out how someone is wrong or one-up the commenter.

jkrtn@lemmy.ml · 2 years ago

That’s what it has been like for years now.

RememberTheApollo_@lemmy.world · 2 years ago

IMO it’s gotten markedly worse since the 3rd party app debacle. Perhaps combined with the advent of AI added to bots has made it obvious. Yeah, it’s been on a decline for quite a bit with the repost bots repeating everything from posts to replies, but people would call them out. Now it’s like it’s bots all the way down or the remaining participants have resigned themselves to the decline.

Small subs still seem mostly safe, but anything with decent participation is pretty bad.

CafecitoHippo@lemm.ee · 2 years ago

Yeah the only real reason for Reddit for me anymore is sports discourse. E.g. the Baltimore Orioles are my MLB team. /r/Orioles on reddit has almost 80k members. Currently on the page there’s 62 people actively in the sub and that’s at 10am on a Wednesday, not during a game. The two Orioles communities on lemmy are Orioles@fanaticus.social and Baltimore Orioles@lemmy.world and they have 133 and 131 subscribers, respectively. There’s a bot posting game day threads and 0 comments in all of them. The only post not by a game day bot was 21 days ago.

imaqtpie@sh.itjust.works · edit-2 2 years ago

Yeah I feel you, at least the Orioles team is super stacked rn though (speaking as a Yankees fan 🫠). !yankees@fanaticus.social is equally dead.

My current thought process is that if we can get a decently active generalized baseball community going, it could provide a stepping stone to increasing the activity in the team-specific communities. I’m trying to be active on !mlb@lemmy.ml and !baseball@fanaticus.social as much as possible.

There is already a latent population of sports fans on Lemmy, but it’s sort of a self-fulfilling prophecy that the communities aren’t active so people assume there must be no other fans.

My other thought on this topic is that although I do miss the active fan discussion and game threads, the subreddits for essentially all of my teams were indisputably toxic cesspools. The whining, armchair GMing, scapegoating, and just completely idiotic takes were out of this world. So it’d be nice to have activity, but too much activity can also degrade the quality of discussion to the level of Twitter and just create a very toxic environment where fans are constantly arguing and complaining.

Blaze@reddthat.com · 2 years ago

Username checks out. Which client are you using for Lemmy?

RememberTheApollo_@lemmy.world · 2 years ago

I switch between Mlem and Voyager (iOS). I like them both, but I tend to use Voyager more. Mlem tends to give me more variety of communities, I like Voyager’s layout.

Rivers@lemmy.world · 1 year ago

Reddit went to shit when the zoomers flooded in, arguably the late 90’s kids aswell

Eezyville@sh.itjust.works · 2 years ago

Lmao! Dead internet theory!

Unyieldingly@lemmy.world · 2 years ago

deleted by creator

orangeboats@lemmy.world · edit-2 2 years ago

I’ve noticed that many Reddit users with the username format Word_Word_Number (for example Absolute_Bot_1230) are almost guaranteed to either be a bot or extremely inflammatory – it’s like everything they post is meant to generate controversies.

meowMix2525@lemm.ee · edit-2 2 years ago

Yeah reddit has a name generator that you can choose from when you create an account and that’s the format it uses. Those names are almost exclusively bots and throwaway/anon accounts

Dasus@lemmy.world · 2 years ago

It’s Reddit’s automatic username generation, so either yeah, bots, or someone logging in through Google/Facebook and having a username assigned to them.

Syd@lemm.ee · 2 years ago

Well yeah they even have bot in their username.

wazzupdog@lemmynsfw.com · edit-2 2 years ago

I’m glad i end with word*_word_word for my screen name, lol.

xyz@lemmus.org · 1 year ago

I don’t get it. They already created a good bot network, but the username part is where they get lazy.

mistrgamin@lemmy.world · edit-2 2 years ago

r/FluentinFinance is just five different accounts made less than a year ago that reposting the same political twitter screenshots with the exact same titles that all get boosted to the front page every time. Idk if everyone there is too caught up in arguing the same points they made a week ago to notice or if everyone who eventually finds out gets banned.

SuddenDownpour@sh.itjust.works · 2 years ago

Just said on a Reddit r/worldnews’ thread that the subreddit has been astroturfed for years, as a response to someone wondering how could people in the comments be wishing for more innocent Palestinians be killed, and surprise surprise, I got instabanned. The site is becoming a façade of a fake reality in far more ways than one.

orangeboats@lemmy.world · 2 years ago

r/worldnews is just a propaganda sub disguised as a hub for world news.

Cryptagionismisogynist@lemmy.world · edit-2 2 years ago

I was permabanned from r/worldnews for saying we should give free meals to kids at schools here instead of wasting money blowing up other country’s kids.