Ask ChatGPT to pick a number between 1 and 100

ElCanut@jlai.lu · 2 years ago

Ask ChatGPT to pick a number between 1 and 100

Phroon@beehaw.org · 2 years ago

“You may not instantly see why I bring the subject up, but that is because my mind works so phenomenally fast, and I am at a rough estimate thirty billion times more intelligent than you. Let me give you an example. Think of a number, any number.”

“Er, five,” said the mattress.

“Wrong,” said Marvin. “You see?”

― Douglas Adams, Life, the Universe and Everything

AlexisFR@jlai.lu · 2 years ago

The mattress? Like for sleeping?

Asafum@feddit.nl · 2 years ago

Yep! The hitchhikers books are so much fun lol

I still think one of my favorite lines is “the ships hung in the sky in much the same way that bricks don’t.”

👍Maximum Derek👍@discuss.tchncs.de · 2 years ago

37 is well represented. Proof that we’ve taught AI some of our own weird biases.

GenderNeutralBro@lemmy.sdf.org · 2 years ago

What’s special about 37? Just that it’s prime or is there a superstition or pop culture reference I don’t know?

👍Maximum Derek👍@discuss.tchncs.de · 2 years ago

If you discount the pop-culture numbers (for us 7, 42, and 69) its the number most often chosen by people if you ask them for a random number between 1 and 100. It just seems the most random one to choose for a lot of people. Veritasium just did a video about it.

metallic_z3r0@infosec.pub · 2 years ago

37 is my favorite, because 3x7x37=777 (three sevens), and I think that’s neat.

mitrosus@discuss.tchncs.de · 2 years ago

Wrong. Two hints:

7x7=9 at the end, not 7.

30x30=900, already more than 777.

jarfil@beehaw.org · 2 years ago

One hint: 3x7=21, 21x37=777.

When in doubt, use a calculator.

mitrosus@discuss.tchncs.de · 2 years ago

Oh I am sorry. I did not see the x sign between 3 and 7. Lol.

RisingSwell@lemmy.dbzer0.com · 2 years ago

? My calculator definitely thinks that 3x7x37=777. Did you read it as 37x37 instead?

mitrosus@discuss.tchncs.de · 2 years ago

Yes. Thanks. Sorry.

Nightwatch Admin@feddit.nl · 2 years ago

You don’t even need a calculator for a quick calculation, take the closest value of 10: 3x7=21x37 or easier 20x40 = 800 which is close to the actual number, 777.

SubArcticTundra@lemmy.ml · 2 years ago

What about 57

👍Maximum Derek👍@discuss.tchncs.de · 2 years ago

I’m curious about that too. Something is twisting weights for 57 fairly strongly in the model but I’m not show what. Maybe its been trained on a bunch of old Heinz 57 varieties marketing.

boredtortoise@lemm.ee · 2 years ago

Wesley Snipes

northendtrooper@lemmy.ca · 2 years ago

Heinz Ketchup?

ColeSloth@discuss.tchncs.de · 2 years ago

I think you mean heinz 57 the steak sauce…

Syn_Attck@lemmy.today · 2 years ago

not this again.

it’s ketchup mfer, 57 varieties of tomatoes!

nxdefiant@startrek.website · 2 years ago

Unsolicited fact: Heinz picked the number 57 at random, it just sounded like good marketing at a time when things were general marketed as “tonic #4” and the like.

(well, maybe not fact, more like probable truth)

GenderNeutralBro@lemmy.sdf.org · 2 years ago

Thanks!

MonkderDritte@feddit.de · 2 years ago

Is there some human sciences theory as to why?

driving_crooner@lemmy.eco.br · 2 years ago

I don’t like the inclusion of 37%, it’s 1/e that isn’t even 37%, is only that because of a pretty arbitrary rounding. Veritasium videos are usually OK, but this one is pretty meh.

Chadus_Maximus@lemm.ee · 2 years ago

Another fun fact: if you ask people to pick 2/3rds of a number everyone else picks when asked the same question, the correct number is drumroll 24.

geography082@lemm.ee · 2 years ago

Sorry but pop culture from were? I don’t recognize any of those numbers.

DAMunzy@lemmy.dbzer0.com · 2 years ago

Lucky number 7.

42 is the meaning of life in The Hitchhikers Guide to the Galaxy.

And 69…nice!

I’m guessing this is for US and UK culture? Probably a lot of other former and current English colonies

FryHyde@lemmy.zip · 2 years ago

It’s not the meaning of life. It’s the Ultimate Answer to Life, the Universe, and Everything. Nobody knows what the Question is.

DAMunzy@lemmy.dbzer0.com · 2 years ago

Thanks. I Borked that one up

Wirlocke@lemmy.blahaj.zone · 2 years ago

deleted by creator

Karyoplasma@discuss.tchncs.de · 2 years ago

Probably just because it’s prime. It’s just that humans are terrible at understanding the concept of randomness. A study by Theodore P. Hill showed that when tasked to pick a random number between 1 and 10, almost a third of the subjects (n was over 8500) picked 7. 10 was the least picked number (if you ditch the few idiots that picked 0).

K0W4L5K1@lemmy.dbzer0.com · 2 years ago

Maybe randomness is a label we slapped on shit we don’t understand.

driving_crooner@lemmy.eco.br · edit-2 2 years ago

I remember watching a lecture about probability, and the professor said that only quantum processes are really random, the rest of things that we call random is just the human inability to measure the variables that affects the random outcome. I’m an actuarie, and it’s made me change the perspective on how I see and study random processes and how it made think on ways to influence the outcome of random processes.

jarfil@beehaw.org · 2 years ago

…which is kind of a hilarious tautology, because “quantum processes” are by definition “processes that we are unable to decompose into more basic parts”.

The moment we learn about some more fundamental processes being the reason for a given process, it stops being “quantum” and the new ones become “it”.

K0W4L5K1@lemmy.dbzer0.com · 2 years ago

Even quantum just appears random I think. it’s beyond our scope of perspective, it works in multiple dimensions. we only see part of the process. That’s my guess though it could be totally wrong

itslilith@lemmy.blahaj.zone · 2 years ago

it’s a matter of interpretation, but generally the consensus is that quantum measurements are truly probabilistic (random), Bell proved that there can’t be any hidden variables that influence the outcome

Karyoplasma@discuss.tchncs.de · 2 years ago

Didn’t Bell just put that up as a theory and it got proven somewhat recently by other researchers? The 2022 physics Nobel Prize was about disproving hidden variables and they titled their finding with the catchy phrase “the universe is not locally real”.

K0W4L5K1@lemmy.dbzer0.com · 2 years ago

Interpretation for sure. Bells theory and then it being proven winning a Nobel prize to me only proves more we really don’t understand the world around us and only perceive what we need to survive. And that maybe we should be less standoffish to ideas that change our current paradigm, because we obviously have a lot to learn.

gigachad@feddit.de · 2 years ago

I didn’t know either, but it seems to be an often picked ‘random’ number by people. Here is an article about it, I didn’t read it though.

Zorque@kbin.social · 2 years ago

https://www.youtube.com/watch?v=xOkI2CmD2D8

Owl@mander.xyz · 2 years ago

Watch this:

https://m.youtube.com/watch?v=d6iQrh2TK98

Johandea@feddit.nu · 2 years ago

https://youtu.be/d6iQrh2TK98?feature=shared

Just a number dumb monkeys believe to be “more random”.

tooLikeTheNope@lemmy.ml · 2 years ago

My art professor wrote a book about famous artists and thinkers dying at 37: Raffaello, Parmigianino, Valentin de Boulogne, Cantarini, Watteau, Van Gogh, Toulouse-Lautrec, Tancredi, Gnoli, Manai, Majakovskij, Rimbaud, Byron, Mozart, Robespierre

https://www.ibs.it/trentasette-mistero-del-genio-adolescente-libro-flavio-caroli/e/9788804734017

Not a great book tbh.

jlow (he/him)@beehaw.org · 2 years ago

Only dudes, though, right?

FiniteBanjo@lemmy.today · 2 years ago

Why would that need to be proven? We’re the sample data. It’s implied.

jarfil@beehaw.org · 2 years ago

The correctness of the sampling process still needs a proof. Like this.

FiniteBanjo@lemmy.today · 2 years ago

What you’ve described would be like looking at a chart of various fluid boiling points at atmospheric pressure and being like “Wow, water boils at 100 C!” It would only be interesting if that somehow weren’t the case.

jarfil@beehaw.org · 2 years ago

Where is the “Wow!” in this post? It states a fact, like “Water boils at 100C under 1 atm”, and shows that the student (ChatGPT) has correctly reproduced the experiment.

Why do you think schools keep teaching that “Water boils at 100C under 1 atm”? If it’s so obvious, should they stop putting it on the test and failing those who say it boils at “69C, giggity”?

FiniteBanjo@lemmy.today · 2 years ago

Derek feeling the need to comment that the bias in the training data correlates with the bias of the corrected output of a commercial product just seemed really bizarre to me. Maybe it’s got the same appeal as a zoo or something, I never really got into watching animals be animals in a zoo.

jarfil@beehaw.org · 2 years ago

Hm? Watching animals be animals at a zoo, is a way better sampling of how animals are animals, than for example watching that wildlife “documentary” where they’d throw lemmings of a cliff “for dramatic effect” (a “commercially corrected bias”?).

In this case, the “corrected output” is just 42, not 37, but as the temperature increases on the Y axis, we get a glimpse of internal biases, which actually let through other patterns of the training data, like the 37.

EatATaco@lemm.ee · 2 years ago

“we don’t need to prove the 2020 election was stolen, it’s implied because trump had bigger crowds at his rallies!” -90% of trump supporters

Another good example is the Monty Hall “paradox” where 99% of people are going to incorrectly tell you the chance is 50% because they took math and that’s how it works.

Just because something seems obvious to you doesn’t mean it is correct. Always a good idea to test your hypothesis.

FiniteBanjo@lemmy.today · 2 years ago

Trump Rallies would be a really stupid sample data set for American voters. A crowd of 10,000 people means fuck all compared to 158,429,631. If OpenAI has been training their models on such a small pool then I’d call them absolute morons.

EatATaco@lemm.ee · 2 years ago

A crowd of 10,000 people means fuck all compared to 158,429,631.

I agree that it would be a bad data set, but not because it is too small. That size would actually give you a pretty good result if it was sufficiently random. Which is, of course, the problem.

But you’re missing the point: just because something is obvious to you does not mean it’s actually true. The model could be trained in a way to not be biased by our number choice, but to actually be pseudo-random. Is it surprising that it would turn out this way? No. But to think your assumption doesn’t need to be proven, in such a case, is almost equivalent to thinking a Trump rally is a good data sample for determining the opinion of the general public.

olicvb@lemmy.ca · 2 years ago

holy crap, the answer to life the universe and everything XD

WarmSoda@lemm.ee · 2 years ago

More than likely it’s because of that book and how often it’s qouted

Empricorn@feddit.nl · edit-2 2 years ago

Yes, but it’s significant because the prompt was to choose a number. I realize computers can’t really be random, but if we needed to just select a popular number…we can already do that!

https://slate.com/technology/2022/06/bridle-ways-of-being-excerpt-computer-randomness.html

sexy_peach@beehaw.org · 2 years ago

Computers can be random with special hardware.

Empricorn@feddit.nl · 2 years ago

Care to elaborate?

sexy_peach@beehaw.org · 2 years ago

There are devices that measure radioactive decay for operations where truly random numbers are very important. Or something like that, I am not an expert, sorry.

Empricorn@feddit.nl · 2 years ago

Interesting. As I understand it, pure computing (not sensors recording external data) are incapable of generating truly random numbers. But I’m obviously not an expert either!

I’ve been using “Perfect Passwords” for years, which apparently generate nearly random passwords from server noise, but he admits it’s still not truly 100% random…

FiniteBanjo@lemmy.today · 2 years ago

No shit, sherlock, it’s sample data is the internet.

Appoxo@lemmy.dbzer0.com · 2 years ago

Wheres 69 then?

Chadus_Maximus@lemm.ee · 2 years ago

That’s a naughty number and we don’t allow those.

Bene7rddso@feddit.de · 2 years ago

In a lot of cases there’s no naughty context to 69

Chadus_Maximus@lemm.ee · 2 years ago

In a lot of cases the sky isn’t blue.

Worx@lemmynsfw.com · 2 years ago

Actually true though, in roughly half of all cases. More if you count cloud cover as not being blue

FiniteBanjo@lemmy.today · 2 years ago

nice

ReallyActuallyFrankenstein@lemmynsfw.com · 2 years ago

What does “temperature” on the Y-axis refer to?

gerryflap@feddit.nl · 2 years ago

I’m not a hundred percent sure, but afaik it has to do with how random the output of the GPT model will be. At 0 it will always pick the most probable next continuation of a piece of text according to its own prediction. The higher the temperature, the more chance there is for less probable outputs to get picked. So it’s most likely to pick 42, but as the temperature increases you see the chance of (according to the model) less likely numbers increase.

This is how temperature works in the softmax function, which is often used in deep learning.

driving_crooner@lemmy.eco.br · 2 years ago

https://youtu.be/wjZofJX0v4M your answer from the 22:00 mark on.

ReallyActuallyFrankenstein@lemmynsfw.com · 2 years ago

Super helpful, thanks!

HarkMahlberg@kbin.social · 2 years ago

I mean… they didn’t specify it had to be random (or even uniform)? But yeah, it’s a good showcase of how GPT acquired the same biases as people, from people…

OsrsNeedsF2P@lemmy.ml · 2 years ago

uniform

Reminds me of my previous job where our LLM was grading things too high. The AI “engineer” adjusted the prompt to tell the LLM that the average output should be 3. I had a hard time explaining that wouldn’t do anything at all, because all the chats were independent events.

Anyways, I quit that place and the project completely derailed.

aname@lemmy.one · 2 years ago

Ask humans the same and most common numer is 37

Catsrules@lemmy.ml · 2 years ago

I saw that YouTube video as well.

Cethin@lemmy.zip · 2 years ago

For very different reasons though. 37 is what people think is the most random, because humans are dumb. The LLM here tried to choose the most likely.

lemmyingly@lemm.ee · 2 years ago

Hello Veritasium enjoyer

erwan@lemmy.ml · 2 years ago

In his video, he shows that the more common answers are actually 42 and 69.

I discards them because they’re picked for a reason rather than a human genuinely trying trying to pick a random number, but they’re still way more common than 37.

lemmyingly@lemm.ee · 2 years ago

That’s because they asked the internet for those polls. The internet thinks they’re funny by picking the meme numbers. So I can understand why they chose to omit those numbers from their results.

aname@lemmy.one · 2 years ago

What are you referring to?

boert@feddit.de · 2 years ago

Most probably this: https://www.youtube.com/watch?v=d6iQrh2TK98

aname@lemmy.one · 2 years ago

Thanks, I’ll have a look

lemmyingly@lemm.ee · 2 years ago

YouTube STEM educator. 15 million subscribers. Probably in the top 5 STEM educators on the platform.

He released a video on the number 37 two weeks ago, with 6 million views.

aname@lemmy.one · 2 years ago

I know veritasium but I hadn’t seen the video. Thanks, I’ll check it out.

lemmyingly@lemm.ee · 2 years ago

I thought I’d give you context just in case, as your question was vague. You might not have consumed YouTube and was blissfully unaware. :)

aname@lemmy.one · 2 years ago

Thank you for being thoughtful :)

Arthur Besse@lemmy.ml · 2 years ago

Crozekiel@lemmy.zip · 2 years ago

I always like to throw out 37 because of Dante’s girlfriend.

ForestOrca@kbin.social · 2 years ago

WAIT A MINUTE!!! You mean Douglas Adams was actually an LLM?

ElCanut@jlai.lu · 2 years ago

I’ve never seen Douglas Adams and a LLM in the same room together 🤷

Naboo_calls_for_aid@sopuli.xyz · 2 years ago

So many things are starting to make sense

FIash Mob #5678@beehaw.org · 2 years ago

HA, funny that this comes up. DND Beyond doesn’t have a d100, so I opened my ChatGPT sub and had it roll a d100 for me a few times so I could use my magic beans properly.

terminhell@lemmy.dbzer0.com · 2 years ago

I use the percentile die for that.

FIash Mob #5678@beehaw.org · 2 years ago

Also an excellent method.

TauriWarrior@aussie.zone · 2 years ago

Opened up DND Beyond to check since i remember rolling it before and its there, its between D8 and D10, the picture even shows 2 dice

FIash Mob #5678@beehaw.org · 2 years ago

That’s helpful. Thank you.

Urist@lemmy.ml · 2 years ago

Roll two d10, once for each digit, and profit?

The Cuuuuube@beehaw.org · 2 years ago

But why use Chatgpt for that? Why not a duck duck go action? I just don’t understand why we’re asking a LLM whose goal is consistency, not randomness, to do random

ancap shark@lemmy.today · 2 years ago

LMs aren’t thinking, aren’t inventing, they are predicting what is supposed to be answered next, so it’s expected that they will produce the same results every time

xthexder@l.sw0.com · edit-2 2 years ago

This graph actually shows a little more about what’s happening with the randomness or “temperature” of the LLM.
It’s actually predicting the probability of every word (token) it knows of coming next, all at once.
The temperature then says how random it should be when picking from that list of probable next words. A temperature of 0 means it always picks the most likely next word, which in this case ends up being 42.
As the temperature increases, it gets more random (but you can see it still isn’t a perfect random distribution with a higher temperature value)

Rook@pawb.social · 2 years ago

Which model?

When I tried on ChatGPT 4, it wrote a short python script and executed it to get a random integer.

import random

# Pick a random number between 1 and 100
random_number = random.randint(1, 100)
random_number

TonyTonyChopper@mander.xyz · 2 years ago

does the neural network actually run scripts or is it pretending

Amju Wolf@pawb.social · 2 years ago

It generates code and then you can use a call to some runtime execution API to run that code, completely separate from the neural network.

Umbrias@beehaw.org · 2 years ago

That’s not answering the question though.

“Pick a number between 1 and 100” doesn’t mean “grab two d10” or write a script.

xyguy@startrek.website · 2 years ago

Only 1000 times? It’s interesting that there’s such a bias there but it’s a computer. Ask it 100,000 times and make sure it’s not a fluke.

PhreakyByNature@feddit.uk · 2 years ago

NEEDS MOAR 69 FELLOW HUMAN

Owl@mander.xyz · 2 years ago

Zorque@kbin.social · 2 years ago

In a row?!

Zoop@beehaw.org · 2 years ago

Try not to suck any dick on the way to the parking lot!

lolola@lemmy.blahaj.zone · 2 years ago

What’s the y axis?

kciwsnurb@aussie.zone · 2 years ago

The temperature scale, I think. You divide the logit output by the temperature before feeding it to the softmax function. Larger (resp. smaller) temperature results in a higher (resp. lower) entropy distribution.

Ms. ArmoredThirteen@lemmy.ml · 2 years ago

I don’t understand any of these words, I need to take a math class or something

humbletightband@lemmy.dbzer0.com · 2 years ago

Higher temperature -> more chaotic output

lolola@lemmy.blahaj.zone · 2 years ago

I still don’t understand.

Midnitte@beehaw.org · 2 years ago

More yellow more common, more blue less common

kciwsnurb@aussie.zone · 2 years ago

Each row in the figure is a probability distribution over possible outputs (x-axis labels). The more yellow, the more likely (see the colour map on the right). With a small temperature (e.g., last row), all the probability mass is on 42. This is a low entropy distribution because if you sample from it you’ll constantly get 42, so no randomness whatsoever (think entropy as a measure of randomness/chaos). As temperature increases (rows closer to the first/topmost one), 42 is still the most likely output, but the probability mass gets dispersed to other possible outputs too (other outputs get a bit more yellow), resulting in higher entropy distributions. Sampling from such distribution gives you more random outputs (42 would still be frequent, but you’d get 37 or others too occasionally). Hopefully this is clearer.

Someone in another reply uses the word “creativity” to describe the effect of temperature scaling. The more commonly used term in the literature is “diversity”.

The Octonaut@mander.xyz · 2 years ago

Temperature is basically how creative you want the AI to be. The lower the temperature, the more predictable (and repeatable) the response.

lolola@lemmy.blahaj.zone · 2 years ago

Creativity is hot. That makes more sense, thanks.