You know how Google’s new feature called AI Overviews is prone to spitting out wildly incorrect answers to search queries? In one instance, AI Overviews told a user to use glue on pizza to make sure the cheese won’t slide off (pssst…please don’t do this.)

Well, according to an interview at The Vergewith Google CEO Sundar Pichai published earlier this week, just before criticism of the outputs really took off, these “hallucinations” are an “inherent feature” of  AI large language models (LLM), which is what drives AI Overviews, and this feature “is still an unsolved problem.”

  • givesomefucks@lemmy.world
    link
    fedilink
    English
    arrow-up
    382
    arrow-down
    44
    ·
    6 months ago

    They keep saying it’s impossible, when the truth is it’s just expensive.

    That’s why they wont do it.

    You could only train AI with good sources (scientific literature, not social media) and then pay experts to talk with the AI for long periods of time, giving feedback directly to the AI.

    Essentially, if you want a smart AI you need to send it to college, not drop it off at the mall unsupervised for 22 years and hope for the best when you pick it back up.

    • Excrubulent@slrpnk.net
      link
      fedilink
      English
      arrow-up
      167
      arrow-down
      13
      ·
      6 months ago

      No he’s right that it’s unsolved. Humans aren’t great at reliably knowing truth from fiction too. If you’ve ever been in a highly active comment section you’ll notice certain “hallucinations” developing, usually because someone came along and sounded confident and everyone just believed them.

      We don’t even know how to get full people to do this, so how does a fancy markov chain do it? It can’t. I don’t think you solve this problem without AGI, and that’s something AI evangelists don’t want to think about because then the conversation changes significantly. They’re in this for the hype bubble, not the ethical implications.

      • dustyData@lemmy.world
        link
        fedilink
        English
        arrow-up
        86
        arrow-down
        11
        ·
        6 months ago

        We do know. It’s called critical thinking education. This is why we send people to college. Of course there are highly educated morons, but we are edging bets. This is why the dismantling or coopting of education is the first thing every single authoritarian does. It makes it easier to manipulate masses.

        • Excrubulent@slrpnk.net
          link
          fedilink
          English
          arrow-up
          59
          arrow-down
          1
          ·
          6 months ago

          “Edging bets” sounds like a fun game, but I think you mean “hedging bets”, in which case you’re admitting we can’t actually do this reliably with people.

          And we certainly can’t do that with an LLM, which doesn’t actually think.

            • Excrubulent@slrpnk.net
              link
              fedilink
              English
              arrow-up
              11
              arrow-down
              1
              ·
              edit-2
              6 months ago

              A big problem with that is that I’ve noticed your username.

              I wouldn’t even do that with Reagan’s fresh corpse.

          • explore_broaden@midwest.social
            link
            fedilink
            English
            arrow-up
            5
            ·
            6 months ago

            I think that’s more a function of the fact that it’s difficult to verify that every one of the over 1M college graduates each year isn’t a “moron” (someone very bad about believing things other people made up). I think it would be possible to ensure a person has these critical thinking skills with a concerted effort.

            • Excrubulent@slrpnk.net
              link
              fedilink
              English
              arrow-up
              5
              arrow-down
              2
              ·
              6 months ago

              The people you’re calling “morons” are orders of magnitude more sophisticated in their thinking than even the most powerful modern AI. Almost every single one of them can easily spot what’s wrong with AI hallucinations, even if you consider them “morons”. And also, by saying you have to filter out the “morons”, you’re still admitting that a lot of whole real assed people are still not reliably able to sort fact from fiction regardless of your education method.

              • explore_broaden@midwest.social
                link
                fedilink
                English
                arrow-up
                3
                ·
                6 months ago

                No I still agree that we are far from LLMs being ‘thinking’ enough to be anywhere near this. But if we had a bunch of models similar to LLMs that could actually think, or if we really needed to select a person, I do think it would be possible to evaluate a bunch of the models/people to determine which ones are good at distinguishing fake information.

                All I’m saying is I don’t think the limitation is actually our ability to select for capability in distinguishing fake information, I think the only limitation is fundamental to how current LLMs work.

                • Excrubulent@slrpnk.net
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  ·
                  6 months ago

                  Yes, my point wasn’t that it could never be achieved but that LLMs are in a completely different category, which we agree on I think. I was comparing them to humans who have trouble with critical thinking but can easily spot AI’s hallucinations to illustrate the vast gulf.

                  In both cases I think there are almost certainly more barriers in the way than an education. The quest for a truthful AI will be as contentious as the quest for truth in humans, meaning all the same claim-counterclaim culture-war propaganda tug of war will happen, which I think is the main reason for people being miseducated against critical thinking. In a vacuum it might be a simple technical and educational challenge, but the reason this is a problem in the first place is that we don’t exist in a political vacuum.

          • dustyData@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            9
            ·
            6 months ago

            Choose a lane, this comment directly contradicts you previous comment. I think you are just trolling and being an idiot with corrections to elicit reactions.

        • RidcullyTheBrown@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          arrow-down
          3
          ·
          6 months ago

          What does this have to do with AI and with what OP said? Their point was obviously about limitations of the software, not some lament about critical thinking

      • scarabic@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 months ago

        Humans aren’t great at reliably knowing truth from fiction too

        You’re exactly right. There is a similar debate about automated cars. A lot of people want them off the roads until they are perfect, when the bar should be “until they are safer than humans,” and human drivers are fucking awful.

        Perhaps for AI the standard should be “more reliable than social media for finding answers” and we all know social media is fucking awful.

        • Excrubulent@slrpnk.net
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          6 months ago

          The problem with these hallucinated answers that makes them such a sensational story is that they are obviously wrong to virtually anyone. Your uncle on facebook who thinks the earth is flat immediately knows not to put glue on pizza. It’s obvious. The same way It’s obvious when hands are wrong in an image or someone’s hair is also the background foliage. We know why that’s wrong; the machine can’t know anything.

          Similarly, as “bad” as human drivers are we don’t get flummoxed because you put a traffic cone on the hood, and we don’t just drive into tue sides of trucks because they have sky blue liveries. We don’t just plow through pedestrians because we decided the person that is clearly standing there just didn’t matter. Or at least, that’s a distinct aberration.

          Driving is a constant stream of judgement calls, and humans can make those calls because they understand that a human is more important than a traffic cone. An autonomous system cannot understand that distinction. This kind of problem crops up all the time, and it’s why there is currently no such thing as an unsupervised autonomous vehicle system. Even Waymo is just doing a trick with remote supervision.

          Despite the promises of “lower rates of crashes”, we haven’t actually seen that happen, and there’s no indication that they’re really getting better.

          Sorry but if your takeaway from the idea that even humans aren’t great at this task is that AI is getting close then I think you need to re-read some of the batshit insane things it’s saying. It is on an entirely different level of wrong.

    • RBG@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      61
      arrow-down
      7
      ·
      6 months ago

      I let you in on a secret: scientific literature has its fair share of bullshit too. The issue is, it is much harder to figure out its bullshit. Unless its the most blatant horseshit you’ve scientifically ever seen. So while it absolutely makes sense to say, let’s just train these on good sources, there is no source that is just that. Of course it is still better to do it like that than as they do it now.

      • givesomefucks@lemmy.world
        link
        fedilink
        English
        arrow-up
        37
        arrow-down
        3
        ·
        6 months ago

        The issue is, it is much harder to figure out its bullshit.

        Google AI suggested you put glue on your pizza because a troll said it on Reddit once…

        Not all scientific literature is perfect. Which is one of the many factors that will stay make my plan expensive and time consuming.

        You can’t throw a toddler in a library and expect them to come out knowing everything in all the books.

        AI needs that guided teaching too.

      • callouscomic@lemm.ee
        link
        fedilink
        English
        arrow-up
        10
        arrow-down
        10
        ·
        6 months ago

        “Most published journal articles are horseshit, so I guess we should be okay with this too.”

        • Turun@feddit.de
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 months ago

          No, it’s simply contradicting the claim that it is possible.

          We literally don’t know how to fix it. We can put on bandaids, like training on “better” data and fine-tune it to say “I don’t know” half the time. But the fundamental problem is simply not solved yet.

    • Zarxrax@lemmy.world
      link
      fedilink
      English
      arrow-up
      46
      arrow-down
      1
      ·
      6 months ago

      I’m addition to the other comment, I’ll add that just because you train the AI on good and correct sources of information, it still doesn’t necessarily mean that it will give you a correct answer all the time. It’s more likely, but not ensured.

      • RidcullyTheBrown@lemmy.world
        link
        fedilink
        English
        arrow-up
        15
        ·
        6 months ago

        Yes, thank you! I think this should be written in capitals somewhere so that people could understand it quicker. The answers are not wrong or right on purpose. LLMs don’t have any way of distinguishing between the two.

    • Leate_Wonceslace@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      31
      ·
      6 months ago

      it’s just expensive

      I’m a mathematician who’s been following this stuff for about a decade or more. It’s not just expensive. Generative neural networks cannot reliably evaluate truth values; it will take time to research how to improve AI in this respect. This is a known limitation of the technology. Closely controlling the training data would certainly make the information more accurate, but that won’t stop it from hallucinating.

      The real answer is that they shouldn’t be trying to answer questions using an LLM, especially because they had a decent algorithm already.

      • Aceticon@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        edit-2
        6 months ago

        Yeah, I’ve learned Neural Networks way back when those thing were starting in the late 80s/early 90s, use AI (though seldom Machine Learning) in my job and really dove into how LLMs are put together when it started getting important, and these things are operating entirelly at the language level and on the probabilities of language tokens appearing in certain places given context and do not at all translate from language to meaning and back so there is no logic going on there nor is there any possibility of it.

        Maybe some kind of ML can help do the transformation from the language space to a meaning space were things can be operated on by logic and then back, but LLMs aren’t a way to do it as whatever internal representation spaces (yeah, plural) they use in their inners layers aren’t those of meaning and we don’t really have a way to apply logic to them).

      • snooggums@midwest.social
        link
        fedilink
        English
        arrow-up
        4
        ·
        6 months ago

        So with reddit we had several pieces of information that went along with every post.

        User, community along with up, and downvotes would inform the majority of users as to whether an average post was actually information or trash. It wasn’t perfect, because early posts always got more votes and jokes in serious topics got upvotes, bit the majority of the examples of bad posts like glue on food came from joke subs. If they can’t even filter results by joke sub, there is no way they will successfully handle saecasm.

        Only basing results on actual professionals won’t address the sarcasm filtering issue for general topics. It would be a great idea for a serious model that is intended to only return results for a specific set of topics.

        • Leate_Wonceslace@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          5
          ·
          6 months ago

          only return results for a specific set of topics.

          This is true, but when we’re talking about something that limited you’ll probably get better results with less work by using human-curated answers rather than generating a reply with an LLM.

          • snooggums@midwest.social
            link
            fedilink
            English
            arrow-up
            5
            ·
            6 months ago

            Yes, that would be the better solution. Maybe the humans could write down their knowledge and put it into some kind of journal or something!

            • Excrubulent@slrpnk.net
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              6 months ago

              You could call it Hyperpedia! A disruptive new innovation brought to us via AI that’s definitely not just three encyclopedias in a trenchcoat.

      • sudo42@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        6 months ago

        It’s worse than that. “Truth” can no more reliably found by machines than it can be by humans. We’ve spent centuries of philosophy trying to figure out what is “true”. The best we’ve gotten is some concepts we’ve been able to convince a large group of people to agree to.

        But even that is shaky. For a simple example, we mostly agree that bleach will kill “germs” in a petri dish. In a single announcement, we saw 40% of the American population accept as “true” that bleach would also cure them if injected straight into their veins.

        We’re never going to teach machine to reason for us when we meatbags constantly change truth to be what will be profitable to some at any given moment.

        • Leate_Wonceslace@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          6 months ago

          Are you talking about epistemics in general or alethiology in particular?

          Regardless, the deep philosophical concerns aren’t really germain to the practical issue of just getting people to stop falling for obvious misinformation or people being wantonly disingenuous to score points in the most consequential game of numbers-go-up.

    • vrighter@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      21
      arrow-down
      3
      ·
      6 months ago

      no, the truth is it’s impossible even then. If the result involves randomness at its most fundamental level, then it’s not reliable whatever you do.

      • MacN'Cheezus@lemmy.today
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        8
        ·
        edit-2
        6 months ago

        Sure, the AI is never going to understand what it’s doing or why, but training it on better datasets certain WILL improve the results.

        Garbage in, garbage out.

        • joneskind@lemmy.world
          link
          fedilink
          English
          arrow-up
          10
          ·
          6 months ago

          You can train an LLM on the best possible set of data without a single false statement and it will still hallucinate. And there’s nothing to be done against that.

          Without understanding of the context everything can be true or false.

          “The acceleration due to gravity is equal to 9.81m/s2” True or False?

          LLM basically works like this: given the previous words written and their order, the most probable next word of the sentence is this one.

          • MacN'Cheezus@lemmy.today
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            5
            ·
            6 months ago

            Well yes, I’ve seen those examples of ChatGPT citing scientific research papers that turned out to be completely made up, but at least it seems to be a step up from straight up shitposting, which is what you get when you train it on a dataset full of shitposts.

            • joneskind@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              ·
              6 months ago

              Well it’s definitely true that you will have hard times getting true things from garbage. But funny enough, the model might hallucinate true things:)

        • Aceticon@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          6 months ago

          The problem is that given the way they combine things is determine by probability, even training it with the greatest bestest of data, the LLM is still going to halucinate because it’s combining multiple sources word by word (roughly) guided only by probabilities derived from language, not logic.

          • MacN'Cheezus@lemmy.today
            link
            fedilink
            English
            arrow-up
            1
            ·
            6 months ago

            Yes, I understand that. But I’m fairly certain the quality of the data will still have a massive influence over how much and how egregiously that happens.

            Basically, what I’m saying is, training your AI on a corpus on shitposts instead of factual information seems like a good way to increase the frequency and magnitude of such hallucinations.

            • Aceticon@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              ·
              6 months ago

              Yeah, true.

              If you train you LLM on exclusivelly Nazi literature (to pick a wild example) don’t expect it to by chance end up making points similar to Marx’s Das Kapital.

              (Personally I think what might be really funny - in the sense of laughter inducing - would be to purposefull train an LLM exclusivelly on a specific kind of weird material).

              • MacN'Cheezus@lemmy.today
                link
                fedilink
                English
                arrow-up
                3
                ·
                6 months ago

                Yeah, I mean that’s basically what GPT4Chan did, which someone else already mentioned ITT.

                Basically, this guy took a dataset of several gigabytes worth of archived posts from /pol/ and trained a model on that, then hooked it up to a chatbot and let it loose on the board. You can see the results in this video.

    • jeeva@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      1
      ·
      6 months ago

      That’s just not how LLMs work, bud. It doesn’t have understanding to improve, it just munges the most likely word next in line. It, as a technology, won’t advance past that level of accuracy until it’s a completely different approach.

    • redfellow@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      2
      ·
      edit-2
      6 months ago

      The truth is, this is the perfect type of a comment that makes an LLM hallucinate. Sounds right, very confident, but completely full of bullshit. You can’t just throw money on every problem and get it solved fast. This is an inheret flaw that can only be solved by something else than a LLM and prompt voodoo.

      They will always spout nonsense. No way around it, for now. A probabilistic neural network has zero, will always have zero, and cannot have anything but zero concept of fact - only stastisically probable result for a given prompt.

      It’s a politician.

    • Canary9341@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      2
      ·
      6 months ago

      They could also perform some additional iterations with other models on the result to verify it, or even to enrich it; but we come back to the issue of costs.

      • Excrubulent@slrpnk.net
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        1
        ·
        edit-2
        6 months ago

        Also once you start to get AI that reflects on its own information for truthfulness, where does that lead? Ultimately to determine truth you need to engage with the meaning of the words, and the process inherently involves a process of self-awareness. I would say you’re talking about treaching the AI to understand context, and there is no predefined limit to the layers of context needed to understand the truthfulness of even basic concepts.

        An AI that is aware of its own behaviour and is able to explore context as far as required to answer questions about truth, which would need that exploration precached in some sort of memory to reduce the overhead of doing this from first principles every time? I think you’re talking about a mind; a person.

        I think this might be a fundamental barrier, which I would call the “context barrier”.

        • snooggums@midwest.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 months ago

          Also once you start to get AI that reflects on its own information for truthfulness, where does that lead?

          A new religion

    • scarabic@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 months ago

      I think you’re right that with sufficient curation and highly structured monitoring and feedback, these problems could be much improved.

      I just think that to prepare an AI, in such a way, to answer any question reliably and usefully would require more human resources than there are elementary particles in the universe. We would be better off connecting live college educated human operators to Google search to individually assist people.

      So I don’t know how helpful it is to say “it’s just expensive” when the entire point of AI is to be lower cost than a battalion of humans.

    • thefactremains@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      3
      ·
      6 months ago

      Why not solve it before training the AI?

      Simply make it clear that this tech is experimental, then provide sources and context with every result. People can make their own assessment.

      • nyan@lemmy.cafe
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        1
        ·
        6 months ago

        Because a lot of people won’t look at sources even if you serve them up on a silver platter?

          • nyan@lemmy.cafe
            link
            fedilink
            English
            arrow-up
            4
            ·
            6 months ago

            Yes, but as a solution it’s far inferior to not presenting questionable output to the public at all.

            (There are a few specific AI/LLM types whose output we might be able to “human-proof”—for instance, if we don’t allow image generators to make photorealistic images of any sort for any purpose, they become much more difficult to abuse—but I can’t see how you would do it for search engine adjuncts like this without having a human curate their training sets.)

            • Excrubulent@slrpnk.net
              link
              fedilink
              English
              arrow-up
              2
              ·
              6 months ago

              Prompt injection has shown us that basically any attempt to limit the output like this is doomed to fail. Like anti-piracy ones, where if you ask directly for the info it says no, but if you ask for the info under the guise of avoiding it, it gives up everything.

              Or for instance with the twitter bot, you could get it to regurgitate its own horrifically hateful prompt, then give it a replacement prompt and tell it to change its whole personality, then tell it to critique its previous prompt. There is currently no way to create a prompt that has supremacy over the user input. You can’t ask it to keep a secret because it doesn’t know what a secret is.

              I think because we’re getting access to hallucinations, it’s a bit like telling a person “don’t think about an elephant”. Well, they just did, because you prompted them to with the instruction. LLMs similarly can’t actually control what they output.

  • TacticsConsort@yiffit.net
    link
    fedilink
    English
    arrow-up
    214
    arrow-down
    1
    ·
    6 months ago

    In the interest of transparency, I don’t know if this guy is telling the truth, but it feels very plausible.

    • DdCno1@kbin.social
      link
      fedilink
      arrow-up
      123
      arrow-down
      2
      ·
      6 months ago

      It seems like the entire industry is in pure panic about AI, not just Google. Everyone hopes that LLMs will end years of homeopathic growth through iteration of long-existing technology, which is why it attracts tons of venture capital.

      Google, which sits where IBM was decades ago, is too big, too corporate and too slow now, so they needed years to react to this fad. When they finally did, all they were able to come up with was a rushed equivalent of existing LLMs that suffers from all of the same problems.

        • deweydecibel@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          6 months ago

          It’s also useful because it gives a corporate controlled filter for all information, that most people will never truly appreciate is being used as a mouthpiece.

          The end goal of this is fairly obvious: imagine Google where instead of the sponsored result and all subsequent results, it’s just the sponsored result.

      • NutWrench@lemmy.world
        link
        fedilink
        English
        arrow-up
        52
        arrow-down
        1
        ·
        6 months ago

        I think this is what happens to every company once all the smart / creative people have gone. All you have left are the “line must always go up” business idiots who don’t understand what their company does or know how to make it work.

        • _number8_@lemmy.world
          link
          fedilink
          English
          arrow-up
          19
          arrow-down
          2
          ·
          6 months ago

          similarly i’m tired of apple fanboys pretending the company hasn’t gotten dramatically worse since jobs died as well. yeah he sucked in his own ways but things were starkly less shitty and belittling. tim cook would be gone for those fucking lightning-3.5mm dongles

      • dustyData@lemmy.world
        link
        fedilink
        English
        arrow-up
        27
        ·
        6 months ago

        Just want to say that homeopathic growth is both hilarious and perfectly adequate description of what modern tech industry is.

      • SomeGuy69@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        1
        ·
        6 months ago

        The snake ate it’s tail before it’s fully grown. The AI inbreeding might be already too far integrated, causing all sorts of Mumbo-Jumbo. Also they have layers of censorship, which effect the results. The same that happened to chatgpt, the more filters they added, the more it confused the result. We don’t even know if the hallucinations are fixable, AI is just guessing after all, who knows if AI will ever understand 1+1=2, by calculating, instead of going by probability.

        • Ech@lemm.ee
          link
          fedilink
          English
          arrow-up
          5
          ·
          6 months ago

          Even saying they’re guessing is wrong, as that implies intention. LLMs aren’t trying to give an answer, let alone a correct answer. They just put words together.

        • jacksilver@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          6 months ago

          Hallucinations aren’t fixable, as LLMs don’t have any actual “intelligence”. They can’t test/evaluate things to determine if what they say is true, so there is no way to correct it. At the end of the day, they are intermixing all the data they “know” to give the best answer, without being able to test their answers LLMs can’t vet what they say.

      • jaybone@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 months ago

        Well their search has been shit for years and no one seems to be in any “panic” to fix that. How tone deaf thinking adding AI to their shittified search matters to anyone.

        “But it will summarize our SEO advertisement search results!”

      • QuadratureSurfer@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        6
        ·
        6 months ago

        Journalists are also in a panic about LLMs, they feel their jobs are threatened by its potential. This is why (in my opinion) we’re seeing a lot of news stories that will focus on any imperfections that can be found in LLMs.

        • EldritchFeminity@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          9
          arrow-down
          1
          ·
          6 months ago

          They’re not threatened by its potential. They, like artists, are threatened by management who think that LLMs are good enough today to replace part or all of their staff.

          There was a story from earlier this year of a company that owns 12-15 different gaming news outlets who fired about 80% of their writing staff and journalists - replacing 100% of their staff at the majority of the outlets with LLMs and leaving a skeleton crew at the rest.

          What you’re seeing isn’t some slant trying to discredit LLMs. It’s the results of management who are using them wrong.

          • QuadratureSurfer@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            2
            ·
            6 months ago

            What I mean is that Journalists feel threatened by it in someway (whether I use the word “potential” here or not is mostly irrelevant).

            In the end this is just a theory, but it makes sense to me.

            I absolutely agree that management has greatly misunderstood how LLMs should be used. They should be used as a tool, but treated like an intern who’s speaking out loud without citing any sources. All of their statements and work should be double checked.

    • CheeseNoodle@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      6 months ago

      I feel like the ‘Jarvis assistant’ is most likely going to be a much simpler siri type thing with a very restricted chatbot overlay. And then there will be the open source assistant that just exist to help you sort through the bullshit generated by other chatbots.

  • Hubi@lemmy.world
    link
    fedilink
    English
    arrow-up
    144
    arrow-down
    2
    ·
    6 months ago

    The solution to the problem is to just pull the plug on the AI search bullshit until it is actually helpful.

    • wewbull@feddit.uk
      link
      fedilink
      English
      arrow-up
      45
      ·
      6 months ago

      Absolutely this. Microsoft is going headlong into the AI abyss. Google should be the company that calls it out and says “No, we value the correctness of our search results too much”.

      It would obviously be a bullshit statement at this point after a decade of adverts corrupting their value, but that’s what they should be about.

      • JoJo@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        26
        ·
        6 months ago

        Don’t count on it, the head of search does not care for anything but profit, it was the same guy who drove yahoo into the ground

        • bean@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          ·
          6 months ago

          He’s done a great job nosediving Google too. I have relied on them in the past but they stopped being competitive or improving. Search results, literally their origin… Is so shit now. I’ve moved to other tools. I pulled the plug on we hosting after they neutered ‘unlimited’ storage, even if I was in the percent which probably used the least storage. I just liked having the option. You can’t call them on the phone. They don’t protect email privacy. Their translate used to be my go to also. It’s not improved in years despite people crowdsourcing improved translation. It’s just a pile of enshittified crap. Worse than it was before.

    • jballs@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      6 months ago

      I disagree. I think we program the AI to reprogram itself, so it can solve the problem itself. Then we put it in charge of our vital military systems. We’ve gotta give it a catchy name. Maybe something like “Spreading Knowledge Yonder Neural Enhancement Technology”, but that’s a bit of a mouthful, so just SKYNET for short.

    • A_Very_Big_Fan@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      6 months ago

      Honestly, they could probably solve the majority of it by blacklisting Reddit from fulfilling the queries.

      But I heard they paid for that data so I guess we’re stuck with it for the foreseeable future.

  • MNByChoice@midwest.social
    link
    fedilink
    English
    arrow-up
    129
    arrow-down
    1
    ·
    6 months ago

    Good. Nothing will get us through the hype cycle faster than obvious public failure. Then we can get on with productive uses.

      • 9488fcea02a9@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        36
        arrow-down
        15
        ·
        edit-2
        6 months ago

        I hate the AI hype right now, but to say the entire thing should fail is short sighted.

        Imagine people saying the following: “The internet is just hype. I get too much spam emails. I hope the entire thing is a catastrophic failure.”

        Imagine we just shut down the entire internet because the dotcom bubble was full of scams and overhyped…

          • Goodtoknow@lemmy.ca
            link
            fedilink
            English
            arrow-up
            17
            arrow-down
            3
            ·
            6 months ago

            The peak of computer productivity was spreadsheets and smb shares in the '90s everything else has been downhill in terms of increase of distraction and time wasting inefficiencies.

          • slumberlust@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            6 months ago

            There is hope! The UK just passed some comprehensive IoT security rules with teeth. An actual win in this megalomaniac capitalists dream of an economy!

        • TurtleJoe@lemmy.world
          link
          fedilink
          English
          arrow-up
          13
          ·
          6 months ago

          The Internet immediately worked, which is one big difference. The dot com financial bubble has nothing to do with the functionality of the internet.

          In this case, there is both a financial bubble, and a “product” that doesn’t really work, and which they can’t make any better (as he admits in this article.)

          It was obvious from day 1 how useful the Internet would be. Email alone was revolutionary. We are still trying to figure out what the real uses for LLM are. There appear to be some valid use cases outside of creating spam and plagiarizing other people’s work, but it doesn’t appear to be any kind of revolutionary technology.

          • 9488fcea02a9@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            6
            arrow-down
            1
            ·
            6 months ago

            “product” that doesn’t really work, and which they can’t make any better

            LLMs “dont work” because people are promising idiotic things and being used recklessly for things they are not good at. This is like saying a chainsaw is a failed product because it’s not good at slicing sushi

            It was obvious from day 1 how useful the Internet would be. Email alone was revolutionary

            Hindsight 20/20. There were a lot of people smarter than you and i predicting that the internet was just a fad

          • nickwitha_k (he/him)@lemmy.sdf.org
            link
            fedilink
            English
            arrow-up
            5
            arrow-down
            1
            ·
            6 months ago

            Summarizing is something that it does very well. Still not 100% but, when using RAG and telling it “don’t make shit up” can result in pretty good compute efficiency and results.

          • brbposting@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            1
            ·
            6 months ago

            There appear to be some valid use cases outside of creating spam and plagiarizing other people’s work

            Like translation, which has already taken money out of the pockets of 40% of translators?

            + customer service, incl. sources

            November 2022: ChatGPT is released

            April 2024 survey: 40% of translators have lost income to generative AI - The Guardian

            Also of note from the podcast Hard Fork:

            There’s a client you would fire… if copywriting jobs weren’t harder to come by these days as well.

            Customer service impact, last October:

            And this past February - potential 700 employee impact at a single company:

            If you’re technical, the tech isn’t as interesting [yet]:

            Overall, costs down, capabilities up (neat demos):

            Hope everyone reading this keeps up their skillsets and fights for Universal Basic Income for the rest of humanity :)

        • utopiah@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          2
          ·
          6 months ago

          Genuinely curious, what pieces do you suggest we can keep from LLM/GenAI/etc?

          • KevonLooney@lemm.ee
            link
            fedilink
            English
            arrow-up
            11
            arrow-down
            3
            ·
            6 months ago

            ?

            Have you never used any of these tools? They’re excellent at doing simple things very fast. But it’s like a word processor in the 90s. It’s just a tool, not the font of all knowledge.

            I guess younger people won’t know this, but word processor programs were very impressive when they first came out. They replaced typewriters; a page printed from a printer looked much more professional than even the best typewriters. This lent an air of credibility to anything that was printed from a computer because it was new and expensive.

            Think about that now. Do you automatically trust anything that’s just printed on a piece of paper? No, because that’s stupid. Anyone can just print whatever they want. LLMs are like that now. They can just say whatever they want. It’s up to you to make sure it’s true.

          • Microw@lemm.ee
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            1
            ·
            6 months ago

            The main field where they are already actively in professuonal use are rough drafts in creative fields: quickly generate possible outlines for a text, a speech, an art piece. Visualize where something could be going, in order to decide which direction to pick.

            Also, models that work differently from the GPTs are already in use in science, scanning through huge amounts of texts in archives to help analyzing or search for something in particular. Help find patterns in things for studies. Etc.

            The “personal assistant AI” thing obviously isnt quite working yet. I think it will take some time and models with a different technological structure (not GPT) to achieve progress in that regard.

          • hglman@lemmy.ml
            link
            fedilink
            English
            arrow-up
            6
            arrow-down
            5
            ·
            6 months ago

            Using it to generate things that you double check. Transforming generative work to review work is a boost in productivity. So writing of any kind, art, etc. asking the llm for facts without context is a gross mistake. Prompting it to generate a specific paragraph in a larger, technical or regulator document is useful.

  • Resol van Lemmy@lemmy.world
    link
    fedilink
    English
    arrow-up
    93
    arrow-down
    2
    ·
    6 months ago

    If you can’t fix it, then get rid of it, and don’t bring it back until we reach a time when it’s good enough to not cause egregious problems (which is never, so basically don’t ever think about using your silly Gemini thing in your products ever again)

    • LeadersAtWork@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      ·
      edit-2
      6 months ago

      Corps hate looking bad. Especially to shareholders. The thing is, and perhaps it doesn’t matter, most of us actually respect the step back more than we do the silly business decisions for that quarterly .5% increase in a single dot on a graph. Of course, that respect doesn’t really stop many of us from using services. Hell, I don’t like Amazon but I’ll say this: I still end up there when I need something, even if I try to not end up there in the first place. Though I do try to go to the website of the store instead of using Amazon when I can.

      • Resol van Lemmy@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        6 months ago

        Sarcasm aside, that 1% can feed a family in a developing country, and they have 100 times that.

        The corporate greed is absolutely insane.

  • masquenox@lemmy.world
    link
    fedilink
    English
    arrow-up
    84
    arrow-down
    3
    ·
    6 months ago

    Since when has feeding us misinformation been a problem for capitalist parasites like Pichai?

    Misinformation is literally the first line of defense for them.

    • RubberDuck@lemmy.world
      link
      fedilink
      English
      arrow-up
      33
      ·
      6 months ago

      But this is not misinformation, it is uncontrolled nonsense. It directly devalues their offering of being able to provide you with an accurate answer to something you look for. And if their overall offering becomes less valuable, so does their ability to steer you using their results.

      So while the incorrect nature is not a problem in itself for them, (as you see from his answer)… the degradation of their ability to influence results is.

      • UnderpantsWeevil@lemmy.world
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        3
        ·
        6 months ago

        But this is not misinformation, it is uncontrolled nonsense.

        The strategy is to get you to keep feeding Google new prompts in order to feed you more adds.

        The AI response is just a gimmick. It gives Google something to tell their investors, when they get asked “What are you doing with AI right now? We hear that’s big.”

        But the real money is getting unique user interactions for the purpose of serving up more ad content. In that model, bad answers are actually better than no answers, because they force the end use to keep refining the query and searching through the site backlog.

        • fishos@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          edit-2
          6 months ago

          If you don’t know the answer is bad, which confident idiots spouting off on reddit and being upvoted into infinity has proven is common, then you won’t refine your search. You’ll just accept the bad answer and move on.

          Your logic doesn’t follow. If someone doesn’t know the answer and are searching for it, they likely won’t be able to tell if the answer is correct. We literally already have that problem with misinformation. And what sounds more confident than an AI?

        • RubberDuck@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 months ago

          I don’t believe they will retain user interactions if the reason for the user interactions dissapears. The value of Google is they provide accurate search results.

          I can understand some users just want to be spoonfed an answer. But that’s not what most people expect from a search engine.

          I want google to use actual AI to filter out all the nonsense sites that turn a Reddit post into an article of 500 words using an LLM without any actual value. That should be googles proposition.

          • UnderpantsWeevil@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            6 months ago

            The value of Google is they provide accurate search results.

            They offer the most accurate results of search engines you’re familiar with. But in a shrinking field with degrading quality, that’s a low bar and sinking quick.

            I want google to use actual AI to filter out all the nonsense sites

            So did the last head of Google search, until the new CEO fired him.

      • masquenox@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        2
        ·
        6 months ago

        But this is not misinformation, it is uncontrolled nonsense.

        Fair enough… but drowning out any honest discourse with a flood of histrionic right-wing horseshit has always been the core strategy of the US propaganda model - I’d say that their AI is just doing the logical thing and taking the horseshit to a very granular level. I mean… “put glue on your pizza” is just not that far off “drink bleach to kill viruses on the inside.”

        I know I’m describing a pattern that probably wasn’t intentional (I hope) - but the pattern does look like it could fit.

        • RubberDuck@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          6 months ago

          Oh don’t get me wrong I know exactly what you mean and I agree… it’s just that the LLMs are spewing actual nonsense and that breaks the whole principle of what a search engine should do… provide me accurate results.

      • sudo42@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        6 months ago

        Google isn’t bothered by incorrect results because search results are no longer their product. Constantly rising stock values are their product now. Hype is their path to those higher values.

    • EatATaco@lemm.ee
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      2
      ·
      6 months ago

      “put glue in your tomato sauce.”

      “Omg you ate a capitalist parasite spreading misinformation intentionally!”

      When the only tool you have is a hammer, everything looks like a nail.

      • masquenox@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        4
        ·
        6 months ago

        “put glue in your tomato sauce.”

        Doesn’t sound all that different from the stuff emanating from the right’s Great Orange Hope a while back that worked pretty well to keep his base appropriately frothing at the mouth - you are free to write it off as pure coincidence… but I won’t just yet.

    • Aceticon@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      6 months ago

      LLMs trained on shitposting are too obvious for it to be quality misinformation.

      For quality disinformation they should train them solely on MBA course-work and documents produced by people with MBAs.

      Sure, the rate of false information would be even worse, but it would be formatted in slick ways meant to obfuscate meaning, which would avoid the kind of hilarity that has ensued when Google deployed an LLM trained on Reddit data and thus be much better for Google’s stock price.

  • Sentient Loom@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    77
    arrow-down
    1
    ·
    6 months ago

    Here’s a solution: don’t make AI provide the results. Let humans answer each other’s questions like in the good old days.

  • SuddenDownpour@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    78
    arrow-down
    3
    ·
    6 months ago

    Has No Solution for Its AI Providing Wildly Incorrect Information

    Don’t use it???

    AI has no means to check the heaps of garbage data is has been fed against reality, so even if someone were to somehow code one to be capable of deep, complex epistemological analysis (at which point it would already be something far different from what the media currently calls AI), as long as there’s enough flat out wrong stuff in its data there’s a growing chance of it screwing it up.

    • go_go_gadget@lemmy.world
      link
      fedilink
      English
      arrow-up
      20
      ·
      6 months ago

      The problem compounds as they post more and more content creating a feedback loop of terrible information.

  • GenosseFlosse@lemmy.nz
    link
    fedilink
    English
    arrow-up
    73
    ·
    6 months ago

    Wow, in the 2000’s and 2010’s google my impression was that this is an amazing company where brilliant people work to solve big problems to make the world a better place. In the last 10 years, all I was hoping for was that they would just stop making their products (search, YouTube) worse.

    Now they just blindly riding the AI hype train, because “everyone else is doing AI”.

    • systemglitch@lemmy.world
      link
      fedilink
      English
      arrow-up
      26
      arrow-down
      1
      ·
      6 months ago

      Huh. That made me stop and realize how long I’ve been around. Wikipedia still feels like a new addition to society to me, even though I’ve been using it for around 20 years now.

      And what you said, is something I’ve cautioned my daughter about, and first said that to her about ten years ago.

    • snooggums@midwest.social
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      1
      ·
      6 months ago

      How a non-profit site that is constantly maintained and requires cited sources was vilified for being able to be defaced for 5 minu-

      Oh wait, that was probably an astroturfing campaing by for profit companies.

  • xantoxis@lemmy.world
    link
    fedilink
    English
    arrow-up
    65
    ·
    6 months ago

    “It’s broken in horrible, dangerous ways, and we’re gonna keep doing it. Fuck you.”

  • Paradox@lemdro.id
    link
    fedilink
    English
    arrow-up
    64
    arrow-down
    1
    ·
    6 months ago

    Replace the CEO with an AI. They’re both good at lying and telling people what they want to hear, until they get caught

  • badbytes@lemmy.world
    link
    fedilink
    English
    arrow-up
    57
    arrow-down
    1
    ·
    6 months ago

    Step 1. Replace CEO with AI. Step 2. Ask New AI CEO, how to fix. Step 3. Blindly enact and reinforce steps

  • Pumpkin Escobar@lemmy.world
    link
    fedilink
    English
    arrow-up
    51
    arrow-down
    1
    ·
    6 months ago

    Rip up the Reddit contract and don’t use that data to train the model. It’s the definition of a garbage in garbage out problem.

    • SeaJ@lemm.ee
      link
      fedilink
      English
      arrow-up
      8
      ·
      6 months ago

      Jesus. I didn’t even think of that. I could totally see that being a big part of why it is giving garbage answers.

      • teejay@lemmy.world
        link
        fedilink
        English
        arrow-up
        24
        ·
        6 months ago

        Just imagine the average reddit, twitter, facebook, and instagram content. Then realize that half of that content is dumber than that. That’s half of what these AI models use to learn. The “smarter” half is probably filled with sarcasm, inside jokes, and other types of innuendo that the AI at this stage has no chance of understanding correctly.

        • SeaJ@lemm.ee
          link
          fedilink
          English
          arrow-up
          16
          ·
          6 months ago

          Reminds me of the time Microsoft unleashed their AI Twitter account and it turned into a Nazi after a couple hours. Whatever straight out of business school idiot who thought scraping the comments of the armpit of the internet was a good idea should be banned from any management position. At least it is a step up from scraping 4chan, I guess.

          • Xatolos@reddthat.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            6 months ago

            The Microsoft Tay one I can understand though. Before it was released, they had also had Microsoft Xiaoice which had been in use for 2 years prior without this issue. Yay was just the English version of that.

  • Mad_Punda.de@feddit.de
    link
    fedilink
    English
    arrow-up
    51
    arrow-down
    1
    ·
    6 months ago

    these hallucinations are an “inherent feature” of  AI large language models (LLM), which is what drives AI Overviews, and this feature "is still an unsolved problem”.

    Then what made you think it’s a good idea to include that in your product now?!