• diffuselight@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      Cost reduction in the field is orders of magnitude potential. Look at llama running on everything down to a raspy pi after 2 months.

      There are massive gains to be made - once we have dedicated hardware for transformers, that’s orders of magnitude more.

      See your phone being able to playback 24h of video but die after 3h of browsing? Dedicated hardware codec support

      • hottari@lemmy.ml
        link
        fedilink
        arrow-up
        1
        arrow-down
        3
        ·
        1 year ago

        Yeah but Llama’s quality cannot compete with ChatGPT models (Doesn’t matter what model you use, if you want good and FAST results, you require serious compute). We do have commercial dedicated AI chips from NVDA, last time I checked you had to make an order to even get a price. George Hotz who is also working on something similar, by his account from a Lex Fridman podcast mentioned that a personal AI rig would have to be closer to a mainframe’s size.

        There’s nothing I have seen so far that leads me to believe that generative AI gets more efficient with weaker hardware.