thickertoofan@lemm.ee to LocalLLaMA@sh.itjust.worksEnglish · 1 年前Microsoft just released BitNet!github.comexternal-linkmessage-square6linkfedilinkarrow-up119arrow-down10file-textcross-posted to: technology@lemmy.ziptechnology@lemmy.ml
arrow-up119arrow-down1external-linkMicrosoft just released BitNet!github.comthickertoofan@lemm.ee to LocalLLaMA@sh.itjust.worksEnglish · 1 年前message-square6linkfedilinkfile-textcross-posted to: technology@lemmy.ziptechnology@lemmy.ml
minus-squarehendrik@palaver.p3x.delinkfedilinkEnglisharrow-up1·1 年前Nice. Any additional info on how difficult it was to train this and whether we can expect more? They have a 3B model in the demo video, but doesn’t seem like they released that… I mean I’d like something a bit larger.
Nice. Any additional info on how difficult it was to train this and whether we can expect more? They have a 3B model in the demo video, but doesn’t seem like they released that… I mean I’d like something a bit larger.