Sahwa@reddthat.com to Technology@lemmy.worldEnglish · 2 months agoFather sues Google, claiming Gemini chatbot drove son into fatal delusiontechcrunch.comexternal-linkmessage-square232linkfedilinkarrow-up1756arrow-down117cross-posted to: fuck_ai@lemmy.worldtechnology@lemmit.online
arrow-up1739arrow-down1external-linkFather sues Google, claiming Gemini chatbot drove son into fatal delusiontechcrunch.comSahwa@reddthat.com to Technology@lemmy.worldEnglish · 2 months agomessage-square232linkfedilinkcross-posted to: fuck_ai@lemmy.worldtechnology@lemmit.online
minus-squaremisery mansion@lemmy.worldlinkfedilinkEnglisharrow-up6·2 months agoWhat is an rlhf data set?
minus-squarewonderingwanderer@sopuli.xyzlinkfedilinkEnglisharrow-up8·2 months agoReinforcement Learning from Human Feedback It’s a method of fine-tuning and aligning LLMs which requires active human input
What is an rlhf data set?
Reinforcement Learning from Human Feedback
It’s a method of fine-tuning and aligning LLMs which requires active human input