Sahwa@reddthat.com to Technology@lemmy.worldEnglish · 4 個月前Father sues Google, claiming Gemini chatbot drove son into fatal delusiontechcrunch.comexternal-linkmessage-square235fedilinkarrow-up1772arrow-down116
arrow-up1756arrow-down1external-linkFather sues Google, claiming Gemini chatbot drove son into fatal delusiontechcrunch.comSahwa@reddthat.com to Technology@lemmy.worldEnglish · 4 個月前message-square235fedilink
minus-squarewonderingwanderer@sopuli.xyzlinkfedilinkEnglisharrow-up8·4 個月前Reinforcement Learning from Human Feedback It’s a method of fine-tuning and aligning LLMs which requires active human input
Reinforcement Learning from Human Feedback
It’s a method of fine-tuning and aligning LLMs which requires active human input