BlushedPotatoPlayers@sopuli.xyz to Technology@lemmy.worldEnglish · 9 months agoAI chatbots tend to choose violence and nuclear strikes in wargameswww.newscientist.comexternal-linkmessage-square25fedilinkarrow-up17arrow-down11file-text
arrow-up16arrow-down1external-linkAI chatbots tend to choose violence and nuclear strikes in wargameswww.newscientist.comBlushedPotatoPlayers@sopuli.xyz to Technology@lemmy.worldEnglish · 9 months agomessage-square25fedilinkfile-text
minus-squareEven_Adder@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up0arrow-down1·edit-29 months agoIf that’s really how they work, it wouldn’t explain these: https://notes.aimodels.fyi/researchers-discover-emergent-linear-strucutres-llm-truth/ https://notes.aimodels.fyi/self-rag-improving-the-factual-accuracy-of-large-language-models-through-self-reflection/ https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html https://poke-llm-on.github.io/ https://arxiv.org/abs/2310.02207
If that’s really how they work, it wouldn’t explain these:
https://notes.aimodels.fyi/researchers-discover-emergent-linear-strucutres-llm-truth/
https://notes.aimodels.fyi/self-rag-improving-the-factual-accuracy-of-large-language-models-through-self-reflection/
https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html
https://poke-llm-on.github.io/
https://arxiv.org/abs/2310.02207