Poisoned AI went rogue during training and couldn't be taught to behave again in 'legitimately scary' study

L4sBot@lemmy.world · 9 months ago

Daxtron2@startrek.website · 9 months ago

LLM trained on inflammatory data produces inflammatory results, shocking.

JustMy2c@lemm.ee · 9 months ago

I know we don’t like them here but the word reddit is not banned (yet)

Daxtron2@startrek.website · 9 months ago

What? What does my comment have anything to do with Reddit?

JustMy2c@lemm.ee · 9 months ago

So you’re saying that “Inflammatory data” isn’t a reference to reddit? :D

Daxtron2@startrek.website · 9 months ago

Not inherently, I’m sure that’s part of it but it’s really everywhere. Even here on Lemmy I’ve run into nasty folk

JustMy2c@lemm.ee · 9 months ago

True but it’s reddit that’s served as a base for most models…

Daxtron2@startrek.website · 9 months ago

Not just reddit, LAION is a huge dataset

JustMy2c@lemm.ee · 9 months ago

Obviously but reddit is in the goldilocks zone where you get coherent intelligent stuff and humor and facts.

But it’s still toxic for an Ai.

Daxtron2@startrek.website · 9 months ago

Saying it served as the base for most models is just objectively incorrect though

kent_eh@lemmy.ca · 9 months ago

I’d say using Twitter and Facebook would be worse than reddit. Or, and I shudder to think about it, truth social…

JustMy2c@lemm.ee · 9 months ago

Reddit is used more for Ai models as those…

Chocrates@lemmy.world · 9 months ago

No, LLM is the AI, OP is saying if you train it with hate it’s gonna spit out hate

JustMy2c@lemm.ee · 9 months ago

And I’m saying that reddit data is sublime for Ai. And specifically that it’s invested with toxicity