Researchers Put Google Gemini in Charge of an Entire Coffee Shop, and It's Inexorably Driving It Out of Business

other_cat@piefed.zip · 1 day ago

Researchers Put Google Gemini in Charge of an Entire Coffee Shop, and It's Inexorably Driving It Out of Business

mnemonicmonkeys@sh.itjust.works · 1 day ago

LLM’s are a technological dead end. They aren’t interesting in the slightest, as anything they can do is already done more effectively and efficiently with other tools

ericwdhs@discuss.online · 1 day ago

I think LLMs are an interesting technology. Of course, the output is inherently untrustworthy, and that rules out a ton of applications tech bros are trying to cram it into.

blargh513@sh.itjust.works · 1 day ago

Huh?

I think people just need to reset their expectations.

I asked one for help to interpret PCI policy application (credit card regulatory stuff). I gave it the situation and it provided me with a good answer that, when I asked our compliance team about, they agreed.

That saved me a lot of time. I don’t see how that’s a dead end. Then I had it draft a response to the person asking questions; I tuned it a little to my liking and sent it. What might have taken me an hour before took 10 minutes. This seems like a helpful thing, not a bad thing. I’m not sure what other technology would have done that.

SaveTheTuaHawk@lemmy.ca · 9 hours ago

But you had to ask your compliance team. Now repeat after your compliance team has been laid off. Good luck.

petrol_sniff_king@lemmy.blahaj.zone · 16 hours ago

I had it draft a response to the person asking questions; I tuned it a little to my liking and sent it.

Gemini, remind me not to ask blargh any questions.

Also, Gemini, my daughter is asking for someone to play with her. Can you run around with the feather wand and have her chase it or something?

lIlIlIlIlIlIl@lemmy.world · 23 hours ago

Do you have any examples?

SaveTheTuaHawk@lemmy.ca · 9 hours ago

In scientific queries. LMs return an answer from the largest data but if a system or model was recently proven wrong, they still return the wrong answer.

If you make very specific queries about DNA or protein sequence, they usually generate fabrications that are completely wrong.

They tend to return answers trained on the Internet, an uncurated pile of dogshit when it comes to science.

mnemonicmonkeys@sh.itjust.works · 21 hours ago

Google search up until about 5 years ago. Then they enshittified in favor of AI summaries that regularly get shit wrong

FauxLiving@lemmy.world · 1 day ago

They aren’t interesting in the slightest, as anything they can do is already done more effectively and efficiently with other tools

Then why are the other tools not being used?

LLMs translate much better than anything that was engineered. Summarization of text is another application where there are simply no engineered counterparts.

LLMs certainly don’t live up to the absurd hype created by the tech sector, but it is just as absurd to state that they are worse than other tools in all tasks.