"Chat GPT told me that it *can't* alter its data set but it did say it could simulate what it would be like if it altered it's data set"

jakob.pxi

@david_chisnall @futurebird @CptSuperlative @emilymbender btw, we are working on "edge AI" in a research project and have taken to reframe "summarization" of the source information into "wayfinding" of the information.

Offering fidelity estimations and affordances to navigate the source text from the condensed version should (we hope) inform the mental model of users that they are using a stochastic machine that is merely there to help work with large texts. Still early working hypothesis.

Pete Alex Harris🦡🕸️🌲/∞🪐∫

@futurebird @david_chisnall @CptSuperlative @emilymbender

But the abstract is already a summary of the paper you can scan to tell if the paper will be useful to you, and you can (usually) trust that summary to be accurate to the content of the paper and concise enough to include the most relevant points. You can't assume the same of an LLM summary, so it's worse than an abstract search.

I can see the advantage of a syntax- and context-aware abstract search if LLMs were that, but they aren't.

Konosocio

@futurebird @Jirikiha @nazokiyoubinbou @joby @CptSuperlative @emilymbender because they secretely hope it will turn off the porch light and then will do your bidding and take over the world for you.

Irenes (many)

@futurebird @Jirikiha @nazokiyoubinbou @joby @CptSuperlative @emilymbender sigh the stochastic parrots paper (thanks Emily! <3) did an excellent job of explaining the reason. astonishingly, it did so before this was a widespread phenomenon.

Irenes (many)

@futurebird @Jirikiha @nazokiyoubinbou @joby @CptSuperlative @emilymbender

people assess credibility by building a mental model of the person they're talking to, based on how they speak and what they say. the machine subverts that process by being something we don't have models for.

Abyssal Rook

@futurebird Remember when it was going around that ChatGPT couldn't count the number of letters in a given word? Like, saying Raspberry had 2 R's?

It's because it breaks words down into chunks, not letters, for some unfathomable reason. Thing is, if you asked it how it figured that out, it would demonstrate that it broke the word down into individual letters, then count each letter, and then get a different answer that might ALSO still be wrong, somehow, and then go "See? Like that."

myrmepropagandist

@petealexharris @david_chisnall @CptSuperlative @emilymbender

I want you to give it a try. Take one of those folders of pdfs of papers you are "gonna totally read" and give them to https://notebooklm.google.com/

Ask for a summary. You are correct about the limitations and that's better IMO than not understanding them, but the quality of these guesses is very good and useful in the right contexts. Until I saw this I couldn't understand why so many people were using it at all.

myrmepropagandist

@AbyssalRook

Because when you ask it "how did you come up with that answer" it looks at the vast data set for examples of people explaining how they come up with answers and then it produces an answer similar to what people have said.

Maybe part of what trips people up is the hubris of thinking you can ask the LLM a question it has no training data for. The training data is so huge that this is really unlikely.

myrmepropagandist

@petealexharris @david_chisnall @CptSuperlative @emilymbender

Is this an efficient use of electricity and computing power? This is a good question and the answer may be "no."

wall-e / Daniel

@FediThing @futurebird the metaphor I've used with my "non-technical" friends and family is that LLMs are basically like billions of Galton Boards that each have one hole for a marble on the input side and one output hole for every word and punctuation mark in every language.
Connected to each of the output holes is another Galton Board with it's input.

While it's a gross oversimplification that's ignoring context and attention, and is really better suited to explain a Markov Chain, so far it has helped me drive home the point that it's "just" stochastically picking the "most correct" word given a preceding word.

It's also useful to visualize why training is so expensive: You have to tune every peg of every Galton Board until the system behaves correctly.

Leon P Smith

@futurebird There's always been a small minority of people who get overly taken in with say, Eliza and other ancient chatbots whose nonsense isn't anywhere nearly as plausible.

Now that the language generation is much more consistently plausible, I guess in retrospect it doesn't surprise me that much that so many more people would get taken in so easliy.

myrmepropagandist

@marshray

I'm not angry, I'd just really like to know what you are getting at.

Nathan Sorenson

@semitones @cykonot @futurebird Robinson's Podcast has a lot of good philosophy of science episodes, with a particular focus on foundations of physics but per OP there was a recent episode w/ Ned Block that is one of many that touches on LLMs and philosophy of mind: https://youtu.be/wM1fcZr0iSk

Chebucto Regional Softball Club

"Chat GPT told me that it can't alter its data set but it did say it could simulate what it would be like if it altered it's data set"