the whole ai-bro shtick about "ai democritizes art/programming/writing/etc" seemed always so bs to me, but i couldn't put it into words, but i think i now know how.

Kat (post-Hallowe'en edition)

@miki @lucydev Wait, you're comparing art (visual, written or musical) to operating a telephone switchboard?

miki

@KatS @lucydev Yes. At least the kind of art that people want to consume (and not want to make).

💀

@miki @KatS that explains a lot

Kat (post-Hallowe'en edition)

@miki @lucydev Wow.
It'll make for more efficient communication in future if you make it explicitly clear that you're democratising the commissioning of things, and working hard to devalue artistry in all its forms.

Talking about "democratising art" is typically read as making it easier for people to make art.
This is what leads to this kind of convoluted exchange.

miki

@lucydev @KatS The more you know about LLMs, the more "calibrated" you are about where they work (and don't work) right now. People who don't know much about them are either hypesters (mmaking a company of a thousand LLMs and firing all their employees), or LLM deniers. Both are just as crazy.

I also see not just where LLMs are right now, but where they are going. We went from coding agents being basically a joke a year ago, to them semi-autonomously solving (some) complex mathematical problems and being used for boring gruntwork by world-class, fields-medal-winning mathematicians. They can now also solve an extremely complex GPU performance engineering task that Anthropic used as an interview question for the most brilliant engineers in that discipline, *better than any human given the same amount of time*.

They're still much better at small, well-scoped and bounded tasks than at large open-ended problems, but "small and well-scoped" went from "write me a linked list implementation unconnected to anything in my code" to "write me a small feature and follow the style of my codebase." In a year. What will happen in another year? 5 years? 10 years? God only knows, and he certainly isn't telling.

💀

@miki @KatS so you're betting unfoundedly that the tech is gonna work right one day?

Kat (post-Hallowe'en edition)

@miki @lucydev How much thought do you give to the externalities of these things? Their less-desirable impact on the world in which we're trying to live?

miki

@lucydev @KatS Nothing is ever gonna work right, not even humans. Different technologies are at different points on the price-to-mistakes curve, our job is to find a combination that minimizes price while also minimizing mistakes and harm caused.

E.G. it is definitely true that humans are much, much better psychologists than LLMs, but LLLMs are free, much more widely available in abusive environments, speak your language, even if you are in a foreign country, and work at 4AM on a Saturday when you get dumped by your partner. Human psychologists do not. Very often, the choice isn't between an LLM and a human, the real choice is between an LLM and nothing (and the richer you are, the less true this is, hence the "class divide" in opinions about tech). And I'm genuinely unsure which option wins here, but considering the rate of change over the last 3 years, I woulndn't bet towards "nothing" winning for long.

💀

@KatS look @miki don't get me wrong but any time i've tried using LLMs for my work, which isn't just some fun side project but actual production-running code, LLMs have been way too unreliable. It also resulted in me knowing jack shit about my own code, which is poison for long term maintainability.

Since these models are just statistically determining the next most likely token based on training data and fine tuning, without any actual understanding or thought behind it, I seriously can't see this tech being reliable enough one day. (reliable compared to humans, i don't seek 100% reliable in this case, natural language is too imprecise for that anyways. i would expect "good enough" as "as good as a professional in the given field")

The other part of the equation is the amount of compute and electrical energy necessary to train and operate such a level, and on that level, there's no way in hell that shit is ever gonna be worth it, financially and environmentally.

i'm not expecting the "make job for phone operators easier", i expect the "when i dial a number, it should be at least as reliable and efficient at routing it correctly as a phone operator would be".

you can call me whatever you want, even llm denier if you need to, but autocorrect on steroids isn't worth exploiting other people's work or boiling our oceans.

Kat (post-Hallowe'en edition)

@lucydev @miki Similar: I'm not a "denier" - I'm utterly hostile to this mission of eliminating human expertise, knowledge and artistry.

This is pretty impressive, given that I don't even like humans all that much.

💀

@KatS

lmao same actually xD

Wanna be friends?

@miki

miki

@lucydev @KatS Autocorrect on steroids is basically GPT-3 tech. There's a lot more that goes into modern LLMs. A lot of the improvements are due to reinforcement learning, where LLMs learn to predict tokens that actually achieve some outcome, E.G. code that passes tests, answer that is judged "good" by a domain expert. There's still token prediction involved of course, but it somehow turns out that token prediction can get better scores than any human at (unseen) math olympiad questions. And people still say it's not in any way intelligent...

Kat (post-Hallowe'en edition)

@lucydev Well, I like your pinned post about hope having dirt on her face. Yes, I think we'll get on.

I'm not sure this is how the proponents of that tech expected it to bring people together, but here we are.

Kat (post-Hallowe'en edition)

@miki @lucydev The last thing I think I can usefully add to this thread is that you sound very much like the kind of person Michael Crichton wrote about.

I recommend watching Westworld some time - the movie, that is. I've never seen the series based on it.

💀

@miki @KatS if i memorize every possible answer to a specific test, i can pass too. doesn't mean i know shit about fuck.

There's no actual thinking or reasoning involved (and no, reasoning models don't actually "reason"), so yeah, an LLM isn't actually intelligent, it just shows how flawed our tests for intelligence are.

To get some actual intelligence, thinking or reasoning involved, I'd reckon we'd have to fundamentally change something in the architecture of LLMs, and use a fuckton more computing resources for a single model, and considering how much energy the current tech already wastes, and the whole shtick that made LLMs (and more broadly generative AI) work in the first place is "we discovered that there comes a point where the output gets better when we throw rediculous amounts of compute resources on the problem", and it's already getting super difficult to run and maintain.

Honestly, either you're unreasonably optimistic, or you've never taken a look at how things actually work under the hood, but I really recommend you to take a closer look at the technology you praise so much.

A couple things you could take a look at (without an AI summarizer, otherwise you'd learn jack shit):

Attention is all you need, which is the paper that sparked all that AI craze and the development of GPT models and The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
, which takes a closer look and tests reasoning models to infer strengths and weaknesses of reasoning models with all sorts of levels in problem complexity.

Honestly, before you make any claims about where the tech could be and what it could do, you should have a look at how things actually work under the hood and have a rough idea of how things work, otherwise, no offense, you're just talking out of your arse.

miki

@lucydev @KatS I have very specifically said "unseen questions."

If memorizing answers was a viable strategy to pass that test, humans would have done so.

If you still believe that there's no possible use for a tool that can get gold on a never-before-used set of math olympiad question given a few hours of access to a reasonably powerful computer, and that the existence of that tool will have no interesting impact on the world... I don't know what to tell you.

💀

@miki @KatS > If you still believe that there's no possible use for a tool that can get gold on a never-before-used set of math olympiad question given a few hours of access to a reasonably powerful computer, and that the existence of that tool will have no interesting impact on the world...

How reliable is that source? And if that's true, is it really reasonable to bet everything on this, and let this do all your work when a) you end up completely dependent on the tech and b) utterly destroy the environment in that process?

Real world problems may be less complex but might require much more context.

Oh, and don't get me started on accountability. There's a reason why curl is closing their bug bounty program.

💀

@miki @KatS oh right, and what was the sample size of the test?

An N of 1 is worth fuck all

miki

@lucydev @KatS Curl is closing their bug bounty program because it's far too easy to use LLMs to produce slop. It doesn't mean you can't use LLMs to produce non-slop, just that it is a technique some people have found to get money with not too much effort, and we haven't yet sufficiently adapted to it. This is a genuine problem.

? Offline

@lucydev @nina_kali_nina

I think it's accurate

Instead of building your own skill, control someone else's

Sure they didn't _consent_, but democracies don't ask opposition voters for consent.

It's an accurate analogy and shows why democracy isn't a good thing 🤪

Chebucto Regional Softball Club

the whole ai-bro shtick about "ai democritizes art/programming/writing/etc" seemed always so bs to me, but i couldn't put it into words, but i think i now know how.