Anthropic has developed an AI 'brain scanner' to understand how LLMs work and it turns out the reason why chatbots are terrible at simple math and hallucinate is weirder than you thought

[email protected]

you can't trust its explanations as to what it has just done.

I might have had a lucky guess, but this was basically my assumption. You can't ask LLMs how they work and get an answer coming from an internal understanding of themselves, because they have no 'internal' experience.

Unless you make a scanner like the one in the study, non-verbal processing is as much of a black box to their 'output voice' as it is to us.

[email protected]

They do it because it works on the whole. If straight titles were as effective they'd be used instead.

? Offline

But you wouldn't multiply, say, 74*14 to get the answer.

[email protected]

Don't tell me that my thoughts aren't weird enough.

[email protected]

The one weird trick that makes clickbait work

[email protected]

But then you wouldn't need to click on thir Ad infested shite website where 1-2 paragraphs worth of actual information is stretched into a giant essay so that they can show you more Ads the longer you scroll

? Offline

It really is quite unfortunate, I wish titles do what titles are supposed to do instead of being baits.but you are right, even consciously trying to avoid clicking sometimes curiosity gets the best of me. But I am improving.

[email protected]

The problem with common core math isn’t that rounding is inherently bad, it’s that you don’t start with that as a framework.

? Offline

I might. Then I can subtract 74 to get 74*14, and subtract 28 to get 72*13.

I don't generally do that to 'weird' numbers, I usually get closer to multiples of 5, 9, 10, or 11.

But a computer stores information differently. Perhaps it moves closer to numbers with simpler binary addresses.

? Offline

This is what I do, except I would add 700 and 236 at the end.

Well except I would probably add 700 and 116 or something, because my working memory fucking sucks and my brain drops digits very easily when there's more than 1

[email protected]

Maybe you're right. Maybe it's Markov chains all the way down.

The only way I can think to test this would be to "poison" the training data with faulty arithmetic to see if it is just recalling precedent or actually implementing an algorithm.

[email protected]

But you're doing two calculations now, an approximate one and another one on the last digits, since you're going to do the approximate calculation you might act as well just do the accurate calculation and be done in one step.

This solution, while it works, has the feeling of evolution. No intelligent design, which I suppose makes sense considering the AI did essentially evolve.

[email protected]

Appreciate the advice on how my brain should work.

[email protected]

Not, but I'd do 7510 + 754, then subtract the extra.

The LLM method of doing it with multiple numbers without proper interpolation though makes it extra weird

[email protected]

People are generally shit at understanding probabilities and even when they have a fairly strong math background tend to explain probablistic outcomes through anthropomorphism rather than doing the more difficult and "think-painy" statistical analysis that would be required to know if there was anything more to it.

I myself start to have thoughts that balatro is purposefully screwing me over or feeding me outcomes when it's just randomness and probability as stated.

Ultimately, it's easier (and more fun) for us to reason that way and it largely serves us better in everyday life.

But these things are entire casinos' worth of probability and statistics in and of themselves, and the people developing them want desperately to believe that they are something more than pseudorandom probabilistic fancy autocomplete engines.

Add the difficulty of getting someone to understand how something works when their salary depends on them not understanding it to the existing inability of humans to reason probabilistically and the AGI from LLM delusion becomes near impossible to shake for some folks.

I wouldn't be surprised if this AI hype bubble yields a cult in the end.

[email protected]

Can an LLM do something similar despite having never seen anything that isn’t a word or number?

No.

[email protected]

This dumbass is convinced that humans are chatbots likely because chatbots are his only friends.

[email protected]

Thanks for copypasting. It should be criminal to share a clickbait non-descriptive headline without atleast copying a couple paragraphs for context.

[email protected]

Sounds scary. I read a story the other day about a dude who really got himself a discord server with chatbots, and that was his main place of "communicating" and "socializing"

[email protected]

This anecdote has the makings of a "men will literally x instead of going to therapy" joke.

On a more serious note though, I really wish people would stop anthropomorphisizing these things, especially when they do it while dehumanizing and devaluing humanity as a whole.

But that's unlikely to happen. It's the same type of people that thought the mind was a machine in the first industrial revolution, and then a CPU in the third...now they think it's an LLM.

LLMs could have some better (if narrower) applications if we could stop being so stupid as to inject them into things where they are obviously counterproductive.

agnos.is Forums

Anthropic has developed an AI 'brain scanner' to understand how LLMs work and it turns out the reason why chatbots are terrible at simple math and hallucinate is weirder than you thought