Anthropic has developed an AI 'brain scanner' to understand how LLMs work and it turns out the reason why chatbots are terrible at simple math and hallucinate is weirder than you thought

[email protected]

"Ask Claude to add 36 and 59 and the model will go through a series of odd steps, including first adding a selection of approximate values (add 40ish and 60ish, add 57ish and 36ish). Towards the end of its process, it comes up with the value 92ish. Meanwhile, another sequence of steps focuses on the last digits, 6 and 9, and determines that the answer must end in a 5. Putting that together with 92ish gives the correct answer of 95," the MIT article explains."

That is precisrly how I do math. Feel a little targeted that they called this odd.

[email protected]

I think a lot of services are doing this behind the scenes already. Otherwise chatgpt would be getting basic arithmetic wrong a lot more considering the methods the article has shown it's using.

[email protected]

Rote memorization should be minimized in school curriculum

[email protected]

Another very surprising outcome of the research is the discovery that these LLMs do not, as is widely assumed, operate by merely predicting the next word. By tracing how Claude generated rhyming couplets, Anthropic found that it chose the rhyming word at the end of verses first, then filled in the rest of the line.

If the llm already knows the full sentence it's going to output from the first word it "guesses" I wonder if you could short circuit it and say just give the full sentence instead of doing a cycle for each word of the sentence, could maybe cut down on llm energy costs.

[email protected]

(72 * 10) + (2 * 3) = x

There, fixed, because otherwise order of operation gets fucky.

[email protected]

Then take that concept further, and let it keep introspecting and inspecting how it comes to the conclusions it does and eventually....

[email protected]

Which is exactly how we do it.

[email protected]

You know they don't think - even though "It's a peculiar truth that we don't understand how large language models (LLMs) actually work."?

It's truly shocking to read this from a mess of connected neurons and synapses like yourself. You're simply doing fancy word prediction of the next word /s

[email protected]

But how is this different from your average redditor?

[email protected]

I use a calculator. Which an AI should also be and not need to do weird shit to do math.

[email protected]

I mean it implies that they CAN start with the conclusion or the "thought" and then generate the text to verbalize that.

It's shocking to what length humans will go to explain how their wetware neural network is fundamentally different and it's impossible for LLMs to think or reason in any way. Honestly LLMs teach us more about human intelligence (or the lack thereof) than machine intelligence. Like obi wan said, "The ability to speak does not make one intelligent" haha.

[email protected]

No it hasn't. When you program you break down the problem into many smaller sub programs and then codify them. There are errors that need debugging. But never "how does this part of the program I wrote work?".

There are some cases like detergents, apparently until recently we didn't know exactly how it works. But human engineered tools are not comparable to this.

[email protected]

It's like that "Joey Repeat After Me" meme from friends haha

[email protected]

Better yet, teach AI to write code replacing specific optimized AI networks. Then automatically profile and optimize and unit test!

[email protected]

Fascist. If someone does maths differently than your preference, it's not "weird shit". I'm facile with mental math despite what's perhaps a non-standard approach, and it's quite functional to be able to perform simple to moderate levels of mathematics mentally without relying on a calculator.

[email protected]

Function calling is a thing chatbots can do now

[email protected]

Wtf hahahahaha

[email protected]

But who is going around asking these bots to specifically do math? Like in normal usage, Ive never once done that because I could just use a calculator or spreadsheet software if I need to get fancy lol

[email protected]

Someone put 69 to research and then to article. Nice trolling.

[email protected]

How I'd do it is basically

72 * (10+3)

(72 * 10) + (72 * 3)

(720) + (3*(70+2))

(720) + (210+6)

(720) + (216)

936

Basically I break the numbers apart into easier chunks and then add them together.

agnos.is Forums

Anthropic has developed an AI 'brain scanner' to understand how LLMs work and it turns out the reason why chatbots are terrible at simple math and hallucinate is weirder than you thought