It still can’t count the Rs in strawberry, I’m not worried.

[email protected]

I mean I tested it out, even tbough I am sure your trolling me and DeepSeek correctly counts the R's

[email protected]

Not trolling you at all:

https://lemmy.world/comment/14735060

[email protected]

Non thinking prediction models can't count the r's in strawberry due to the nature of tokenization.

However openai o1 and deep seek r1 can both reliably do it correctly

[email protected]

Yes it can

[email protected]

[email protected]

[email protected]

[email protected]

[email protected]

Clearly not the first try

[email protected]

“Again” so it failed the first time. Got it.

[email protected]

It didn't, I just wanted a short reply. Though it failed when I asked again at the same chat. But when asked to split the word to 2 parts it became sure that the correct answer is 3.

[email protected]

[email protected]

[email protected]

That isn't at all how something like a diffusion based model works actually.

[email protected]

[email protected]

[email protected]

[email protected]

[email protected]

https://ibb.co/wVNsn5H

https://ibb.co/HpK5G5Pp

https://ibb.co/sp1wGMFb

https://ibb.co/4wyKhkRH

https://ibb.co/WpBTZPRm

https://ibb.co/0yP73j6G

Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)

[email protected]

It’s because LLMs don’t work with letters. They work with tokens that are converted to vectors.

They literally don’t see the word “strawberry” in order to count the letters.

Splitting the letter probably separates them into individual tokens

agnos.is Forums

It still can’t count the Rs in strawberry, I’m not worried.