It still can’t count the Rs in strawberry, I’m not worried.
-
I mean I tested it out, even tbough I am sure your trolling me and DeepSeek correctly counts the R's
-
Not trolling you at all:
-
Non thinking prediction models can't count the r's in strawberry due to the nature of tokenization.
However openai o1 and deep seek r1 can both reliably do it correctly
-
Yes it can
-
-
-
-
-
Clearly not the first try
-
“Again” so it failed the first time. Got it.
-
It didn't, I just wanted a short reply. Though it failed when I asked again at the same chat. But when asked to split the word to 2 parts it became sure that the correct answer is 3.
-
-
-
That isn't at all how something like a diffusion based model works actually.
-
-
-
-
-
Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)
-
It’s because LLMs don’t work with letters. They work with tokens that are converted to vectors.
They literally don’t see the word “strawberry” in order to count the letters.
Splitting the letter probably separates them into individual tokens