Why I am not impressed by A.I.
-
[email protected]replied to [email protected] last edited by
Yes, at some point the meme becomes the training data and the LLM doesn't need to answer because it sees the answer all over the damn place.
-
[email protected]replied to [email protected] last edited by
What would have been different about this if it had impressed you? It answered the literal question and also the question the user was actually trying to ask.
-
[email protected]replied to [email protected] last edited by
I really don't see the point OP is trying to make with this example though. It accurately answered their misspelled question, and also accurately answered the question they were apparently trying to ask. I don't see the problem.
-
[email protected]replied to [email protected] last edited by
Except many many experts have said this is not why it happens. It cannot count letters in the incoming words. It doesn't even know what "words" are. It has abstracted tokens by the time it's being run through the model.
It's more like you don't know the word strawberry, and instead you see:
How many 'r's in ?And you respond with nonsense, because the relation between 'r' and is nonsensical.
-
[email protected]replied to [email protected] last edited by
It doesn't see "strawberry" or "straw" or "berry". It's closer to think of it as seeing , an abstract token representing the same concept that the training data associated with the word.
-
[email protected]replied to [email protected] last edited by
What gaps in functionality do you see being demonstrated in this example? It answered both the literal question and the intended question, recognizing the misspelling. I may be having a complete brain-fart but to me this seems like a demo of it working very well.
-
[email protected]replied to [email protected] last edited by
It didn't? StRawbeRy has 2 rs. StRawbeRRy has 3.
-
[email protected]replied to [email protected] last edited by
How many strawberries could a strawberry bury if a strawberry could bury strawberries
-
[email protected]replied to [email protected] last edited by
OHHHHHHH.... my bad. I'm an idiot. Being an LLM it's giving the answer it thinks a human such as myself would come up with.
-
[email protected]replied to [email protected] last edited by
Works fine for me in o3-mini-high:
Counting letters in “strawberry”
Alright, I’m checking: the word “strawberry” is spelled S T R A W B E R R Y. Let me count the letters: S (1), T (2), R (3), A (4), W (5), B (6), E (7), R (8), R (9), Y (10). There are three R’s: in positions 3, 8, and 9. So, the answer is 3. Even if we ignore case, the count still holds. Therefore, there are 3 r’s in “strawberry.”
-
[email protected]replied to [email protected] last edited by
Maybe you're a bot too...
-
[email protected]replied to [email protected] last edited by
Not last time I checked, but we all could be as far as you know.
-
[email protected]replied to [email protected] last edited by
Or you might eliminate some that are what you are looking for because the summaries are inaccurate.
Guess it depends on whether an unreliable system is still better than being overwhelmed with choices.
-
[email protected]replied to [email protected] last edited by
You asked a stupid question and got a stupid response, seems fine to me.
-
[email protected]replied to [email protected] last edited by
Yes, nobody asking that question is wonderring about the "straw" part of the word. They're asking, is the "berry" part one, or two "r"s
-
[email protected]replied to [email protected] last edited by
A normal person would say 'strawberry with two "r"s'
-
[email protected]replied to [email protected] last edited by
Answer, you're using it wrong /stevejobs
-
[email protected]replied to [email protected] last edited by
Really? AI has been marketed as being able to count the r’s in “strawberry?” Please link to this ad.
-
[email protected]replied to [email protected] last edited by
We have one that indexes all the wikis and GDocs and such at my work and it’s incredibly useful for answering questions like “who’s in charge of project 123?” or “what’s the latest update from team XYZ?”
I even asked it to write my weekly update for MY team once and it did a fairly good job. The one thing I thought it had hallucinated turned out to be something I just hadn’t heard yet. So it was literally ahead of me at my own job.
I get really tired of all the automatic hate over stupid bullshit like this OP. These tools have their uses. It’s very popular to shit on them. So congratulations for whatever agreeable comments your post gets. Anyway.