AGI achieved 🤖
-
That came later though, as in “I had dinner with the Mrs last night.”
Yes but it did come, and took place as the common usage. So much so that Ms. Is used to describe a woman both with and without reference to marital status.
I'm down with using Mrs. not to refer to marital status but imo just going with Ms. Is clearer and easier because of how deeply associated Mrs. Is with it.
-
what do you mean by spell fine? They're just emitting the tokens for the words. Like, it's not writing "strawberry," it's writing tokens <302, 1618, 19772>, which correspond to st, raw, and berry respectively. If you ask it to put a space between each letter, that will disrupt the tokenization mechanism, and it's going to be quite liable to making mistakes.
I don't think it's really fair to say that the lookup 19772 -> berry counts as the LLM being able to spell, since the LLM isn't operating at that layer. It doesn't really emit letters directly. I would argue its inability to reliably spell words when you force it to go letter-by-letter or answer queries about how words are spelled is indicative of its poor ability to spell.
what do you mean by spell fine?
I mean that when you ask them to spell a word they can list every character one at a time.
-
I'm still puzzled by the idea of what mess this war was if at times you had someone still not clearly identifiable, but that close you can do a sheboleth check on them, and that at any moment you or the other could be shot dead.
Also, the current conflict of Russia vs Ukraine seems to invent ukrainian 'паляница' as a check, but as I had no connection to actual ukrainians and their UAF, I can't say if that's not entirely localized to the internet.
wrote on last edited by [email protected]Have you ever been to a very dense jungle or forest... at midnight?
Ok, now, drop mortar and naval artillery shells all over it.
For weeks, or months.
The holes this creates are commonly used by both sides as cover and concealment.
Also, its often raining, sometimes quite heavily, such that these holes will up with water, and you are thus soaking wet.
Ok, now, add in pillboxes and bunkers, as well as a few spiderwebs of underground tunnel networks, many of which have concealed entrances.
You do not have a phone. GPS does not exist.
You might have a map, which is out of date, and you might have a compass, if you didn't drop or break it.
A radio is either something stationary, or is the size and weight of approximately, somewhat less than a miniature refrigerator, and one bullet or good piece of shrapnel will take it out of commission.
Ok, now, you and all your buddies are either half starving or actually starving, beyond exhausted, getting maybe an average of 2 to 4 hours of sleep, and you, and the enemy, are covered in dirt, blood and grime.
Also, you and everyone else may or may not have malaria, or some other fun disease, so add shit and vomit to the mix of what everyone is covered in.
Ok! Enjoy your 2 to 8 week long camping trip from hell, in these conditions... also, kill everyone that is trying to kill you, soldier.
-
then 14b, man sooo close...
wrote on last edited by [email protected]I wonder how QWEN 3.0 performs cause it surpasses Deepseek apparently
-
Next step how many r in Lollapalooza
wrote on last edited by [email protected]
With Reasoning (this is QWEN on hugginchat it says there is Zero)
-
Yes but it did come, and took place as the common usage. So much so that Ms. Is used to describe a woman both with and without reference to marital status.
I'm down with using Mrs. not to refer to marital status but imo just going with Ms. Is clearer and easier because of how deeply associated Mrs. Is with it.
That’s up to you, I much prefer Mrs. Ms. feels somehow condescending to me.
-
We gotta raise the bar, so they keep struggling to make it “better”
::: spoiler My attempt
0000000000000000 0000011111000000 0000111111111000 0000111111100000 0001111111111000 0001111111111100 0001111111111000 0000011111110000 0000111111000000 0001111111100000 0001111111100000 0001111111100000 0001111111100000 0000111111000000 0000011110000000 0000011110000000
Btw, I refuse to give my money to AI bros, so I don’t have the “latest and greatest”
:::
wrote on last edited by [email protected]Tested on ChatGPT o4-mini-high
It sent me this
0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 0 0 1 1 1 0 0 0 0 0 0 0 1 1 1 0 0 0 0 1 1 1 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0
I asked it to remove the spaces
0001111100000000 0011111111000000 0011111110000000 0111111111100000 0111111111110000 0011111111100000 0001111111000000 0011111100000000 0111111111100000 1111111111110000 1111111111110000 1111111111110000 1111111111110000 0011100111000000 0111000011100000 1111000011110000
I guess I just murdered a bunch of trees and killed a random dude with the water it used, but it looks good
-
I really like checking these myself to make sure it’s true. I WAS NOT DISAPPOINTED!
(Total Rs is 8. But the LOGIC ChatGPT pulls out is ……. remarkable!)
Try with o4-mini-high. It’s made to think like a human by checking its answer and doing step by step, rather than just kinda guessing one like here
-
And yet they can seemingly spell and count (small numbers) just fine.
The problem is that it's not actually counting anything. It's simply looking for some text somewhere in its database that relates to that word and the number of R's in that word. There's no mechanism within the LLM to actually count things. It is not designed with that function. This is not general AI, this is a Generative Adversarial Network that's using its vast vast store of text to put words together that sound like they answer the question that was asked.
-
I really like checking these myself to make sure it’s true. I WAS NOT DISAPPOINTED!
(Total Rs is 8. But the LOGIC ChatGPT pulls out is ……. remarkable!)
-
I want an option to select Marvin the paranoid android mood: "there's your answer, now if you could leave me to wallow in self-pitty"
Here I am, emissions the size of a small country, and they ask me to count letters...
-
ohh god, I never through to ask reasoning models,
DeepSeekR17b was gold too
It's painful how Reddit that is...
So,
Now,
Alright,
-
I really like checking these myself to make sure it’s true. I WAS NOT DISAPPOINTED!
(Total Rs is 8. But the LOGIC ChatGPT pulls out is ……. remarkable!)
What is this devilry?
-
I want an option to select Marvin the paranoid android mood: "there's your answer, now if you could leave me to wallow in self-pitty"
Lol someone could absolutely do that as a character card.
-
And people are trusting these things to do jobs / parts of jobs that humans used to do.
Humans are pretty dumb sometimes lol
-
Have you ever been to a very dense jungle or forest... at midnight?
Ok, now, drop mortar and naval artillery shells all over it.
For weeks, or months.
The holes this creates are commonly used by both sides as cover and concealment.
Also, its often raining, sometimes quite heavily, such that these holes will up with water, and you are thus soaking wet.
Ok, now, add in pillboxes and bunkers, as well as a few spiderwebs of underground tunnel networks, many of which have concealed entrances.
You do not have a phone. GPS does not exist.
You might have a map, which is out of date, and you might have a compass, if you didn't drop or break it.
A radio is either something stationary, or is the size and weight of approximately, somewhat less than a miniature refrigerator, and one bullet or good piece of shrapnel will take it out of commission.
Ok, now, you and all your buddies are either half starving or actually starving, beyond exhausted, getting maybe an average of 2 to 4 hours of sleep, and you, and the enemy, are covered in dirt, blood and grime.
Also, you and everyone else may or may not have malaria, or some other fun disease, so add shit and vomit to the mix of what everyone is covered in.
Ok! Enjoy your 2 to 8 week long camping trip from hell, in these conditions... also, kill everyone that is trying to kill you, soldier.
It's weird foot soldiers kept killing each other.
It's not weird we had 'frag' as a verb from the Vietnam war.
-
It's painful how Reddit that is...
So,
Now,
Alright,
Probably where 90% of the training for this particular problem came from.
-
I wonder how QWEN 3.0 performs cause it surpasses Deepseek apparently
I don't have any other models pulled down, if they're open I'll try it and respond back here
-
Humans are pretty dumb sometimes lol
It's far better at the use of there, their, and they're.
The average US citizen couldn't craft a professional sounding document of their life depended on it.
It's not better than a professional at anything, The average human is far below that bar.
-
You've missed something about the Chinese Room. The solution to the Chinese Room riddle is that it is not the person in the room but rather the room itself that is communicating with you. The fact that there's a person there is irrelevant, and they could be replaced with a speaker or computer terminal.
Put differently, it's not an indictment of LLMs that they are merely Chinese Rooms, but rather one should be impressed that the Chinese Room is so capable despite being a completely deterministic machine.
If one day we discover that the human brain works on much simpler principles than we once thought, would that make humans any less valuable? It should be deeply troubling to us that LLMs can do so much while the mathematics behind them are so simple. Arguments that because LLMs are just scaled-up autocomplete they surely can't be very good at anything are not comforting to me at all.
This.
I often see people shitting on AI as "fancy autocomplete" or joking about how they get basic things incorrect like this post but completely discount how incredibly fucking capable they are in every domain that actually matters. That's what we should be worried about... what does it matter that it doesn't "work the same" if it still accomplishes the vast majority of the same things? The fact that we can get something that even approximates logic and reasoning ability from a deterministic system is terrifying on implications alone.