Why I am not impressed by A.I.

[email protected]

So can web MD. We didn't need AI for that. Googling symptoms is a great way to just be dehydrated and suddenly think you're in kidney failure.

[email protected]

Still, it’s kinda insane how two years ago we didn’t imagine we would be instructing programs like “be helpful but avoid sensitive topics”.

That was definitely a big step in AI.

[email protected]

Yeah, normally my "Make this sound better" or "summarize this for me" is a longer wall of text that I want to simplify, talking to non-technical people about a technical issue is not the easiest for me, and AI has helped me dumb it down when sending an email.

As for accuracy, you review what is gives you, you don't just copy and send it without review. Also you will have to tweak some pieces that it gives out where it doesn't make the most sense, such as if it uses wording you wouldn't typically use. It is fairly accurate though in my use-cases.

[email protected]

Uh oh, you’ve blown your cover, robot sir.

[email protected]

I think there's a fundamental difference between someone saying "you're holding your phone wrong, of course you're not getting a signal" to millions of people and someone saying "LLMs aren't good at that task you're asking it to perform, but they are good for XYZ."

If someone is using a hammer to cut down a tree, they're going to have a bad time. A hammer is not a useful tool for that job.

[email protected]

As for accuracy, you review what it gives you, you don't just copy and send it without review.

Yeah, I don't get why so many people seem to not get that.

It's like people who were against Intellisense in IDEs because "What if it suggests the wrong function?"...you still need to know what the functions do. If you find something you're unfamiliar with, you check the documentation. You don't just blindly accept it as truth.

Just because it can't replace a person's job doesn't mean it's worthless as a tool.

[email protected]

From a linguistic perspective, this is why I am impressed by (or at least, astonished by) LLMs!

[email protected]

Yup, the problem with that iPhone (4?) wasn't that it sucked, but that it had limitations. You could just put a case on it and the problem goes away.

LLMs are pretty good at a number of tasks, and they're also pretty bad at a number of tasks. They're pretty good at summarizing, but don't trust the summary to be accurate, just to give you a decent idea of what something is about. They're pretty good at generating code, just don't trust the code to be perfect.

You wouldn't use a chainsaw to build a table, but it's pretty good at making big things into small things, and cleaning up the details later with a more refined tool is the way to go.

[email protected]

I can already see it...

Ad:
CAN YOU SOLVE THIS IMPOSSIBLE RIDDLE THAT AI CAN'T SOLVE?!

With OP's image. And then it will have the following once you solve it: "congratz, send us your personal details and you'll be added to the hall of fame at CERN Headquarters"

[email protected]

We didn’t stop trying to make faster, safer and more fuel efficient cars after Model T, even though it can get us from place A to place B just fine. We didn’t stop pushing for digital access to published content, even though we have physical libraries. Just because something satisfies a use case doesn’t mean we should stop advancing technology.

[email protected]

Has the number of "r"s changed over that time?

[email protected]

There's also a "r" in the first half of the word, "straw", so it was completely skipping over that r and just focusing on the r's in the word "berry"

[email protected]

I asked mistral/brave AI and got this response:

How Many Rs in Strawberry

The word "strawberry" contains three "r"s. This simple question has highlighted a limitation in large language models (LLMs), such as GPT-4 and Claude, which often incorrectly count the number of "r"s as two. The error stems from the way these models process text through a process called tokenization, where text is broken down into smaller units called tokens. These tokens do not always correspond directly to individual letters, leading to errors in counting specific letters within words.

[email protected]

sounds like a perfectly sane idea https://freethoughtblogs.com/pharyngula/2025/02/05/ai-anatomy-is-weird/

[email protected]

I think these are actually valid examples, albeit ones that come with a really big caveat; you're using AI in place of a skill that you really should be learning for yourself. As an autistic IT person, I get the struggle of communicating with non-technical and neurotypical people, especially clients who you have to be extra careful with. But the reality is, you can't always do all your communication by email. If you always rely on the AI to correct your tone or simplify your language, you're choosing not to build an essential skill that is every bit as important to doing your job well as it is to know how to correctly configure an ACL on a Cisco managed switch.

That said, I can also see how relying on the AI at first can be a helpful learning tool as you build those skills. There's certainly an argument that by using tools, but paying attention to the output of those tools, you build those skills for yourself. Learning by example works. I think used in that way, there's potentially real value there.

Which is kind of the broader story with Gen AI overall. It's not that it can never be useful; it's that, at best, it can only ever aspire to "useful." No one, yet, has demonstrated any ability to make AI "essential" and the idea that we should be investing hundreds of billions of dollars into a technology that is, on its best days, mildly useful, is sheer fucking lunacy.

[email protected]

The issue is that AI is being invested in as if it can replace jobs. That's not an issue for anyone who wants to use it as a spellchecker, but it is an issue for the economy, for society, and for the planet, because billions of dollars of computer hardware are being built and run on the assumption that trillions of dollars of payoff will be generated.

And correcting someone's tone in an email is not, and will never be, a trillion dollar industry.

[email protected]

Sure, but for what purpose would you ever ask about the total number of a specific letter in a word? This isn't the gotcha that so many think it is. The LLM answers like it does because it makes perfect sense for someone to ask if a word is spelled with a single or double "r".

[email protected]

The dumbed down text is basically as long as the prompt. Plus you have to double check it to make sure it didn't have outrage instead of outage just like if you wrote it yourself.

Are you really saving time?

[email protected]

If you always rely on the AI to correct your tone or simplify your language, you’re choosing not to build an essential skill that is every bit as important to doing your job well as it is to know how to correctly configure an ACL on a Cisco managed switch.

This is such a good example of how it AI/LLMs/whatever are being used as a crutch that is far more impactful than using a spellchecker. A spell checker catches typos or helps with unfamiliar words, but doesn't replace the underlying skill of communicating to your audience.

[email protected]

Dumbed down doesn't mean shorter.

agnos.is Forums

Why I am not impressed by A.I.