Will LLMs make finding answers online a thing of the past?

[email protected]

to an extent, yes, but not completely

[email protected]

Maybe in the sense that the Internet may become so inundated with AI garbage that the only way to get factual information is by actually reading a book or finding a real person to ask, face to face.

[email protected]

You know how the steel from prenuclear proliferation is prized? I wonder if that's going to happen with data from before 2022 as well now. Lol.

[email protected]

There might be a way to mitigate that damage. You could categorize the training data by the source. If it's verified to be written by a human, you could give it a bigger weight. If not, it's probably contaminated by AI, so give it a smaller weight. Humans still exist, so it's still possible to obtain clean data. Quantity is still a problem, since these models are really thirsty for data.

[email protected]

LLMs can't distinguish truth from falsehoods, they only produce output that resembles other output. So they can't tell the difference between human and AI input.

[email protected]

Trouble is that 'quick answers' mean the LLM took no time to do a thorough search.

LLMs don't "search". They essentially provide weighted parrot-answers based on what they've seen elsewhere.

If you tell an LLM that the sky is red, they will tell you the sky is red. If you tell them your eyes are the colour of the sky, they will repeat that your eyes are red. LLMs aren't capable of checking if something is true.

Theyre just really fast parrots with a big vocabulary. And every time they squawk, it burns a tree.

[email protected]

They're extremely good at giving what looks like a correct answer,

Exactly. Sometimes the thing that looks right IS right, and sometimes it's not. The stochastic parrot doesn't know the difference

[email protected]

Google made the huge mistake of placing the CEO of adds in charge of search.

And now it fucking sucks.

[email protected]

Math problems are a unique challenge for LLMs, often resulting in bizarre mistakes. While an LLM can look up formulas and constants, it usually struggles with applying them correctly. Sort of, like counting the hours in a week, it says it calculates 7*24, which looks good, but somehow the answer is still 10 🤯. Like, WTF? How did that happen? In reality, that specific problem might not be that hard, but the same phenomenon can still be seen in more complicated problems. I could give some other examples too, but this post is long enough as it is.

For reliable results in math-related queries, I find it best to ask the LLM for formulas and values, then perform the calculations myself. The LLM can typically look up information reasonably accurately but will mess up the application. Just use the right tool for the right job, and you'll be ok.

[email protected]

Is your abuse of the ellipsis and dashes supposed to be ironic? Isn't that a LLM tell?

I'm not even sure what the ('phrase') construct is even meant to imply, but it's wild. Your abuse of punctuation in general feels like a machine trying to convince us it's human or a machine transcribing a human's stream of consciousness.

[email protected]

That's a problem when you want to automate the curation and annotation process. So far, you could have just dumped all of your data into the model, but that might not be an option in the future, as more and more of the training data was generated by other LLMs.

When that approach stops working, AI companies need to figure out a way to get high quality data, and that's when it becomes useful to have data that was verified to be written by actual people. This way, an AI doesn't even need to be able to curate the data, as humans have done that to some extent. You could just prioritize the small amount of verified data while still using the vast amounts of unverified data for training.

[email protected]

I said "cut a forum by 90%", not "a forum happens to be smaller than another". Ask ChatGPT if you have trouble with words.

[email protected]

My 70 year old boss and his 50 year old business partner just today generated a set of instructions for scanning to a thumb drive on a specific model of printer.

They obviously missed the "AI Generated" tag on the Google search and couldn't figure out why the instructions cited the exact model but told them to press buttons and navigate menus that didn't exist.

These are average people and they didn't realize that they were even using ai much less how unreliable it can be.

I think there's going to be a place for forums to discuss niche problems for as long as ai just means advanced LLM and not actual intelligence.

[email protected]

When diagnosing software related tech problems with proper instructions, there’s always the risk of finding outdated tips. You may be advised to press buttons that no longer exist in the version you’re currently using.

With hardware though, that’s unlikely to happen, as long as the model numbers match. However, when relying on AI generated instructions, anything is possible.

[email protected]

Not so simple with hardware also. Although less frequent, hardware also has variants, the nuances of which are easily missed by LLMs

agnos.is Forums

Will LLMs make finding answers online a thing of the past?