WhatsApp Deploys AI, for Those Incapable of Comprehending Straightforward Messages From Their Friends and Family
-
Ahh, intellect of the average American on display. So... -óóóh, is that a Donut?
Why are you making fun of topological toroidal surfaces mr smarty pants
-
Ahh, intellect of the average American on display. So... -óóóh, is that a Donut?
wrote on last edited by [email protected]People in the US don't use WhatsApp, for the most part.
-
An LLM could be trained on the way a specific person communicates over time
Are there any companies doing anything similar to this? From what I've seen companies avoid this stuff like the plague, their LLMs are always frozen with no custom training. Training takes a lot of compute, but also has huge risks of the LLM going off the rails and saying bad things that could even get the company into trouble or get bad publicity. Also the disk space per customer, and loading times of individual models.
The only hope for your use case is that the LLM has a large enough context window to look at previous examples from your chat and use those for each request, but that isn't the same thing as training.
There are plenty of people and organisations doing stuff like this, there are plenty of examples on HuggingFace, though typically it's to get an LLM to communicate in a specific manner (e.g. this one trained on Lovecraft's works). People drastically overestimate the amount of compute time/resources training and running an LLM takes; do you think Microsoft could force their AI on every single Windows computer if it was as challenging as you imply? Also, you do not need to start from scratch. Get a model that's already robust and developed and fine tune it with additional training data, or for a hack job, just merge a LoRA into the base model.
The intent, by the way, isn't for the LLM to respond for you, it's just to interpret a message and offer suggestions on what a message means or rewrite it to be clear (while still displaying the original).
-
the same thing a person could do
asking for clarification seems like a reasonable thing to do in a conversation.
A tool is not about to do that because it would feel weird and creepy for it to just take over the conversation.
wrote on last edited by [email protected]The intent isn’t for the LLM to respond for you, it’s just to interpret a message and offer suggestions on what a message means or rewrite it to be clear (while still displaying the original).
-
There are plenty of people and organisations doing stuff like this, there are plenty of examples on HuggingFace, though typically it's to get an LLM to communicate in a specific manner (e.g. this one trained on Lovecraft's works). People drastically overestimate the amount of compute time/resources training and running an LLM takes; do you think Microsoft could force their AI on every single Windows computer if it was as challenging as you imply? Also, you do not need to start from scratch. Get a model that's already robust and developed and fine tune it with additional training data, or for a hack job, just merge a LoRA into the base model.
The intent, by the way, isn't for the LLM to respond for you, it's just to interpret a message and offer suggestions on what a message means or rewrite it to be clear (while still displaying the original).
wrote on last edited by [email protected]Huggingface isn't customer-facing, it's developer-facing. Letting customers retrain your LLM sounds like a bad idea for a company like Meta or Microsoft, it's too risky and could make them look bad. Retraining an LLM for Lovecraft is a totally different scale than retraining an LLM for hundreds of millions of individual customers.
do you think Microsoft could force their AI on every single Windows computer if it was as challenging as you imply?
It's a cloned image, not unique per computer
-
An LLM could be trained on the way a specific person communicates over time
Are there any companies doing anything similar to this? From what I've seen companies avoid this stuff like the plague, their LLMs are always frozen with no custom training. Training takes a lot of compute, but also has huge risks of the LLM going off the rails and saying bad things that could even get the company into trouble or get bad publicity. Also the disk space per customer, and loading times of individual models.
The only hope for your use case is that the LLM has a large enough context window to look at previous examples from your chat and use those for each request, but that isn't the same thing as training.
My friend works for a startup that does exactly that - trains AIs on conversations and responses from a specific person (some business higher-ups) for purposes of "coaching" and "mentoring". I don't know how well it works.
-
My friend works for a startup that does exactly that - trains AIs on conversations and responses from a specific person (some business higher-ups) for purposes of "coaching" and "mentoring". I don't know how well it works.
it probably works pretty well when it's tested and verified instead of unsupervised
and for a small pool of people instead of hundreds of millions of users
-
Huggingface isn't customer-facing, it's developer-facing. Letting customers retrain your LLM sounds like a bad idea for a company like Meta or Microsoft, it's too risky and could make them look bad. Retraining an LLM for Lovecraft is a totally different scale than retraining an LLM for hundreds of millions of individual customers.
do you think Microsoft could force their AI on every single Windows computer if it was as challenging as you imply?
It's a cloned image, not unique per computer
wrote on last edited by [email protected]Hugging Face being developer-facing is completely irrelevant considering the question you asked was whether I was aware of any companies doing anything like this.
Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.
It’s a cloned image, not unique per computer
Microsoft's Copilot works off a base model, yes, but just an example that LLMs aren't as CPU intensive as made out to be. Further automated finetuning isn't out of the realm of possibility either and I fully expect Microsoft to do this in the future.
-
Hugging Face being developer-facing is completely irrelevant considering the question you asked was whether I was aware of any companies doing anything like this.
Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.
It’s a cloned image, not unique per computer
Microsoft's Copilot works off a base model, yes, but just an example that LLMs aren't as CPU intensive as made out to be. Further automated finetuning isn't out of the realm of possibility either and I fully expect Microsoft to do this in the future.
wrote on last edited by [email protected]Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.
they release them to developers, not automatically retrain them unsupervised in their actual products and put them in the faces of customers to share screenshots of the AI's failures on social media and give it a bad name
-
People in the US don't use WhatsApp, for the most part.
No, but they develop it
-
This post did not contain any content.
I get it, big stupid money hungry tech wants to put itself between you and every other person on earth for the almighty dollar.
How about we use tech to fix problems not create new ones!?
-
People in the US don't use WhatsApp, for the most part.
Interesting! What do people in the US generally use?
-
Your concern that companies like Meta and Microsoft are too scared to let users retrain their models is also irrelevant considering both of these companies have already released models so that anyone can retrain or checkpoint merge them i.e. Llama by Meta and Phi by Microsoft.
they release them to developers, not automatically retrain them unsupervised in their actual products and put them in the faces of customers to share screenshots of the AI's failures on social media and give it a bad name
They release them under permissive licences so that anyone can do that.
-
They release them under permissive licences so that anyone can do that.
wrote on last edited by [email protected]yea someone could take the model and make their own product with their own PR and public perception
that's very different from directly spoonfeeding it as a product to the general public consumers inside of WhatsApp or something
it's like saying someone can mod Skyrim to put nude characters in it, that's very different from Bethesda selling the game with nude characters
-
To be fair, my father tends to make messages quite incomprehensible by adding irrelevant information all over the place. Sometimes going on for multiple screens while it could easily have been a 2-3 sentence message.
Sadly I think AI would even be worse at picking up what information is important from that. But I understand why people want it.
As for very active groupchats, I am not gonna read +25 messages a day, but being able to glance the gist of it would be awesome.
The exact point at which the gist of it can be manipulated, leaving out context and nudging you toward a different opinion than you might have formed if you'd read the whole thread.
-
This post did not contain any content.
They're interjecting themselves between us and our contacts. They'll have the power to "summarize", which is also the power to subtly re-interpret meaning. The AI will be given a broad goal, and it will chip away at that goal bit by bit, across millions of summaries to mold public opinion.
-
People in the US don't use WhatsApp, for the most part.
You're right.
They don't use WhatsApp, they use Facebook Messenger.
-
Trying not to be too douchey here, but ironically, your message is actually a very good example of where this technology could be beneficial.
IT is ACTUALLY not EASY to read a MESSAGE when THE CASE randomly SWITCHES back AND forth.
im going to ask chatgpt to give you a swirlie
-
The exact point at which the gist of it can be manipulated, leaving out context and nudging you toward a different opinion than you might have formed if you'd read the whole thread.
Friend, I think you need to reconsider your world perspective a bit. Not everyone is out to get you all the time.
-
Interesting! What do people in the US generally use?
Sms/iMessage typically. I started using WhatsApp because my wife's from south America, where they commonly use it.