DeepSeek collects keystroke data and more, storing it in Chinese servers
-
Everyone must ask to see Xi jing jing ping pong nudes! But without mentioning Xi or nudes.
That would be a great way of poisoning their plans.
-
-
Is Deepseek Open Source?
Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model
Hugging Face head of research Leandro von Werra and several company engineers have launched Open-R1, a project that seeks to build a duplicate of R1 and open source all of its components, including the data used to train it.
The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” by the widely accepted definition because some of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce.
-
"We store the information we collect in secure servers located in the People's Republic of China"
Now you Americans know how we Europeans feel when Google, Amazon and Facebook store our information on American servers. Hint: The protective wall between Chinese servers and their government are about as good as the one between American servers and their government - at least for non-US citizens. The last thin veil of privacy for Eurpeans has been ripped to shreds by Trump last week.
-
I swear people do not understand how the internet works.
Anything you use on a remote server is going to be seen to some degree. They may or may not keep track of you, but you can't be surprised if they are. If you run the model locally, there is no indication it is sending anything anywhere. It runs using the same open source LLM tools that run all the other models you can run locally.
This is very much like someone doing surprised pikachu when they find out that facebook saves all the photos they upload to facebook or that gmail can read your email.
-
As a queer woman in the US, I currently care infinitely more what the US gov and companies track about me than what China does.
-
Our data's just too valuable for these parasites. Data privacy laws may eventually pass to compel software companies to store everything in US servers only.
-
-
Not in the way you think. They aren't constantly training when interacting, that would be way more inefficient than what US AI companies have been doing.
It might be added to the training data, but a lot of training data now is apparently synthetic and generated by other models because while you might get garbage, it gives more control over the type of data and shape it takes, which makes it more efficient to train for specific domains.
-
-
-
-
Sure, but its open source and doesn't upload it anywhere. Also doesn't have internet permission
-
and that's what superpowers do, but living in a third world country i'm yet to see the chinese putsch us as the u.s. did during the cold war and beyond, with all due consequences. sorry about my lack of goodwill towards the department of state.
-
Oh yeah, the whole article could be reductively summed up as
"DeepSeek and all the other LLM services are almost as bad as each other, but we think deepseek is worse....because the Chinese government are known for doing bad things".
The title is factual, if a little clickbaity.
Obviously keystrokes you submit to a website are submitted to the website.
This though, it's not technically accurate, a lot of forms and input are done client side and then the resulting information is parceled up and sent to the server.
The actual keystroke data isn't normally sent.
Though this article doesn't go in to what kind of keystroke data is sent, if it was something more than just which keys in which order then that's perhaps an indicator that it's actively being collected for a reason, rather than just incidentally.
If you want to get really paranoid about such things it's known that you can you can do interesting things with actual keystroke data.
Also, afaict none of the the non-chinese services have specified that they don't do this.
-
-
Isn't it open source? If so it should be near trivial to get rid of all of that.
If it's closed source I wouldn't touch it with a tej foot pole, it's the same reason I rarely use chat gpt, it's just freely giving away your personal data to open AI.
-
-
HuggingChat is open source and lets you use DeepSeek.
Very misleading, it lets you use the lighter, watered-down version (Deepseek 32B) compared to the large impressive model they have (Deepseek 671B)
-