Does DeepSeek Censor Its AI Answers? On These Sensitive Topics, Yes.
-
[email protected]replied to [email protected] last edited by
R1-32B hasn't been added to Ollama yet, the model I use is Deepseek v2, but as they're both licensed under MIT I'd assume they behave similarly. I haven't tried out OpenAI o1 or Claude yet as I'm only running models locally.
-
[email protected]replied to [email protected] last edited by
I haven't looked into running any if these models myself so I'm not too informed, but isn't the censorship highly dependent on the training data? I assume they didn't release theirs.
-
[email protected]replied to [email protected] last edited by
Video of censored answers show R1 beginning to give a valid answer, then deleting the answer and saying the question is outside its scope. That suggests the censorship isn’t in the training data but in some post-processing filter.
And even if the censorship were at the training level, the whole buzz about R1 is how cheap it is to train.
-
[email protected]replied to [email protected] last edited by
Hmm I’m using 32b from ollama, both on windows and Mac.
-
[email protected]replied to [email protected] last edited by
Interesting. I wonder if model distillation affected censoring in R1.
-
[email protected]replied to [email protected] last edited by
I thought this was a big deal because it's Open Source -- is it not possible to see what is causing these blocks and censorings?
-
[email protected]replied to [email protected] last edited by
I'm still hazy on how open source it really is. Even if you ask it, it will tell you v3 is, and I quote "not open source". There is a github repository so there is some code available, but I get the sense that open-source is being used as a bit of a teaser here and that for-profit licensing is likely where this is headed.
Even if the code was fully available, I suspect it could take weeks to find the censorship bits. -
[email protected]replied to [email protected] last edited by
Ah, I just found it.
Alpaca is just being weird again.
(I'm presently typing this while attempting to look over the head of my cat) -
[email protected]replied to [email protected] last edited by
beginning to give a valid answer, then deleting the answer
If it IS open source someone could undo this, but I assume its more difficult than a single on/off button. That along with it being selfhostable, it might be pretty good.
-
[email protected]replied to [email protected] last edited by
But it's still censored anyway