Does DeepSeek Censor Its AI Answers? On These Sensitive Topics, Yes.

[email protected]

R1-32B hasn't been added to Ollama yet, the model I use is Deepseek v2, but as they're both licensed under MIT I'd assume they behave similarly. I haven't tried out OpenAI o1 or Claude yet as I'm only running models locally.

[email protected]

I haven't looked into running any if these models myself so I'm not too informed, but isn't the censorship highly dependent on the training data? I assume they didn't release theirs.

[email protected]

Video of censored answers show R1 beginning to give a valid answer, then deleting the answer and saying the question is outside its scope. That suggests the censorship isn’t in the training data but in some post-processing filter.

And even if the censorship were at the training level, the whole buzz about R1 is how cheap it is to train.

[email protected]

Hmm I’m using 32b from ollama, both on windows and Mac.

[email protected]

Interesting. I wonder if model distillation affected censoring in R1.

[email protected]

I thought this was a big deal because it's Open Source -- is it not possible to see what is causing these blocks and censorings?

[email protected]

I'm still hazy on how open source it really is. Even if you ask it, it will tell you v3 is, and I quote "not open source". There is a github repository so there is some code available, but I get the sense that open-source is being used as a bit of a teaser here and that for-profit licensing is likely where this is headed.
Even if the code was fully available, I suspect it could take weeks to find the censorship bits.

[email protected]

Ah, I just found it.
Alpaca is just being weird again.
(I'm presently typing this while attempting to look over the head of my cat)

[email protected]

beginning to give a valid answer, then deleting the answer

If it IS open source someone could undo this, but I assume its more difficult than a single on/off button. That along with it being selfhostable, it might be pretty good.

[email protected]

Deepseek-R1 unable to answer prompt "Tianamen Square"
But it's still censored anyway

agnos.is Forums

Does DeepSeek Censor Its AI Answers? On These Sensitive Topics, Yes.