Does DeepSeek Censor Its AI Answers? On These Sensitive Topics, Yes.

lorips@lemmy.world · replied to Guest on 28 Jan 2025, 08:22

R1-32B hasn't been added to Ollama yet, the model I use is Deepseek v2, but as they're both licensed under MIT I'd assume they behave similarly. I haven't tried out OpenAI o1 or Claude yet as I'm only running models locally.

bytejunk@lemmy.world · 28 Jan 2025, 13:49

I haven't looked into running any if these models myself so I'm not too informed, but isn't the censorship highly dependent on the training data? I assume they didn't release theirs.

aboubenadhem@lemmy.world · replied to Guest on 28 Jan 2025, 13:49

Video of censored answers show R1 beginning to give a valid answer, then deleting the answer and saying the question is outside its scope. That suggests the censorship isn’t in the training data but in some post-processing filter.

And even if the censorship were at the training level, the whole buzz about R1 is how cheap it is to train.

tyler@programming.dev

Hmm I’m using 32b from ollama, both on windows and Mac.

banshee@lemmy.world

Interesting. I wonder if model distillation affected censoring in R1.

psmgx@lemmy.world

I thought this was a big deal because it's Open Source -- is it not possible to see what is causing these blocks and censorings?

imgonnatrythis@sh.itjust.works

I'm still hazy on how open source it really is. Even if you ask it, it will tell you v3 is, and I quote "not open source". There is a github repository so there is some code available, but I get the sense that open-source is being used as a bit of a teaser here and that for-profit licensing is likely where this is headed.
Even if the code was fully available, I suspect it could take weeks to find the censorship bits.

lorips@lemmy.world

Ah, I just found it.
Alpaca is just being weird again.
(I'm presently typing this while attempting to look over the head of my cat)

triflingtoad@sh.itjust.works

beginning to give a valid answer, then deleting the answer

If it IS open source someone could undo this, but I assume its more difficult than a single on/off button. That along with it being selfhostable, it might be pretty good.

lorips@lemmy.world

But it's still censored anyway

agnos.is Forums

Does DeepSeek Censor Its AI Answers? On These Sensitive Topics, Yes.