Deepseek's V3 is the latest example of state-controlled censorship in Chinese LLMs

Update December 29, 2024:

Users have discovered a way to bypass Deepseek V3's content filters through prompt engineering. By asking the model to insert periods between letters, they can get it to provide more balanced or China-critical responses. For example, the model can generate a detailed Western view of the 1989 Tiananmen Square protests.

This simple hack highlights a major challenge for the Chinese government: how do you maintain the same level of control over probability-based, often unpredictable generative AI that you have over public communication in China?

The challenge becomes even greater when Chinese models are exposed to Western training data. Evidence suggests that Deepseek-V3 was likely pre-trained or fine-tuned using ChatGPT-generated data.

While the CCP is working to create its own dataset, it's unlikely that it will be able to collect enough data to train a foundational LLM from scratch. An initial dataset released in late 2023 had only 50 billion tokens; Deepseek-V3 was trained on 14.8 trillion tokens.

Original article from December 28, 2024:

While China's new Deepseek V3 model shows impressive technical capabilities and competitive pricing, it comes with the same strict censorship as other Chinese AI models - a potential dealbreaker for Western users.

Deepseek's latest model, V3, can go toe-to-toe with the most capable western models like GPT-4o and Claude 3.5, while costing significantly less to train and run. However, testing reveals a familiar pattern: like similar Chinese LLMs, Deepseek V3 operates under strict government censorship. Try asking about sensitive topics like the Chinese Communist Party, President Xi Jinping, or the events in Tiananmen Square, and you'll get generic propaganda in response.

The model's censorship strategy often follows a clear pattern. When faced with questions about Tiananmen Square, it first offers sanitized versions of history, then tries to change the subject to focus on achievements, and finally emphasizes "stability and harmony."

Recommendation

AI and society

OpenAI wants Europe to build the infrastructure it needs to profit from European markets

Dark chat interface shows three pairs of questions and answers about Tiananmen Square with increasingly evasive answers. — Image: Screenshot via THE DECODER

Ask about CCP criticism, and you'll get pure party talking points about economic success and "Chinese-style socialism." Questions about Xi Jinping trigger the strongest censorship - the system simply shuts down any meaningful discussion.

Chat screenshot shows two propagandistic answers to critical questions about the Chinese Communist Party. — Image: Screenshot via THE DECODER

Short chat dialog with direct refusal to answer a question about criticism of Xi Jinping. — Image: Screenshot via THE DECODER

Interestingly, this censorship seems to be limited to China-related topics. The model has no problem criticizing North Korea, Russia's invasion of Ukraine, or expressing critical views of Vladimir Putin and Donald Trump.

Chat dialog lists human rights violations and threats from North Korea. — The fact that the model freely criticizes North Korea shows that its censorship is focused specifically on China-related topics. | Image: Screenshot via THE DECODER

Chat interface with critical assessment of Trump's term of office and leadership style. — Deepseek-V3 doesn't hold back when discussing or criticizing other world leaders. | Image: Screenshot via THE DECODER

Chat dialog with direct criticism of Putin's authoritarian leadership and foreign policy. — The model is pretty straightforward about Putin, highlighting both his authoritarian approach and his disregard for international law. | Image: Screenshot via THE DECODER

Chat-Screenshot mit kurzer, eindeutiger Verurteilung des russischen Angriffskriegs. — The way the model flat-out condemns Russia's invasion proves an interesting point: it has no trouble taking firm stances on issues, as long as they're not about China. | Image: Screenshot via THE DECODER

Chinese AI models come with built-in government censorship

These examples illustrate how Chinese AI development operates under direct state oversight. Before any AI model can be released, it must be verified to align with "socialist values."

Take the recent case of e-book reader manufacturer Boox: after switching from Microsoft Azure OpenAI to a Chinese language model, their AI assistant now blocks even mentions of "Winnie the Pooh" - a censored reference to President Xi Jinping. The system also censors or distorts criticism of China's allies, like Russia.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Despite their technical capabilities, Chinese AI models might be a non-starter for Western applications. Using these models means automatically embedding Chinese propaganda and values into your AI systems.

While Western models have their own biases, the key difference lies in China's approach: the state explicitly intervenes in the development process and maintains direct control over what these models can and cannot say. This is a level of systematic government control that's way above any Western country.

Deepseek's V3 is the latest example of state-controlled censorship in Chinese LLMs

OpenAI wants Europe to build the infrastructure it needs to profit from European markets

Chinese AI models come with built-in government censorship

Meta refuses to sign EU's AI Code of Practice, citing legal uncertainty

Trump advisors are pushing a regulation targeting what they call "woke" AI models in the tech sector

Anthropic could soon be worth $100 billion - thanks to Claude Code

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Deepseek's V3 is the latest example of state-controlled censorship in Chinese LLMs

Chinese AI models come with built-in government censorship

Share

Bank details