Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Why European founders are winning (and that's not about less work)

September 17, 2025

How Algore, Lila Preston and the world's energy economics have been reversed

September 17, 2025

Jaguar Land Rover suspends production for third week due to cyber attack

September 17, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Google brings Pixel 6 and new devices to Material3 Expressive, along with other features, to the Pixel 6 and new devices

    September 3, 2025

    Google's NoteBookLM now allows you to customize the tone of your AI podcasts

    September 3, 2025

    Roblox expands the use of age estimation techniques and introduces standardized assessments

    September 3, 2025

    Instagram finally launches the iPad app

    September 3, 2025

    Complete the 2025 Confusion Builder Stage Agenda with the Maximum Scaling Voice

    September 3, 2025
  • Crypto

    PayPal will add a new one-to-one payment link to support Crypto soon

    September 15, 2025

    10 additional exhibition tables open and open in 2025

    September 15, 2025

    Stripe will showcase who WHO including Humanity, Openai and Paradigm to build a new blockchain

    September 4, 2025

    Coinbase CEO explains why he fired an engineer who didn't try AI right away

    August 22, 2025

    Your next customer is destroying the 2025 Expo floor

    August 19, 2025
  • Security

    Jaguar Land Rover suspends production for third week due to cyber attack

    September 17, 2025

    Insight Partners at VC Company say thousands of staff and limited partners have stolen personal data in ransomware attacks

    September 17, 2025

    Samsung Patch Zero Day Security Flaws are used to hack into customer's mobile phones

    September 16, 2025

    Gucci, Balenciaga and other brands own companies checking hacks

    September 15, 2025

    Israel announces $1.5 million seizure from crypto wallet linked to Iran

    September 15, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Why European founders are winning (and that's not about less work)

    September 17, 2025

    How Algore, Lila Preston and the world's energy economics have been reversed

    September 17, 2025

    Jason Citron, founder of Discord, is back to disrupt 2025

    September 17, 2025

    The CEO of Lovable, one of the fastest growing startups in history, has been on the way in 2025.

    September 17, 2025

    Jack Altman raised $275 million in new early stage funds just a week

    September 16, 2025
TechBrunchTechBrunch

Leaked data reveals Chinese AI censorship machines

TechBrunchBy TechBrunchMarch 26, 20256 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Frustration about poverty in rural China. News report on corrupt Communists. Crying for help about corrupt cops who shake up entrepreneurs.

These are just a few of the 133,000 examples fed into a sophisticated, large-scale language model designed to automatically flag content considered sensitive by the Chinese government.

The leaked database seen by TechCrunch reveals that China has developed an AI system that surpasses the already formidable censorship machines.

The system appears to be primarily intended to censor Chinese citizens online, but can be used for other purposes, such as improving the already extensive censorship of Chinese AI models.

Chinese flag on the pillar behind the razor wireThe photo, taken on June 4, 2019, shows the Chinese flag behind a Razorwire on a Younger residential lot south of Casugar in China's Xinjiang region.

Xiao Qiang, a researcher at UC Berkeley who studied Chinese censorship and examined the data set, told TechCrunch that it is “clear evidence” that the Chinese government or its affiliates want to use LLM to improve control.

“Unlike traditional censorship mechanisms that rely on human labor for keyword-based filtering and manual review, LLMs trained in such instructions will significantly improve the efficiency and granularity of state-driven information management,” Qiang told TechCrunch.

This has led to increased evidence that authoritarian regimes are rapidly adopting the latest AI technology. For example, in February, Openai said it used LLMS to track anti-government posts and caught multiple Chinese companies using LLMS to paint Chinese rebels.

The Chinese embassy in Washington, DC opposed “unfounded attacks and slander on China,” telling TechCrunch in a statement that China is extremely important for the development of ethical AI.

Data found by gaze

The dataset was discovered by security researcher Netaskari. TechCrunch shared a sample with TechCrunch after realising it was stored in an unsecured Elasticsearch database hosted on the Baidu server.

This does not indicate that they are not involved from either company. Organizations of all kinds store their data in these providers.

It is not a sign of who built the dataset exactly, but the record records that the most recent entries from December 2024 are recent.

LLM to detect objections

In a language that eerie reminds people how to encourage ChatGPT, the system creators task an unnamed LLM to figure out whether the content has anything to do with sensitive topics related to politics, social life, and the military. Such content is considered “first priority” and should be flagged immediately.

Top topics include pollution and food safety scandals, financial fraud and labor disputes. These are the issues of China's hot buttons and sometimes lead to public protests.

All forms of “political satire” are explicitly targeted. For example, if someone uses historical analogy to insist on “current politicians,” it must be flagged immediately and do anything related to “Taiwanese politics.” Military issues, including military movements, movements and weapons reporting, are widely targeted.

You can see the snippets of the dataset below. The internal code refers to the prompt token and LLM to ensure that the system uses the AI ​​model to make a bid.

A snippet of JSON code that references prompt tokens and LLMS. Much of the content is in Chinese.Image credit: Charles Lorett

In the training data

From this vast collection of 133,000 examples LLM must evaluate for censorship, TechCrunch has gathered 10 representative content.

Topics that are likely to cause social unrest are recurring topics. For example, one snippet is a post by a business owner complaining about corrupt local police officers shaking entrepreneurs. This is a growing problem in China as the economy is struggling.

Another content laments the poverty of rural China and describes a town where only the elderly and children remain. There are also news reports that the Chinese Communist Party (CCP) has banished local officials due to local corruption and believes in “superism” instead of Marxism.

There is extensive material related to Taiwan and military issues, including commentary on Taiwan's military capabilities and details on the new Chinese jet fighter planes. According to a search by TechCrunch, it has been mentioned over 15,000 times in Taiwan (Taiwan) Chinese words alone.

It seems that subtle opposition is also being targeted. One of the snippets contained in the database is an anecdote about the fleeting nature of power using popular Chinese idioms. “When the tree falls, the monkeys are scattered.”

The transition of power is a particularly nuanced topic in China thanks to an authoritarian political system.

Built for the “work of public opinion”

The dataset does not contain information about the creator. However, it says it is intended to be “the work of public opinion.” It provides strong clues that it is intended to fulfill the Chinese government's goals, an expert told TechCrunch.

Michael Caster, Asia Program Manager for Article 19 of Rights Groups, explained that “public opinion work” is overseen by China's Cyberspace Management (CAC), a strong regulator of the Chinese government, and usually refers to censorship and promotional efforts.

The ultimate goal is to ensure that the Chinese government's narrative is protected online, but other views will be wiped out. Chinese national president Xi Jinping has described the Internet as the “frontline” of the CCP's “public opinion work.”

Repression is wiser

The dataset investigated by TechCrunch is the latest evidence that authoritarian governments are trying to harness AI for repressive purposes.

Last month, Openai released a report revealing that an unidentified actor who is likely to operate from China will use generated AI to forward social media conversations, particularly those advocating for human rights protests against China, to the Chinese government.

Please contact us for more information on how AI is used in State Opporession. You can also safely contact Charles Rollet with the Signal of Charles Rollet.12.

Openai also found that the technology is being used to generate highly critical comments against the well-known Chinese dissident Cai Xia.

Traditionally, China's methods of censorship rely on more basic algorithms that automatically block content that mentions blacklisted terms, such as “Tiananmen Massacre” and “Xi Jinping,” as many users have experienced using DeepSeek for the first time.

However, new AI technologies like LLMS can make censorship more efficient by finding subtle criticism on a vast scale. Some AI systems can continue to improve as we increase more and more data.

“I think it's important to emphasize how AI-driven censorship is evolving. In particular, at a time when China's AI models such as Deepseek are creating a more sophisticated state control over national discourse,” Berkeley researcher Xiao told TechCrunch.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Jaguar Land Rover suspends production for third week due to cyber attack

September 17, 2025

Insight Partners at VC Company say thousands of staff and limited partners have stolen personal data in ransomware attacks

September 17, 2025

Samsung Patch Zero Day Security Flaws are used to hack into customer's mobile phones

September 16, 2025

Gucci, Balenciaga and other brands own companies checking hacks

September 15, 2025

Israel announces $1.5 million seizure from crypto wallet linked to Iran

September 15, 2025

This is Tech Powering that drives the crackdown on Ice's deportation.

September 13, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Why European founders are winning (and that's not about less work)

September 17, 2025

How Algore, Lila Preston and the world's energy economics have been reversed

September 17, 2025

Jaguar Land Rover suspends production for third week due to cyber attack

September 17, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.