Anthropic指控深度求索等中国公司不当获取其数据

Wait 5 sec.

CADE METZ2026年2月24日深度求索是三家被指控利用Anthropic人工智能系统数据训练自身模型的中国初创企业之一，该过程被称为知识蒸馏。 Cfoto/Future Publishing, via Getty ImagesThe San Francisco artificial intelligence start-up Anthropic has accused three Chinese companies of improperly harvesting large amounts of data from its A.I. technologies in an effort to accelerate the development of their own systems.旧金山人工智能初创企业Anthropic指控三家中国公司通过不当手段大量抓取其人工智能技术数据，以加速自研系统开发。Anthropic said in a blog post on Monday that DeepSeek, Moonshot and MiniMax — three prominent Chinese start-ups — had used about 24,000 fraudulent accounts to generate over 16 million conversations with its Claude chatbot that could be used to teach skills to their own chatbots.Anthropic周一在一篇博客文章中表示，中国三家知名初创企业深度求索、月之暗面和稀宇科技利用约2.4万个虚假账号，与Anthropic的Claude聊天机器人产生了超过1600万次对话，这些数据可用于训练三家公司自己的聊天机器人。Using data from one A.I. system to train another — a process called distillation — is common in A.I. work. But Anthropic’s terms of service forbid anyone to surreptitiously harvest data for distillation and do not allow its technologies to be used in China.利用一个人工智能系统的数据训练另一个系统的过程被称为知识蒸馏，在人工智能领域较为常见。但Anthropic的服务条款禁止任何人以秘密方式抓取数据用于蒸馏，同时不允许其技术在中国境内使用。OpenAI, Anthropic’s primary rival, has also accused Chinese companies of lifting large amounts of data from its chatbot, ChatGPT, for similar proposes.Anthropic的主要竞争对手OpenAI也指控中国公司从其聊天机器人ChatGPT中大量提取数据用于类似目的。In a memo sent to the House Select Committee on China last week, OpenAI said DeepSeek and other Chinese start-ups were using new and “obfuscated” distillation methods as part of their “ongoing efforts to free-ride” on technologies developed by OpenAI and other U.S. companies.在上周致美国国会众议院中国问题特设委员会的一份备忘录中，OpenAI称，深度求索等中国初创企业正采用新型的“混淆式”蒸馏手段，“持续搭便车”使用OpenAI及其他美国公司开发的技术。Like OpenAI, Anthropic said the practice was a national security risk, adding that it could allow China to build A.I. technologies to create bioweapons or tools for mass surveillance. The start-up has guardrails on its technologies designed to prevent them from being used in those ways, but the guardrails can be stripped away during distillation.与OpenAI一样，Anthropic称此类行为构成国家安全风险，并表示这可能让中国得以开发用于制造生物武器或大规模监控工具的人工智能技术。Anthropic已为其技术设置安全防护机制，防止被用于此类用途，但这些防护措施在蒸馏过程中可能被剥离。Anthropic called on government officials and other A.I. companies to help prevent Chinese companies from distilling American models.Anthropic呼吁政府官员及其他人工智能企业共同阻止中国公司对美国模型进行蒸馏。“These campaigns are growing in intensity and sophistication,” Anthropic said in its post. “The window to act is narrow, and the threat extends beyond any single company or region. Addressing it will require rapid, coordinated action among industry players, policymakers and the global A.I. community.”“此类行动正变得愈发激烈且手段更趋复杂，”Anthropic在文章中表示，“可供采取行动的时间窗口正迅速缩小，且威胁已超出单一企业或地区范围。应对这一问题，需要行业参与者、政策制定者及全球人工智能界迅速采取协同行动。”DeepSeek, Moonshot and MiniMax did not immediately respond to requests for comment.深度求索、月之暗面、稀宇科技三家公司暂未回应置评请求。Anthropic published its post amid a tussle with the Defense Department over the Pentagon’s use of its technologies. The Pentagon has approved Anthropic’s technologies for use with classified tasks, but it is threatening to sever ties with the start-up because Anthropic does not want its technologies used in situations involving autonomous weapons or domestic surveillance.此文发布之际，Anthropic正与美国国防部就五角大楼对其技术的使用陷入争执。五角大楼已批准将Anthropic的技术用于涉密任务，但因该初创公司不希望其技术被应用于自主武器或国内监控领域，五角大楼威胁要终止双方的合作关系。Last year, DeepSeek spooked Silicon Valley tech companies and sent the U.S. financial markets into a tailspin after releasing A.I. technologies that matched the performance of anything else on the market.去年，深度求索推出了性能与全球市场同类产品相当的人工智能技术，令硅谷科技企业震惊，并引发美国金融市场剧烈震荡。Until then, the prevailing wisdom in Silicon Valley had been that the most powerful systems could not be built without billions of dollars in specialized computer chips. But DeepSeek said it had created its technologies using far fewer resources.在此之前，硅谷的普遍观点是：没有数十亿美元的专用计算机芯片，就无法打造出最强大的人工智能系统。但深度求索表示，打造其技术所耗费的资源远少于此。Like U.S. companies, DeepSeek, Moonshot and MiniMax build their A.I. technologies using computer code and data corralled from across the internet. A.I. companies across the globe lean heavily on a practice called open sourcing, which means they freely share the code that underpins their technologies and reuse code shared by others. They see this is as way of accelerating technological development.与美国企业一样，深度求索、月之暗面、稀宇科技均通过从互联网搜集的计算机代码和数据构建人工智能技术。全球人工智能企业都高度依赖开源模式——即免费共享支撑其技术的代码，并复用他人分享的代码。它们认为这是加速技术发展的一种途径。A.I. companies also need enormous amounts of online data to train their A.I. systems. The leading systems learn their skills by analyzing just about all of the text on the internet.人工智能企业还需要海量网络数据来训练系统。顶尖人工智能系统通过分析互联网上几乎所有文本习得相关能力。Distillation is often used to train new systems. This is often allowed by open source technologies. But if a company takes data from proprietary technology, the practice may be legally problematic.知识蒸馏常被用于训练新系统，开源技术通常允许这一做法。但如果一家公司从专有技术中提取数据，则可能涉嫌违法。Anthropic, which is now valued at $380 billion, is facing multiple lawsuits accusing it of illegally using copyrighted internet data to train its systems. In September as part of a landmark legal settlement, Anthropic agreed to pay $1.5 billion to a group of authors and publishers after a judge ruled it had illegally downloaded and stored millions of copyrighted books. It was the largest payout in the history of U.S. copyright cases.目前估值达3800亿美元的Anthropic正面临多起诉讼，它被控非法使用受版权保护的网络数据训练系统。去年9月，在一桩具有里程碑意义的法律和解中，法官裁定Anthropic非法下载并存储数以百万计受版权保护的书籍，该公司同意向一批作者和出版商支付15亿美元赔偿金。这是美国版权案史上金额最高的赔偿。OpenAI and other A.I. companies face similar suits, including a lawsuit brought by The New York Times against OpenAI and its partner Microsoft. That suit contends that millions of articles published by The Times were used to train automated chatbots that now compete with the news outlet as a source of reliable information. Both OpenAI and Microsoft deny the claims.OpenAI及其他人工智能企业也面临类似诉讼，其中包括《纽约时报》对OpenAI及其合作伙伴微软提起的诉讼。该诉讼称，《纽约时报》数以百万计的文章被用于训练自动聊天机器人，而这些机器人如今已成为一个与时报构成竞争关系的可靠信息来源。OpenAI和微软均否认相关指控。Cade Metz撰写有关人工智能、无人驾驶汽车、机器人、虚拟现实和其他技术新兴领域的新闻。翻译：纽约时报中文网点击查看本文英文版。