加载中…
个人资料
  • 博客等级:
  • 博客积分:
  • 博客访问:
  • 关注人气:
  • 获赠金笔:0支
  • 赠出金笔:0支
  • 荣誉徽章:
正文 字体大小:

中国怎么还能追上来?

(2025-01-16 05:47:05)
尽管美国政府为了遏制中国高科技产业的发展,禁止美国企业出口最尖端的芯片到中国,美国《时代》周刊却发现,中国的人工智能领域还是在以极快的速度追赶着美国——就连美国AI领域的精英,都对此感到“震惊”。

可究竟是什么,让美国在芯片上的对中国的卡脖子,事倍功半呢?

美国《时代》周刊在其分析中首先指出,支撑AI领域发展的核心三件套,分别是数据、算法以及算力。

然而,除了依托于芯片这种实体物的算力,可以被禁止过境,数据和算法都是跨越国界且难以被限制的。其中,数据是可以从互联网上公开取得的。算法的创新则通常是以学术论文这样的形式在全世界分享传播——即便被限制,中国也有丰富的AI人才储备正在不断转化为顶尖的AI研究人员,且数量多于美国。

其次,《时代》指出,即便是在芯片的限制上,中国也有办法应对。除了在美国相关禁令出台前,就已经提前囤积好了足够多的芯片,中国的企业还可以通过租用云端的算力服务器来跑AI。同时,美国虽然此前限制了最尖端的芯片,但稍差一些的芯片当时仍然可以出口到中国。

《时代》周刊在这里还特别提到,目前中国国产的一些AI大模型,已经通过软件和算法的优化,把这些差一些芯片的性能发挥到了极致,结果反而实现了在表现能力上反超有尖端芯片支持的美国大模型。

新上市的芯片往往还需要一段时间的适应磨合期,而一些被广泛使用的旧芯片的性能则已经得到了最大释放。

因此,《时代》介绍说,诸如美国谷歌公司的前CEO埃里克·施密特(Eric Schmidt)这样在美国AI领域有举足轻重地位的人物惊讶地发现,原本他们以为美国在AI领域轻松地保持2-3年的领先优势,但现在中国还是在迅速追了上来。

“我被震惊了,我以为我们在芯片上的限制可以把他们拦下来”,施密特在去年11月一次讲话中这样说道。

不过,有美国的专业人士对《时代》表示,随着美国对芯片的限制在宽度和深度上不断地加码,美国还是会逐步在算力和规模上拉开与中国的差距。但也有专业人士认为,鉴于中国在限制下仍然能不断进步,美国与其限制,不如通过对话与中国一同寻找让彼此都放心的解决办法。

How China Is Advancing in AI Despite U.S. Chip Restrictions

By Harry Booth

In 2017, Beijing unveiled an ambitious roadmap to dominate artificial intelligence development, aiming to secure global leadership by 2030. By 2020, the plan called for “iconic advances” in AI to demonstrate its progress. Then in late 2022, OpenAI’s release of ChatGPT took the world by surprise—and caught China flat-footed.

At the time, leading Chinese technology companies were still reeling from an 18-month government crackdown that shaved around $1 trillion off China's tech sector. It was almost a year before a handful of Chinese AI chatbots received government approval for public release. Some questioned whether China’s stance on censorship might hobble the country’s AI ambitions. Meanwhile, the Biden administration’s export controls, unveiled just a month before ChatGPT’s debut, aimed to cut China off from the advanced semiconductors essential for training large-scale AI models. Without cutting-edge chips, Beijing’s goal of AI supremacy by 2030 appeared increasingly out of reach.

But fast forward to today, and a flurry of impressive Chinese releases suggests the U.S.’s AI lead has shrunk. In November, Alibaba and Chinese AI developer DeepSeek released reasoning models that, by some measures, rival OpenAI’s o1-preview. The same month, Chinese videogame juggernaut Tencent unveiled Hunyuan-Large, an open-source model that the company’s testing found outperformed top open-source models developed in the U.S. across several benchmarks. Then, in the final days of 2024, DeepSeek released DeepSeek-v3, which now ranks highest among open-source AI on a popular online leaderboard and holds its own against top performing closed systems from OpenAI and Anthropic.

Before DeepSeek-v3 was released, the trend had already caught the attention of Eric Schmidt, Google’s former CEO and one of the most influential voices on U.S. AI policy. In May 2024, Schmidt had confidently asserted that the U.S. maintained a two-to-three year lead in AI, “which is an eternity in my books.” Yet by November, in a talk at the Harvard Kennedy School, Schmidt had changed his tune. He cited the advances from Alibaba, and Tencent as evidence that China was closing the gap. “This is shocking to me,” he said. “I thought the restrictions we placed on chips would keep them back.” 

Beyond a source of national prestige, who leads on AI will likely have ramifications for the global balance of power. If AI agents can automate large parts of the workforce, they may provide a boost to nations’ economies. And future systems, capable of directing weapons or hacking adversaries, could provide a decisive military advantage. As nations caught between the two superpowers are forced to choose between Chinese or American AI systems, artificial intelligence could emerge as a powerful tool for global influence. China’s rapid advances raise questions about whether U.S. export controls on semiconductors will be enough to maintain America's edge.

Building more powerful AI depends on three essential ingredients: data, innovative algorithms, and raw computing power, or compute. Training data for large language models like GPT-4o is typically scrapped from the internet, meaning it’s available for developers across the world. Similarly, algorithms, or new ideas for how to improve AI systems, move across borders with ease, as new techniques are often shared in academic papers. Even if they weren’t, China has a wealth of AI talent, producing more top AI researchers than the U.S. By contrast, advanced chips are incredibly hard to make, and unlike algorithms or data, they are a physical good that can be stopped at the border.

The supply chain for advanced semiconductors is dominated by America and its allies. U.S. companies Nvidia and AMD have an effective duopoly on datacenter-GPUs used for AI. Their designs are so intricate—with transistors measured in single-digit nanometers—that currently, only the Taiwanese company TSMC manufactures these top-of-the-line chips. To do so, TSMC relies on multi-million dollar machines that only Dutch company ASML can build. 

The U.S. has sought to leverage this to its advantage. In 2022, the Biden administration introduced export controls, laws that prevent the sale of cutting-edge chips to China. The move followed a series of measures that began under Trump’s first administration, which sought to curb China’s access to chip-making technologies. These efforts have not only restricted the flow of advanced chips into China, but hampered the country’s domestic chip industry. China’s chips lag “years behind,” U.S. Secretary of Commerce Gina Raimondo told 60 minutes in April.

Yet, the 2022 export controls encountered their first hurdle before being announced, as developers in China reportedly stockpiled soon-to-be restricted chips. DeepSeek, the Chinese developer behind an AI reasoning model called R1, which rivals OpenAI’s O1-preview, assembled a cluster of 10,000 soon-to-be-banned Nvidia A100 GPUs a year before export controls were introduced.

Smuggling might also have undermined the export control’s effectiveness. In October, Reuters reported that restricted TSMC chips were found on a product made by Chinese company Huawei. Chinese companies have also reportedly acquired restricted chips using shell companies outside China. Others have skirted export controls by renting GPU access from offshore cloud providers. In December, The Wall Street Journal reported that the U.S. is preparing new measures that would limit China’s ability to access chips through other countries. 

While U.S. export controls curtail China’s access to the most cutting-edge semiconductors, they still allow the sale of less powerful chips. Deciding which chips should and should not be allowed has proved challenging. In 2022, Nvidia tweaked the design of its flagship chip to create a version for the Chinese market that fell within the restrictions’ thresholds. The chip was still useful for AI development, prompting the U.S. to tighten restrictions in October 2023. “We had a year where [China] could just buy chips which are basically as good,” says Lennart Heim, a lead on AI and compute at the RAND corporation’s Technology and Security Policy Center. He says this loophole, coupled with the time for new chips to find their way into AI developers’ infrastructure, is why we are yet to see the export controls have a full impact on China’s AI development.

It remains to be seen whether the current threshold strikes the right balance. In November, Tencent released a language model called Hunyuan-Large that outperforms Meta’s most powerful variant of Llama 3.1 in several benchmarks. While benchmarks are an imperfect measure for comparing AI models’ overall intelligence, Hunyuan-Large’s performance is impressive because it was trained using the less powerful, unrestricted Nvidia H20 GPUs, according to research by the Berkeley Risk and Security Lab. “They're clearly getting much better use out of the hardware because of better software,” says Ritwik Gupta, the author of the research, who also advises the Department of Defense’s Defense Innovation Unit. Rival Chinese lab’s DeepSeek-v3, believed to be the strongest open model available, was also trained using surprisingly little compute. Although there is significant uncertainty about how President-elect Donald Trump will approach AI policy, several experts told TIME in November that they expected export controls to persist—and even be expanded.

Before new restrictions were introduced in December, Chinese companies once again stockpiled soon-to-be-blocked chips.“This entire strategy needs to be rethought,” Gupta says. “Stop playing whack-a-mole with these hardware chips.” He suggests that instead of trying to slow down development of large language models by restricting access to chips, the U.S. should concentrate on preventing the development of military AI systems, which he says often need less computing power to train. Though he acknowledges that restrictions on other parts of the chip supply chain—like ASML’s machines used for manufacturing chips—have been pivotal in slowing China’s domestic chip industry.

Heim says that over the last year, the U.S.’s lead has shrunk, though he notes that while China may now match the U.S.’s best open source models, these lag roughly one year behind the top closed models. He adds that the closing gap does not necessarily mean export controls are failing. “Let’s move away from this binary of export controls working or not working,” he says, adding that it may take longer for China to feel them bite.  

The last decade has seen a dizzying increase in the compute used for training AI models. For example, OpenAI’s GPT-4, released in 2023, is estimated to have been trained using roughly 10,000 times more compute than GPT-2, released in 2019. There are indications that trend is set to continue, as American companies like X and Amazon build massive supercomputers with hundreds of thousands of GPUs, far exceeding the computing power used to train today's leading AI models. If it does, Heim predicts that U.S. chip export restrictions will hamper China's ability to keep pace in AI development. “Export controls mostly hit you on quantity,” Heim says, adding that even if some restricted chips find their way into the hands of Chinese developers, by reducing the number, export controls make it harder to train and deploy models at scale. “I do expect export controls to generally hit harder over time, as long as compute stays as important,” he says.

Within Washington, “right now, there is a hesitation to bring China to the [negotiating] table,” says Scott Singer, a visiting scholar in the Technology and International Affairs Program at the Carnegie Endowment for International Peace. The implicit reasoning: ‘[If the U.S. is ahead], why would we share anything?’” 

But he notes there are compelling reasons to negotiate with China on AI. “China does not have to be leading to be a source of catastrophic risk,” he says, adding its continued progress in spite of compute restrictions means it could one day produce AI with dangerous capabilities. "If China is much closer, consider what types of conversations you want to have with them around ensuring both sides' systems remain secure,” Singer says

0

阅读 收藏 喜欢 打印举报/Report
  

新浪BLOG意见反馈留言板 欢迎批评指正

新浪简介 | About Sina | 广告服务 | 联系我们 | 招聘信息 | 网站律师 | SINA English | 产品答疑

新浪公司 版权所有