中国怎么还能追上来?
(2025-01-16 05:47:05)
尽管美国政府为了遏制中国高科技产业的发展,禁止美国企业出口最尖端的芯片到中国,美国《时代》周刊却发现,中国的人工智能领域还是在以极快的速度追赶着美国——就连美国AI领域的精英,都对此感到“震惊”。
可究竟是什么,让美国在芯片上的对中国的卡脖子,事倍功半呢?
美国《时代》周刊在其分析中首先指出,支撑AI领域发展的核心三件套,分别是数据、算法以及算力。
然而,除了依托于芯片这种实体物的算力,可以被禁止过境,数据和算法都是跨越国界且难以被限制的。其中,数据是可以从互联网上公开取得的。算法的创新则通常是以学术论文这样的形式在全世界分享传播——即便被限制,中国也有丰富的AI人才储备正在不断转化为顶尖的AI研究人员,且数量多于美国。
其次,《时代》指出,即便是在芯片的限制上,中国也有办法应对。除了在美国相关禁令出台前,就已经提前囤积好了足够多的芯片,中国的企业还可以通过租用云端的算力服务器来跑AI。同时,美国虽然此前限制了最尖端的芯片,但稍差一些的芯片当时仍然可以出口到中国。
《时代》周刊在这里还特别提到,目前中国国产的一些AI大模型,已经通过软件和算法的优化,把这些差一些芯片的性能发挥到了极致,结果反而实现了在表现能力上反超有尖端芯片支持的美国大模型。
新上市的芯片往往还需要一段时间的适应磨合期,而一些被广泛使用的旧芯片的性能则已经得到了最大释放。
因此,《时代》介绍说,诸如美国谷歌公司的前CEO埃里克·施密特(Eric
Schmidt)这样在美国AI领域有举足轻重地位的人物惊讶地发现,原本他们以为美国在AI领域轻松地保持2-3年的领先优势,但现在中国还是在迅速追了上来。
“我被震惊了,我以为我们在芯片上的限制可以把他们拦下来”,施密特在去年11月一次讲话中这样说道。
不过,有美国的专业人士对《时代》表示,随着美国对芯片的限制在宽度和深度上不断地加码,美国还是会逐步在算力和规模上拉开与中国的差距。但也有专业人士认为,鉴于中国在限制下仍然能不断进步,美国与其限制,不如通过对话与中国一同寻找让彼此都放心的解决办法。
How China Is Advancing in AI
Despite U.S. Chip Restrictions
By Harry Booth
In 2017, Beijing unveiled an ambitious roadmap to dominate
artificial intelligence development, aiming to secure global
leadership by 2030. By 2020, the plan called for “iconic advances”
in AI to demonstrate its progress. Then in late 2022, OpenAI’s
release of ChatGPT took the world by surprise—and caught China
flat-footed.
At the time, leading Chinese technology companies were still
reeling from an 18-month government crackdown that shaved around $1
trillion off China's tech sector. It was almost a year before a
handful of Chinese AI chatbots received government approval for
public release. Some questioned whether China’s stance on
censorship might hobble the country’s AI ambitions. Meanwhile, the
Biden administration’s export controls, unveiled just a month
before ChatGPT’s debut, aimed to cut China off from the advanced
semiconductors essential for training large-scale AI models.
Without cutting-edge chips, Beijing’s goal of AI supremacy by 2030
appeared increasingly out of reach.
But fast forward to today, and a flurry of impressive Chinese
releases suggests the U.S.’s AI lead has shrunk. In November,
Alibaba and Chinese AI developer DeepSeek released reasoning models
that, by some measures, rival OpenAI’s o1-preview. The same month,
Chinese videogame juggernaut Tencent unveiled Hunyuan-Large, an
open-source model that the company’s testing found outperformed top
open-source models developed in the U.S. across several benchmarks.
Then, in the final days of 2024, DeepSeek released DeepSeek-v3,
which now ranks highest among open-source AI on a popular online
leaderboard and holds its own against top performing closed systems
from OpenAI and Anthropic.
Before DeepSeek-v3 was released, the trend had already caught
the attention of Eric Schmidt, Google’s former CEO and one of the
most influential voices on U.S. AI policy. In May 2024, Schmidt had
confidently asserted that the U.S. maintained a two-to-three year
lead in AI, “which is an eternity in my books.” Yet by November, in
a talk at the Harvard Kennedy School, Schmidt had changed his tune.
He cited the advances from Alibaba, and Tencent as evidence that
China was closing the gap. “This is shocking to me,” he said. “I
thought the restrictions we placed on chips would keep them
back.”
Beyond a source of national prestige, who leads on AI will
likely have ramifications for the global balance of power. If AI
agents can automate large parts of the workforce, they may provide
a boost to nations’ economies. And future systems, capable of
directing weapons or hacking adversaries, could provide a decisive
military advantage. As nations caught between the two superpowers
are forced to choose between Chinese or American AI systems,
artificial intelligence could emerge as a powerful tool for global
influence. China’s rapid advances raise questions about whether
U.S. export controls on semiconductors will be enough to maintain
America's edge.
Building more powerful AI depends on three essential
ingredients: data, innovative algorithms, and raw computing power,
or compute. Training data for large language models like GPT-4o is
typically scrapped from the internet, meaning it’s available for
developers across the world. Similarly, algorithms, or new ideas
for how to improve AI systems, move across borders with ease, as
new techniques are often shared in academic papers. Even if they
weren’t, China has a wealth of AI talent, producing more top AI
researchers than the U.S. By contrast, advanced chips are
incredibly hard to make, and unlike algorithms or data, they are a
physical good that can be stopped at the border.
The supply chain for advanced semiconductors is dominated by
America and its allies. U.S. companies Nvidia and AMD have an
effective duopoly on datacenter-GPUs used for AI. Their designs are
so intricate—with transistors measured in single-digit
nanometers—that currently, only the Taiwanese company TSMC
manufactures these top-of-the-line chips. To do so, TSMC relies on
multi-million dollar machines that only Dutch company ASML can
build.
The U.S. has sought to leverage this to its advantage. In
2022, the Biden administration introduced export controls, laws
that prevent the sale of cutting-edge chips to China. The move
followed a series of measures that began under Trump’s first
administration, which sought to curb China’s access to chip-making
technologies. These efforts have not only restricted the flow of
advanced chips into China, but hampered the country’s domestic chip
industry. China’s chips lag “years behind,” U.S. Secretary of
Commerce Gina Raimondo told 60 minutes in April.
Yet, the 2022 export controls encountered their first hurdle
before being announced, as developers in China reportedly
stockpiled soon-to-be restricted chips. DeepSeek, the Chinese
developer behind an AI reasoning model called R1, which rivals
OpenAI’s O1-preview, assembled a cluster of 10,000
soon-to-be-banned Nvidia A100 GPUs a year before export controls
were introduced.
Smuggling might also have undermined the export control’s
effectiveness. In October, Reuters reported that restricted TSMC
chips were found on a product made by Chinese company Huawei.
Chinese companies have also reportedly acquired restricted chips
using shell companies outside China. Others have skirted export
controls by renting GPU access from offshore cloud providers. In
December, The Wall Street Journal reported that the U.S. is
preparing new measures that would limit China’s ability to access
chips through other countries.
While U.S. export controls curtail China’s access to the most
cutting-edge semiconductors, they still allow the sale of less
powerful chips. Deciding which chips should and should not be
allowed has proved challenging. In 2022, Nvidia tweaked the design
of its flagship chip to create a version for the Chinese market
that fell within the restrictions’ thresholds. The chip was still
useful for AI development, prompting the U.S. to tighten
restrictions in October 2023. “We had a year where [China] could
just buy chips which are basically as good,” says Lennart Heim, a
lead on AI and compute at the RAND corporation’s Technology and
Security Policy Center. He says this loophole, coupled with the
time for new chips to find their way into AI developers’
infrastructure, is why we are yet to see the export controls have a
full impact on China’s AI development.
It remains to be seen whether the current threshold strikes
the right balance. In November, Tencent released a language model
called Hunyuan-Large that outperforms Meta’s most powerful variant
of Llama 3.1 in several benchmarks. While benchmarks are an
imperfect measure for comparing AI models’ overall intelligence,
Hunyuan-Large’s performance is impressive because it was trained
using the less powerful, unrestricted Nvidia H20 GPUs, according to
research by the Berkeley Risk and Security Lab. “They're clearly
getting much better use out of the hardware because of better
software,” says Ritwik Gupta, the author of the research, who also
advises the Department of Defense’s Defense Innovation Unit. Rival
Chinese lab’s DeepSeek-v3, believed to be the strongest open model
available, was also trained using surprisingly little compute.
Although there is significant uncertainty about how President-elect
Donald Trump will approach AI policy, several experts told TIME in
November that they expected export controls to persist—and even be
expanded.
Before new restrictions were introduced in December, Chinese
companies once again stockpiled soon-to-be-blocked chips.“This
entire strategy needs to be rethought,” Gupta says. “Stop playing
whack-a-mole with these hardware chips.” He suggests that instead
of trying to slow down development of large language models by
restricting access to chips, the U.S. should concentrate on
preventing the development of military AI systems, which he says
often need less computing power to train. Though he acknowledges
that restrictions on other parts of the chip supply chain—like
ASML’s machines used for manufacturing chips—have been pivotal in
slowing China’s domestic chip industry.
Heim says that over the last year, the U.S.’s lead has shrunk,
though he notes that while China may now match the U.S.’s best open
source models, these lag roughly one year behind the top closed
models. He adds that the closing gap does not necessarily mean
export controls are failing. “Let’s move away from this binary of
export controls working or not working,” he says, adding that it
may take longer for China to feel them
bite.
The last decade has seen a dizzying increase in the compute
used for training AI models. For example, OpenAI’s GPT-4, released
in 2023, is estimated to have been trained using roughly 10,000
times more compute than GPT-2, released in 2019. There are
indications that trend is set to continue, as American companies
like X and Amazon build massive supercomputers with hundreds of
thousands of GPUs, far exceeding the computing power used to train
today's leading AI models. If it does, Heim predicts that U.S. chip
export restrictions will hamper China's ability to keep pace in AI
development. “Export controls mostly hit you on quantity,” Heim
says, adding that even if some restricted chips find their way into
the hands of Chinese developers, by reducing the number, export
controls make it harder to train and deploy models at scale. “I do
expect export controls to generally hit harder over time, as long
as compute stays as important,” he says.
Within Washington, “right now, there is a hesitation to bring
China to the [negotiating] table,” says Scott Singer, a visiting
scholar in the Technology and International Affairs Program at the
Carnegie Endowment for International Peace. The implicit reasoning:
‘[If the U.S. is ahead], why would we share
anything?’”
But he notes there are compelling reasons to negotiate with
China on AI. “China does not have to be leading to be a source of
catastrophic risk,” he says, adding its continued progress in spite
of compute restrictions means it could one day produce AI with
dangerous capabilities. "If China is much closer, consider what
types of conversations you want to have with them around ensuring
both sides' systems remain secure,” Singer says
前一篇:为什么时间在不同意识状态下会变慢
后一篇:北约秘书长呼吁成员国做好战争准备