行业研究公司研究宏观策略财报招股书会议纪要 Token 低空经济十五五 AIGC 大模型

中国人工智能安全全景报告(State of AI Safety in China)

信息技术 2024-05-14 Jason Zhou,Kwan Yee Ng,Brian Tse 安远AI 王英杰

核心观点

中国在人工智能安全领域的研究投入和产出显著增加，尤其在 LLM 去学习、生物和化学领域 AI 的滥用风险以及评估 LLM 的“权力寻求”和“自我意识”风险等方面。
中国积极参与国际人工智能治理，签署了布莱切利宣言，并与法国发布联合声明，与美国建立政府间对话，显示出主要大国在人工智能安全问题上的共识日益趋同。
中国国家政策和领导层对发展大型模型的同时平衡风险表现出日益增长的兴趣。
中国正在制定国家人工智能法，其中包含人工智能安全的条款，例如对基础模型进行专门监管，并规定 AGI 的价值对齐。
中国的 3 个最大人工智能中心所在地的政府已发布关于 AGI 或大型模型的政策，主要旨在加速发展，但也包含国际合作、伦理以及测试和评估等条款。
中国的专家们最近讨论了几个重点人工智能安全话题，包括 AI 不得跨越的“红线”以避免“存在风险”，人工智能安全研究的最低资金水平，以及人工智能对生物安全的影响。

关键数据

过去 6 个月平均每月有近 15 篇关于前沿人工智能安全的中文技术论文。
报告确定了 11 个关键研究小组，这些小组过去一年撰写了大部分相关论文。
中国已签署布莱切利宣言，并与法国发布联合声明，与美国建立政府间对话。
自 2022 年以来，中国与西方国家举行了 8 场关于人工智能的 1.5 轨道或 2 轨道对话，其中 2 场专注于前沿人工智能安全和管理。

研究结论

中国在人工智能安全领域的研究投入和产出显著增加，与国际社会的共识日益趋同。
中国正在积极制定国家人工智能法，并采取措施加强人工智能安全监管。
中国的专家们对人工智能安全风险的认识不断提高，并开始探讨更深入的话题。

Published May 14, 2024 Executive Summary Executive Summary (1) ➢Therelevance and qualityof Chinesetechnical research for frontier AI safetyhasincreasedsubstantially, with growing work on frontier issues such as LLM unlearning, misuse risksof AI in biology and chemistry, and evaluating "power-seeking" and "self-awareness" risks of LLMs. ➢There have been nearly15 Chinese technical papers on frontier AI safety per monthonaverage over the past 6 months.The report identifies 11 key research groups who have written asubstantial portion of these papers. ➢China’s decision to sign theBletchley Declaration, issue a joint statement on AI governance withFrance, and pursue an intergovernmental AI dialogue with theUSindicates agrowingconvergence of views on AI safety among major powerscompared to early 2023.➢Since 2022, 8Track 1.5 or 2 dialoguesfocused on AI have taken place between China andWestern countries, with 2 focused on frontier AI safety and governance. Executive Summary Executive Summary (II) ➢Chinesenational policy and leadershipshow growing interest indeveloping large modelswhile balancing risk prevention.➢Unofficialexpert draftsof China’sforthcoming national AI lawcontainprovisions on AIsafety, such as specialized oversight for foundation models and stipulating value alignment of AGI.➢Local governmentsin China’s 3 biggest AI hubs have issuedpolicies on AGI or large models,primarily aimed at accelerating development while also including provisions on topics such asinternational cooperation, ethics, and testing and evaluation.➢Several influentialindustry associationsestablishedprojects or committees to research AIsafety and security problems, but their focus is primarily on content and data security ratherthan frontier AI safety.➢In recent months,Chinese expertshave discussed several focusedAI safety topics, including“red lines”that AI must not cross to avoid “existential risks,”minimum funding levelsfor AIsafety research, and AI’s impact onbiosecurity. Table of Contents Section 1: Introduction and scopeSection 2:Technical safety researchSection 3: International governanceSection 4: Domestic governanceSection 5: Lab and industry practicesSection 6: Expert views on AI risksSection 7: Public opinion on AISection 8:Additional resourcesSection 9:About us Introduction and Scope Thanks to positive feedback on our first report and rapid AI developments sinceOctober 2023, we have decided to issue an update! ➢The 2023 version was published before the UK AI Safety Summit, and our CEO, Brian Tse, shared itwith other attendees at the summit.➢We provided briefings on the report to over a dozen organizations including the BrookingsInstitution, the Center for Strategic and International Studies, Google DeepMind, the FrontierModel Forum, and the Tony Blair Institute for Global Change.➢Media outlets including Politico and Sixth Tone have covered our report, and it has beenrecommended by leading AI experts, including Jeffrey Ding in his ChinAI newsletter. Introduction and Scope Our report focuses on “frontier AI risks.” ➢We share the focus of the 2023 UK AI Safety Summit, which emphasized risks from cutting-edge largemodels – “highly capable general-purpose AI models, including foundation models, that could perform awide variety of tasks” – as well as narrow AI systems in dangerous domains.1 ■We include both types of models when using the phrase “frontier AI.” Introduction and Scope Our report focuses on AI safety rather than AI security. ➢In English, risks from frontier AI are the subject of the discipline called AI “safety.” In Chinese, the term“人工智能安全” encompasses this definition, while also including AI “security.”2 ■AI “safety” is about protecting against broadly harmful consequences that could result from AIsystems such as accidents and misuse, whereas AI “security” is about preventing AI systems frombeing attacked and compromised. ■AI security includes topics such as cybersecurity of AI model weights, data security of AI models,and physical security of AI development facilities, which we exclude from the scope of the report. ■We exclude lethal autonomous weapons (LAWs) from the scope of this report to focus onnon-military AI risks. ➢In cases of ambiguity, we translate the term “人工智能安全” as “AI safety/security.” ➢Some AI safety topics can also be considered AI security issues and fallwithinour scope, such as: Table of Contents Section 1: Introduction and scopeSection 2:Technical safety researchSection 3: International governanceSection 4: Domestic governanceSection 5: Lab and industry practicesSection 6: Expert views on AI risksSection 7: Public opinion on AISection 8:Additional resourcesSection 9:About us Technical Safety Research Overview of key developments since October 2023 ➢Relevance and quantity of frontier AI safety researchhas risen substantially compared to 2023,with increasing interest in frontier aspects of AI safety. ➢We have identified with high confidence11key safety research g

点击免费查看完整报告

中国人工智能安全全景报告(State of AI Safety in China)

你可能感兴趣

2026年人工智能全景报告 (State of AI 2026)

State of Wildlife Trade in China 2008

The State of Wildlife Trade in China 2007

State of Wildlife Trade in China 2006

剑桥大学报告：2019年度AI全景报告State of AI Report

State of AI Report: 6 trends shaping the landscape in 2025

State of AI in Financial Services: 2026 Trends

SUMMARY OF PUBLIC OUTREACH EFFORTS CONCERNING STATE AND LOCAL PUBLIC SAFETY SPECTRUM MANAGEMENT POLICIES & PROCEDURES

Commodities Comment：Micro holes in the China safety net?

Assessment of occupational safety and health hazards exposure of workers in small-scale gold mining in the Philippines