Collaborative Research: SLES: Verifying and Enforcing Safety Constraints in AI-based Sequential Generation
合作研究:SLES:验证和执行基于人工智能的顺序生成中的安全约束
基本信息
- 批准号:2331967
- 负责人:
- 金额:$ 26万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-10-01 至 2026-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Artificial intelligence (AI) has achieved transformative impacts on various complex real-world challenges. Among its applications, sequential data are prevalent in many critical usages of AI when it directly engages with its users. Self-driving cars rely on AI to process sequences of sensor data from cameras and radars, and make a sequence of real-time decisions to ensure safe driving. Healthcare monitoring systems use AI to analyze sequences of patient health data, such as blood pressure, heart rate, and others, to detect anomalies and predict potential health issues. Chatbots utilize AI to understand natural language and generate safe, fair, and appropriate text responses as sequences of words and sentences. The sequential data produced by AI make its behavior hard to characterize because of the complex dependencies within the sequence, and a careless application of AI in these scenarios may lead to harmful consequences, such as a collision of an autonomous vehicle or the generation of biased or toxic texts. This project aims to study the safety of AI under scenarios with sequential data, provide assurance for its behavior in mission-critical environments, and ensure AI-based sequential generation can adhere to safety constraints and social norms. Ultimately, this research will help with reducing unexpected AI failures, preventing bias and discrimination in AI technologies, aligning AI systems with human values and societal norms, and building up public trust for AI-enabled applications.The technical contributions of this project consist of three thrusts. The first thrust develops a formal verification framework for assuring the safety of AI models for sequential generation tasks with rigorous mathematical guarantees. It includes a series of innovative verification algorithms for bound propagation and branch-and-bound for general non-linear functions involved in sequential generation models. These new verification methods will be integrated into the alpha-beta-CROWN neural network verifier, a well-known open-source toolbox developed by investigators. The second thrust involves training and inference algorithms that ensure sequential generation models comply with specified safety constraints, with a unique probabilistic framework that decomposes a safety constraint into action-level components and enforces them at each generation step. This approach can be integrated with model training to improve the safety performance of sequential generation models using posterior regularization techniques. Lastly, the third thrust aims to integrate the formal verification and constrained generation components above and apply them to three important real-world applications: safety of text generation, safety and stability of controlled systems, and robust AI-generated text detectors. This project will also result in tools to the broader AI community, including the alpha-beta-CROWN neural network verifier, and the shared data and benchmarks developed to evaluate the safety of sequential generation models.This project is supported by a partnership with the NSF and Open Philanthropy.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
人工智能(AI)对各种复杂的现实世界挑战产生了变革性的影响。在其应用程序中,当人工智能直接与用户互动时,顺序数据在人工智能的许多关键用途中普遍存在。自动驾驶汽车依靠人工智能处理来自摄像头和雷达的传感器数据序列,并做出一系列实时决策,以确保安全驾驶。医疗监控系统使用AI来分析患者健康数据序列,如血压、心率等,以检测异常并预测潜在的健康问题。聊天机器人利用人工智能来理解自然语言,并生成安全,公平和适当的文本响应作为单词和句子的序列。人工智能产生的序列数据使其行为难以描述,因为序列中存在复杂的依赖关系,在这些场景中不小心应用人工智能可能会导致有害的后果,例如自动驾驶汽车的碰撞或产生有偏见或有毒的文本。该项目旨在研究人工智能在序列数据场景下的安全性,为其在关键任务环境中的行为提供保证,并确保基于人工智能的序列生成能够遵守安全约束和社会规范。最终,这项研究将有助于减少意外的人工智能失败,防止人工智能技术中的偏见和歧视,使人工智能系统与人类价值观和社会规范保持一致,并建立公众对人工智能应用程序的信任。第一个推力开发了一个正式的验证框架,以确保具有严格数学保证的顺序生成任务的AI模型的安全性。它包括一系列创新的验证算法,用于顺序生成模型中涉及的一般非线性函数的边界传播和分支定界。这些新的验证方法将被集成到alpha-beta-CROWN神经网络验证器中,这是一个由研究人员开发的著名开源工具箱。第二个推力涉及训练和推理算法,确保顺序生成模型符合指定的安全约束,具有独特的概率框架,将安全约束分解为动作级组件并在每个生成步骤中执行它们。这种方法可以与模型训练相结合,使用后验正则化技术来提高序贯生成模型的安全性能。最后,第三个目标是整合上述形式化验证和约束生成组件,并将其应用于三个重要的现实应用:文本生成的安全性,受控系统的安全性和稳定性,以及强大的人工智能生成的文本检测器。该项目还将为更广泛的人工智能社区提供工具,包括alpha-beta-CROWN神经网络验证器,以及为评估连续发电模型的安全性而开发的共享数据和基准。该项目得到了与NSF和开放慈善事业的合作伙伴关系的支持。该奖项反映了NSF的法定使命,并被认为值得通过使用基金会的智力价值和更广泛的评估来支持。影响审查标准。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Huan Zhang其他文献
Studies of Uposomal bcl-2 Antisense Oligode ·-·
脂质体bcl-2反义寡核苷酸的研究·-·
- DOI:
- 发表时间:
2005 - 期刊:
- 影响因子:0
- 作者:
Dong He;Huan Zhang - 通讯作者:
Huan Zhang
Enhanced Dissociation Activation of CO2 on the Bi/Cu(111) interface by the synergistic effect
协同效应增强Bi/Cu(111)界面上CO2的解离活化
- DOI:
10.1016/j.jcat.2022.04.001 - 发表时间:
2022-04 - 期刊:
- 影响因子:7.3
- 作者:
Huan Zhang;Zhaofeng Liang;Chaoqin Huang;Lei Xie;Hongbing Wang;Jinping Hu;Zheng Jiang;Fei Song - 通讯作者:
Fei Song
Clinical Assessment of Brachial-Ankle Pulse Wave Velocity and Stiffness Index: Hypertriglyceridemia Effects on Arterial Stiffness
臂踝脉搏波速度和僵硬度指数的临床评估:高甘油三酯血症对动脉僵硬度的影响
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
An Zhao;Yinbao Chong;Hangmei Zhong;Jieshi Ma;Huan Zhang;Z. Luo;Gaosen Li;Xiaomin Luo - 通讯作者:
Xiaomin Luo
Interplanetary magnetic field control of plasma distribution in the polar ionosphere
极地电离层等离子体分布的行星际磁场控制
- DOI:
10.1016/j.asr.2020.08.024 - 发表时间:
2021 - 期刊:
- 影响因子:2.6
- 作者:
Shenggao Yang;Libin Weng;Yaguang Zhu;Xu Yang;Sihui Hu;Peikang Xu;Huan Zhang;Weidong Pan;Jie Shang;Xing Su - 通讯作者:
Xing Su
Multifunctional theranostic nanoparticles for multi-modal imaging-guided CAR-T immunotherapy and chemo-photothermal combinational therapy of non-Hodgkin's lymphoma
用于非霍奇金淋巴瘤多模式成像引导的 CAR-T 免疫治疗和化疗光热联合治疗的多功能治疗诊断纳米颗粒
- DOI:
10.1039/d1bm01982a - 发表时间:
2022 - 期刊:
- 影响因子:6.6
- 作者:
Bowen Shi;Dan Li;Weiwu Yao;Wenfang Wang;Jiang Jiang;Ruiheng Wang;Fuhua Yan;Han Liu;Huan Zhang;Jian Ye - 通讯作者:
Jian Ye
Huan Zhang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似国自然基金
水凝胶改性陶瓷人工关节牢固结合界面的构筑与减磨润滑机理研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
锆酸铅基反铁电体畴动力学及其调控机理研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
载铁生物炭对土壤镉污染的吸附固定及微生物协同作用机制研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
SREBP转录因子BbSre1负调控球孢白僵菌抗真菌物质产生的机制研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
面向截肢患者运动感知重建的肌电假肢手关节运动反馈时变编码研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
面向水质应急快检的碳点/微流控限域增强发光传感研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
面向挠性压电太阳翼的物理信息混合建模与非同位控制方法研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
随机3维 Burgers 方程正则性研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
犬尿氨酸通过AhR/STAT3轴活化粒细胞样MDSCs促进慢性肾脏病心脏纤维化的机制研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
磁性的机器学习研究: 以图神经网络为中心
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
相似海外基金
Collaborative Research: SLES: Guaranteed Tubes for Safe Learning across Autonomy Architectures
合作研究:SLES:跨自治架构安全学习的保证管
- 批准号:
2331878 - 财政年份:2024
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Guaranteed Tubes for Safe Learning across Autonomy Architectures
合作研究:SLES:跨自治架构安全学习的保证管
- 批准号:
2331879 - 财政年份:2024
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments
协作研究:SLES:安全的分布式强化学习系统:理论、算法和实验
- 批准号:
2331781 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Foundations of Qualitative and Quantitative Safety Assessment of Learning-enabled Systems
合作研究:SLES:学习型系统定性和定量安全评估的基础
- 批准号:
2331938 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Bridging offline design and online adaptation in safe learning-enabled systems
协作研究:SLES:在安全的学习系统中桥接离线设计和在线适应
- 批准号:
2331880 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Foundations of Qualitative and Quantitative Safety Assessment of Learning-enabled Systems
合作研究:SLES:学习型系统定性和定量安全评估的基础
- 批准号:
2331937 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Safety under Distributional Shift in Learning-Enabled Power Systems
合作研究:SLES:学习型电力系统分配转变下的安全性
- 批准号:
2331776 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments
协作研究:SLES:安全的分布式强化学习系统:理论、算法和实验
- 批准号:
2331780 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Bridging offline design and online adaptation in safe learning-enabled systems
协作研究:SLES:在安全的学习系统中桥接离线设计和在线适应
- 批准号:
2331881 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Verifying and Enforcing Safety Constraints in AI-based Sequential Generation
合作研究:SLES:验证和执行基于人工智能的顺序生成中的安全约束
- 批准号:
2331966 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant