Distributionally Robust Adaptive Control: Enabling Safe and Robust Reinforcement Learning
分布式鲁棒自适应控制:实现安全鲁棒的强化学习
基本信息
- 批准号:2135925
- 负责人:
- 金额:$ 37.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-07-01 至 2025-06-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Data-driven algorithms can autonomously control complex systems like autonomous cars and drones. However, the use of such powerful algorithms remains relegated primarily to controlled laboratory environments. The main reason for the minimal adoption of data-driven methods for safety-critical systems is the difficulty one encounters when attempting to establish safety and predictability guarantees as one would do with well-established control theoretical methods. This award supports fundamental research to identify the best methodologies to consolidate data-driven and control-theoretic tools so that the overall methodology is safe, robust, and high-performing. The new approach lifts control tools to speak the same language as the data-driven methods. In doing so, the performance of the data-driven methods is not compromised, and yet, the safety guarantees of control-theoretic tools can be constructed. Safe and predictable autonomous operation of complex systems can bring immense socio-economic benefits through its application in medical robotics, autonomous logistics, transportation, and extra-terrestrial exploration, to name a few. This research involves multiple disciplines, including robotics, control theory, statistical learning, and mathematics. The cross-disciplinary nature will assist underrepresented groups' broader participation in STEM and impact engineering education. To adopt data-driven methods that rely on reinforcement learning (RL) algorithms in safety-critical systems, we need guarantees on safety and robustness. Robust and adaptive control methodologies developed for classical systems with parametric uncertainties cannot be used directly in conjunction with RL because the latter operates on data-driven models for which identifying parametric and deterministic uncertainties is difficult, if not impossible. This research will construct a new class of robust adaptive controllers that are robust to errors in the learned distributions, thus allowing RL algorithms to directly interact with these controllers without further restrictions. Due to robustness at the level of distributions, notions of risk-aware safety can be included in a straightforward manner. This research will first aim to construct controllers that track temporally evolving state distributions with uniform bounds. Then, the epistemic uncertainties will be introduced with a novel adaptive control scheme to quantifiably control the effect of the uncertainties in the space of distributions. The results produced through this effort will bring the two distinct worlds of data-driven control and classical control together at a natural intersection point where trajectories of distributions, not of sample paths, are considered.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
数据驱动的算法可以自主控制自动汽车和无人机等复杂系统。然而,这种强大的算法的使用仍然主要局限于受控的实验室环境。对于安全关键系统,数据驱动方法的最小采用的主要原因是,当试图建立安全性和可预测性保证时遇到的困难,就像人们使用成熟的控制理论方法一样。该奖项支持基础研究,以确定整合数据驱动和控制理论工具的最佳方法,使整体方法安全,稳健和高性能。新方法提升了控制工具,使其与数据驱动方法使用相同的语言。在这样做时,数据驱动方法的性能不会受到影响,但可以构建控制理论工具的安全保证。复杂系统的安全和可预测的自主操作可以通过其在医疗机器人,自主物流,运输和外星探索等方面的应用带来巨大的社会经济效益。这项研究涉及多个学科,包括机器人,控制理论,统计学习和数学。跨学科的性质将有助于代表性不足的群体更广泛地参与STEM和影响工程教育。为了在安全关键系统中采用依赖于强化学习(RL)算法的数据驱动方法,我们需要保证安全性和鲁棒性。为具有参数不确定性的经典系统开发的鲁棒和自适应控制方法不能直接与RL结合使用,因为后者操作于数据驱动模型,对于该模型,识别参数和确定性不确定性是困难的,如果不是不可能的话。本研究将构建一类新的鲁棒自适应控制器,这些控制器对学习分布中的误差具有鲁棒性,从而允许RL算法直接与这些控制器进行交互,而无需进一步的限制。由于分布水平的稳健性,可以以直接的方式包含风险感知安全性的概念。本研究将首先设计控制器,以追踪具有一致边界的随时间演化的状态分布。然后,将认知不确定性引入一种新的自适应控制方案,以量化控制分布空间中的不确定性的影响。通过这一努力产生的结果将把数据驱动控制和经典控制两个不同的世界在一个自然的交叉点,其中分布的轨迹,而不是样本路径,被认为是一起的。这个奖项反映了NSF的法定使命,并已被认为是值得的支持,通过评估使用基金会的智力价值和更广泛的影响审查标准。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions
- DOI:
- 发表时间:2022-11
- 期刊:
- 影响因子:0
- 作者:Yikun Cheng;Pan Zhao;N. Hovakimyan
- 通讯作者:Yikun Cheng;Pan Zhao;N. Hovakimyan
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Naira Hovakimyan其他文献
Three-dimensional coordinated path-following control for second-order multi-agent networks
二阶多智能体网络三维协调路径跟踪控制
- DOI:
10.1016/j.jfranklin.2015.01.020 - 发表时间:
2015-09 - 期刊:
- 影响因子:0
- 作者:
Zongyu Zuo;Venanzio Cichella;Ming Xu;Naira Hovakimyan - 通讯作者:
Naira Hovakimyan
FlipDyn in Graphs: Resource Takeover Games in Graphs
图表中的 FlipDyn:图表中的资源接管游戏
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Sandeep Banik;Shaunak D. Bopardikar;Naira Hovakimyan - 通讯作者:
Naira Hovakimyan
Naira Hovakimyan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Naira Hovakimyan', 18)}}的其他基金
Collaborative Research: SLES: Guaranteed Tubes for Safe Learning across Autonomy Architectures
合作研究:SLES:跨自治架构安全学习的保证管
- 批准号:
2331878 - 财政年份:2024
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
NSF-AoF: RI: Small: Safe Reinforcement Learning in Non-Stationary Environments With Fast Adaptation and Disturbance Prediction
NSF-AoF:RI:小型:具有快速适应和干扰预测功能的非平稳环境中的安全强化学习
- 批准号:
2133656 - 财政年份:2021
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
NRI: INT: COLLAB: Synergetic Drone Delivery Network in Metropolis
NRI:INT:COLLAB:大都市的协同无人机交付网络
- 批准号:
1830639 - 财政年份:2018
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
CPS: Medium: Collaborative Research: Against Coordinated Cyber and Physical Attacks: Unified Theory and Technologies
CPS:媒介:协作研究:对抗协调的网络和物理攻击:统一理论和技术
- 批准号:
1739732 - 财政年份:2017
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
NRI: Collaborative Research: ASPIRE: Automation Supporting Prolonged Independent Residence for the Elderly
NRI:合作研究:ASPIRE:自动化支持老年人长期独立居住
- 批准号:
1528036 - 财政年份:2015
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
EAGER: Human centered robotic system design
EAGER:以人为本的机器人系统设计
- 批准号:
1548409 - 财政年份:2015
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
相似国自然基金
供应链管理中的稳健型(Robust)策略分析和稳健型优化(Robust Optimization )方法研究
- 批准号:70601028
- 批准年份:2006
- 资助金额:7.0 万元
- 项目类别:青年科学基金项目
心理紧张和应力影响下Robust语音识别方法研究
- 批准号:60085001
- 批准年份:2000
- 资助金额:14.0 万元
- 项目类别:专项基金项目
ROBUST语音识别方法的研究
- 批准号:69075008
- 批准年份:1990
- 资助金额:3.5 万元
- 项目类别:面上项目
改进型ROBUST序贯检测技术
- 批准号:68671030
- 批准年份:1986
- 资助金额:2.0 万元
- 项目类别:面上项目
相似海外基金
VIPAuto: Robust and Adaptive Visual Perception for Automated Vehicles in Complex Dynamic Scenes
VIPAuto:复杂动态场景中自动驾驶车辆的鲁棒自适应视觉感知
- 批准号:
EP/Y015878/1 - 财政年份:2024
- 资助金额:
$ 37.5万 - 项目类别:
Fellowship
CAREER: Risk-Based Methods for Robust, Adaptive, and Equitable Flood Risk Management in a Changing Climate
职业:在气候变化中实现稳健、适应性和公平的洪水风险管理的基于风险的方法
- 批准号:
2238060 - 财政年份:2023
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
CAREER: Enabling Robust and Adaptive Architectures through a Decoupled Security-Centric Hardware/Software Stack
职业:通过解耦的以安全为中心的硬件/软件堆栈实现鲁棒性和自适应架构
- 批准号:
2238548 - 财政年份:2023
- 资助金额:
$ 37.5万 - 项目类别:
Continuing Grant
Exploiting Geometries of Learning for Fast, Adaptive and Robust AI
利用学习几何实现快速、自适应和鲁棒的人工智能
- 批准号:
DP230101176 - 财政年份:2023
- 资助金额:
$ 37.5万 - 项目类别:
Discovery Projects
Adaptive and Robust Methods in Statistical Machine Learning
统计机器学习中的自适应和鲁棒方法
- 批准号:
2748915 - 财政年份:2022
- 资助金额:
$ 37.5万 - 项目类别:
Studentship
CAREER: Robust and Adaptive Streaming Analytics for Sensorized Farms: Internet-of-Small-Things to the Rescue
职业:适用于传感农场的稳健且自适应的流分析:小型物联网的救援
- 批准号:
2146449 - 财政年份:2022
- 资助金额:
$ 37.5万 - 项目类别:
Continuing Grant
Towards a Robust Theory of Adaptive Learning Algorithms
迈向稳健的自适应学习算法理论
- 批准号:
RGPIN-2017-05085 - 财政年份:2022
- 资助金额:
$ 37.5万 - 项目类别:
Discovery Grants Program - Individual
Accurate, Efficient, and Robust Adaptive Solution Methods and Models for Predicting Multi-Scale Physically-Complex Flows
用于预测多尺度物理复杂流的准确、高效、鲁棒的自适应解决方法和模型
- 批准号:
RGPIN-2019-06758 - 财政年份:2022
- 资助金额:
$ 37.5万 - 项目类别:
Discovery Grants Program - Individual
Adaptive Robust Optimization with Endogenous Uncertainty and Active Learning in Smart Manufacturing
智能制造中具有内生不确定性和主动学习的自适应鲁棒优化
- 批准号:
2030296 - 财政年份:2021
- 资助金额:
$ 37.5万 - 项目类别:
Standard Grant
Accurate, Efficient, and Robust Adaptive Solution Methods and Models for Predicting Multi-Scale Physically-Complex Flows
用于预测多尺度物理复杂流的准确、高效、鲁棒的自适应解决方法和模型
- 批准号:
DGDND-2019-06758 - 财政年份:2021
- 资助金额:
$ 37.5万 - 项目类别:
DND/NSERC Discovery Grant Supplement