Collaborative Research: Autonomous Hierarchical Adaptive Dynamic Programming for Decision Making in Complex Environment

协作研究:复杂环境下自主分层自适应动态规划决策

基本信息

  • 批准号:
    1917275
  • 负责人:
  • 金额:
    $ 22.27万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-08-01 至 2023-07-31
  • 项目状态:
    已结题

项目摘要

The recent big wave of artificial intelligence (AI) not only provided tremendous advancements ranging from fundamental research to a wide range of exciting applications, but also presents enormous amounts of opportunities as well as challenges to the community. Among many of the AI techniques, adaptive dynamic programming and reinforcement learning (ADP/RL) is widely considered as one of the key methodologies for learning-based intelligent decision-making process.The objective of this project is to develop an innovative autonomous hierarchical ADP/RL approach for decision making in complex environments. By autonomously providing a hierarchical representation of sub-goals for improved learning and exploration capability, the proposed research provides a new approach to systematically and adaptively develop an optimal multi-step hierarchical temporal abstraction sequence, rather than the one-step primitive action in traditional methods. The research method advances the foundations, principles, architectures, and algorithms for autonomous learning and hierarchical control, which will facilitate the capability of learning and generalization for decision-making. This project provides unique opportunities to attract and educate future professionals by bridging the connections of ADP/RL and energy systems, and for students to work on cutting-edge problems. The team consists of two PIs with strong collaborations and complementary expertise in computational intelligence, machine learning, autonomous control, and the smart grid.This research advances the scientific foundations and methodologies of intelligent decision making in complex environments with high-dimensionality, big data, and uncertainty. The collaborations with industry integrates fundamental research into a microgrid application providing critical technical innovations to the energy sector. In addition, the developed ADP/RL based intelligent decision making method can benefit other types of complex engineering systems. Furthermore, the research results of this project are also expected to fulfill a critical need in the community by training and preparing future workforce in the cross-disciplinary areas of machine learning and energy systems. The integrative outreach and education activities will provide unique opportunities to attract women and minorities into the intelligent system and smart grid field.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
最近的人工智能(AI)浪潮不仅提供了从基础研究到广泛的令人兴奋的应用的巨大进步,而且也为社区带来了巨大的机遇和挑战。在众多的人工智能技术中,自适应动态规划和强化学习(ADP/RL)被广泛认为是基于学习的智能决策过程的关键方法之一,本项目的目标是开发一种创新的自主分层ADP/RL方法来解决复杂环境下的决策问题。通过自主地提供子目标的分层表示以提高学习和探索能力,所提出的研究提供了一种新的方法来系统地和自适应地开发最优的多步分层时间抽象序列,而不是传统方法中的一步原始动作。该研究方法提出了自主学习和递阶控制的基础、原理、结构和算法,这将有助于提高决策的学习能力和泛化能力。该项目提供了独特的机会,通过桥接ADP/RL和能源系统的连接来吸引和教育未来的专业人士,并让学生研究前沿问题。该研究团队由两名PI组成,他们在计算智能、机器学习、自主控制和智能电网方面具有强大的合作和互补的专业知识,该研究推进了在高维、大数据和不确定性的复杂环境中进行智能决策的科学基础和方法。 与工业界的合作将基础研究整合到微电网应用中,为能源部门提供关键的技术创新。此外,所开发的基于ADP/RL的智能决策方法也可用于其他类型的复杂工程系统。此外,该项目的研究成果还有望通过在机器学习和能源系统的跨学科领域培训和准备未来的劳动力来满足社区的关键需求。综合推广和教育活动将提供独特的机会,吸引妇女和少数民族进入智能系统和智能电网领域。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(24)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Decentralized Event-Triggered Control for a Class of Nonlinear-Interconnected Systems Using Reinforcement Learning
  • DOI:
    10.1109/tcyb.2019.2946122
  • 发表时间:
    2019-10
  • 期刊:
  • 影响因子:
    11.8
  • 作者:
    Xiong Yang;Haibo He
  • 通讯作者:
    Xiong Yang;Haibo He
Event-Driven H∞-Constrained Control Using Adaptive Critic Learning
  • DOI:
    10.1109/tcyb.2020.2972748
  • 发表时间:
    2020-02
  • 期刊:
  • 影响因子:
    11.8
  • 作者:
    Xiong Yang;Haibo He
  • 通讯作者:
    Xiong Yang;Haibo He
An Improved Trust-Region Method for Off-Policy Deep Reinforcement Learning
Energy-Storage-Based Intelligent Frequency Control of Microgrid With Stochastic Model Uncertainties
  • DOI:
    10.1109/tsg.2019.2942770
  • 发表时间:
    2020-03
  • 期刊:
  • 影响因子:
    9.6
  • 作者:
    C. Mu;Yong Zhang;H. Jia;Haibo He
  • 通讯作者:
    C. Mu;Yong Zhang;H. Jia;Haibo He
Synchronization of complex-valued dynamic networks with intermittently adaptive coupling: A direct error method
具有间歇自适应耦合的复值动态网络的同步:一种直接误差方法
  • DOI:
    10.1016/j.automatica.2019.108675
  • 发表时间:
    2020-02-01
  • 期刊:
  • 影响因子:
    6.4
  • 作者:
    Hu, Cheng;He, Haibo;Jiang, Haijun
  • 通讯作者:
    Jiang, Haijun
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Haibo He其他文献

A Multifactorial Evolutionary Algorithm for Multitasking Under Interval Uncertainties
区间不确定性下多任务处理的多因素进化算法
Adaptive Critic Designs for Event-Triggered Robust Control of Nonlinear Systems With Unknown Dynamics
未知动力学非线性系统事件触发鲁棒控制的自适应批评设计
  • DOI:
    10.1109/tcyb.2018.2823199
  • 发表时间:
    2019-06
  • 期刊:
  • 影响因子:
    11.8
  • 作者:
    Xiong Yang;Haibo He
  • 通讯作者:
    Haibo He
Numerical Simulation and Analysis of Three Dimensional Flow Field of a Counter-Rotating Fan with Various Angles
不同角度对转风机三维流场数值模拟与分析
Event-triggered optimal control for nonlinear constrained-inputsystems with partially unknown dynamics via adaptive dynamic programming
通过自适应动态规划对部分未知动态的非线性约束输入系统进行事件触发最优控制
Intelligent load frequency controller using GrADP for island smart grid with electric vehicles and renewable resources
使用 GrADP 的智能负载频率控制器用于具有电动汽车和可再生资源的岛屿智能电网
  • DOI:
    10.1016/j.neucom.2015.04.092
  • 发表时间:
    2015-12
  • 期刊:
  • 影响因子:
    6
  • 作者:
    Yufei Tang;Jun Yang;Jun Yan;Haibo He
  • 通讯作者:
    Haibo He

Haibo He的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Haibo He', 18)}}的其他基金

SpecEES: Collaborative Research: Enabling Spectrum and Energy-Efficient Dynamic Spectrum Access Wireless Networks using Neuromorphic Computing
SpecEES:协作研究:使用神经形态计算实现频谱和节能动态频谱接入无线网络
  • 批准号:
    1731672
  • 财政年份:
    2017
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
NRI: Collaborative Research: Dynamic Robot Guides for Emergency Evacuations
NRI:协作研究:紧急疏散动态机器人指南
  • 批准号:
    1526835
  • 财政年份:
    2015
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
TC: Small: Secure the Electrical Power Grid: Smart Grid versus Smart Attacks
TC:小:保护电网:智能电网与智能攻击
  • 批准号:
    1117314
  • 财政年份:
    2011
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Continuing Grant
CAREER: AIS - An Integrated Optimization and Prediction Framework for Machine Intelligence based on Adaptive Dynamic Programming
职业:AIS - 基于自适应动态规划的机器智能集成优化和预测框架
  • 批准号:
    1053717
  • 财政年份:
    2011
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: SII-NRDZ: SweepSpace: Enabling Autonomous Fine-Grained Spatial Spectrum Sensing and Sharing
合作研究:SII-NRDZ:SweepSpace:实现自主细粒度空间频谱感知和共享
  • 批准号:
    2348589
  • 财政年份:
    2024
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
Collaborative Research: NeTS: Medium: Black-box Optimization of White-box Networks: Online Learning for Autonomous Resource Management in NextG Wireless Networks
合作研究:NeTS:中:白盒网络的黑盒优化:下一代无线网络中自主资源管理的在线学习
  • 批准号:
    2312835
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
III: Medium: Collaborative Research: Integrating Large-Scale Machine Learning and Edge Computing for Collaborative Autonomous Vehicles
III:媒介:协作研究:集成大规模机器学习和边缘计算以实现协作自动驾驶汽车
  • 批准号:
    2348169
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Continuing Grant
CPS: Medium: Collaborative Research: Robust Sensing and Learning for Autonomous Driving Against Perceptual Illusion
CPS:中:协作研究:针对自动驾驶对抗知觉错觉的鲁棒感知和学习
  • 批准号:
    2235231
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
Collaborative Research: CISE: Large: Integrated Networking, Edge System and AI Support for Resilient and Safety-Critical Tele-Operations of Autonomous Vehicles
合作研究:CISE:大型:集成网络、边缘系统和人工智能支持自动驾驶汽车的弹性和安全关键远程操作
  • 批准号:
    2321531
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Continuing Grant
Collaborative Research: Data-Driven Microreaction Engineering by Autonomous Robotic Experimentation in Flow
协作研究:通过自主机器人实验进行数据驱动的微反应工程
  • 批准号:
    2208489
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
Collaborative Research: NeTS: Medium: Black-box Optimization of White-box Networks: Online Learning for Autonomous Resource Management in NextG Wireless Networks
合作研究:NeTS:中:白盒网络的黑盒优化:下一代无线网络中自主资源管理的在线学习
  • 批准号:
    2312836
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
Collaborative Research: NeTS: Medium: Black-box Optimization of White-box Networks: Online Learning for Autonomous Resource Management in NextG Wireless Networks
合作研究:NeTS:中:白盒网络的黑盒优化:下一代无线网络中自主资源管理的在线学习
  • 批准号:
    2312834
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
Collaborative Research: CISE: Large: Integrated Networking, Edge System and AI Support for Resilient and Safety-Critical Tele-Operations of Autonomous Vehicles
合作研究:CISE:大型:集成网络、边缘系统和人工智能支持自动驾驶汽车的弹性和安全关键远程操作
  • 批准号:
    2321532
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Continuing Grant
Collaborative Research: NeTS: Medium: Black-box Optimization of White-box Networks: Online Learning for Autonomous Resource Management in NextG Wireless Networks
合作研究:NeTS:中:白盒网络的黑盒优化:下一代无线网络中自主资源管理的在线学习
  • 批准号:
    2312833
  • 财政年份:
    2023
  • 资助金额:
    $ 22.27万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了