权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

RINGS: A Deep Reinforcement Learning Enabled Large-scale UAV Network with Distributed Navigation, Mobility Control, and Resilience

RINGS：深度强化学习支持的大规模无人机网络，具有分布式导航、移动控制和弹性

基本信息

批准号：
2148253
负责人：
Yingbin Liang
金额：
$ 100万
依托单位：
Ohio State University
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2022
资助国家：
美国
起止时间：
2022-05-01 至 2025-04-30
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2148253&HistoricalAwards=false
关键词：
RINGS Deep Reinforcement Learning Enabled

项目摘要

Thanks to the significant technological advances in unmanned aerial vehicles (UAVs), the past few years have witnessed an explosive growth of UAVs in civilian and commercial applications. To ensure efficient and reliable auto-navigation and planning in these applications, there is an urgent need to develop the foundational technology that enables UAVs to have strong sensing, communications, and on-board computing capabilities, to adapt rapidly to the dynamically changing environment, to be resilient to extreme environmental conditions, and to be secure against data contamination and malicious attacks. The interdisciplinary nature of the proposed research will provide valuable research opportunities and hands-on projects for a diverse group of students.The goal of this project is to leverage and significantly advance the recent breakthroughs in NextG wireless communications, deep machine learning, hardware-aware model generation, and robust and trustworthy artificial intelligence, to enable the design of an intelligent and resilient UAV navigation and planning system. More specifically, this project will develop: (a) real-time communication assisted ambient sensing with multi-modality data fusion and machine learning assisted fast processing for global state tracking; (b) a multi-agent decentralized reinforcement learning (RL) framework with highly scalable computations and flexible latency tolerance; (c) deep learning based message passing for efficient communication and powerful hardware-aware neural architecture search for efficient on-board computation; and (d) comprehensive robustness and security design for system protection from outlier data, malicious poisoning attacks, and RL system attacks. The project will also conduct extensive performance evaluations to validate the developed approaches and algorithms.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

由于无人机（UAV）的重大技术进步，过去几年无人机在民用和商业应用中呈爆炸式增长。为了确保这些应用中高效可靠的自动导航和规划，迫切需要开发基础技术，使无人机具有强大的传感，通信和机载计算能力，快速适应动态变化的环境，对极端环境条件具有弹性，并防止数据污染和恶意攻击。建议研究的跨学科性质将为多样化的学生群体提供宝贵的研究机会和实践项目。该项目的目标是利用并显著推进NextG无线通信，深度机器学习，硬件感知模型生成以及强大且值得信赖的人工智能的最新突破，以实现智能和弹性无人机导航和规划系统的设计。更具体地说，该项目将开发：（a）实时通信辅助的环境传感，具有多模态数据融合和机器学习辅助的快速处理，用于全球状态跟踪;（B）多代理分散强化学习框架，具有高度可扩展的计算和灵活的延迟容忍度;（c）基于深度学习的消息传递，用于高效通信，以及强大的硬件感知神经架构搜索，用于高效的机载计算;以及（d）全面的鲁棒性和安全性设计，以保护系统免受离群数据、恶意中毒攻击和RL系统攻击。该项目还将进行广泛的性能评估，以验证开发的方法和算法。该奖项反映了NSF的法定使命，并被认为值得通过使用基金会的智力价值和更广泛的影响审查标准进行评估。

项目成果

期刊论文数量（20）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Provably Efficient Algorithm for Nonstationary Low-Rank MDPs

DOI：
10.48550/arxiv.2308.05471
发表时间：
2023-08
期刊：
ArXiv
影响因子：
0
作者：
Yuan Cheng;J. Yang;Yitao Liang
通讯作者：
Yuan Cheng;J. Yang;Yitao Liang

A near-optimal algorithm for safe reinforcement learning under instantaneous hard constraints

瞬时硬约束下安全强化学习的近乎最优算法

DOI：
发表时间：
2023
期刊：
Proc. International Conference on Machine Learning (ICML
影响因子：
0
作者：
Shi, Ming;Liang, Yingbin;Shroff, Ness.
通讯作者：
Shroff, Ness.

Non-stationary Reinforcement Learning under General Function Approximation

DOI：
10.48550/arxiv.2306.00861
发表时间：
2023-06
期刊：
影响因子：
0
作者：
Songtao Feng;Ming Yin;Ruiquan Huang;Yu-Xiang Wang;J. Yang;Yitao Liang
通讯作者：
Songtao Feng;Ming Yin;Ruiquan Huang;Yu-Xiang Wang;J. Yang;Yitao Liang

CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning

DOI：
10.48550/arxiv.2302.04782
发表时间：
2023-02
期刊：
ArXiv
影响因子：
0
作者：
Sheng Yue;Guan Wang;Wei Shao;Zhaofeng Zhang;Sen Lin;Junkai Ren;Junshan Zhang
通讯作者：
Sheng Yue;Guan Wang;Wei Shao;Zhaofeng Zhang;Sen Lin;Junkai Ren;Junshan Zhang

Provably efficient UCB-type algorithms for learning predictive state representation

用于学习预测状态表示的可证明有效的 UCB 型算法

DOI：
发表时间：
2024
期刊：
Proc. International Conference on Learning Representations (ICLR
影响因子：
0
作者：
Huang, Ruiquan;Liang, Yingbin;Yang, Jing.
通讯作者：
Yang, Jing.

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Yingbin Liang其他文献

A New Perspective of Proximal Gradient Algorithms

近端梯度算法的新视角

DOI：
发表时间：
2015
期刊：
arXiv: Optimization and Control
影响因子：
0
作者：
Yi Zhou;Yingbin Liang;Lixin Shen
通讯作者：
Lixin Shen

On the Equivalence of Two Achievable Regions for the Broadcast Channel

广播频道两个可达到区域的等效性

DOI：
10.1109/tit.2010.2090236
发表时间：
2011
期刊：
IEEE Transactions on Information Theory
影响因子：
2.5
作者：
Yingbin Liang;G. Kramer;H. Poor
通讯作者：
H. Poor

Capacity bounds for a class of cognitive interference channels with state

一类具有状态的认知干扰信道的容量界限

DOI：
10.1109/allerton.2011.6120223
发表时间：
2011
期刊：
2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton)
影响因子：
0
作者：
Ruchen Duan;Yingbin Liang
通讯作者：
Yingbin Liang