Career: IIS: RI: Improving Multi-Agent Reinforcement Learning for Cooperative, Partially Observable Settings

职业:IIS:RI:改进合作、部分可观察设置的多智能体强化学习

基本信息

  • 批准号:
    2044993
  • 负责人:
  • 金额:
    $ 55万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2021
  • 资助国家:
    美国
  • 起止时间:
    2021-03-01 至 2026-02-28
  • 项目状态:
    未结题

项目摘要

As intelligent systems become more prevalent, these systems will need to coordinate with each other (e.g., apps, robots, autonomous cars), resulting in multi-agent systems. Allowing multi-agent systems to learn will let them operate in more complex and realistic scenarios by adapting their behavior to fit specific needs. Reinforcement learning is a promising form of trial-and-error learning that has the potential to drastically improve outcomes in many multi-agent domains (e.g., warehouses, delivery), but new methods are required for coordinating the agents in realistic domains with noisy and limited communication and sensing (i.e., partial observability). This project will develop these new reinforcement learning methods for coordinating teams of agents in various partially observable settings. The results will impact the development of future artificial intelligence (AI) and robotic systems and will be conveyed through outreach and educational activities.This project will develop a number of novel methods for cooperative multi-agent reinforcement learning (MARL) under partial observability. MARL, the extension of reinforcement learning methods for multi-agent domains, has gained popularity for generating high-quality solutions in some domains, but more work is needed to make the methods more scalable and widely applicable. Therefore, this project will first provide a better theoretical understanding of centralized training for decentralized execution methods. Centralized training for decentralized execution is the dominant paradigm in MARL where agents are trained offline and only executed online. The project will then develop new centralized training methods that are unbiased, scalable and perform well in a wide range of domains. Second, the project will develop online decentralized learning methods that allow agents to learn online even in noisy multi-agent settings. Lastly, to allow agents to learn and execute in an asynchronous manner, the project will develop methods for asynchronous MARL as well as asynchronous hierarchical learning with learning over multiple layers of a hierarchy. The resulting methods will significantly improve performance, stability and scalability of MARL methods and make them more generally applicable to large realistic domains.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
随着智能系统变得越来越普遍,这些系统将需要相互协调(例如应用程序、机器人、自动驾驶汽车),从而产生多智能体系统。允许多智能体系统学习将使它们通过调整其行为以满足特定需求来在更复杂和现实的场景中运行。强化学习是一种有前途的试错学习形式,有可能极大地改善许多多智能体领域(例如仓库、交付)的结果,但需要新的方法来协调具有嘈杂且有限的通信和感知(即部分可观察性)的现实领域中的智能体。该项目将开发这些新的强化学习方法,用于在各种部分可观察的环境中协调代理团队。 研究结果将影响未来人工智能(AI)和机器人系统的发展,并将通过外展和教育活动进行传达。该项目将开发一些在部分可观测性下进行协作多智能体强化学习(MARL)的新颖方法。 MARL 是多智能体领域强化学习方法的扩展,由于在某些领域生成高质量的解决方案而受到欢迎,但需要做更多的工作来使该方法更具可扩展性和更广泛的适用性。因此,这个项目首先会为去中心化执行方法的集中训练提供更好的理论理解。分散执行的集中训练是 MARL 中的主导范例,其中代理离线训练,仅在线执行。然后,该项目将开发新的集中培训方法,这些方法是公正的、可扩展的并且在广泛的领域中表现良好。 其次,该项目将开发在线分散学习方法,使智能体即使在嘈杂的多智能体环境中也能在线学习。最后,为了允许代理以异步方式学习和执行,该项目将开发异步 MARL 以及异步分层学习的方法,并在层次结构的多个层上进行学习。由此产生的方法将显着提高 MARL 方法的性能、稳定性和可扩展性,并使它们更普遍地适用于大型现实领域。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
On Centralized Critics in Multi-Agent Reinforcement Learning
  • DOI:
    10.1613/jair.1.14386
  • 发表时间:
    2023-05
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Xueguang Lyu;Andrea Baisero;Yuchen Xiao;Brett Daley;Chris Amato
  • 通讯作者:
    Xueguang Lyu;Andrea Baisero;Yuchen Xiao;Brett Daley;Chris Amato
Shield Decentralization for Safe Multi-Agent Reinforcement Learning
用于安全多智能体强化学习的屏蔽去中心化
Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
  • DOI:
    10.1609/aaai.v36i9.21171
  • 发表时间:
    2022-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Xueguang Lyu;Andrea Baisero;Yuchen Xiao;Chris Amato
  • 通讯作者:
    Xueguang Lyu;Andrea Baisero;Yuchen Xiao;Chris Amato
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
  • DOI:
    10.48550/arxiv.2209.10113
  • 发表时间:
    2022-09
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Yuchen Xiao;Weihao Tan;Chris Amato
  • 通讯作者:
    Yuchen Xiao;Weihao Tan;Chris Amato
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Christopher Amato其他文献

(A Partial Survey of) Decentralized, Cooperative Multi-Agent Reinforcement Learning
  • DOI:
    10.48550/arxiv.2405.06161
  • 发表时间:
    2024-05
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Christopher Amato
  • 通讯作者:
    Christopher Amato

Christopher Amato的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Christopher Amato', 18)}}的其他基金

NRI: FND: Coordinating and Incorporating Trust in Teams of Humans and Robots with Multi-Robot Reinforcement Learning
NRI:FND:通过多机器人强化学习协调和整合人类和机器人团队的信任
  • 批准号:
    2024790
  • 财政年份:
    2020
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
Doctoral Mentoring Consortium at the Nineteenth International Conference on Autonomous Agents and Multi-Agent Systems
第十九届自主代理和多代理系统国际会议博士生导师联盟
  • 批准号:
    2002606
  • 财政年份:
    2020
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
NSF-BSF: RI: Small: Decentralized Active Goal Recognition
NSF-BSF:RI:小型:去中心化主动目标识别
  • 批准号:
    1816382
  • 财政年份:
    2018
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: Coordinating Human-Robot Teams in Uncertain Environments
NRI:FND:COLLAB:在不确定环境中协调人机团队
  • 批准号:
    1734497
  • 财政年份:
    2017
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
CRII: RI: Planning and learning with macro-actions in cooperative multiagent systems
CRII:RI:协作多智能体系统中宏观行动的规划和学习
  • 批准号:
    1664923
  • 财政年份:
    2016
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
CRII: RI: Planning and learning with macro-actions in cooperative multiagent systems
CRII:RI:协作多智能体系统中宏观行动的规划和学习
  • 批准号:
    1463945
  • 财政年份:
    2015
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant

相似国自然基金

基于IIS/TOR信号途径探究蜂王浆外泌体lncRNA调控西方蜜蜂级型分化的分子机制
  • 批准号:
    32302811
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
IIS/FoxO通路调控Argopecten属扇贝寿命的分子机制
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    56 万元
  • 项目类别:
    面上项目
IIS/TOR通路调控蜜蜂工蜂生殖发育的分子机制研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
以IIS通路为中心的Zn2+胁迫对黑水虻幼虫生长影响的氧化应激机制研究
  • 批准号:
    31860618
  • 批准年份:
    2018
  • 资助金额:
    39.0 万元
  • 项目类别:
    地区科学基金项目
基于IIS通路探讨六味地黄丸“三补”药组对AD模型秀丽线虫的神经保护作用
  • 批准号:
    81704132
  • 批准年份:
    2017
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Collaborative Research: CISE-MSI: DP: IIS RI: Research Capacity Expansion via Development of AI Based Algorithms for Optimal Management of Electric Vehicle Transactions with Grid
合作研究:CISE-MSI:DP:IIS RI:通过开发基于人工智能的算法来扩展研究能力,以实现电动汽车与电网交易的优化管理
  • 批准号:
    2318611
  • 财政年份:
    2023
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
Collaborative Research: CISE-MSI: DP: IIS RI: Research Capacity Expansion via Development of AI Based Algorithms for Optimal Management of Electric Vehicle Transactions with Grid
合作研究:CISE-MSI:DP:IIS RI:通过开发基于人工智能的算法来扩展研究能力,以实现电动汽车与电网交易的优化管理
  • 批准号:
    2318612
  • 财政年份:
    2023
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
Collaborative Research: IIS: RI: Medium: Lifelong learning with hyper dimensional computing
协作研究:IIS:RI:中:超维计算的终身学习
  • 批准号:
    2211387
  • 财政年份:
    2022
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
Collaborative Research: IIS: RI: Medium: Lifelong learning with hyper dimensional computing
协作研究:IIS:RI:中:超维计算的终身学习
  • 批准号:
    2211386
  • 财政年份:
    2022
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
CAREER: IIS: RI: Foundations of Deep Neural Network Robustness and Efficiency
职业:IIS:RI:深度神经网络鲁棒性和效率的基础
  • 批准号:
    2144960
  • 财政年份:
    2022
  • 资助金额:
    $ 55万
  • 项目类别:
    Continuing Grant
Collaborative Research: CISE-MSI: RCBP-RF: IIS-RI: Analytically-based frameworks for AI model verification and improvement in cyber-physical systems
合作研究:CISE-MSI:RCBP-RF:IIS-RI:基于分析的人工智能模型验证和网络物理系统改进框架
  • 批准号:
    2130990
  • 财政年份:
    2021
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
Collaborative Research: CISE-MSI: RCBP-RF: IIS-RI: Analytically-based frameworks for AI model verification and improvement in cyber-physical systems
合作研究:CISE-MSI:RCBP-RF:IIS-RI:基于分析的人工智能模型验证和网络物理系统改进框架
  • 批准号:
    2131001
  • 财政年份:
    2021
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
IIS: RI: Travel Proposal: Student Travel Support for the 2019 Association for Computational Linguistics Student Research Workshop
IIS:RI:旅行提案:2019 年计算语言学协会学生研究研讨会的学生旅行支持
  • 批准号:
    1929269
  • 财政年份:
    2019
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
IIS-RI: International Conference on Automated Planning and Scheduling (ICAPS) 2017 Doctoral Consortium Travel Awards
IIS-RI:国际自动化规划与调度会议 (ICAPS) 2017 博士联盟旅行奖
  • 批准号:
    1745800
  • 财政年份:
    2017
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
IIS-RI: ICAPS 2016 Doctoral Consortium Travel Awards
IIS-RI:ICAPS 2016 博士联盟旅行奖
  • 批准号:
    1630144
  • 财政年份:
    2016
  • 资助金额:
    $ 55万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了