权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CAREER: Learning Agents in Dynamic, Collaborative, and Adversarial Multiagent Environments

职业：动态、协作和对抗性多智能体环境中的学习智能体

基本信息

批准号：
0237699
负责人：
Peter Stone
金额：
$ 51.74万
依托单位：
University of Texas at Austin
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2003
资助国家：
美国
起止时间：
2003-02-01 至 2009-01-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=0237699&HistoricalAwards=false
关键词：
CAREER Learning Agents Dynamic Collaborative

项目摘要

This project aims to enable multiple intelligent agents to learn to act both individually and in coordination with one another towards individual and/or common goals in real-time, noisy, collaborative and adversarial environments. The approach taken will be to study complete agents in specific, complex environments, with the goal of drawing general lessons from the specific implementations. Fundamental research will be conducted in four main areas. First, multiagent reinforcement learning will be scaled up to handle larger and more complex problems than has been previously possible. Second, new state representations suitable for learning will be proposed andtested. Third, game theoretic approaches to improving agent performance by predicting the responses of other agents will be investigated. Fourth, strategies for learning autonomous biddingagents will be developed and tested. Application domains will include: robotic soccer, both in simulation and with real robots; and autonomous bidding agents in multiple realistic scenarios.The rich simulation environments to be used for this research are ideal substrates for teaching students about complete intelligent agents, including perception, cognition, and action. The educational goals of this project include leveraging the appeal of these domains to students into challenging, exciting, and instructive undergraduate and graduate courses.

该项目旨在使多个智能代理能够学习在实时，嘈杂，协作和对抗环境中单独和相互协调地对个人和/或共同目标采取行动。所采取的方法将是在特定的复杂环境中研究完整的代理，目的是从具体的实现中吸取一般的经验教训。基础研究将在四个主要领域进行。首先，多智能体强化学习将扩大规模，以处理比以前更大、更复杂的问题。第二，新的状态表示适合学习将提出和测试。第三，将研究通过预测其他代理的响应来提高代理性能的博弈论方法。第四，学习自主biddingagents的策略将被开发和测试。应用领域将包括：机器人足球，无论是在模拟和与真实的机器人;和自主投标代理在多个现实scenaries.The丰富的模拟环境，用于这项研究是理想的基板，教学生完整的智能代理，包括感知，认知和行动。该项目的教育目标包括利用这些领域对学生的吸引力，使其成为具有挑战性，令人兴奋和指导性的本科生和研究生课程。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Peter Stone其他文献

Composing Efficient, Robust Tests for Policy Selection

为策略选择编写高效、稳健的测试

DOI：
10.48550/arxiv.2306.07372
发表时间：
2023
期刊：
ArXiv
影响因子：
0
作者：
Dustin Morrill;Thomas J. Walsh;D. Hernández;Peter R. Wurman;Peter Stone
通讯作者：
Peter Stone

Is yoghurt an acceptable alternative to raw milk for reducing eczema and allergy in infancy?

酸奶是否是生奶的可接受替代品，可以减少婴儿期的湿疹和过敏？

DOI：
10.1111/cea.13121
发表时间：
2018
期刊：
Clinical and Experimental Allergy
影响因子：
6.1
作者：
Julian Crane;C. Barthow;Edwin A. Mitchell;T. Stanley;Gordon Purdie;Judy Rowden;Janice Kang;Fiona Hood;Phillipa Barnes;P. Fitzharris;Robyn Maude;Peter Stone;Rinki Murphy;K. Wickens
通讯作者：
K. Wickens