权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Learning good representations for and with reinforcement learning

通过强化学习学习良好的表征

基本信息

批准号：
RGPIN-2017-06788
负责人：
Precup, Doina
金额：
$ 5.17万
依托单位：
McGill University
依托单位国家：
加拿大
项目类别：
Discovery Grants Program - Individual
财政年份：
2017
资助国家：
加拿大
起止时间：
2017-01-01 至 2018-12-31
项目状态：
已结题

来源：
https://www.nserc-crsng.gc.ca/ase-oro/Details-Detailles_eng.asp?id=635095
关键词：
Learning good representations reinforcement learning

项目摘要

Artificial intelligence (AI) has made great progress in isolating different aspects of intelligence and proposing flexible representations and powerful algorithms that lead to competence in specific tasks. For example, AI agents are better than humans at playing games like Go, a feat once considered impossible. However, the sort of flexible, robust, and autonomous competence routinely exhibited by humans, or even animals, remains elusive. The best AI systems are still tuned to specific problems. Our main research goal is to develop general AI methodology that relies, at its core, on reinforcement learning. Reinforcement learning is an approach to learning from interaction with an environment, inspired by animal learning theory. This proposal aims to design algorithms that can automatically create representations for reinforcement learning agents which allow them to model the world and to act at multiple time scales. We aim to provide new optimization criteria which describe formally what is a good set of abstract representations, provide gradient-based learning algorithms to learn such models, and demonstrate their effectiveness through empirical evaluations in simulated domains, game playing, as well as real time series prediction data sets. We will tackle the crucial problem of exploration, by explaining how an agent should move about its environment in order to optimize its learning speed. Finally, we will leverage these methods inside other algorithms that can benefit from multiple time scales, such as the training of deep, recurrent neural networks.

人工智能（AI）在隔离智能的不同方面取得了巨大进展，并提出了灵活的表示和强大的算法，从而能够胜任特定任务。例如，AI智能体比人类更擅长玩围棋等游戏，这曾经被认为是不可能的。然而，人类甚至动物通常表现出的那种灵活、健壮和自主的能力仍然难以捉摸。最好的人工智能系统仍然是针对特定问题而调整的。我们的主要研究目标是开发通用的人工智能方法，其核心是强化学习。强化学习是一种从与环境的交互中学习的方法，受到动物学习理论的启发。该提案旨在设计可以自动为强化学习代理创建表示的算法，这些代理允许它们对世界进行建模并在多个时间尺度上采取行动。我们的目标是提供新的优化标准，正式描述什么是一个很好的一套抽象的表示，提供基于梯度的学习算法来学习这样的模型，并证明其有效性，通过经验评估模拟域，游戏，以及真实的时间序列预测数据集。我们将解决探索的关键问题，通过解释代理应该如何在其环境中移动以优化其学习速度。最后，我们将在其他算法中利用这些方法，这些算法可以从多个时间尺度中受益，例如深度递归神经网络的训练。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Precup, Doina其他文献

Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation

DOI：
10.1016/j.media.2019.101557
发表时间：
2020-01-01
期刊：
MEDICAL IMAGE ANALYSIS
影响因子：
10.9
作者：
Nair, Tanya;Precup, Doina;Arbel, Tal
通讯作者：
Arbel, Tal

An information-theoretic approach to curiosity-driven reinforcement learning

DOI：
10.1007/s12064-011-0142-z
发表时间：
2012-09-01
期刊：
THEORY IN BIOSCIENCES
影响因子：
1.1
作者：
Still, Susanne;Precup, Doina
通讯作者：
Precup, Doina

Fast reinforcement learning with generalized policy updates

DOI：
10.1073/pnas.1907370117
发表时间：
2020-12-01
期刊：
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
影响因子：
11.1
作者：
Barreto, Andre;Hou, Shaobo;Precup, Doina
通讯作者：
Precup, Doina