登录/注册

{{ userInfo.is_pay ? '专属管家' : '联系客服' }}

调研领500喵币

开通猫会员

用户头像

{{ userInfo.nickname }}

个人中心

ID: {{ userInfo.uid }}

复制

会员有效期至{{dayjs(userInfo?.membership_time * 1000).format('YYYY.MM.DD')}}

开通会员尊享 16+ 权益

{{isVip ? '立即续费' : '立即开通'}}

智能选题

智能选题

课程8折

智能标书

文献分析

文献分析

更多特权

更多特权

剩余喵币

{{userInfo.mew_coin_count}}

{{userInfo.over_mew_coin || 0}}喵币将在本周失效

专属邀请码

复制

{{userInfo.share?.code}}

邀好友注册得200喵币/人任务中心

任务中心

退出账号

{{loginType === 2 ? '微信扫码注册' : '欢迎来到猫眼课题宝'}}

登录二维码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

为了保证账户安全，请在
微信「猫眼课题宝」内点击授权

重新扫码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

账号注册

您好~为了给您提供更精准的分析体验，需完善基础信息！所有信息100%保密，请放心填写！

立即使用

切换微信登录

*注：建议或bug反馈被采纳后获得{{feedback_mew_coin}}喵币奖励，请关注公众号模版消息通知

取消

提交

已收到您的反馈，我们会尽快处理。若内容被采纳你将获得{{feedback_mew_coin}}喵币奖励。请关注《猫眼课题宝》消息通知。

{{ChannelMewCoin}}

喵币已到账！

*喵币用于产品体验解锁使用，有效期 30 天

在猫眼课题宝您可以：

{{item.title}}

{{item.desc}}

微信扫码添加小助理，回复“调研”
领取调研问卷

首次添加还可额外获得
{{customer_mew_coin}}喵币奖励哦！

完成问卷填写，立得{{question_mew_coin}}喵币奖励

永久回看权已生效！

直播主题

《{{latestCourse?.name}}》

立即去查看

7天猫会员

有效期至：{{dayjs(userInfo.membership_time * 1000).format('YYYY-MM-DD HH:mm')}}

已送您“7天会员体验卡+500喵币”

次数升级

享智能标书等多功能月解锁次数1次

10次

优享折扣

获会员期内充值喵币 8折等3大折扣

开心收下

永久回看权已生效！

课程

《{{giftRes?.img}}》

立即去查看

永久回看权已生效！

课程

《{{receiveTrainingCourseInfo?.name}}》

立即去查看

{{userInfo?.nickname}}

猫会员

{{vipStr}}

后失效

·查看权益对比·

猫会员

（全方位提升课题决策能力）

会员专属

升级猫会员：购买喵币享 8 折优惠

免费领最高 6W 喵币

二维码

{{qrCodeError}}

请先阅读
服务协议并同意

扫码添加「专属客服」
了解团购优惠方案

客服在线时间：工作日9:00-18:00

￥

{{currentInfo?.price}}

已优惠{{ _.floor(_.toNumber(currentInfo.original_price) - _.toNumber(currentInfo.price), 0) }}元

倒计时

支持：

支付宝

支付宝/

微信

*信息服务类购买后不支持退款

请阅读并同意《猫眼课题宝服务协议》

常见问题

会员权益说明

会员权益对比

权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

- 微信扫一扫 -

请添加您的「专属会员管家」
提供专属会员服务

Reinforcement Learning Algorithms Based on Dynamic Programming

基于动态规划的强化学习算法

基本信息

批准号：
9214866
负责人：
Andrew Barto
金额：
$ 31.63万
依托单位：
University of Massachusetts Amherst
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
1992
资助国家：
美国
起止时间：
1992-09-15 至 1997-02-28
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=9214866&HistoricalAwards=false
关键词：
Reinforcement Learning Algorithms Based Dynamic

项目摘要

This project will investigate aspects of a class of reinforcement learning algorithms based on dynamic programming (DP). Although these algorithms have been widely studied and have been experimented with in many applications, their theory is not developed enough to permit a clear understanding of the classes of problems for which they may be the methods of choice, or to guide their application. Research at the University of Massachusetts has made considerable recent progress in relating these methods to the most closely related conventional methods and in understanding the factors that influence their performance, both successful and unsuccessful. These methods may provide the only computationally feasible approaches to very large and analytically intractable sequential decision problems. The objectives of this project are: 1) to continue development of DP-based reinforcement learning methods an their theory, 2) to investigate their computational complexity, and 3) to define the characteristics of problems for which they are best suited.

该项目将调查一个强化学习类动态算法编程（DP）。虽然这些算法已经被广泛研究，已经在许多国家进行了实验应用程序，他们的理论不是发展到足以让一个清晰的对问题类别的理解它们可能是选择，或指导其应用。大学的研究马萨诸塞州已经取得了相当大的成就最近的进展，这些最密切相关的方法常规方法并且了解影响他们的表现，既成功，不成功。这些方法可以提供了唯一的计算可行的办法，解析难处理的序列决策问题。这一目标项目是：1）继续发展基于DP的强化学习方法和理论，2）研究其计算复杂性，以及3）定义问题的特点，他们是最合适的。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Andrew Barto其他文献

Andrew Barto的其他文献

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

{{ truncateString('Andrew Barto', 18)}}的其他基金

CRCNS: Collaborative Research: Neural Correlates of Hierarchical Reinforcement Learning

CRCNS：协作研究：分层强化学习的神经关联

批准号：
1208051
财政年份：
2012
资助金额：
$ 31.63万
项目类别：
Continuing Grant

NRI-Small: Collaborative Research: Multiple Task Learning from Unstructured Demonstrations

NRI-Small：协作研究：从非结构化演示中进行多任务学习

批准号：
1208497
财政年份：
2012
资助金额：
$ 31.63万
项目类别：
Standard Grant

SGER: Building Blocks for Creative Search

SGER：创意搜索的构建模块

批准号：
0733581
财政年份：
2007
资助金额：
$ 31.63万
项目类别：
Standard Grant

Collaborative Research: Intrinsically Motivated Learning in Artificial Agents

协作研究：人工智能体的内在动机学习

批准号：
0432143
财政年份：
2004
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Dynamic Abstraction in Reinforcement Learning

强化学习中的动态抽象

批准号：
0218125
财政年份：
2002
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Lyapunov Methods for Reinforcement Learning

强化学习的李亚普诺夫方法

批准号：
0070102
财政年份：
2000
资助金额：
$ 31.63万
项目类别：
Standard Grant

KDI: Temporal Abstraction in Reinforcement Learning

KDI：强化学习中的时间抽象

批准号：
9980062
财政年份：
1999
资助金额：
$ 31.63万
项目类别：
Standard Grant

Multiple Time Scale Reinforcement Learning

多时间尺度强化学习

批准号：
9511805
财政年份：
1995
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Neural Networks for Adaptive Control

用于自适应控制的神经网络

批准号：
8912623
财政年份：
1989
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Conference on the Neurone as a Computational Unit, June 28--July 1, 1988, King's College, Cambridge, England

神经元作为计算单元会议，1988 年 6 月 28 日至 7 月 1 日，英国剑桥国王学院

批准号：
8808758
财政年份：
1988
资助金额：
$ 31.63万
项目类别：
Standard Grant

相似国自然基金

Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis

批准号：
批准年份：
2024
资助金额：
万元
项目类别：
合作创新研究团队

Understanding structural evolution of galaxies with machine learning

批准号：
n/a
批准年份：
2022
资助金额：
10.0 万元
项目类别：
省市级项目

煤矿安全人机混合群智感知任务的约束动态多目标Q-learning进化分配

批准号：
批准年份：
2022
资助金额：
30 万元
项目类别：
青年科学基金项目

基于领弹失效考量的智能弹药编队短时在线Q-learning协同控制机理

批准号：
62003314
批准年份：
2020
资助金额：
24.0 万元
项目类别：
青年科学基金项目

集成上下文张量分解的e-learning资源推荐方法研究

批准号：
61902016
批准年份：
2019
资助金额：
24.0 万元
项目类别：
青年科学基金项目

具有时序迁移能力的Spiking-Transfer learning (脉冲-迁移学习)方法研究

批准号：
61806040
批准年份：
2018
资助金额：
20.0 万元
项目类别：
青年科学基金项目

基于Deep-learning的三江源区冰川监测动态识别技术研究

批准号：
51769027
批准年份：
2017
资助金额：
38.0 万元
项目类别：
地区科学基金项目

具有时序处理能力的Spiking-Deep Learning（脉冲深度学习）方法研究

批准号：
61573081
批准年份：
2015
资助金额：
64.0 万元
项目类别：
面上项目

基于有向超图的大型个性化e-learning学习过程模型的自动生成与优化

批准号：
61572533
批准年份：
2015
资助金额：
66.0 万元
项目类别：
面上项目

E-Learning中学习者情感补偿方法的研究

批准号：
61402392
批准年份：
2014
资助金额：
26.0 万元
项目类别：
青年科学基金项目

相似海外基金

CAREER: Robust Reinforcement Learning Under Model Uncertainty: Algorithms and Fundamental Limits

职业：模型不确定性下的鲁棒强化学习：算法和基本限制

批准号：
2337375
财政年份：
2024
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments

协作研究：SLES：安全的分布式强化学习系统：理论、算法和实验

批准号：
2331781
财政年份：
2023
资助金额：
$ 31.63万
项目类别：
Standard Grant

CIF: SMALL: Theoretical Foundations of Partially Observable Reinforcement Learning: Minimax Sample Complexity and Provably Efficient Algorithms

CIF：SMALL：部分可观察强化学习的理论基础：最小最大样本复杂性和可证明有效的算法

批准号：
2315725
财政年份：
2023
资助金额：
$ 31.63万
项目类别：
Standard Grant

CAREER: Reinforcement Learning-Based Control of Heterogeneous Multi-Agent Systems in Structured Environments: Algorithms and Complexity

职业：结构化环境中异构多智能体系统的基于强化学习的控制：算法和复杂性

批准号：
2237830
财政年份：
2023
资助金额：
$ 31.63万
项目类别：
Continuing Grant

Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments

协作研究：SLES：安全的分布式强化学习系统：理论、算法和实验

批准号：
2331780
财政年份：
2023
资助金额：
$ 31.63万
项目类别：
Standard Grant

Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments

协作研究：SLES：安全的分布式强化学习系统：理论、算法和实验

批准号：
2331782
财政年份：
2023
资助金额：
$ 31.63万
项目类别：
Standard Grant

Theory and Algorithms for Relation between Stochastic Control and Reinforcement Learning

随机控制与强化学习关系的理论和算法

批准号：
2741077
财政年份：
2022
资助金额：
$ 31.63万
项目类别：
Studentship

Reinforcement Learning Algorithms Designed to Persist

旨在持久的强化学习算法

批准号：
RGPIN-2022-04035
财政年份：
2022
资助金额：
$ 31.63万
项目类别：
Discovery Grants Program - Individual

Developing robust and scalable reinforcement learning algorithms

开发强大且可扩展的强化学习算法

批准号：
2740739
财政年份：
2022
资助金额：
$ 31.63万
项目类别：
Studentship

Parameter-free Algorithms for Reinforcement Learning

强化学习的无参数算法

批准号：
558512-2021
财政年份：
2022
资助金额：
$ 31.63万
项目类别：
Alexander Graham Bell Canada Graduate Scholarships - Doctoral

{{ showInfoDetail.title }}

成果类型：
{{ showInfoTypeEnum[showInfoType] }}

学术检索：
百度学术

作者：{{ showInfoDetail.author }}

知道了