Prediction and Planning: Bridging the Gap
预测和规划:弥合差距
基本信息
- 批准号:0209088
- 负责人:
- 金额:$ 29.17万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2002
- 资助国家:美国
- 起止时间:2002-09-01 至 2006-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project is fundamental research to improve the performance of intelligent software agents, based on the observation that an agent's past experiences are a valuable and generally underutilized database. The goal is to produce algorithms that make stronger use of data than existing reinforcement learning algorithms, enabling a view of the agent's stored experiences as a repository that can be mined for performance-improving information. More generally, the agent may choose to use data obtained by observing other agents, or even from mining the web.The impact of this research may be felt in many areas. For example, software learning agents can be expected to learn in a much more human-like manner; noteworthy experiences will be remembered, and their influence on future performance will not attenuate. There will be no sampling requirements on the data, so it will be possible to learn from watching others and possible to use repositories of stored data to learn new behaviors. Among the likely practical applications of this work are network management and electronic commerce.
该项目是提高智能软件代理性能的基础研究,基于对代理过去经验是一个有价值且通常未被充分利用的数据库的观察。我们的目标是产生比现有强化学习算法更能利用数据的算法,使智能体存储经验的视图成为一个存储库,可以挖掘性能改进信息。更一般地说,代理可以选择使用通过观察其他代理获得的数据,甚至是通过挖掘web获得的数据。这项研究的影响可以在许多领域感受到。例如,可以期望软件学习代理以更像人类的方式学习;值得注意的经验会被记住,它们对未来表现的影响不会减弱。对数据没有抽样要求,因此可以通过观察他人来学习,也可以使用存储的数据库来学习新的行为。这项工作可能的实际应用包括网络管理和电子商务。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Ronald Parr其他文献
Amazing Things Come From Having Many Good Models
令人惊奇的事情来自于拥有许多好的模型
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Cynthia Rudin;Chudi Zhong;Lesia Semenova;Margo Seltzer;Ronald Parr;Jiachang Liu;Srikar Katta;Jon Donnelly;Harry Chen;Zachery Boner - 通讯作者:
Zachery Boner
An Optimal Tightness Bound for the Simulation Lemma
模拟引理的最优紧界
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Sam Lobel;Ronald Parr - 通讯作者:
Ronald Parr
Ronald Parr的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Ronald Parr', 18)}}的其他基金
RI: Small: Feature Encoding for Reinforcement Learning
RI:小型:强化学习的特征编码
- 批准号:
1815300 - 财政年份:2018
- 资助金额:
$ 29.17万 - 项目类别:
Continuing Grant
EAGER: Collaborative Research: An Unified Learnable Roadmap for Sequential Decision Making in Relational Domains
EAGER:协作研究:关系领域顺序决策的统一可学习路线图
- 批准号:
1836575 - 财政年份:2018
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
RI: Small: Non-parametric Approximate Dynamic Programming for Continuous Domains
RI:小:连续域的非参数近似动态规划
- 批准号:
1218931 - 财政年份:2012
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
EAGER: IIS: RI: Learning in Continuous and High Dimensional Action Spaces
EAGER:IIS:RI:在连续和高维行动空间中学习
- 批准号:
1147641 - 财政年份:2011
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Collaborative: RI: Feature Discovery and Benchmarks for Exportable Reinforcement Learning
协作:RI:可导出强化学习的特征发现和基准
- 批准号:
0713435 - 财政年份:2007
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
CAREER: Observing to Plan - Planning to Observe
职业生涯:观察到计划 - 计划到观察
- 批准号:
0546709 - 财政年份:2006
- 资助金额:
$ 29.17万 - 项目类别:
Continuing Grant
相似海外基金
HoloSurge: Multimodal 3D Holographic tool and real-time Guidance System with point-of-care diagnostics for surgical planning and interventions on liver and pancreatic cancers
HoloSurge:多模态 3D 全息工具和实时指导系统,具有护理点诊断功能,可用于肝癌和胰腺癌的手术规划和干预
- 批准号:
10103131 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
EU-Funded
Planning Grant: Developing capacity to attract diverse students to the geosciences: A public relations framework
规划补助金:培养吸引多元化学生学习地球科学的能力:公共关系框架
- 批准号:
2326816 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Planning: FIRE-PLAN: Building Wildland Fire Science Capacity in Alaska Through The University of Alaska Fairbanks Rural Campuses
规划:FIRE-PLAN:通过阿拉斯加大学费尔班克斯乡村校区建设阿拉斯加荒地火灾科学能力
- 批准号:
2333423 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Collaborative Research: Planning: FIRE-PLAN:High-Spatiotemporal-Resolution Sensing and Digital Twin to Advance Wildland Fire Science
合作研究:规划:FIRE-PLAN:高时空分辨率传感和数字孪生,以推进荒地火灾科学
- 批准号:
2335568 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Collaborative Research: Planning: FIRE-PLAN:High-Spatiotemporal-Resolution Sensing and Digital Twin to Advance Wildland Fire Science
合作研究:规划:FIRE-PLAN:高时空分辨率传感和数字孪生,以推进荒地火灾科学
- 批准号:
2335569 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Planning: FIRE-PLAN: Exploring fire as medicine to revitalize cultural burning in the Upper Midwest
规划:FIRE-PLAN:探索火作为药物,以振兴中西部北部的文化燃烧
- 批准号:
2349282 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
CC* Planning: Strengthening Central Michigan University's Cyberinfrastructure
CC* 规划:加强中央密歇根大学的网络基础设施
- 批准号:
2345749 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
CAREER: Statistical Power Analysis and Optimal Sample Size Planning for Longitudinal Studies in STEM Education
职业:STEM 教育纵向研究的统计功效分析和最佳样本量规划
- 批准号:
2339353 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Continuing Grant
Planning: Advancing Discovery on a Sustainable National Research Enterprise
规划:推进可持续国家研究企业的发现
- 批准号:
2412406 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant
Planning: Artificial Intelligence Assisted High-Performance Parallel Computing for Power System Optimization
规划:人工智能辅助高性能并行计算电力系统优化
- 批准号:
2414141 - 财政年份:2024
- 资助金额:
$ 29.17万 - 项目类别:
Standard Grant