权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

NRI: Collaborative Research: A Framework for Hierarchical, Probabilistic Planning and Learning

NRI：协作研究：分层、概率规划和学习的框架

基本信息

批准号：
1637614
负责人：
Stefanie Tellex
金额：
$ 54.27万
依托单位：
Brown University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2016
资助国家：
美国
起止时间：
2016-09-01 至 2019-08-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1637614&HistoricalAwards=false
关键词：
NRI Collaborative Research Framework Hierarchical

项目摘要

This project is an effort to create a unified framework for solving very large problems with uncertain states and actions, such as manipulator robots acting in real-world environments. The results may have especially great promise for assistive technologies, including autonomous robots that can be used by elderly and disabled populations to aid them in their daily activities. The proposed integrated framework will represent, apply, and learn hierarchical domain knowledge, and will include the ability to transfer knowledge from simpler problems to more complex ones. The research will enable autonomous agents to develop a structured representation of complex domains based on experience. The agents will use learned representations to interpret natural language commands for both low-level and high-level requests. The technical focus is enabling tractable planning in large, uncertain domains by generating and leveraging probabilistic domain knowledge at multiple levels of abstraction. Agents will autonomously create layered representations in which the layers build on one another to produce complex behaviors. Agents will learn to perform useful behaviors, such as navigating using low-level sensor feedback or assembling complex objects such as a bridge or a table. The key technical contributions will be methods for (1) planning in large state/action spaces using the abstract object-oriented Markov decision process (AMDP) model, a new formalism for representing probabilistic domain knowledge at multiple levels of abstraction; (2) learning hierarchical task knowledge in the form of AMDPs; and (3) interpreting natural language commands at multiple levels of abstraction by mapping to the learned hierarchical structure. The formalism will be demonstrated and validated in several domains, including a simulated "cleanup" toy domain, challenging and complex video games, and a robot manipulation task.

这个项目致力于创建一个统一的框架来解决具有不确定状态和动作的非常大的问题，例如在真实世界环境中操作的机械手机器人。这一结果可能会对辅助技术带来特别大的希望，包括可以被老年人和残疾人使用的自主机器人，以帮助他们进行日常活动。拟议的集成框架将表示、应用和学习分层的领域知识，并将包括将知识从较简单的问题转移到较复杂的问题的能力。这项研究将使自主代理能够根据经验开发复杂领域的结构化表示。代理将使用学习的表示法来解释低级和高级请求的自然语言命令。技术重点是通过在多个抽象级别生成和利用概率领域知识，在大型、不确定的领域实现易于处理的规划。代理将自主地创建分层表示，其中的层相互构建以产生复杂的行为。代理将学习执行有用的行为，例如使用低级别传感器反馈导航或组装复杂对象，如桥梁或桌子。关键的技术贡献将是用于(1)使用抽象的面向对象的马尔可夫决策过程(AMDP)模型在大的状态/动作空间中进行规划的方法，该模型是在多个抽象层次上表示概率领域知识的新形式；(2)以AMDP的形式学习分层任务知识；以及(3)通过映射到所学习的分层结构来解释多个抽象层次上的自然语言命令。这种形式主义将在几个领域得到演示和验证，包括模拟的“清理”玩具领域，具有挑战性的复杂视频游戏，以及机器人操作任务。

项目成果

期刊论文数量（10）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Robots That Use Language

DOI：
10.1146/annurev-control-101119-071628
发表时间：
2020-05
期刊：
Annu. Rev. Control. Robotics Auton. Syst.
影响因子：
0
作者：
Stefanie Tellex;N. Gopalan;H. Kress-Gazit;Cynthia Matuszek
通讯作者：
Stefanie Tellex;N. Gopalan;H. Kress-Gazit;Cynthia Matuszek

Learning Approximate Stochastic Transition Models

DOI：
发表时间：
2017-10
期刊：
ArXiv
影响因子：
0
作者：
Pichao Wang;W. Li;Jun Wan;P. Ogunbona;Xinwang Liu
通讯作者：
Pichao Wang;W. Li;Jun Wan;P. Ogunbona;Xinwang Liu

The Expected-Length Model of Options

期权的预期长度模型

DOI：
发表时间：
2019
期刊：
IJCAI
影响因子：
0
作者：
David Abel*, John Winder*
通讯作者：
David Abel*, John Winder*

State Abstractions for Lifelong Reinforcement Learning

DOI：
发表时间：
2018-07
期刊：
影响因子：
0
作者：
David Abel;Dilip Arumugam;Lucas Lehnert;M. Littman
通讯作者：
David Abel;Dilip Arumugam;Lucas Lehnert;M. Littman

Planning with State Abstractions for Non-Markovian Task Specifications

非马尔可夫任务规范的状态抽象规划

DOI：
发表时间：
2019
期刊：
Robotics science and systems
影响因子：
0
作者：
Yoonseon Oh, Roma Patel
通讯作者：
Yoonseon Oh, Roma Patel

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Stefanie Tellex其他文献

Grounding Language Attributes to Objects using Bayesian Eigenobjects

使用贝叶斯特征对象将语言属性基础化为对象

DOI：
10.1109/iros40897.2019.8968603
发表时间：
2019
期刊：
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
影响因子：
0
作者：
Vanya Cohen;B. Burchfiel;Thao Nguyen;N. Gopalan;Stefanie Tellex;G. Konidaris
通讯作者：
G. Konidaris

Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop

与机器人的口语交互：研究问题和建议，美国国家科学基金会未来方向研讨会的报告

DOI：
发表时间：
2020
期刊：
Computer Speech and Language
影响因子：
4.3
作者：
M. Marge;C. Espy;Nigel G. Ward;A. Alwan;Yoav Artzi;Mohit Bansal;Gil;Blankenship;J. Chai;Hal Daumé;Debadeepta Dey;M. Harper;T. Howard;Casey;Kennington;Ivana Kruijff;Dinesh Manocha;Cynthia Matuszek;Ross Mead;Raymond;Mooney;Roger K. Moore;M. Ostendorf;Heather Pon;A. Rudnicky;Matthias;Scheutz;R. Amant;Tong Sun;Stefanie Tellex;D. Traum;Zhou Yu
通讯作者：
Zhou Yu

Advantages and Limitations of using Successor Features for Transfer in Reinforcement Learning

在强化学习中使用后继特征进行迁移的优点和局限性

DOI：
发表时间：
2017
期刊：
arXiv.org
影响因子：
0
作者：
Lucas Lehnert;Stefanie Tellex;M. Littman
通讯作者：
M. Littman

Time-Lapse Light Field Photography for Perceiving Non-Lambertian Scenes

用于感知非朗伯场景的延时光场摄影

DOI：
10.15607/rss.2017.xiii.026
发表时间：
2017
期刊：
Robotics: Science and Systems
影响因子：
0
作者：
John G. Oberlin;Stefanie Tellex
通讯作者：
Stefanie Tellex

Dialogue with Robots: Proposals for Broadening Participation and Research in the SLIVAR Community

与机器人对话：扩大 SLIVAR 社区参与和研究的提案

DOI：
发表时间：
2024
期刊：
arXiv.org
影响因子：
0
作者：
Casey Kennington;Malihe Alikhani;Heather Pon;Katherine Atwell;Yonatan Bisk;Daniel Fried;Felix Gervits;Zhao Han;Mert Inan;Michael Johnston;Raj Korpan;Diane Litman;M. Marge;Cynthia Matuszek;Ross Mead;Shiwali Mohan;Raymond Mooney;Natalie Parde;Jivko Sinapov;Angela Stewart;Matthew Stone;Stefanie Tellex;Tom Williams
通讯作者：
Tom Williams