S&AS: FND: COLLAB: Learning Manipulation Skills Using Deep Reinforcement Learning with Domain Transfer

S

基本信息

  • 批准号:
    1724191
  • 负责人:
  • 金额:
    $ 30万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-09-01 至 2022-08-31
  • 项目状态:
    已结题

项目摘要

This project develops new methods of using deep reinforcement learning to solve real world robotics problems. The project focuses on robotic manipulation tasks such as grasping, opening doors, helping out in the home, performing repairs aboard Navy ships, etc. The key operation in all of the above is the ability for the robot to reliably manipulate objects, parts, or tools with its hands in order to perform a task. The project leverages deep reinforcement learning: a new approach to robotic learning that is capable of learning both perceptual features and control policies simultaneously. This project could have important benefits for a variety of practical applications including: explosive ordnance disposal for our military, materials handling aboard Navy ships, dexterous robotic assistants for NASA astronauts in space, assistive technologies that could help seniors age in place longer, better capabilities for handling radioactive materials during nuclear cleanup, assistance for ergonomically challenging tasks in manufacturing, and general assistance in the office and the home.This research investigates novel deep reinforcement learning approaches for robotic grasping and manipulation that work well in previously unseen, unstructured environments and compose end-to-end tasks from simpler sub-task controllers. The research is built on two main results from research team's recent work, the deep learning approach to grasping and domain adaptation methods for deep neural networks. The research is guided by the following three key ideas: 1) learning in simulation and then using domain transfer techniques to adapt the solutions to reality; 2) simplifying learning for visuomotor control by using planning to estimate the value function; and 3) using symbolic task and motion planning to perform end-to-end tasks by sequencing learned controllers and planned arm/hand motions. The research team performs extensive evaluations to ensure that the system is able to perform novel instances of a task, e.g., those in a context that the robot has not seen before.
该项目开发了使用深度强化学习解决真实的世界机器人问题的新方法。该项目的重点是机器人操作任务,如抓取,开门,在家里帮忙,在海军舰艇上进行维修等,上述所有操作的关键是机器人能够可靠地操纵物体,零件或工具,以执行任务。该项目利用深度强化学习:一种新的机器人学习方法,能够同时学习感知特征和控制策略。该项目可能对各种实际应用产生重要的好处,包括:为我们的军队处理爆炸物,在海军舰艇上处理材料,为NASA宇航员在太空中提供灵巧的机器人助手,可以帮助老年人更长时间在原地老化的辅助技术,在核清理过程中处理放射性材料的更好能力,在制造业中协助人体工程学挑战性任务,这项研究调查了用于机器人抓取和操纵的新型深度强化学习方法,这些方法在以前看不见的非结构化环境中工作良好,并从更简单的子任务控制器组成端到端任务。该研究建立在研究团队最近工作的两个主要成果之上,即用于抓取的深度学习方法和用于深度神经网络的域自适应方法。该研究由以下三个关键思想指导:1)在模拟中学习,然后使用域转移技术来适应现实的解决方案; 2)通过使用规划来估计值函数,简化视觉控制的学习; 3)使用符号任务和运动规划来执行端到端的任务,通过排序学习的控制器和规划的手臂/手部运动。研究团队进行了广泛的评估,以确保系统能够执行任务的新实例,例如,那些机器人从未见过的场景。

项目成果

期刊论文数量(24)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Policy learning in SE (3) action spaces
SE (3) 行动空间中的政策学习
Learning discrete state abstractions with deep variational inference
  • DOI:
  • 发表时间:
    2020-03
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Ondrej Biza;Robert W. Platt;Jan-Willem van de Meent;Lawson L. S. Wong
  • 通讯作者:
    Ondrej Biza;Robert W. Platt;Jan-Willem van de Meent;Lawson L. S. Wong
Pick and Place Without Geometric Object Models
无需几何对象模型即可拾取和放置
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
  • DOI:
    10.48550/arxiv.2205.14292
  • 发表时间:
    2022-05
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Dian Wang;Colin Kohler;Xu Zhu;Ming Jia;Robert W. Platt
  • 通讯作者:
    Dian Wang;Colin Kohler;Xu Zhu;Ming Jia;Robert W. Platt
Learning 6-DoF Grasping and Pick-Place Using Attention Focus
  • DOI:
  • 发表时间:
    2018-06
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Marcus Gualtieri;Robert W. Platt
  • 通讯作者:
    Marcus Gualtieri;Robert W. Platt
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Robert Platt其他文献

The nature of essential hypertension.
原发性高血压的性质。
  • DOI:
  • 发表时间:
    1959
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Robert Platt
  • 通讯作者:
    Robert Platt
Coarticulation in Markov Decision Processes
马尔可夫决策过程中的协同表达
MIT Open Access Articles LQR-RRT*: Optimal sampling-based motion planning with automatically derived extension heuristics
麻省理工学院开放获取文章 LQR-RRT*:基于自动导出的扩展启发式的最佳基于采样的运动规划
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Alejandro Perez;Robert Platt;G. Konidaris;L. Kaelbling;Tomás Lozano
  • 通讯作者:
    Tomás Lozano
Improving Grasp Skills Using Schema Structured Learning
使用模式结构化学习提高掌握技能
  • DOI:
  • 发表时间:
    2006
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Robert Platt;R. Grupen;A. Fagg
  • 通讯作者:
    A. Fagg
Manipulation gaits: sequences of grasp control tasks
操纵步态:抓取控制任务的序列

Robert Platt的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Robert Platt', 18)}}的其他基金

FRR: Symmetric Policy Learning for Robotic Manipulation
FRR:机器人操作的对称策略学习
  • 批准号:
    2314182
  • 财政年份:
    2023
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
CHS: Medium: Collaborative Research: Manipulation Assistance for Activities of Daily Living in Everyday Environments
CHS:媒介:协作研究:日常环境中日常生活活动的操纵辅助
  • 批准号:
    1763878
  • 财政年份:
    2018
  • 资助金额:
    $ 30万
  • 项目类别:
    Continuing Grant
CAREER: Robotic Manipulation Using Deep Deictic Reinforcement Learning
职业:使用深度指示强化学习的机器人操作
  • 批准号:
    1750649
  • 财政年份:
    2018
  • 资助金额:
    $ 30万
  • 项目类别:
    Continuing Grant
S&AS: INT: COLLAB: Composable and Verifiable Design for Autonomous Humanoid Robots in Space Missions
S
  • 批准号:
    1724257
  • 财政年份:
    2017
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: Collaborative Research: Human-Supervised Perception and Grasping in Clutter
NRI:协作研究:人类监督的杂乱中的感知和抓取
  • 批准号:
    1427081
  • 财政年份:
    2014
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant

相似国自然基金

Novosphingobium sp. FND-3降解呋喃丹的分子机制研究
  • 批准号:
    31670112
  • 批准年份:
    2016
  • 资助金额:
    62.0 万元
  • 项目类别:
    面上项目

相似海外基金

S&AS: FND: COLLAB: Planning and Control of Heterogeneous Robot Teams for Ocean Monitoring
S
  • 批准号:
    2311967
  • 财政年份:
    2022
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: RAPID: Targeted Sampling of an Unanticipated Harmful Algal Bloom in Lake Anna, Virginia with Aerial and Aquatic Robots
NRI:FND:协作:快速:利用空中和水上机器人对弗吉尼亚州安娜湖意外有害藻华进行有针对性的采样
  • 批准号:
    2001119
  • 财政年份:
    2020
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: RAPID: Targeted Sampling of an Unanticipated Harmful Algal Bloom in Lake Anna, Virginia with Aerial and Aquatic Robots
NRI:FND:协作:快速:利用空中和水上机器人对弗吉尼亚州安娜湖意外有害藻华进行有针对性的采样
  • 批准号:
    2001120
  • 财政年份:
    2020
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: RAPID: Targeted Sampling of an Unanticipated Harmful Algal Bloom in Lake Anna, Virginia with Aerial and Aquatic Robots
NRI:FND:协作:快速:利用空中和水上机器人对弗吉尼亚州安娜湖意外有害藻华进行有针对性的采样
  • 批准号:
    2001216
  • 财政年份:
    2020
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
S&AS: FND: COLLAB: Adaptable Vehicular Sensing and Control for Fleet-Oriented Systems in Smart Cities
S
  • 批准号:
    1849238
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
S&AS: FND: COLLAB: Learning from Stories: Practical Value Alignment and Taskability for Autonomous Systems
S
  • 批准号:
    1849231
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: Design of dynamic multibehavioral robots: new tools to consider design tradeoff and enable more capable robotic systems
NRI:FND:COLLAB:动态多行为机器人的设计:考虑设计权衡并实现功能更强大的机器人系统的新工具
  • 批准号:
    1924723
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
S&AS: FND: COLLAB: Adaptable Vehicular Sensing and Control for Fleet-Oriented Systems in Smart Cities
S
  • 批准号:
    1849246
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
NRI: FND: COLLAB: An Open-Source Robotic Leg Platform that Lowers the Barrier for Advanced Prosthetics Research
NRI:FND:COLLAB:降低高级假肢研究障碍的开源机器人腿部平台
  • 批准号:
    1949346
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
S&AS: FND: COLLAB: Learning from Stories: Practical Value Alignment and Taskability for Autonomous Systems
S
  • 批准号:
    1849262
  • 财政年份:
    2019
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了