权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

S&AS: FND: COLLAB: Learning Manipulation Skills Using Deep Reinforcement Learning with Domain Transfer

基本信息

批准号：
1724191
负责人：
Robert Platt
金额：
$ 30万
依托单位：
Northeastern University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2017
资助国家：
美国
起止时间：
2017-09-01 至 2022-08-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1724191&HistoricalAwards=false
关键词：
S&amp FND COLLAB Learning Manipulation

项目摘要

This project develops new methods of using deep reinforcement learning to solve real world robotics problems. The project focuses on robotic manipulation tasks such as grasping, opening doors, helping out in the home, performing repairs aboard Navy ships, etc. The key operation in all of the above is the ability for the robot to reliably manipulate objects, parts, or tools with its hands in order to perform a task. The project leverages deep reinforcement learning: a new approach to robotic learning that is capable of learning both perceptual features and control policies simultaneously. This project could have important benefits for a variety of practical applications including: explosive ordnance disposal for our military, materials handling aboard Navy ships, dexterous robotic assistants for NASA astronauts in space, assistive technologies that could help seniors age in place longer, better capabilities for handling radioactive materials during nuclear cleanup, assistance for ergonomically challenging tasks in manufacturing, and general assistance in the office and the home.This research investigates novel deep reinforcement learning approaches for robotic grasping and manipulation that work well in previously unseen, unstructured environments and compose end-to-end tasks from simpler sub-task controllers. The research is built on two main results from research team's recent work, the deep learning approach to grasping and domain adaptation methods for deep neural networks. The research is guided by the following three key ideas: 1) learning in simulation and then using domain transfer techniques to adapt the solutions to reality; 2) simplifying learning for visuomotor control by using planning to estimate the value function; and 3) using symbolic task and motion planning to perform end-to-end tasks by sequencing learned controllers and planned arm/hand motions. The research team performs extensive evaluations to ensure that the system is able to perform novel instances of a task, e.g., those in a context that the robot has not seen before.

该项目开发了使用深度强化学习解决真实的世界机器人问题的新方法。该项目的重点是机器人操作任务，如抓取，开门，在家里帮忙，在海军舰艇上进行维修等，上述所有操作的关键是机器人能够可靠地操纵物体，零件或工具，以执行任务。该项目利用深度强化学习：一种新的机器人学习方法，能够同时学习感知特征和控制策略。该项目可能对各种实际应用产生重要的好处，包括：为我们的军队处理爆炸物，在海军舰艇上处理材料，为NASA宇航员在太空中提供灵巧的机器人助手，可以帮助老年人更长时间在原地老化的辅助技术，在核清理过程中处理放射性材料的更好能力，在制造业中协助人体工程学挑战性任务，这项研究调查了用于机器人抓取和操纵的新型深度强化学习方法，这些方法在以前看不见的非结构化环境中工作良好，并从更简单的子任务控制器组成端到端任务。该研究建立在研究团队最近工作的两个主要成果之上，即用于抓取的深度学习方法和用于深度神经网络的域自适应方法。该研究由以下三个关键思想指导：1）在模拟中学习，然后使用域转移技术来适应现实的解决方案; 2）通过使用规划来估计值函数，简化视觉控制的学习; 3）使用符号任务和运动规划来执行端到端的任务，通过排序学习的控制器和规划的手臂/手部运动。研究团队进行了广泛的评估，以确保系统能够执行任务的新实例，例如，那些机器人从未见过的场景。

项目成果

期刊论文数量（24）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Policy learning in SE (3) action spaces

SE (3) 行动空间中的政策学习

DOI：
发表时间：
2020
期刊：
Proceedings of the Conference on Robot Learning
影响因子：
0
作者：
Wang, Dian;Kohler, Colin;Platt, Robert
通讯作者：
Platt, Robert

Learning discrete state abstractions with deep variational inference

DOI：
发表时间：
2020-03
期刊：
ArXiv
影响因子：
0
作者：
Ondrej Biza;Robert W. Platt;Jan-Willem van de Meent;Lawson L. S. Wong
通讯作者：
Ondrej Biza;Robert W. Platt;Jan-Willem van de Meent;Lawson L. S. Wong

Pick and Place Without Geometric Object Models

无需几何对象模型即可拾取和放置

DOI：
10.1109/icra.2018.8460553
发表时间：
2018
期刊：
Proceedings of 2018 IEEE International Conference on Robotics and Automation (ICRA
影响因子：
0
作者：
Gualtieri, Marcus;Pas, Andreas ten;Platt, Robert
通讯作者：
Platt, Robert

BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework

DOI：
10.48550/arxiv.2205.14292
发表时间：
2022-05
期刊：
ArXiv
影响因子：
0
作者：
Dian Wang;Colin Kohler;Xu Zhu;Ming Jia;Robert W. Platt
通讯作者：
Dian Wang;Colin Kohler;Xu Zhu;Ming Jia;Robert W. Platt

Learning 6-DoF Grasping and Pick-Place Using Attention Focus

DOI：
发表时间：
2018-06
期刊：
影响因子：
0
作者：
Marcus Gualtieri;Robert W. Platt
通讯作者：
Marcus Gualtieri;Robert W. Platt

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Robert Platt其他文献

The nature of essential hypertension.

原发性高血压的性质。

DOI：
发表时间：
1959
期刊：
The Lancet
影响因子：
0
作者：
Robert Platt
通讯作者：
Robert Platt

Coarticulation in Markov Decision Processes

马尔可夫决策过程中的协同表达

DOI：
发表时间：
2004
期刊：
Neural Information Processing Systems
影响因子：
0
作者：
Khashayar Rohanimanesh;Robert Platt;S. Mahadevan;R. Grupen
通讯作者：
R. Grupen

MIT Open Access Articles LQR-RRT*: Optimal sampling-based motion planning with automatically derived extension heuristics

麻省理工学院开放获取文章 LQR-RRT*：基于自动导出的扩展启发式的最佳基于采样的运动规划

DOI：
发表时间：
期刊：
影响因子：
0
作者：
Alejandro Perez;Robert Platt;G. Konidaris;L. Kaelbling;Tomás Lozano
通讯作者：
Tomás Lozano

Improving Grasp Skills Using Schema Structured Learning

使用模式结构化学习提高掌握技能

DOI：
发表时间：
2006
期刊：
影响因子：
0
作者：
Robert Platt;R. Grupen;A. Fagg
通讯作者：
A. Fagg

Manipulation gaits: sequences of grasp control tasks

操纵步态：抓取控制任务的序列

DOI：
10.1109/robot.2004.1307247
发表时间：
2004
期刊：
IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004
影响因子：
0
作者：
Robert Platt;A. Fagg;R. Grupen
通讯作者：
R. Grupen