权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

RI: Medium: Learning to Map and Navigate with Vision and Language

RI：媒介：学习用视觉和语言绘制地图和导航

基本信息

批准号：
2212433
负责人：
Kostas Daniilidis
金额：
$ 120万
依托单位：
University of Pennsylvania
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2022
资助国家：
美国
起止时间：
2022-09-01 至 2026-08-31
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2212433&HistoricalAwards=false
关键词：
RI Medium Learning Map Navigate

项目摘要

This project aims to advance the state of the art in robotic mapping and navigation by enabling spatial understanding using semantic maps and spatial reasoning for following language instructions given only visual inputs. Current performance in those tasks is low because of the inability to ground semantic entities and instructions spatially. Instead of grounding semantics to images, spatial understanding and navigation can be achieved if a system uses maps as an intermediate representation, as also indicated by behavioral and neural findings in spatial cognition. Building a map of an unseen space without exhaustive exploration can be learned, and this process can be facilitated by cross-modal language-vision attentional mechanisms. The project will integrate research with education and outreach underrepresented groups in Philadelphia neighborhoods as a target broadening the participation.This research is centered around understanding how vision and language interact to create better spatial representations like maps and facilitate navigation. The project will approach the vision-language from three angles. (i) How robots can learn to predict a map when entering an unseen environment using active learning. (ii) How navigation instructions can be encoded into spatial configuration schemata and navigational concepts that can be better aligned to maps and paths than raw language embeddings, and (iii) how navigational language representations can facilitate the creation of maps in unseen environments, and how one can follow instructions by using maps and language to create paths to follow.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

该项目旨在通过使用语义地图和空间推理来实现空间理解，以遵循仅给出视觉输入的语言指令，从而推进机器人测绘和导航的最新技术。由于无法在空间上定位语义实体和指令，这些任务的当前性能较低。如果系统使用地图作为中间表示，则可以实现空间理解和导航，而不是将语义扎根于图像，空间认知中的行为和神经发现也表明了这一点。无需进行详尽的探索即可构建看不见的空间地图，并且可以通过跨模态语言视觉注意力机制来促进这一过程。该项目将把研究与教育和推广费城社区中代表性不足的群体结合起来，作为扩大参与的目标。这项研究的重点是了解视觉和语言如何相互作用，以创建更好的空间表示（如地图）并促进导航。该项目将从三个角度探讨视觉语言。 (i) 机器人如何使用主动学习在进入看不见的环境时学习预测地图。 (ii) 如何将导航指令编码为空间配置模式和导航概念，从而比原始语言嵌入更好地与地图和路径对齐；(iii) 导航语言表示如何促进在看不见的环境中创建地图，以及如何通过使用地图和语言创建可遵循的路径来遵循指令。该奖项反映了 NSF 的法定使命，并已被通过使用基金会的智力优点和更广泛的影响审查标准进行评估，认为值得支持。

项目成果

期刊论文数量（1）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Cross-modal Map Learning for Vision and Language Navigation

DOI：
10.1109/cvpr52688.2022.01502
发表时间：
2022-03
期刊：
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
影响因子：
0
作者：
G. Georgakis;Karl Schmeckpeper;Karan Wanchoo;Soham Dan;E. Miltsakaki;D. Roth;Kostas Daniilidis
通讯作者：
G. Georgakis;Karl Schmeckpeper;Karan Wanchoo;Soham Dan;E. Miltsakaki;D. Roth;Kostas Daniilidis

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Kostas Daniilidis其他文献

Perception-Driven Curiosity with Bayesian Surprise

感知驱动的好奇心与贝叶斯惊喜

DOI：
发表时间：
2019
期刊：
影响因子：
0
作者：
Bernadette Bucher;Anton Arapin;Ramanan Sekar;M. Badger;Feifei Duan;Oleh Rybkin;Kostas Daniilidis
通讯作者：
Kostas Daniilidis

Technical report on Optimization-Based Bearing-Only Visual Homing with Applications to a 2-D Unicycle Model

关于基于优化的仅轴承视觉归位及其在二维独轮车模型中的应用的技术报告

DOI：
发表时间：
2014
期刊：
影响因子：
0
作者：
Roberto Tron;Kostas Daniilidis
通讯作者：
Kostas Daniilidis

Template gradient matching in spherical images

球形图像中的模板梯度匹配

DOI：
10.1117/12.527043
发表时间：
2004
期刊：
Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention
影响因子：
0
作者：
L. Sorgi;Kostas Daniilidis
通讯作者：
Kostas Daniilidis

Predicting the Future with Transformational States

用转型国家预测未来

DOI：
发表时间：
2018
期刊：
ArXiv
影响因子：
0
作者：
Andrew Jaegle;Oleh Rybkin;K. Derpanis;Kostas Daniilidis
通讯作者：
Kostas Daniilidis

Live Demonstration: Unsupervised Event-Based Learning of Optical Flow, Depth and Egomotion

现场演示：基于事件的无监督光流、深度和自我运动学习

DOI：
发表时间：
2019
期刊：
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
影响因子：
0
作者：
A. Z. Zhu;Liangzhe Yuan;Kenneth Chaney;Kostas Daniilidis
通讯作者：
Kostas Daniilidis