权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

EAGER: Diverse M-Best Predictions from Probabilistic Models

EAGER：概率模型的多样化 M-Best 预测

基本信息

批准号：
1353694
负责人：
Dhruv Batra
金额：
$ 18.44万
依托单位：
Virginia Polytechnic Institute and State University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2013
资助国家：
美国
起止时间：
2013-09-15 至 2015-08-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1353694&HistoricalAwards=false
关键词：
EAGER Diverse Best Predictions Probabilistic

项目摘要

Computer Vision systems must deal with significant levels of ambiguity - from inter- and intra-object occlusion and varying appearance, lighting, and pose. Probabilistic models provide a principled framework for dealing with uncertainty and for converting evidence into a posteriori belief about the world. Typically, a vision system uses this belief to predict the "most likely" or maximum a-posteriori hypothesis. Unfortunately, our current models are inaccurate and this single-best hypothesis is often incorrect. This project explores a novel way to allow vision systems to hedge against uncertainty by producing multiple plausible hypotheses. Specifically, this project develops techniques for finding a diverse set of high-probability solutions from probabilistic models. The project focuses on (a) interactive object cutout (where multiple segmentations are shown to the user to expedite convergence to an acceptable result); (b) semantic segmentation (where multiple plausible scene labelings are propagated to subsequent stages of a cascade for higher-order processing); (c) person/object tracking (where multiple localization hypotheses on each frame reduce the search space of a sequence tracker). This project is producing new scientific knowledge in the context of probabilistic reasoning and advancing the state of art in computer vision. The techniques developed are useful for other AI domains such as Speech and Natural Language Processing. The PI and his students are broadly disseminating produced work by organizing workshops, tutorials, and journal special issues, and publicly sharing code and results. The project is engaging undergraduate students and women in computer science research.

计算机视觉系统必须处理大量的模糊性--从对象间和对象内的遮挡，以及外观、照明和姿势的变化。概率模型为处理不确定性和将证据转化为对世界的后验信念提供了一个原则性的框架。通常，视觉系统使用这种信念来预测“最有可能的”或最大后验假设。不幸的是，我们目前的模型是不准确的，这种单一最佳假设往往是不正确的。这个项目探索了一种新的方法，允许视觉系统通过产生多个看似合理的假设来对冲不确定性。具体地说，这个项目开发了从概率模型中找到一组不同的高概率解决方案的技术。该项目的重点是(A)交互式对象切割(其中向用户显示多个分割以加速收敛到可接受的结果)；(B)语义分割(其中多个可信的场景标签被传播到级联的后续阶段以进行更高级别的处理)；(C)人/对象跟踪(其中每帧上的多个定位假设减少了序列跟踪器的搜索空间)。这个项目在概率推理的背景下产生了新的科学知识，并推动了计算机视觉的最新发展。所开发的技术对语音和自然语言处理等其他人工智能领域也很有用。PI和他的学生正在通过组织研讨会、教程和期刊特刊以及公开分享代码和结果来广泛传播成果。该项目正在吸引本科生和女性参与计算机科学研究。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Dhruv Batra其他文献

Album Story 1 Description for Images in Isolation & in Sequences Re-telling Story 1 Caption in Sequence Storytelling Story 2 Story 3 Re-telling Preferred Photo Sequence Story 4 Story

专辑故事 1 隔离图像说明

DOI：
发表时间：
2016
期刊：
影响因子：
0
作者：
Ting;Francis Ferraro;N. Mostafazadeh;Ishan Misra;Aishwarya Agrawal;Jacob Devlin;Ross B. Girshick;Xiaodong He;Pushmeet Kohli;Dhruv Batra;C. L. Zitnick;Devi Parikh;Lucy Vanderwende;Michel Galley;Margaret Mitchell
通讯作者：
Margaret Mitchell

Habitat Sim Generic Dataset Support Habitat API Habitat Platform

Habitat Sim 通用数据集支持 Habitat API Habitat 平台

DOI：
发表时间：
2019
期刊：
影响因子：
0
作者：
M. Savva;Abhishek Kadian;Oleksandr Maksymets;Yili Zhao;Erik Wijmans;Bhavana Jain;Julian Straub;Jia Liu;V. Koltun;Jitendra Malik;Devi Parikh;Dhruv Batra
通讯作者：
Dhruv Batra