权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CAREER: Discriminative Spatiotemporal Models for Recognizing Humans, Objects, and their Interactions

职业：识别人类、物体及其交互的判别时空模型

基本信息

批准号：
0954083
负责人：
Deva Ramanan
金额：
$ 44.45万
依托单位：
University of California-Irvine
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2010
资助国家：
美国
起止时间：
2010-06-01 至 2015-10-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=0954083&HistoricalAwards=false
关键词：
CAREER Discriminative Spatiotemporal Models Recognizing

项目摘要

One of the goals of computer vision is to build a system that can see people and recognize their activities. Human actions are rarely performed in isolation -- the surrounding environment, nearby objects, and nearby humans affect the nature of the performed activity.Examples include actions such as "eating" and "shaking hands." The research goal of this project is to approach human performance in understanding videos of activities defined by human-object and human-human interactions.This project makes use of structured, contextual representations to make predictions given spatiotemporal data. It does so by extending recent successful work on object recognition to the space-time domain, introducing extensions for spatiotemporal grouping and contextual modeling. Video enables the extraction of additional dynamic cues absent in static images, but this poses additional computational burdens that are addressed through algorithmic innovations for approximate parsing and large-scale discriminative learning.To place activity recognition on firm quantitative ground, the proposed models are evaluated using concrete metrics based on activities of daily living (ADL) and human proxemic models from the medical and anthropological communities. Examples include systems for automated monitoring of stroke patients interacting with everyday objects and automated analysis of crisis response team interactions during emergency drills. This project produces non-scripted, real-world, labeled action recognition datasets, of benefit to the research community as a whole.

计算机视觉的目标之一是建立一个可以看到人并识别他们的活动的系统。人类的行为很少是孤立进行的——周围的环境、附近的物体和附近的人都会影响所进行的活动的性质。示例包括“吃东西”和“握手”等动作。该项目的研究目标是了解人类在理解人与物体和人与人交互定义的活动视频方面的表现。该项目利用结构化的上下文表示来根据时空数据进行预测。它通过将最近在对象识别方面的成功工作扩展到时空域，引入时空分组和上下文建模的扩展来实现这一目标。视频能够提取静态图像中缺少的额外动态线索，但这会带来额外的计算负担，可以通过近似解析和大规模判别学习的算法创新来解决。为了将活动识别置于坚实的定量基础上，使用基于日常生活活动（ADL）的具体指标以及来自医学和人类学界的人类邻近模型来评估所提出的模型。例子包括自动监测中风患者与日常物体互动的系统，以及自动分析紧急演习期间危机应对团队互动的系统。该项目生成非脚本、真实世界、标记的动作识别数据集，对整个研究界都有好处。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Deva Ramanan其他文献

Using Segmentation to Verify Object Hypotheses

DOI：
10.1109/cvpr.2007.383271
发表时间：
2007-06
期刊：
2007 IEEE Conference on Computer Vision and Pattern Recognition
影响因子：
0
作者：
Deva Ramanan
通讯作者：
Deva Ramanan

Recognizing Tiny Faces

识别小脸

DOI：
10.1109/iccvw.2019.00143
发表时间：
2019
期刊：
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
影响因子：
0
作者：
Siva Chaitanya Mynepalli;Peiyun Hu;Deva Ramanan
通讯作者：
Deva Ramanan

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

ViSER：用于铰接 3D 形状重建的视频特定表面嵌入

DOI：
发表时间：
2021
期刊：
Neural Information Processing Systems
影响因子：
0
作者：
Gengshan Yang;Deqing Sun;Varun Jampani;Daniel Vlasic;Forrester Cole;Ce Liu;Deva Ramanan
通讯作者：
Deva Ramanan