Recognition of Presentation by Integration of Visual and Linguistic Information
通过整合视觉和语言信息来识别演示
基本信息
- 批准号:06452396
- 负责人:
- 金额:$ 3.52万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for General Scientific Research (B)
- 财政年份:1994
- 资助国家:日本
- 起止时间:1994 至 1995
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In the understanding of human behaviors, there is a lot of ambiguities which depend on the situation. It is because few strict laws or rules are applicable throughout a wide variety of situations. We have examined the relationship between the gestures and the context of situation which typically represented in the spoken dialog. We proposed a novel framework to understand the behaviro in oral presentations.1. Human behavior understanding in oral presentation : We developed a method to extract visual keys from presentation images. We also developed a method to extract linguistic keys from spoken words in the presentation. A novel framework was developed to integrate the both keys to resolve the intention of the presenter.2. Temporal structure analysis of video by image and sound processing : We have developed a method to estimate the temporal structure of a video sequence considering the contents and the intention of the author. it uses the visual and sound keys. Television commercials are used as the target presentation images.3. Knowledge extraction from diagram and text : We developed a new framework for knowledge extraction from written texts and diagrams and utilization of the obtained knowledge for the automatic organization of flexible hyper-media.
在对人类行为的理解中,存在着许多取决于情境的模糊性。这是因为很少有严格的法律或规则适用于各种各样的情况。我们已经研究了手势和情景之间的关系,这通常表现在口语对话。我们提出了一个新的框架来理解口头陈述中的重复。口头演示中的人类行为理解:我们开发了一种从演示图像中提取视觉键的方法。我们还开发了一种方法,从演讲中的口语单词中提取语言键。提出了一个新的框架,将两个关键词结合起来,以解决演示者的意图.通过图像和声音处理对视频进行时间结构分析:我们开发了一种方法来估计视频序列的时间结构,同时考虑作者的内容和意图。它使用视觉和声音键。电视广告作为目标呈现图像.从图表和文本中提取知识:我们开发了一个新的框架,用于从书面文本和图表中提取知识,并利用所获得的知识自动组织灵活的超媒体。
项目成果
期刊论文数量(46)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1995)
Yuichi Nakamura:“从图表和文本中提取知识以实现媒体集成”,多媒体计算和系统国际会议(即将出版)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Yuichi Nakamura, Masashi Nishitani, Yuichi Ohta: "Human Behavior Understanding in Oral Presentation" IEICE Technical Report SIG-PRU. Vol.95-143. 51-56 (1995)
Yuichi Nakamura、Masashi Nishitani、Yuichi Ohta:“口头演示中的人类行为理解”IEICE 技术报告 SIG-PRU。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
角 保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)
Yasushi Kado:“使用分布式外观模型检测面部方向和面部成分”自动面部和手势识别研讨会 254-259 (1995)
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Yukiyo Uehori, Mitsuhiro Murata, Yuichi Nakamura, Yuichi Ohta: "Temporal Structure Analysis of Television Commercial by Image and Sound Processing" IEICE Technical Report SIG-PRU. Vol.95-159. 9-12 (1995)
Yukiyo Uehori、Mitsuhiro Murata、Yuichi Nakamura、Yuichi Ohta:“通过图像和声音处理进行电视广告的时间结构分析”IEICE 技术报告 SIG-PRU。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
中村 裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1996)
Yuichi Nakamura:“从图表和文本中提取知识以实现媒体集成”,多媒体计算和系统国际会议(即将出版)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
OHTA Yuichi其他文献
OHTA Yuichi的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('OHTA Yuichi', 18)}}的其他基金
Enhancing Image Quality of the 3D Free-Viewpoint Video in Large-Scale Space Utilizing Player-Billboard Method
利用播放器-广告牌方法增强大范围空间中3D自由视点视频的图像质量
- 批准号:25280056 
- 财政年份:2013
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research (B) 
See-through Vision : Visual Augmentation for Pedestrians by Using Surveillance Cameras
透视视觉:使用监控摄像头增强行人视觉
- 批准号:18200011 
- 财政年份:2006
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research (A) 
Generation and transmission of 4D image space by intelligent capturing of large-scale space
大尺度空间智能捕捉4D图像空间生成与传输
- 批准号:14208034 
- 财政年份:2002
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research (A) 
Pattern Recognition and understanding for Visual Information Media
视觉信息媒体的模式识别和理解
- 批准号:11230101 
- 财政年份:1999
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research on Priority Areas 
Development of a 3D Video Camera by using Polynocular Stereo
利用多目立体技术开发 3D 摄像机
- 批准号:09358006 
- 财政年份:1997
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research (A) 
Construction of On-line Multimodal Dictionary for Automatic Human Behavior Understanding
人类行为自动理解在线多模态词典的构建
- 批准号:08458073 
- 财政年份:1996
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Scientific Research (B) 
Development of 3D image Display System with Motion Parallax
运动视差3D图像显示系统的开发
- 批准号:06558044 
- 财政年份:1994
- 资助金额:$ 3.52万 
- 项目类别:Grant-in-Aid for Developmental Scientific Research (B) 

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



