Automatic Alignment of Textto-Video for Semantic Multimedia Analysis
用于语义多媒体分析的文本到视频的自动对齐
基本信息
- 批准号:252286362
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:德国
- 项目类别:Research Grants
- 财政年份:2014
- 资助国家:德国
- 起止时间:2013-12-31 至 2017-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In this project, we aim to explore rich descriptions of video data (TV series and movies) which opens myriad possibilities for multimedia analysis, understanding and obtaining weak labels for popular computer vision tasks. We wish to focus on two forms of text -- plot synopses and books. The former, plots are obtained via crowdsourcing and describe the episode or movie in a summarized way. In contrast books (from which the video is adapted) provide detailed descriptions of the story and visual world the author wishes to portray.While text in the form of subtitles and transcripts has been successfully used to automate person identification [Everingham 2006] or obtain samples for action recognition [Laptev 2008], those text sources are limited in their potential for understanding or obtaining rich descriptions of the story.To use the plot synopses, we will first align the sentences of the synopsis to shots in the video (WP2). We propose to use anchors, primarily person-id to help guide the alignment. We aim to solve two main challenges associated with this task: possible non-linearity of the plot synopsis, and skipping of shots.In contrast to plot synopses, the first step we take in analyzing books is to align chapters and their corresponding video shots (WP3). We can expect that some dialogues in the books match the ones used in the video adaptation. This allows us to automatically identify characters and learn person models in a second step, and also facilitates fine-grained alignment within a chapter.The alignment can be improved by knowing more about the scene or objects present in the shots. We will investigate this interconnected behaviour of labels and anchors in WP4, first in an iterative manner, and then by jointly modeling the two tasks of obtaining weak labels and performing alignment.We divide the applications into two types: (i) obtaining labels from the text sources and (ii) video-related applications. From plot synopses, we will specifically aim to obtain weak labels for places or scenes (WP5-P1). We will also explore tasks such as Summarization, Indexing and Retrieval (WP5-P2). For example, a coherent video summary based on the story (rather than low-level features) can be generated by first running a text summarizer on the plot, followed by selection of the set of aligned to the retained sentences. Indexing the descriptions for keywords can also lead to easy browsing through the video. From books, we wish to exploit dialogs for obtaining supervision for person identification, and rich descriptions surrounding the dialogs to learn attributes for the characters, scenes and objects (WP5-P1). Another interesting application is to automatically find differences between books and their video adaptations (WP5-P2).
在这个项目中,我们的目标是探索视频数据(电视剧和电影)的丰富描述,为多媒体分析,理解和获得流行的计算机视觉任务的弱标签提供了无数的可能性。我们希望集中在两种形式的文本-情节提要和书籍。前者,情节通过众包获得,并以概括的方式描述剧集或电影。相比之下,书籍(视频改编自)提供作者希望描绘的故事和视觉世界的详细描述。虽然字幕和文字记录形式的文本已成功用于自动化人员识别[Everingham 2006]或获得动作识别的样本[Laptev 2008],这些文本来源在理解或获得故事的丰富描述方面的潜力有限。2为了使用情节提要,我们将首先将提要的句子与视频(WP 2)中的镜头对齐。我们建议使用锚点,主要是person-id来帮助引导对齐。我们的目标是解决与此任务相关的两个主要挑战:情节大纲可能存在的非线性,以及镜头的跳过。与情节大纲相比,我们在分析书籍时采取的第一步是将章节与相应的视频镜头对齐(WP 3)。我们可以预期,书中的一些对话与视频改编中使用的对话相匹配。这使我们能够在第二步中自动识别人物并学习人物模型,还有助于在章节内进行细粒度对齐。通过了解镜头中存在的场景或对象,可以改进对齐。我们将研究WP 4中标签和锚的这种相互关联的行为,首先以迭代的方式,然后通过联合建模获得弱标签和执行对齐这两个任务,我们将应用分为两种类型:(i)从文本源获得标签和(ii)视频相关的应用。从情节概要中,我们将专门针对地点或场景(WP 5-P1)获得弱标签。我们还将探索诸如摘要、索引和检索(WP 5-P2)等任务。例如,可以通过首先在情节上运行文本摘要器,然后选择与保留的句子对齐的集合来生成基于故事(而不是低级特征)的连贯视频摘要。为关键字的描述编制索引也可以轻松浏览视频。从书中,我们希望利用对话框来获得对人识别的监督,以及围绕对话框的丰富描述来学习角色,场景和对象的属性(WP 5-P1)。另一个有趣的应用是自动查找书籍及其视频改编之间的差异(WP 5-P2)。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Aligning plot synopses to videos for story-based retrieval
- DOI:10.1007/s13735-014-0065-9
- 发表时间:2015-03
- 期刊:
- 影响因子:5.6
- 作者:Makarand Tapaswi;M. Bäuml;R. Stiefelhagen
- 通讯作者:Makarand Tapaswi;M. Bäuml;R. Stiefelhagen
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Professor Dr.-Ing. Rainer Stiefelhagen其他文献
Professor Dr.-Ing. Rainer Stiefelhagen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Professor Dr.-Ing. Rainer Stiefelhagen', 18)}}的其他基金
ComPLetely Unsupervised Multimodal Character identification On TV series and movies
电视剧和电影中完全无监督的多模态角色识别
- 批准号:
316692988 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Research Grants
相似国自然基金
序列比对( Alignment)的随机分析与快速算法
- 批准号:10271061
- 批准年份:2002
- 资助金额:16.5 万元
- 项目类别:面上项目
相似海外基金
Postdoctoral Fellowship: STEMEdIPRF: Towards a Diverse Professoriate: Experiences that Inform Underrepresented Scholars' Perceptions of Value Alignment and Career Decisions
博士后奖学金:STEMEdIPRF:走向多元化的教授职称:为代表性不足的学者对价值调整和职业决策的看法提供信息的经验
- 批准号:
2327411 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Dynamic, high impact micro-optic security films using automated, high precision alignment between micro-lenses and micro-images
动态、高冲击力的微光学安全薄膜,采用微透镜和微图像之间的自动化、高精度对准
- 批准号:
10076760 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Collaborative R&D
Elucidation of Mechanisms for Improvement of Myocardial Tissue Function by Cardiomyocytes Alignment Control
阐明通过心肌细胞排列控制改善心肌组织功能的机制
- 批准号:
23K15142 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
Reconsideration of a 3-D sintering model by numerical analysis of shape and alignment parameters of metal powders during the sintering process
通过对烧结过程中金属粉末的形状和排列参数进行数值分析来重新考虑 3D 烧结模型
- 批准号:
23K04439 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)
Examining vertical alignment in perceived implementation climate within a trial of motivational interviewing in substance use disorder treatment clinics
在物质使用障碍治疗诊所的动机访谈试验中检查感知实施氛围的垂直一致性
- 批准号:
10680338 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Assembly and re-alignment of HLA genomic region and its implication for fine-mapping suicidality in African descent population
HLA基因组区域的组装和重新排列及其对非洲人后裔自杀倾向精细定位的意义
- 批准号:
10797122 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Planning: PREC: Exploring a Partnership between Historically Black Universities in the District of Columbia and NSF's ChemMatCARS in Alignment with the NSF PREC Program
规划:PREC:探索哥伦比亚特区历史悠久的黑人大学与 NSF ChemMatCARS 之间的合作伙伴关系,与 NSF PREC 计划保持一致
- 批准号:
2334957 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
CRII: III: Measuring the alignment between the interests of local communities, local news, and the national news media
CRII:III:衡量当地社区、当地新闻和国家新闻媒体利益之间的一致性
- 批准号:
2245508 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Pathophysiology determination of peri-implantitis based on macrophage hierarchy and regulation of tissue alignment
基于巨噬细胞层次结构和组织排列调节的种植体周围炎的病理生理学测定
- 批准号:
23H03093 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (B)
III: Medium: Towards Inclusive Recommendation Systems with Stakeholder Alignment
III:中:迈向利益相关者联盟的包容性推荐系统
- 批准号:
2312794 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Continuing Grant