RI: Medium: Collaborative Research: Text-to-Image Reference Resolution for Image Understanding and Manipulation

RI:媒介:协作研究:用于图像理解和操作的文本到图像参考分辨率

基本信息

  • 批准号:
    1561968
  • 负责人:
  • 金额:
    $ 27.5万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2016
  • 资助国家:
    美国
  • 起止时间:
    2016-06-01 至 2019-05-31
  • 项目状态:
    已结题

项目摘要

This project develops new technologies at the interface of computer vision and natural language processing to understand text-to-image relationships. For example, given a captioned image, the project develops techniques which determine which words (e.g. "woman talking on phone", "The farther vehicle") correspond to which image parts. From robotics to human-computer interaction, there are numerous real-world tasks that benefit from practical systems to identify objects in scenes based on language and understand language based on visual context. In particular, the project develops the first language-based image authoring tool which allows users to edit or synthesize realistic imagery using only natural language (e.g. "delete the garbage truck from this photo" or "make an image with three boys chasing a shaggy dog"). Beyond the immediate impact of creating new ways for users to access and author digital images, the broader impacts of this work include three focus areas: the development of new benchmarks for the vision and language communities, outreach and undergraduate research, and leadership in promoting diversity. At the core of the project are new techniques for large-scale text-to-image reference resolution (TIRR) that enable systems to automatically identify the image regions that depict entities described in natural language sentences or commands. These techniques advance image interpretation by enabling systems to perform partial matching between images and sentences, referring expression understanding, and image-based question answering. They also advance image manipulation by enabling systems that can synthesize images starting from a textual description, or modify images based on natural language commands. The main technical contributions of the project are: (1) benchmark datasets for TIRR with comprehensive large-scale gold standard annotations that will make TIRR a standard task for recognition; (2) principled new representations for text-to-image annotations that expose the compositional nature of language using the formalism of the denotation graph; (3) new models for TIRR that perform an explicit alignment (grounding) of words and phrases to image regions guided by the structure of the denotation graph; (4) applications of TIRR methods to referring expression understanding and visual question answering; and (5) applications of TIRR to image creation and manipulation based on natural language input.
该项目开发计算机视觉和自然语言处理界面的新技术,以理解文本与图像之间的关系。例如,给定一个标题图像,该项目开发的技术,以确定哪些词(例如;“打电话的女人”,“远处的车辆”)对应于图像的哪个部分。从机器人到人机交互,有许多现实世界的任务受益于基于语言识别场景中的对象和基于视觉上下文理解语言的实用系统。特别是,该项目开发了第一个基于语言的图像创作工具,允许用户仅使用自然语言编辑或合成逼真的图像。“从这张照片中删除垃圾车”或“制作一个三个男孩追逐一只毛茸茸的狗的照片”)。除了为用户创造获取和创作数字图像的新途径所带来的直接影响外,这项工作的更广泛影响还包括三个重点领域:为视觉和语言社区制定新的基准,拓展和本科生研究,以及在促进多样性方面的领导作用。该项目的核心是大规模文本到图像参考分辨率(TIRR)的新技术,该技术使系统能够自动识别用自然语言句子或命令描述的描述实体的图像区域。这些技术通过使系统能够在图像和句子之间执行部分匹配、参考表达理解和基于图像的问答来推进图像解释。它们还推动了图像处理,使系统能够从文本描述开始合成图像,或基于自然语言命令修改图像。项目的主要技术贡献有:(1)具有全面大规模金标准注释的TIRR基准数据集,使TIRR成为识别的标准任务;(2)文本到图像注释的原则性新表示,使用外延图的形式主义揭示语言的组合性质;(3)新的TIRR模型,在表示图结构的引导下,将单词和短语显式对齐(接地)到图像区域;(4) TIRR方法在参考表情理解和视觉问答中的应用;(5) TIRR在基于自然语言输入的图像创建和处理中的应用。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

James Hays其他文献

Bootstrapping Fine-Grained Classifiers : Active Learning with a Crowd in the Loop
引导细粒度分类器:群体参与的主动学习
  • DOI:
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Genevieve Patterson;Grant Van Horn;Serge J. Belongie;P. Perona;James Hays
  • 通讯作者:
    James Hays
Tropel: Crowdsourcing Detectors with Minimal Training
Tropel:只需最少培训的众包探测器
Granular Privacy Control for Geolocation with Vision Language Models
使用视觉语言模型进行地理定位的精细隐私控制
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Ethan Mendes;Yang Chen;James Hays;Sauvik Das;Wei Xu;Alan Ritter
  • 通讯作者:
    Alan Ritter
Proceedings of the Workshop on 3D Geometry Generation for Scientific Computing
科学计算 3D 几何生成研讨会论文集
  • DOI:
    10.1109/wacvw60836.2024.00088
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Marissa Ramirez de Chanlatte;Phillip Colella;Trevor Darrell;Alexandra Katherine Carlson;Peter H. N. de With;Huayu Deng;Shanyan Guan;James Hays;Tim Houben;Thomas Huisman;Nikita Jaipuria;Hans Johansen;Shuja Khalid;Akshay Krishnan;Chuming Li;M. Pisarenco;Amit Raj;Frank Rudzicz;Tim J. Schoonbeek;Sandhya Sridhar;Nathan Tseng;F. V. D. Sommen;Chen Wang;Yunbo Wang;Tong Wu;Xiaokang Yang;Jiawei Yao;Derek Young;Xianling Zhang
  • 通讯作者:
    Xianling Zhang
I. “Case Study: Novel Approach to HIV-Associated Neuropathy – Platelet Rich Plasma Successful in Treating HIV-Associated Peripheral Neuropathy”,
I.“案例研究:治疗 HIV 相关神经病变的新方法——富含血小板血浆成功治疗 HIV 相关周围神经病变”,
  • DOI:
  • 发表时间:
    2016
  • 期刊:
  • 影响因子:
    0
  • 作者:
    James Hays;J. O. Simmonds;W. Jordan;N. Lucas
  • 通讯作者:
    N. Lucas

James Hays的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('James Hays', 18)}}的其他基金

Collaborative Research NRI: INT: Scalable, Customizable, Robot Learning with Humans
合作研究 NRI:INT:可扩展、可定制、机器人与人类一起学习
  • 批准号:
    2024444
  • 财政年份:
    2020
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
CAREER: Representing, Understanding, and Enhancing Scenes at the Internet-Scale
职业:在互联网规模上呈现、理解和增强场景
  • 批准号:
    1641340
  • 财政年份:
    2016
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
CVPR 2012 Conference Doctoral Consortium
CVPR 2012会议博士联盟
  • 批准号:
    1242042
  • 财政年份:
    2012
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
CAREER: Representing, Understanding, and Enhancing Scenes at the Internet-Scale
职业:在互联网规模上呈现、理解和增强场景
  • 批准号:
    1149853
  • 财政年份:
    2012
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
The LDEO Deep-Sea Repository and the Curating and Maintenance of the Sediment Library & Dredge Collection
LDEO 深海存储库以及沉积物库的管理和维护
  • 批准号:
    0647574
  • 财政年份:
    2007
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
The LDEO Deep-Sea Repository and the Curating and Maintenance of the Sediment Library and Dredge Collection
LDEO 深海储存库以及沉积物库和挖泥船收藏的管理和维护
  • 批准号:
    0350504
  • 财政年份:
    2004
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: High Norther Latitude Amplifiers of Centennial to Millennial Climate Forcing During the Holocene
合作研究:全新世百年至千年气候强迫的北高纬度放大器
  • 批准号:
    0317934
  • 财政年份:
    2003
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Investigating Iceberg Calving from Antarctic Ice Sheets Through Isotope Stage 11
通过同位素第 11 阶段研究南极冰盖的冰山崩解
  • 批准号:
    9615060
  • 财政年份:
    1997
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
A New Approach to Earth and Environmental Science Undergraduate Instruction
地球与环境科学本科教学的新方法
  • 批准号:
    9455688
  • 财政年份:
    1995
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Research Experiences for Undergraduates
本科生的研究经历
  • 批准号:
    8804910
  • 财政年份:
    1988
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312841
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312842
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313151
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312840
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313149
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
  • 批准号:
    2312374
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
  • 批准号:
    2312373
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Superhuman Imitation Learning from Heterogeneous Demonstrations
合作研究:RI:媒介:异质演示中的超人模仿学习
  • 批准号:
    2312955
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Informed, Fair, Efficient, and Incentive-Aware Group Decision Making
协作研究:RI:媒介:知情、公平、高效和具有激励意识的群体决策
  • 批准号:
    2313137
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313150
  • 财政年份:
    2023
  • 资助金额:
    $ 27.5万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了