CAREER: Visual Recognition with Knowledge
职业:具有知识的视觉识别
基本信息
- 批准号:1750082
- 负责人:
- 金额:$ 55万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-08-15 至 2024-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project will address the problem of Visual Recognition with Knowledge (VR-K): a challenging Artificial Intelligence task to enable a seeing machine to identify unknown visible concepts from previous encounters (annotated data samples) and knowledge (other contextual information). For example, consider such a system that has never encountered a zebra, but which has previous visual encounters with "horses" and "black and white striped" patterns. Incorporating the linguistic input that, "A zebra is a horse-like animal with a black and white striped appearance", the machine's task is to formulate a new recognizer for the visual concept "zebra" and to recognize this new concept later. A system that integrates visual and linguistic information in this way can provide the basis for robust personal mobile applications or service robots, such as visual assistants to the vision-impaired, and voice-enable agents for elder care. Conventional supervised learning techniques have been perfected to perform increasingly well on narrow performance tasks. To enable satisfactory performance in service robots and mobile multimedia applications, this research will integrate background and commonsense knowledge models to enable higher level reasoning together with such high-performance recognizers. This project will develop the VR-K framework focused on enabling more generalizable computer vision algorithms through integration with natural language understanding and grounding in knowledge-based reasoning. The research program will include 1) developing efficient probabilistic reasoning engines to construct recognition models of unseen concepts (object and attribute) without new annotation through probabilistic semantic parsing; 2) setting up new large-scale visual challenges and testbeds as the basis for rigorous performance evaluation of visual recognition with knowledge models and ablation analysis; and 3) prototyping the proposed framework on service robots and mobile devices for evaluation of the proposed framework's performance in complex real-world applications over a variety of user studies. The project will include education and outreach activities advancing AI in undergraduate research, diversity enhancement, Entrepreneurial Mindset (EM) education, and K-12 classrooms, and will include workshops to introduce AI and deep learning to professionals in non-CS professions such as medical research and pathology.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该项目将解决带有知识的视觉识别(VR-K)问题:这是一项具有挑战性的人工智能任务,使视觉机器能够从以前的遭遇(带注释的数据样本)和知识(其他上下文信息)中识别未知的可见概念。例如,考虑这样一个系统,它从来没有遇到过斑马,但它以前在视觉上遇到过“马”和“黑白条纹”图案。结合语言输入,“斑马是一种像马一样的动物,有黑白条纹的外观”,机器的任务是为视觉概念“斑马”制定一个新的识别器,并在稍后识别这个新概念。以这种方式集成视觉和语言信息的系统可以为强大的个人移动应用程序或服务机器人提供基础,例如视障人士的视觉助手,以及老年人护理的语音代理。传统的监督学习技术已经得到完善,可以在狭义的任务中表现得越来越好。为了在服务机器人和移动多媒体应用中实现令人满意的性能,本研究将整合背景和常识知识模型,以实现更高层次的推理以及这些高性能识别器。该项目将开发VR-K框架,重点是通过集成自然语言理解和基于知识的推理,实现更通用的计算机视觉算法。研究项目将包括:1)开发高效的概率推理引擎,通过概率语义解析构建未见概念(对象和属性)的识别模型,无需新的注释;2)建立新的大规模视觉挑战和测试平台,以知识模型和消融分析作为严格的视觉识别性能评估的基础;3)在服务机器人和移动设备上对所提出的框架进行原型设计,以评估所提出的框架在各种用户研究的复杂实际应用中的性能。该项目将包括在本科研究、多样性增强、创业思维(EM)教育和K-12教室中推进人工智能的教育和推广活动,并将包括向医学研究和病理学等非计算机科学专业的专业人士介绍人工智能和深度学习的研讨会。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(21)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Modularized Textual Grounding for Counterfactual Resilience
- DOI:10.1109/cvpr.2019.00654
- 发表时间:2019-04
- 期刊:
- 影响因子:0
- 作者:Zhiyuan Fang;Shu Kong;Charless C. Fowlkes;Yezhou Yang
- 通讯作者:Zhiyuan Fang;Shu Kong;Charless C. Fowlkes;Yezhou Yang
Integrating Knowledge and Reasoning in Image Understanding
- DOI:10.24963/ijcai.2019/873
- 发表时间:2019-06
- 期刊:
- 影响因子:0
- 作者:Somak Aditya;Yezhou Yang;Chitta Baral
- 通讯作者:Somak Aditya;Yezhou Yang;Chitta Baral
GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment
- DOI:10.1109/lra.2019.2930426
- 发表时间:2018-09
- 期刊:
- 影响因子:5.2
- 作者:Xin Ye;Zhe L. Lin;Joon-Young Lee;Jianming Zhang;Shibin Zheng;Yezhou Yang
- 通讯作者:Xin Ye;Zhe L. Lin;Joon-Young Lee;Jianming Zhang;Shibin Zheng;Yezhou Yang
A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGA
- DOI:10.1109/hpec.2019.8916237
- 发表时间:2019-09
- 期刊:
- 影响因子:0
- 作者:Mohammad Farhadi;Mehdi Ghasemi;Yezhou Yang
- 通讯作者:Mohammad Farhadi;Mehdi Ghasemi;Yezhou Yang
CAVAN: Commonsense Knowledge Anchored Video Captioning
- DOI:10.1109/icpr56361.2022.9956241
- 发表时间:2022-08
- 期刊:
- 影响因子:0
- 作者:Huiliang Shao;Zhiyuan Fang;Yezhou Yang
- 通讯作者:Huiliang Shao;Zhiyuan Fang;Yezhou Yang
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Yezhou Yang其他文献
Integrated Sensing Systems for Monitoring Interrelated Physiological Parameters in Young and Aged Adults
用于监测年轻人和老年人相关生理参数的集成传感系统
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:2.1
- 作者:
Mark Sprowls;Michael Serhan;En;Lancy Lin;Christopher W. Frames;I. Kucherenko;Keyvan Mollaeian;Yang Li;V. Jammula;D. Logeswaran;M. Khine;Yezhou Yang;T. Lockhart;J. Claussen;Liang Dong;Julian J‐L Chen;Juan;Carmen Gomes;Daejin Kim;Teresa Wu;J. Margrett;Balaji Narasimhan;E. Forzani - 通讯作者:
E. Forzani
Evaluating Safety Metrics for Vulnerable Road Users at Urban Traffic Intersections Using High-Density Infrastructure LiDAR System
使用高密度基础设施 LiDAR 系统评估城市交通交叉口弱势道路使用者的安全指标
- DOI:
10.4271/2024-01-2641 - 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Prabin Kumar Rath;Blake Harrison;Duo Lu;Yezhou Yang;Jeffrey Wishart;Hongbin Yu - 通讯作者:
Hongbin Yu
Radiant exposure level comparison between Gaussian and top hat beams in various scanning patterns.
各种扫描模式下高斯光束和高帽光束的辐射暴露水平比较。
- DOI:
10.1364/ao.53.008585 - 发表时间:
2014 - 期刊:
- 影响因子:1.9
- 作者:
P. U.;Yezhou Yang;H. Le;Do - 通讯作者:
Do
Visuo-Lingustic Question Answering (VLQA) Challenge
视觉语言问答 (VLQA) 挑战
- DOI:
- 发表时间:
2020 - 期刊:
- 影响因子:0
- 作者:
Shailaja Keyur Sampat;Yezhou Yang;Chitta Baral - 通讯作者:
Chitta Baral
Directional effects of correlated wind and waves on the dynamic response of long-span sea-crossing bridges
相关风浪方向效应对大跨跨海大桥动力响应的影响
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:4.3
- 作者:
Rugang Yang;Yongle Li;Cheng Xu;Yezhou Yang;Chen Fang - 通讯作者:
Chen Fang
Yezhou Yang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Yezhou Yang', 18)}}的其他基金
PFI-TT: Broadening Real-Time Continuous Traffic Analysis on the Roadside using AI-Powered Smart Cameras
PFI-TT:使用人工智能驱动的智能摄像头扩大路边实时连续交通分析
- 批准号:
2329780 - 财政年份:2023
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
RI: Small: SM-An Active Approach for Data Engineering to Improve Vision-Language Tasks
RI:小型:SM - 一种改进视觉语言任务的数据工程主动方法
- 批准号:
2132724 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
Collaborative Research: CPS: Medium: Spatio-Temporal Logics for Analyzing and Querying Perception Systems
合作研究:CPS:媒介:用于分析和查询感知系统的时空逻辑
- 批准号:
2038666 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Standard Grant
I-Corps: Determining occupant load and location through machine vision with on-device image processing
I-Corps:通过机器视觉和设备上的图像处理确定乘员负载和位置
- 批准号:
2054807 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Standard Grant
相似国自然基金
基于多幅图象的Visual Hull重构及表面属性建模算法研究
- 批准号:60373031
- 批准年份:2003
- 资助金额:23.0 万元
- 项目类别:面上项目
相似海外基金
CAREER: Exploiting Deep Generative Models for Visual Recognition
职业:利用深度生成模型进行视觉识别
- 批准号:
2239076 - 财政年份:2023
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
Robust visual recognition of high-level form in human observers
人类观察者对高级形式的鲁棒视觉识别
- 批准号:
RGPIN-2019-05554 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Multiscale analysis of visual object recognition in the mouse
小鼠视觉对象识别的多尺度分析
- 批准号:
469984 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Operating Grants
Time to investigate human visual shape perception and recognition
是时候研究人类视觉形状感知和识别了
- 批准号:
RGPIN-2022-04327 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Visual Recognition Beyond Supervised Learning
超越监督学习的视觉识别
- 批准号:
RGPIN-2019-05362 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Pinpointing the neural mechanisms that support the remarkable visual fidelity of visual recognition memory
查明支持视觉识别记忆卓越视觉保真度的神经机制
- 批准号:
2043255 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
Orthographic and Semantic Representations: Consolidation and Role in Visual Word Recognition
拼写和语义表示:视觉单词识别中的巩固和作用
- 批准号:
RGPIN-2018-03758 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Visual Recognition Beyond Supervised Learning
超越监督学习的视觉识别
- 批准号:
RGPIN-2019-05362 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Robust visual recognition of high-level form in human observers
人类观察者对高级形式的鲁棒视觉识别
- 批准号:
RGPIN-2019-05554 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Discovery Grants Program - Individual
Rethinking the neuroanatomical organization of cognition: Recognition memory in visual cortex
重新思考认知的神经解剖学组织:视觉皮层的识别记忆
- 批准号:
10469534 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:














{{item.name}}会员




