Computational auditory scene analysis as causal inference

作为因果推理的计算听觉场景分析

基本信息

  • 批准号:
    1921501
  • 负责人:
  • 金额:
    $ 50.03万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-09-01 至 2023-08-31
  • 项目状态:
    已结题

项目摘要

Just by listening, humans can infer many details about the world around them: what someone said, whether a window in their house is open or shut, or what their child dropped on the floor in the next room. These everyday (but essential) judgments usually require us to separate the distinct causes in the world that generate sound. We hear multiple people talking at once, but can attend to the one we are interested in. We can tell whether a sound was produced in a large or small room, or an empty or furnished apartment, but can also identify what the sound was. And if an object is dropped on a table, we can usually tell the object's approximate weight but also the material the table is made of, just by listening. These abilities are critical to our interactions with the world and will be critical to reproduce in machine hearing systems for robots, automobiles, and other technologies. Here the investigators propose to investigate human abilities to decompose sound into its constituent causes and to build machine systems that can replicate these abilities.The proposed work will jointly pursue two goals. First, the investigators will build models of how sound is generated in the world. This aspect of the work will combine insights from physics and acoustics with empirical measurements of sound, focusing on how forces imparted to objects resonate within the object to yield sound, and on how the resulting sound is altered by reflections off of environmental surfaces on its way to a listener's ears. Second, the investigators will develop a computational framework to infer the most likely explanation of a sound in terms of the events in the world that could have generated it. This aspect of the work will leverage recent advances in artificial intelligence research that render such inferences newly tractable. The resulting machine hearing systems will be compared with human listeners in a series of experiments, with the goal of improving the models of sound generation and the inference algorithms in order to match human auditory abilities.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
仅仅通过倾听,人类就可以推断出周围世界的许多细节:某人说了什么,他们家里的窗户是开着还是关着,或者他们的孩子在隔壁房间的地板上掉了什么。这些日常的(但基本的)判断通常要求我们区分世界上产生声音的不同原因。我们同时听到多个人在说话,但可以专注于我们感兴趣的那个人。我们可以分辨出一个声音是在一个大房间还是小房间,或者是一个空的公寓还是一个有家具的公寓里产生的,但是我们也可以识别出这个声音是什么。如果一个物体掉在桌子上,我们通常可以通过听来判断物体的大致重量,以及桌子的材料。这些能力对于我们与世界的互动至关重要,对于机器人、汽车和其他技术的机器听觉系统的复制也至关重要。在这里,研究人员提出研究人类将声音分解成其组成原因的能力,并建立能够复制这些能力的机器系统。首先,研究人员将建立世界上声音是如何产生的模型。这方面的工作将结合联合收割机的见解,从物理学和声学与经验测量的声音,重点是如何赋予物体的力量在物体内共振产生声音,以及产生的声音是如何改变的反射关闭环境表面的方式,以听众的耳朵。其次,研究人员将开发一个计算框架,根据可能产生声音的世界事件来推断声音的最可能解释。这方面的工作将利用人工智能研究的最新进展,使此类推断变得更加容易处理。最终的机器听觉系统将在一系列实验中与人类听者进行比较,目的是改进声音生成模型和推理算法,以匹配人类的听觉能力。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
  • DOI:
    10.1109/cvpr52688.2022.01027
  • 发表时间:
    2022-06
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Chuang Gan;Yi Gu;Siyuan Zhou;Jeremy Schwartz;S. Alter;James Traer;Dan Gutfreund;J. Tenenbaum;Josh H. McDermott;A. Torralba
  • 通讯作者:
    Chuang Gan;Yi Gu;Siyuan Zhou;Jeremy Schwartz;S. Alter;James Traer;Dan Gutfreund;J. Tenenbaum;Josh H. McDermott;A. Torralba
Causal inference in environmental sound recognition
环境声音识别中的因果推理
  • DOI:
    10.1016/j.cognition.2021.104627
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    3.4
  • 作者:
    Traer, James;Norman-Haignere, Sam V.;McDermott, Josh H.
  • 通讯作者:
    McDermott, Josh H.
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
  • DOI:
  • 发表时间:
    2020-07
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Chuang Gan;Jeremy Schwartz;S. Alter;Martin Schrimpf;James Traer;Julian De Freitas;J. Kubilius;Abhishek Bhandwaldar;Nick Haber;Megumi Sano;Kuno Kim;E. Wang;Damian Mrowca;Michael Lingelbach;Aidan Curtis;Kevin T. Feigelis;Daniel Bear;Dan Gutfreund;David Cox;J. DiCarlo;Josh H. McDermott;J. Tenenbaum;Daniel L. K. Yamins
  • 通讯作者:
    Chuang Gan;Jeremy Schwartz;S. Alter;Martin Schrimpf;James Traer;Julian De Freitas;J. Kubilius;Abhishek Bhandwaldar;Nick Haber;Megumi Sano;Kuno Kim;E. Wang;Damian Mrowca;Michael Lingelbach;Aidan Curtis;Kevin T. Feigelis;Daniel Bear;Dan Gutfreund;David Cox;J. DiCarlo;Josh H. McDermott;J. Tenenbaum;Daniel L. K. Yamins
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Joshua McDermott其他文献

Joshua McDermott的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Joshua McDermott', 18)}}的其他基金

The Perception and Cognition of Sound Texture
声音质感的感知和认知
  • 批准号:
    2240406
  • 财政年份:
    2023
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Standard Grant
Computational neuroimaging of human auditory cortex
人类听觉皮层的计算神经成像
  • 批准号:
    1634050
  • 财政年份:
    2016
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Standard Grant
CAREER: Understanding Real-World Auditory Scene Analysis
职业:了解现实世界的听觉场景分析
  • 批准号:
    1454094
  • 财政年份:
    2015
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Continuing Grant

相似国自然基金

听觉刺激特异性调控情绪的神经环路机制研究
  • 批准号:
    82371516
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
内毛细胞损伤动物模型的建立及其听觉电生理学研究
  • 批准号:
    30872858
  • 批准年份:
    2008
  • 资助金额:
    31.0 万元
  • 项目类别:
    面上项目
儿童植入人工耳蜗后开放式听觉言语发育特性研究
  • 批准号:
    30872859
  • 批准年份:
    2008
  • 资助金额:
    30.0 万元
  • 项目类别:
    面上项目

相似海外基金

Excellence in Research: Incorporating Attention into Computational Auditory Scene Analysis Using Spectral Clustering with Focal Templates
卓越研究:使用带有焦点模板的谱聚类将注意力纳入计算听觉场景分析
  • 批准号:
    2100874
  • 财政年份:
    2021
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Standard Grant
Bayesian prediction for computational auditory scene analysis
用于计算听觉场景分析的贝叶斯预测
  • 批准号:
    510708-2017
  • 财政年份:
    2017
  • 资助金额:
    $ 50.03万
  • 项目类别:
    University Undergraduate Student Research Awards
Applying structure in computational auditory scene analysis
在计算听觉场景分析中应用结构
  • 批准号:
    475019-2015
  • 财政年份:
    2017
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Applying structure in computational auditory scene analysis
在计算听觉场景分析中应用结构
  • 批准号:
    475019-2015
  • 财政年份:
    2016
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
A study on the structure creation of activation support of acoustic measurement environment based on computational auditory scene analysis
基于计算听觉场景分析的声学测量环境激活支撑结构创建研究
  • 批准号:
    16H02911
  • 财政年份:
    2016
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Applying structure in computational auditory scene analysis
在计算听觉场景分析中应用结构
  • 批准号:
    475019-2015
  • 财政年份:
    2015
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Computational Auditory Scene Analysis Using Active Audio-Visual Integration in a Dynamically Changing Environment
在动态变化的环境中使用主动视听集成进行计算听觉场景分析
  • 批准号:
    22700165
  • 财政年份:
    2010
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Development of Robot Audition based on Computational Auditory Scene Analysis
基于计算听觉场景分析的机器人试听开发
  • 批准号:
    19100003
  • 财政年份:
    2007
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Grant-in-Aid for Scientific Research (S)
Construction of computational theories of audition and the application to sound signal processing
听觉计算理论的构建及其在声音信号处理中的应用
  • 批准号:
    15300061
  • 财政年份:
    2003
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Study on Computational Auditory Scene Analysis for Humanoids by Active Audition
基于主动听觉的类人计算听觉场景分析研究
  • 批准号:
    15200015
  • 财政年份:
    2003
  • 资助金额:
    $ 50.03万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了