ITR: Personalized Spatial Audio via Scientific Computing and Computer Vision

ITR:通过科学计算和计算机视觉实现个性化空间音频

基本信息

  • 批准号:
    0086075
  • 负责人:
  • 金额:
    $ 300万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2000
  • 资助国家:
    美国
  • 起止时间:
    2000-09-01 至 2006-08-31
  • 项目状态:
    已结题

项目摘要

This is the first 4 years funding of a five-year continuing award. Humans are very good at discerning the spatial origin of sound using a mixture of frequency-dependent interaural time difference (ITD), interaural level difference (ILD), and pinna spectral cues in disparate environments ranging from open spaces to small crowded rooms. This ability helps us to interact with others and the environment by sorting out individual sounds from a mixture, and helps us to survive by warning us of danger over a wider region of space compared to vision. These advantages of spatial sound are important for human-computer interaction. While the frequency-independent ITD cues (delays) associated with the two ears are relatively easy to render over headphones, the ILD (level difference) and pinna elevation cues are not. For a given source location and frequency content, the sound scattered by the person's torso, head and pinnae, and is received differently at the two ears, leading to differences in the intensity and spectral features of the received sound. These effects are encoded in an extremely individual "Head Related Transfer Function" (HRTF) that depends on the person's anatomical features (structure of the torso, head and pinnae). This individuality has made it difficult to use the HRTF in the proposed applications. Recent research, including that of members of this team, has focused on measuring the HRTFs for individuals in specific environments, on constructing models of the HRTF, on understanding how the geometry of the body is related to the characteristics of HRTF, and how the brain processes the cues to derive spatial information. However, this research has also indicated that the brain is extraordinarily perceptive to errors in cues that result when sound is rendered with an incorrect HRTF.In this project the PI and his team will use numerical methods to compute individualized HRTFs from accurate 3-D surface models of the body. They will use multiview, multiframe computational vision techniques to extract the surface models from imagery. They will then use boundary element methods employing fast multipole/ transform techniques and parallel processing to compute the HRTFs from the surface models. The resulting HRTFs will be evaluated both by objective comparisons with acoustically measured HRTFs and by psychoacoustic testing, and will be used in demonstrations of virtual reality, augmented reality, and teleconferencing. A major advantage of this vision-based approach is that it will allow the PI and his team to investigate and model the way that HRTFs change with body posture, providing the potential of tracking dynamic environments. Thus, the project will include fundamental research to extend the static HRTF measurements to dynamic situations in different environments, using a combination of visual tracking to locate the person in real space, and construction of in-room HRTFs from free-field HRTFs using fast iterative techniques. This will provide a scientific foundation for HCI applications of audio rendering. The research will in addition yield algorithms and understanding that will have an impact on varied fields, including computer vision based model creation; scientific computing; computational acoustics for noise control and land mine detection; neurophysiological understanding of human audition; etc.
这是一个为期五年的连续奖励的第一个四年资助。 人类非常善于在从开放空间到小型拥挤房间的不同环境中使用频率相关的耳间时间差(ITD)、耳间电平差(ILD)和耳廓频谱线索的混合来辨别声音的空间来源。 这种能力可以帮助我们与他人和环境互动,从混合物中分辨出单个声音,并通过警告我们比视觉更广泛的空间区域的危险来帮助我们生存。 空间声音的这些优点对于人机交互是重要的。虽然与双耳相关的频率无关的ITD提示(延迟)相对容易通过耳机呈现,但ILD(电平差)和耳廓高度提示则不然。 对于给定的源位置和频率内容,声音被人的躯干、头部和耳廓散射,并且在两个耳朵处被不同地接收,导致接收到的声音的强度和频谱特征的差异。 这些影响被编码在一个非常个人的“头部相关传递函数”(HRTF),这取决于人的解剖特征(躯干,头部和脊椎的结构)。 这种个性使得难以在所提出的应用中使用HRTF。 最近的研究,包括这个团队的成员,已经集中在测量的HRTF为个人在特定的环境中,在构建模型的HRTF,了解身体的几何形状是如何与HRTF的特性,以及大脑如何处理的线索,以获得空间信息。 然而,这项研究也表明,大脑对错误的线索是非常敏感的,当声音被错误的HRTF渲染时,在这个项目中,PI和他的团队将使用数值方法从精确的三维身体表面模型计算个性化的HRTF。 他们将使用多视图、多帧计算视觉技术从图像中提取表面模型。 然后,他们将使用边界元方法,采用快速多极/变换技术和并行处理来计算表面模型的HRTF。 由此产生的HRTF将通过与声学测量的HRTF的客观比较和心理声学测试进行评估,并将用于虚拟现实,增强现实和电话会议的演示。 这种基于视觉的方法的一个主要优点是,它将允许PI和他的团队调查和建模HRTF随身体姿势变化的方式,从而提供跟踪动态环境的潜力。 因此,该项目将包括基础研究,以扩展静态HRTF测量在不同的环境中的动态情况下,使用视觉跟踪的组合,以定位在真实的空间的人,和建设室内HRTF从自由场HRTF使用快速迭代技术。 这将为音频渲染的HCI应用提供科学的基础。 此外,该研究还将产生算法和理解,这些算法和理解将对各个领域产生影响,包括基于计算机视觉的模型创建;科学计算;用于噪声控制和地雷探测的计算声学;对人类听觉的神经生理学理解等。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Larry Davis其他文献

Modeling and Implementation of Discrete Event Net Based Distributed Control for Industrial Robotic Systems
基于离散事件网络的工业机器人系统分布式控制的建模与实现
  • DOI:
  • 发表时间:
    2012
  • 期刊:
  • 影响因子:
    0
  • 作者:
    立松直倫;大谷淳;Larry Davis;Gen'ichi Yasuda
  • 通讯作者:
    Gen'ichi Yasuda
Mapping TOEFL ® Essentials ™ Test Scores to the Canadian Language Benchmarks
将 TOEFL ® Essentials ™ 考试成绩与加拿大语言基准对应起来
  • DOI:
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    S. Papageorgiou;Larry Davis;Renka Ohta;Pablo Garcia Gomez
  • 通讯作者:
    Pablo Garcia Gomez
A Literature Review of the Influence of Emotional Intelligence (EI) and the Big Five Personality Traits on Leadership Effectiveness
情商(EI)和大五人格特质对领导效能影响的文献综述
Automated Scoring of Speaking Tasks in the Test of English‐for‐Teaching (TEFT™)
教学英语测试 (TEFT™) 中口语任务的自动评分
  • DOI:
    10.1002/ets2.12080
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    0
  • 作者:
    K. Zechner;Lei Chen;Larry Davis;Keelan Evanini;Chong Min Lee;C. W. Leong;Xinhao Wang;Su
  • 通讯作者:
    Su
In Memory of Azriel Rosenfeld

Larry Davis的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Larry Davis', 18)}}的其他基金

EAGER: Document Image Quality Estimation, Enhancement, Classification and Retrieval
EAGER:文档图像质量估计、增强、分类和检索
  • 批准号:
    1359902
  • 财政年份:
    2013
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
ISE:Planning:Collaborative Research: Informal Discovery of Programming Concepts Via Reflective Programming
ISE:规划:协作研究:通过反思性编程非正式地发现编程概念
  • 批准号:
    0917523
  • 财政年份:
    2009
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
Textual Information Access for the Visually Impaired
为视障人士提供文本信息访问
  • 批准号:
    9987944
  • 财政年份:
    2000
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant
CISE Experimental Partnerships: High Performance Systems for Shape and Action Modeling
CISE 实验合作伙伴:用于形状和动作建模的高性能系统
  • 批准号:
    9901249
  • 财政年份:
    1999
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant
CISE Postdoctoral Program: Postdoctoral Research Associate in Computational Science & Engineering Science: High Performance Computing for Remote Sensing Applications
CISE博士后项目:计算科学博士后研究员
  • 批准号:
    9625668
  • 财政年份:
    1996
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
Systems and Software Tools for High Performance Computing
用于高性能计算的系统和软件工具
  • 批准号:
    9401151
  • 财政年份:
    1994
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
Workshop on High Performance Computing and Communications and the National Challenge: Fall 1994: Washington, D.C
高性能计算和通信以及国家挑战研讨会:1994 年秋季:华盛顿特区
  • 批准号:
    9408472
  • 财政年份:
    1994
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
High Performance Computing for Land Cover Dynamics
土地覆盖动态的高性能计算
  • 批准号:
    9318183
  • 财政年份:
    1993
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant
Black-White Group Interaction: A Study of Control and Comfort
黑白群体互动:控制与舒适的研究
  • 批准号:
    8918199
  • 财政年份:
    1990
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
Image Processing By Means of Hierarchical Relaxation Shape Analysis
通过分层松弛形状分析进行图像处理
  • 批准号:
    7904037
  • 财政年份:
    1979
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant

相似海外基金

Personalized Online Adaptive Learning System
个性化在线自适应学习系统
  • 批准号:
    23K20186
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
CAREER: Personalized Maternal Care Decision Support System for Underserved Populations
职业:针对服务不足人群的个性化孕产妇护理决策支持系统
  • 批准号:
    2339992
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant
CAREER: Personalized, wearable robot mobility assistance considering human-robot co-adaptation that incorporates biofeedback, user coaching, and real-time optimization
职业:个性化、可穿戴机器人移动辅助,考虑人机协同适应,结合生物反馈、用户指导和实时优化
  • 批准号:
    2340519
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Continuing Grant
Realizing Human Brain Stimulation of Deep Regions Based on Novel Personalized Electrical Computational Modelling
基于新型个性化电计算模型实现人脑深部刺激
  • 批准号:
    23K25176
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Prediction, Monitoring and Personalized Recommendations for Prevention and Relief of Dementia and Frailty
预防和缓解痴呆症和衰弱的预测、监测和个性化建议
  • 批准号:
    10103541
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    EU-Funded
Modular Laser Sources For Sustainable Production Of Short Personalized Production Series (WAVETAILOR)
用于短个性化生产系列可持续生产的模块化激光源 (WAVETAILOR)
  • 批准号:
    10091981
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    EU-Funded
5G4PHealth: Enhanced 5G-Powered Platform for Predictive Preventive Personalized and Participatory Healthcare
5G4PHealth:增强型 5G 支持平台,用于预测、预防、个性化和参与式医疗保健
  • 批准号:
    10093679
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Collaborative R&D
Biomarker-Based Platform for Early Diagnosis of Chronic Liver Disease to Enable Personalized Therapy (LIVERAIM)
基于生物标志物的慢性肝病早期诊断平台,以实现个性化治疗(LIVERAIM)
  • 批准号:
    10087822
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    EU-Funded
HSI Implementation and Evaluation Project: Increasing Computer Science Undergraduate Retention through Predictive Modeling and Early, Personalized Academic Interventions
HSI 实施和评估项目:通过预测建模和早期个性化学术干预提高计算机科学本科生的保留率
  • 批准号:
    2345378
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
PFI-RP: Resilient and Energy-Efficient Memory Chips for Enhanced Mobile AI and Personalized Machine Learning
PFI-RP:用于增强移动人工智能和个性化机器学习的弹性和节能内存芯片
  • 批准号:
    2345655
  • 财政年份:
    2024
  • 资助金额:
    $ 300万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了