Perceptual Sound Field Reconstruction and Coherent Emulation
感知声场重建和相干仿真
基本信息
- 批准号:EP/F001142/1
- 负责人:
- 金额:$ 49.67万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2008
- 资助国家:英国
- 起止时间:2008 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The project is concerned with the development of a new 5--10 channel audio technology which would improve over existing ones in terms of (a) realism, (b) accuracy and stability of the auditory perspective, (c) size of the sweet spot, and (d) the envelopment experience. Since the new technology aims to create a 360 degrees auditory perspective, the reproduction will take place over speakers positioned at vertices of a regular polygon. Each speaker will consist of two components, one which will radiate the direct sound field toward a listener, and another which will reproduce diffuse sound field by introducing additional scattering. The goal of the particular tasks, listed below, is to find optimal ways to capture sound field cues and render them using the proposed playback system in a manner which would provide the most convincing illusion of the original or desired sound field.(i) Optimal microphone arrays for the proposed play-back system will be investigated. Arrays considered will consist of microphones placed in the horizontal plane at the vertices of a regular polygon, with the number of microphones equal to the number of speakers. For each array, different diameters, in the range from near coincident up to somewhat beyond the optimal value, and different microphone directivity patterns will be considered. These studies will be repreated for a few diameters of the speaker configuration to investigate if the optimal array diameter depends on the size of the speaker lay-out, and if so to characterize that dependence. Possible dependencies between the optimal microphone directivity patterns and array diameters will be also investigated and characterized. Arrays will be evaluated in critical listening tests according to criteria (a)--(d) stated in the above. Experiments will be guided by simulations which would provide initial objective assessment of ITD and ILD cues generated within the listening area. In parallel, mathematical models of sound fields generated by the proposed technology will be investigated, which could provide some additional insight into the optimal microphone array design. (ii) The impact of play-back with cross-talk cancellation will be be systematically investigated. Existing cross-talk cancellation algorithms will be first used, and if necessary, new algorithms which are numerically efficient and effective in a range of listening environments will be developed. Then optimal microphone arrays for play back with cross-talk cancellation will be investigated, i.e. the work described under (i) will be repeated for reproduction with cross-talk cancellation. Finally, optimal systems with and without cross-talk cancellation will be compared.(iii) Algorithms for direct/diffuse sound field separation will be studied. When the number of instruments does not exceed the number of microphones, multichannel equalization techniques can be used to find dry source signals, which can then be convolved with direct/reverberant parts of room impulse responses to obtain direct/diffuse sound field components, respectively. Multichannel equalization in audio is, however, particularly challenging owing to excessively long impulse responses, and we will develop numerically efficient algorithms for multichannel equalization for audio applications. Then we will study psychoacoustic approximation to direct/diffuse sound field decomposition with no restriction on the number of sources. (iv) Combinations of near-coincident directional microphone arrays, for acquiring direct sound field cues, and widely spaced arrays based on omni-directional or bi-directional microphones, for acquiring diffuse sound field cues, will be systematically investigated in critical listening tests according to criteria (a)--(d). This approach will be evaluated in comparison with the approach described in (i)--(iii) where the same array is used for both sound field components.
该项目涉及开发一种新的5-10声道音频技术,该技术将在(A)真实感、(B)听觉视角的准确性和稳定性、(C)甜蜜点的大小和(D)包络体验方面进行改进。由于这项新技术旨在创造360度的听觉透视,因此复制将通过位于规则多边形顶点的扬声器进行。每个扬声器将由两个组件组成,一个组件将向听者辐射直接声场,另一个组件将通过引入附加散射来再现漫射声场。下面列出的特定任务的目标是找到捕获声场提示的最佳方法,并使用建议的回放系统以提供原始或所需声场的最令人信服的错觉的方式呈现它们。(I)将研究建议的回放系统的最佳麦克风阵列。所考虑的阵列将由放置在水平面上正多边形顶点处的麦克风组成,麦克风的数量等于扬声器的数量。对于每个阵列,将考虑不同的直径,从接近重合到略高于最佳值的范围,以及不同的麦克风方向性模式。这些研究将针对扬声器配置的几个直径进行重复,以调查最佳阵列直径是否取决于扬声器布局的大小,如果是,则表征这种相关性。还将调查和表征最佳传声器方向性模式和阵列直径之间的可能依赖关系。数组将根据上述标准(A)--(D)在关键听力测试中进行评估。实验将在模拟的指导下进行,模拟将对听力区域内产生的ITD和ILD线索提供初步的客观评估。同时,还将研究由该技术产生的声场的数学模型,这将为优化麦克风阵列设计提供一些额外的见解。(Ii)将系统地调查串扰消除回放的影响。将首先使用现有的串扰消除算法,如果必要,将开发在各种收听环境中在数值上有效和有效的新算法。然后,将研究用于在串扰消除的情况下重放的最佳麦克风阵列,即将重复(I)中描述的工作,以便在串扰消除的情况下再现。最后,对有串扰抵消和无串扰抵消的最优系统进行了比较。(Iii)研究了直接/扩散声场分离算法。当仪器的数量不超过麦克风的数量时,可以使用多通道均衡技术来寻找干源信号,然后将其与房间脉冲响应的直接/混响部分卷积,分别获得直接/扩散声场分量。然而,由于过长的脉冲响应,音频中的多声道均衡特别具有挑战性,我们将为音频应用开发高效的多声道均衡算法。然后,我们将研究不受声源数量限制的直接/扩散声场分解的心理声学近似。(4)根据标准(A)--(D),将在关键听音测试中系统地研究用于获取直接声场提示的近乎一致的定向麦克风阵列和用于获取扩散声场提示的基于全向或双向传声器的大间距阵列的组合。该方法将与(I)--(Iii)中描述的方法进行比较,在该方法中,两个声场分量使用相同的阵列。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Scattering Delay Network: an interactive reverberator for computer games
散射延迟网络:计算机游戏的交互式混响器
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:Enzo De Sena (Author)
- 通讯作者:Enzo De Sena (Author)
Perceptual evaluation of a circularly symmetric microphone array for panoramic recording of audio
用于全景音频录制的圆形对称麦克风阵列的感知评估
- DOI:
- 发表时间:2010
- 期刊:
- 影响因子:0
- 作者:E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković
- 通讯作者:Z. Cvetković
Efficient Synthesis of Room Acoustics via Scattering Delay Networks
- DOI:10.1109/taslp.2015.2438547
- 发表时间:2015-02
- 期刊:
- 影响因子:0
- 作者:E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković;J. Smith
- 通讯作者:E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković;J. Smith
Localization Uncertainty in Time-Amplitude Stereophonic Reproduction
时幅立体声再现中的定位不确定性
- DOI:10.1109/taslp.2020.2975419
- 发表时间:2020
- 期刊:
- 影响因子:0
- 作者:De Sena E
- 通讯作者:De Sena E
Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction
感知声场重建多通道系统分析与设计
- DOI:10.1109/tasl.2013.2260152
- 发表时间:2013
- 期刊:
- 影响因子:0
- 作者:De Sena E
- 通讯作者:De Sena E
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Zoran Cvetkovic其他文献
Overcomplete expansions and robustness
过度完备的扩展和鲁棒性
- DOI:
10.1109/tfsa.1996.547479 - 发表时间:
1996 - 期刊:
- 影响因子:0
- 作者:
Zoran Cvetkovic;Martin Vetterli - 通讯作者:
Martin Vetterli
Zoran Cvetkovic的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Zoran Cvetkovic', 18)}}的其他基金
Challenges in Immersive Audio Technology
沉浸式音频技术的挑战
- 批准号:
EP/X032981/1 - 财政年份:2024
- 资助金额:
$ 49.67万 - 项目类别:
Research Grant
Visits to University of California, Berkeley, Stanford University, and SRI International
访问加州大学伯克利分校、斯坦福大学、SRI International
- 批准号:
EP/K034626/1 - 财政年份:2013
- 资助金额:
$ 49.67万 - 项目类别:
Research Grant
Robust Syllable Recognition in the Acousic-Waveform Domain
声音波形域中的鲁棒音节识别
- 批准号:
EP/D053005/1 - 财政年份:2006
- 资助金额:
$ 49.67万 - 项目类别:
Research Grant
相似海外基金
A model of directional room impulse responses for evaluating the quality of sound field
用于评估声场质量的定向房间脉冲响应模型
- 批准号:
22KJ1941 - 财政年份:2023
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Effect of lower-elevated sound production on three-dimensional sound-field
低海拔发声对三维声场的影响
- 批准号:
22KF0133 - 财政年份:2023
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Spectroscopic high-speed motion picture measurement of sound field with nanometer-order precision
纳米级精度的声场光谱高速运动图像测量
- 批准号:
22K18809 - 财政年份:2022
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Improving the Perceived Sound Field Continuity Using Convolution of Impulse Sequences for Virtual Acoustics
使用虚拟声学脉冲序列的卷积来提高感知声场连续性
- 批准号:
RGPIN-2020-05621 - 财政年份:2022
- 资助金额:
$ 49.67万 - 项目类别:
Discovery Grants Program - Individual
Data-driven sound field measurement for high-resolution spatial audio analysis and its applications
数据驱动的高分辨率空间音频分析声场测量及其应用
- 批准号:
22H03608 - 财政年份:2022
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Construction of a Real-time High-precision Sound Field Auralization Platform
实时高精度声场可听平台的构建
- 批准号:
22K12123 - 财政年份:2022
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Improvement of the performance of sound field reproduction system and the cooperation with welfare engineering
声场再现系统性能提升及与福利工程的合作
- 批准号:
21H03764 - 财政年份:2021
- 资助金额:
$ 49.67万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Improving the Perceived Sound Field Continuity Using Convolution of Impulse Sequences for Virtual Acoustics
使用虚拟声学脉冲序列的卷积来提高感知声场连续性
- 批准号:
RGPIN-2020-05621 - 财政年份:2021
- 资助金额:
$ 49.67万 - 项目类别:
Discovery Grants Program - Individual
Improving the Perceived Sound Field Continuity Using Convolution of Impulse Sequences for Virtual Acoustics
使用虚拟声学脉冲序列的卷积来提高感知声场连续性
- 批准号:
RGPIN-2020-05621 - 财政年份:2020
- 资助金额:
$ 49.67万 - 项目类别:
Discovery Grants Program - Individual
A Leading Sound Affects the Local-Field Potential Elicited by a Trailing Sound in the Rat's Inferior Colliculus in a Direction-Dependent Manner
前导声音以方向相关的方式影响大鼠下丘中尾随声音引起的局部场电位
- 批准号:
552665-2020 - 财政年份:2020
- 资助金额:
$ 49.67万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's