权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Perceptual Sound Field Reconstruction and Coherent Emulation

感知声场重建和相干仿真

基本信息

批准号：
EP/F001142/1
负责人：
Zoran Cvetkovic
金额：
$ 49.67万
依托单位：
King's College London
依托单位国家：
英国
项目类别：
Research Grant
财政年份：
2008
资助国家：
英国
起止时间：
2008 至无数据
项目状态：
已结题

来源：
https://gtr.ukri.org/projects?ref=EP%2FF001142%2F1
关键词：
Perceptual Sound Field Reconstruction Coherent

项目摘要

The project is concerned with the development of a new 5--10 channel audio technology which would improve over existing ones in terms of (a) realism, (b) accuracy and stability of the auditory perspective, (c) size of the sweet spot, and (d) the envelopment experience. Since the new technology aims to create a 360 degrees auditory perspective, the reproduction will take place over speakers positioned at vertices of a regular polygon. Each speaker will consist of two components, one which will radiate the direct sound field toward a listener, and another which will reproduce diffuse sound field by introducing additional scattering. The goal of the particular tasks, listed below, is to find optimal ways to capture sound field cues and render them using the proposed playback system in a manner which would provide the most convincing illusion of the original or desired sound field.(i) Optimal microphone arrays for the proposed play-back system will be investigated. Arrays considered will consist of microphones placed in the horizontal plane at the vertices of a regular polygon, with the number of microphones equal to the number of speakers. For each array, different diameters, in the range from near coincident up to somewhat beyond the optimal value, and different microphone directivity patterns will be considered. These studies will be repreated for a few diameters of the speaker configuration to investigate if the optimal array diameter depends on the size of the speaker lay-out, and if so to characterize that dependence. Possible dependencies between the optimal microphone directivity patterns and array diameters will be also investigated and characterized. Arrays will be evaluated in critical listening tests according to criteria (a)--(d) stated in the above. Experiments will be guided by simulations which would provide initial objective assessment of ITD and ILD cues generated within the listening area. In parallel, mathematical models of sound fields generated by the proposed technology will be investigated, which could provide some additional insight into the optimal microphone array design. (ii) The impact of play-back with cross-talk cancellation will be be systematically investigated. Existing cross-talk cancellation algorithms will be first used, and if necessary, new algorithms which are numerically efficient and effective in a range of listening environments will be developed. Then optimal microphone arrays for play back with cross-talk cancellation will be investigated, i.e. the work described under (i) will be repeated for reproduction with cross-talk cancellation. Finally, optimal systems with and without cross-talk cancellation will be compared.(iii) Algorithms for direct/diffuse sound field separation will be studied. When the number of instruments does not exceed the number of microphones, multichannel equalization techniques can be used to find dry source signals, which can then be convolved with direct/reverberant parts of room impulse responses to obtain direct/diffuse sound field components, respectively. Multichannel equalization in audio is, however, particularly challenging owing to excessively long impulse responses, and we will develop numerically efficient algorithms for multichannel equalization for audio applications. Then we will study psychoacoustic approximation to direct/diffuse sound field decomposition with no restriction on the number of sources. (iv) Combinations of near-coincident directional microphone arrays, for acquiring direct sound field cues, and widely spaced arrays based on omni-directional or bi-directional microphones, for acquiring diffuse sound field cues, will be systematically investigated in critical listening tests according to criteria (a)--(d). This approach will be evaluated in comparison with the approach described in (i)--(iii) where the same array is used for both sound field components.

该项目涉及开发一种新的5-10声道音频技术，该技术将在(A)真实感、(B)听觉视角的准确性和稳定性、(C)甜蜜点的大小和(D)包络体验方面进行改进。由于这项新技术旨在创造360度的听觉透视，因此复制将通过位于规则多边形顶点的扬声器进行。每个扬声器将由两个组件组成，一个组件将向听者辐射直接声场，另一个组件将通过引入附加散射来再现漫射声场。下面列出的特定任务的目标是找到捕获声场提示的最佳方法，并使用建议的回放系统以提供原始或所需声场的最令人信服的错觉的方式呈现它们。(I)将研究建议的回放系统的最佳麦克风阵列。所考虑的阵列将由放置在水平面上正多边形顶点处的麦克风组成，麦克风的数量等于扬声器的数量。对于每个阵列，将考虑不同的直径，从接近重合到略高于最佳值的范围，以及不同的麦克风方向性模式。这些研究将针对扬声器配置的几个直径进行重复，以调查最佳阵列直径是否取决于扬声器布局的大小，如果是，则表征这种相关性。还将调查和表征最佳传声器方向性模式和阵列直径之间的可能依赖关系。数组将根据上述标准(A)--(D)在关键听力测试中进行评估。实验将在模拟的指导下进行，模拟将对听力区域内产生的ITD和ILD线索提供初步的客观评估。同时，还将研究由该技术产生的声场的数学模型，这将为优化麦克风阵列设计提供一些额外的见解。(Ii)将系统地调查串扰消除回放的影响。将首先使用现有的串扰消除算法，如果必要，将开发在各种收听环境中在数值上有效和有效的新算法。然后，将研究用于在串扰消除的情况下重放的最佳麦克风阵列，即将重复(I)中描述的工作，以便在串扰消除的情况下再现。最后，对有串扰抵消和无串扰抵消的最优系统进行了比较。(Iii)研究了直接/扩散声场分离算法。当仪器的数量不超过麦克风的数量时，可以使用多通道均衡技术来寻找干源信号，然后将其与房间脉冲响应的直接/混响部分卷积，分别获得直接/扩散声场分量。然而，由于过长的脉冲响应，音频中的多声道均衡特别具有挑战性，我们将为音频应用开发高效的多声道均衡算法。然后，我们将研究不受声源数量限制的直接/扩散声场分解的心理声学近似。(4)根据标准(A)--(D)，将在关键听音测试中系统地研究用于获取直接声场提示的近乎一致的定向麦克风阵列和用于获取扩散声场提示的基于全向或双向传声器的大间距阵列的组合。该方法将与(I)--(Iii)中描述的方法进行比较，在该方法中，两个声场分量使用相同的阵列。

项目成果

期刊论文数量（10）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Scattering Delay Network: an interactive reverberator for computer games

散射延迟网络：计算机游戏的交互式混响器

DOI：
发表时间：
期刊：
Audio for Games
影响因子：
0
作者：
Enzo De Sena (Author)
通讯作者：
Enzo De Sena (Author)

Perceptual evaluation of a circularly symmetric microphone array for panoramic recording of audio

用于全景音频录制的圆形对称麦克风阵列的感知评估

DOI：
发表时间：
2010
期刊：
影响因子：
0
作者：
E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković
通讯作者：
Z. Cvetković

Efficient Synthesis of Room Acoustics via Scattering Delay Networks

DOI：
10.1109/taslp.2015.2438547
发表时间：
2015-02
期刊：
IEEE/ACM Transactions on Audio, Speech, and Language Processing
影响因子：
0
作者：
E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković;J. Smith
通讯作者：
E. D. Sena;H. Hacıhabiboğlu;Z. Cvetković;J. Smith

Localization Uncertainty in Time-Amplitude Stereophonic Reproduction

时幅立体声再现中的定位不确定性

DOI：
10.1109/taslp.2020.2975419
发表时间：
2020
期刊：
IEEE/ACM Transactions on Audio, Speech, and Language Processing
影响因子：
0
作者：
De Sena E
通讯作者：
De Sena E

Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction

感知声场重建多通道系统分析与设计

DOI：
10.1109/tasl.2013.2260152
发表时间：
2013
期刊：
IEEE Transactions on Audio, Speech, and Language Processing
影响因子：
0
作者：
De Sena E
通讯作者：
De Sena E

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Zoran Cvetkovic其他文献

Overcomplete expansions and robustness

过度完备的扩展和鲁棒性

DOI：
10.1109/tfsa.1996.547479
发表时间：
1996
期刊：
Proceedings of Third International Symposium on Time-Frequency and Time-Scale Analysis (TFTS-96)
影响因子：
0
作者：
Zoran Cvetkovic;Martin Vetterli
通讯作者：
Martin Vetterli