权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Unsupervised audio-visual geometry calibration of distributed microphone arrays

分布式麦克风阵列的无监督视听几何校准

基本信息

批准号：
166631756
负责人：
Professor Dr.-Ing. Reinhold Häb-Umbach
金额：
--
依托单位：
Institut für Elektrotechnik und Informationstechnik
依托单位国家：
德国
项目类别：
Research Grants
财政年份：
2010
资助国家：
德国
起止时间：
2009-12-31 至 2013-12-31
项目状态：
已结题

来源：
https://gepris.dfg.de/gepris/projekt/166631756?language=en
关键词：
Unsupervised audio visual geometry calibration

项目摘要

The overall goal of the project is to simplify the installation of audio-visual sensor networks. This is achieved by developing algorithms for automatically determining the position of the distributed microphone arrays from reverberant speech input. Then correspondences between events sensed by the acoustic sensor network and those sensed by a sensor network of another modality are established. This other modality is given by a multi-camera-network, where the camera positions are assumed to be known. Once the correspondences have been detected the relative geometries of the acoustic sensor network can be mapped to the given coordinate system of the camera network.In the first project phase we have developed algorithms for microphone array posiltion self calibration in 2D, which are characterized by immunity to reverberation and which can operate on speech input, not requiring artificial calibration signals.The requested project extension is devoted to the following objectives:1) A correspondence will be developed between the relative geometries of the acoustic sensor network and a global coordinate system, which is given by a multi-camera system. This shall be achieved by mapping trajectories of speakers obtained from the acoustic subsystem to trajectories of the visual correlates, i.e., faces or persons.2) The developed geometry self calibration system will be embedded in an ambient communication system for evaluation purposes.In the ambient communication system an acoustic beamformer extracts the speech signal, while the active camera system tracks the speaker. In this test bed the interaction between user and system can be studied in a real-time environment and under realistic environmental conditions, eventually leading to an optimization of the calibration system.

该项目的总体目标是简化视听传感器网络的安装。这是通过开发用于从混响语音输入自动确定分布式麦克风阵列的位置的算法来实现的。然后，建立由声学传感器网络感测的事件与由另一模态的传感器网络感测的事件之间的对应关系。该另一模态由多相机网络给出，其中相机位置被假设为已知的。一旦检测到对应关系，声学传感器网络的相对几何形状就可以映射到摄像机网络的给定坐标系。在第一个项目阶段，我们已经开发了用于麦克风阵列位置自校准的2D算法，其特征在于对混响的免疫力，并且可以对语音输入进行操作，所请求的项目扩展致力于以下目标：1）将在声学传感器网络的相对几何形状与由多相机系统给出的全局坐标系之间建立对应关系。这将通过将从声学子系统获得的扬声器的轨迹映射到视觉相关的轨迹来实现，即，2）将所开发的几何自校准系统嵌入到环境通信系统中进行评估。在环境通信系统中，声学波束形成器提取语音信号，而主动摄像机系统跟踪说话人。在该测试台中，可以在实时环境和现实环境条件下研究用户与系统之间的交互，最终导致校准系统的优化。

项目成果

期刊论文数量（3）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Multi-speaker tracking using multiple distributed microphone arrays

使用多个分布式麦克风阵列进行多扬声器跟踪

DOI：
10.1109/icassp.2014.6853669
发表时间：
2014
期刊：
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
影响因子：
0
作者：
A. Plinge;G. A. Fink
通讯作者：
G. A. Fink

Geometry calibration of multiple microphone arrays in highly reverberant environments

高混响环境中多个麦克风阵列的几何校准