权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

マイクロホンアレーを用いた音環境の認識に関する研究

基于麦克风阵列的声音环境识别研究

基本信息

批准号：
12780259
负责人：
山田武志
金额：
$ 1.15万
依托单位：
University of Tsukuba
依托单位国家：
日本
项目类别：
Grant-in-Aid for Encouragement of Young Scientists (A)
财政年份：
2000
资助国家：
日本
起止时间：
2000 至 2001
项目状态：
已结题

项目摘要

1.個々の音を抽出する技術:マイクロホンアレーによる音源抽出法において,複数の音源の位置を同時に推定し,かつ個々の音源の移動を追尾する機能を実現するために,3次元トレリス法の適用について検討している.3次元トレリス法の性能は,マイクロホンアレーの指向性ビームの鋭さが十分ではなく,他の方向からの音が重畳するような場合,大きく低下してしまう.この問題に対処するための方法としては,複数の環境音が重畳している区間を事前に検出し,あらかじめ重畳を考慮したモデルを用意することが考えられる.本研究では,環境音モデルとHMM合成法を用いて複数の環境音が重畳している区間を検出する方法を提案した.孤立単語,連続単語,文章と環境音が重畳している状況を想定した評価実験を行った結果,環境音が重畳している区間,重畳している環境音の種類とそのSN比を良好に検出できることが分かった.今後,このような重畳区間情報を利用して,3次元トレリス法の性能改善を図る予定である.2.個々の音を認識する技術:HMMにより環境音をモデル化する際,HMMの単位と構造(状態数や状態の接続形態など)について検討する必要がある.まず,92種類の環境音を数種類の構造でモデル化し,認識実験を行った.その結果,環境音によって適した構造は異なっており,認識率に大きな差が生じることが分かった.次に,92種類の環境音を尤度最大化基準に基いてクラスタリングした結果,音響的に似た環境音同士がマージされるものの,クラスタリングの過程で構造を適応的に変化させる必要があることが分かった.そこで,HMMの単位や構造を,同じ基準の下で同時かつ自動的に決定する方法として,逐次状態分割による隠れマルコフ網の自動生成法を適用し,評価実験によりその有効性を確認した.今後,自動的に環境音を収録し,環境音のモデルを逐次的に更新する手法について検討する予定である.

1. Sound extraction technology: sound source extraction method, simultaneous estimation of the position of a plurality of sound sources, movement of a plurality of sound sources, rear-end tracking function, implementation of the application of three-dimensional sound extraction method, performance of three-dimensional sound extraction method, directivity, sharpness, and direction of sound extraction. The method of solving this problem is to examine the problem in advance in the interval between multiple environmental sounds. In this paper, we propose a new method to synthesize multiple environmental sounds by HMM. isolated language, continuous language, article and environmental sound from heavy to medium condition, think about it, evaluate it, conduct results, environmental sound from heavy to medium interval, heavy to medium environmental sound type, SN ratio, good detection, and so on. In the future, the use of this information in multiple intervals, three-dimensional separation method performance improvement, prediction, etc. 2. Sound recognition technology:HMM environmental sound,HMM unit structure (state number, state connection form), the need for discussion. 92 kinds of environmental sound, several kinds of structure, understanding and implementation. As a result, the environmental sound is different from the structure, and the recognition rate is different. 92 kinds of environmental sound, especially the maximum degree of the benchmark, the sound of the environmental sound is similar to the environmental sound, the structure of the process is suitable for the transformation of the necessary. In this case,HMM's single position structure, under the same base, simultaneously and automatically determine the method, the sequential state segmentation, the automatic generation method of the network is applicable, the evaluation of the effectiveness of the system is confirmed. In the future, automatic recording of ambient sound, ambient sound detection and successive updating of the method of detection and discussion of predetermined conditions.