权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

骨導音声を用いた話者識別システムに関する研究

骨导语音说话人识别系统研究

基本信息

批准号：
14750288
负责人：
森幹男
金额：
$ 1.73万
依托单位：
University of Fukui
依托单位国家：
日本
项目类别：
Grant-in-Aid for Young Scientists (B)
财政年份：
2002
资助国家：
日本
起止时间：
2002 至 2004
项目状态：
已结题

来源：
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-14750288/
关键词：
骨導音声話者識別雑音耐性裏声

项目摘要

本年度得られた研究成果は以下の通りである。1.気導音声と骨導音声の収録日本語5母音、及び単語音声(数字)発声時の気導音声と骨導音声の収録を、防音室内で、新たに成人12名に対して行った。また、防音室内で電子協騒音データベースDATの駅構内騒音を再生し、同様の収録を行うことで、雑音環境下音声データを作成した。2.話者識別実験本年度は、日本語5母音発声時の気導音声と骨導音声の対数パワースペクトルの差を特徴量として用いて話者識別実験を行い、詳細な検討を行った。クリーンな(雑音環境でない)環境下で収録した音声に対して話者識別実験を行ったところ、話者12名に対して99.2%の識別率が得られた。また、特徴量として対数パワースペクトルの代わりにケプストラム係数を用いて同様の実験を行ったところ、話者12名に対して100%の識別率が得られた。(昨年度は、話者10名に対して93.0%。一昨年度は、エネルギー比のみを特徴量として話者10名に対して85.0%)さらに、いずれの場合においても、気導音声のみを用いた場合よりも雑音耐性が高いことを確認した。3.裏声の判別と歌声の評価声帯振動が明らかに異なるにも関わらず、特に女性において聴感上判別が困難であるばかりでなく本人も認識していないことがあると言われている裏声を骨導音声の歪み率から判別し、表声-裏声換声点の検出を試みた。その結果、骨導音声の歪み率から換声点の検出が可能であることが明らかとなった。また、骨導音声の歪み率を話者識別の特徴量として用いることによって話者識別率の向上が期待できることが明らかとなった。さらに、気導音声から音楽的声域の評価を行い、客観的に「正しい発声」を自動判定するボイストレーナへの応用について検討を行った結果、有効性が確認出来た。

This year's research results are as follows: 1. Japanese 5 vowels, and Japanese 6 vowels, and Japanese 7 vowels, and Japanese 8 vowels, and Japanese 9 vowels. The sound reproduction in the structure of the electronic coordination system in the soundproof room, the sound reproduction in the soundproof environment, and the sound reproduction in the soundproof environment 2. This year's speaker identification is conducted in the middle of the year, and detailed discussion is conducted on the number of voice and bone voice pairs in the vowel generation. The recognition rate of 99.2% was obtained for 12 speakers under the condition of recording in the audio environment. The recognition rate of 100% was obtained by using the same method of implementation as that of 12 speakers. (Last year, 10 respondents answered 93.0%.) Last year, the number of people with high voice resistance was 85.0%. 3. It is difficult to distinguish the inner sound from the outer sound, especially the inner sound, and the outer sound from the inner sound. The result is that the sound of the bone is cold and the sound is distorted. The sound of the sound is changed. The sound is changed. The feature quantity of speaker recognition is used in the process of speaker recognition. The evaluation of the sound field of the sound field is carried out, and the "positive sound" of the guest is automatically determined. The result of the examination is confirmed.