权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

音声データベースからの音声知識の発見

从语音数据库中发现语音知识

基本信息

批准号：
10143203
负责人：
牧野正三
金额：
$ 1.66万
依托单位：
Tohoku University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research on Priority Areas (A)
财政年份：
1998
资助国家：
日本
起止时间：
1998 至无数据
项目状态：
已结题

项目摘要

現在,マルチメディア情報のライブラリ化が急速に進められている.音声情報においてもニュースなどを自動的にライブラリ化していくことが強く望まれている.自動ライブラリ化は,発声内容が文字情報として与えられている場合とそうでない場合に分けることができる.与えられている場合には,文字情報と音声情報を対応させることが必要であり,そうでない場合には,音声情報のみから話題の要約が必要になる.本研究では,発声内容に関する文字情報が与えられた場合とそうでない場合に分けて研究を行った.文字情報が与えられた場合には,音声情報と言語情報を対応させるために,高精度な音素モデルと高精度な対応付けの方法が必要になる.文字情報が与えられない場合には,汎用の言語モデルや,音声区間の類似性を高速に計算する方法が必要になる.本年度はこれらの研究目的を達成するために,以下の3項目について研究を行った.1. 音声情報と文字情報を対応させるための高精度音素モデルの自動構築法の開発.2. 高精度音素モデルを用いた音声情報と文字情報の高精度対応づけ(自動音素ラベリング)の方法の開発.3. 音響的類似性を利用した,発声内容をあらわすキーワードの自動抽出法の開発.研究項目の1番目に関しては,従来利用されていた方法より高精度な音素モデルを自動的に構築することができた.この方法は事前に要因を規定する必要がないので,要因を規定できない言語モデルの獲得や認識単位の自動設定などへの応用も可能である.しかし,まだ十分なものとは言えず,一層の高精度化が必要である.研究項目の2番目に関しては,一層の高精度化にはパラメータの差分情報や周波数上の利用が必要である.研究項目の3番目に関しては,音響的類似性のみから高頻度で発声されている共通区間を自動的に抽出する手法を開発した.しかし,「〜ました」などの話題に依存しない単語も検出された.今後これらの付属語を除去するために,基本周波数情報や言語情報の利用が必要になると考えている.

Now, the rapid development of information. Sound information is transmitted automatically to the user. Automatic translation, voice content from text information to text information, and from text information to text information. In the case of text information and audio information, it is necessary to offer audio information and topic information. This study is about the text information related to the sound content and the situation. Text information and speech information are the same, and high precision phoneme information and high precision method are necessary. Text information is necessary for high speed calculation of similarity between speech and sound. This year's research objectives were achieved, and the following 3 projects were carried out. 1. Development of Automatic Construction Method for High Precision Phoneme Information and Text Information. 2. Development of a method for high-precision phoneme matching (automatic phoneme matching) of high-precision phoneme matching (automatic phoneme matching) for audio and text information. 3. The similarity of sound is utilized, and the automatic extraction method of sound content is developed. The first part of the research project is to use the method of high precision phoneme construction automatically. This method is based on the requirement that the user should specify the language in advance, and the possibility of automatic setting of the acquired recognition unit. A layer of high precision is necessary. The research project of the two aspects is related to the need for a layer of high precision, differential information and utilization of frequency. The three aspects of the research project are related to the development of methods for automatic extraction of common intervals from high frequency and acoustic similarity. The topic of "~" depends on the language of the topic. In the future, basic frequency information and speech information are necessary for the use of speech information.