权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

音声対話における音声の認識と合成に関する研究

口语对话中的语音识别与合成研究

基本信息

批准号：
05241104
负责人：
新美康永
金额：
$ 74.43万
依托单位：
Kyoto Institute of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research on Priority Areas
财政年份：
1995
资助国家：
日本
起止时间：
1995 至无数据
项目状态：
已结题

项目摘要

今年度は、対話音声の分析、認識、合成、対話制御のモデル化の4点から研究を行った。主な成果は次の通りである。(1)雑音に強い分析法として、帯域分割分割自己相関分析法を提案し、種々の雑音に有効であることを確かめた。また、この方法をダミ-ヘッドを用いた2入力信号に拡張すると、ロバスト性が向上することを確認した。(板倉)(2)音声の振幅包絡を周波数分析して発話速度を抽出する方法を考案し、日本語と英語の発話速度の変化を定量的に示した。また、日本語のbimoraic foot現象、英語の強勢間の音節の等時性などを明らかにした。(北沢)(3)識別学習と入力音声への適応機能の導入により高精度な音素認識アルゴリズムを確立した。また離散型HMnetを用いた強力な言語モデルの獲得法を提案してその有効性を示した。(牧野)(4)強い言語的制約に頼っていた従来の連続音声認識システムに、文法の学習、未知語処理、認識誤りを含んだ文の意味解釈などの機能を導入して、比較的自由な発話を許すことに成功した。(中川)(5)韻律、分節特徴、音質の柔軟な制御が可能なホルマントテンプレート接続型音声合成方式を提案し、新しく開発したARX音声分析法を用いて抽出した音道及び音源パラメタを用いて、良好な合成音声を得た。(粕谷)(6)対話音声と朗読音声の特徴を比較し、対話音声を合成するための韻律規則を作成した。また、特定のタスクを用いて、対話履歴による省略、焦点の付与の制御を行い、ユーザに分かりやすい応答音声が生成する手法を開発した。(広瀬)(7)音声認識の誤りを考慮し、認識や聞き直しを行う対話制御方式の数学的なモデル化を行い、このような制御方式を採る対話システムの総合的な性能と音声認識システムの性能との間の定量的な関係を導いた。(新美)

This year, the sound analysis, recognition, synthesis, and speech control of this year's dialects and dialects are conducted at 4 o'clock in the study. The main results are related to each other. (1) the sound intensity analysis method, domain segmentation and self-correlation analysis proposal, and sound intensity analysis method are required to ensure the accuracy of the analysis. Please use the 2 input signal to make sure that the signal is up and down. (2) Sound amplitude package cycle wavenumber analysis, speech speed extraction method, Japanese English language speed measurement and quantitative analysis. In time, such as the Japanese, Japanese bimoraic foot, and the English language, we will be able to communicate with each other in time. (North China) (3) the sound receiver can input the high-precision phoneme recognition device to ensure that the device is installed. The loose HMnet uses the word "strength" to make a proposal to show that it has sex. (Makino) (4) strengthen the system of language learning, language learning, grammar learning, unknown science, and misunderstanding, which means that the interpretation of language means that the machine can enter into the language, and the free language language is successful. (Nakagawa) (5) the proposed method of sound synthesis may be used in the system of sound synthesis. (Nakagawa) (5) the proposed method of sound synthesis is proposed, and the new method of ARX sound analysis is used to extract the sound channel and sound source. (meal Valley) (6) speech sounds, voice sounds, sounds The use of audio, the omission of speech performance, the payment of focus and the control of the bank, and the sound of the voice response of the telephone, and the telephone. (7) the performance of sound and sound knowledge, the way of language control, the way of mathematical control, the way of mathematical control, the way of speech control, the way of mathematical control, the mode of control, the way of control, the quantitative guidance of performance. (Xinmei)