权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Develoment of Spoken Dialogue System for Japanese and Chinese

日汉口语对话系统的开发

基本信息

批准号：
08558028
负责人：
HIROSE Keikichi
金额：
$ 5.12万
依托单位：
The University of Tokyo
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (A)
财政年份：
1996
资助国家：
日本
起止时间：
1996 至 1998
项目状态：
已结题

来源：
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-08558028/
关键词：
Spoken Dialogue System Speech Recognition Speech Synthesis Multi-lingual System Viterbi Bayesian Predictive Classification Waveform Concatenation Synthesis Prosodic Modelig Tone Recognition ビタビ探索 TD-PSOLA 対話処理言語自動識別ベイズ予測分類 HMM

项目摘要

With the aim of developing a spoken dialogue system for both Japanese and Chinese in order to check the possibility of realizing practical systems of multilingual spoken dialogue, the following major results were obtained.1. After selecting literature retrieval as the system task, we have arranged necessary databases and installed dictionary for speech synthesis. Also a speech corpus was cotstructed for training and evaluating speech recognition.2. Phoneme HMM's and phoneme class HMM's were trained using the corpus. A method was developed to identify the input speech being Japanese or Chinese based on the phoneme/phoneme class sequences.3. A robust speech recognition method was developed based on Bayesian predictive classification with Viterbi approximation. An adaptation method was further proposed, where improved posterior probability density function was estimated via sequential Bayesian learning using adaptation data. Another robust method, minimax, was also investigated to make it … More applicable to continuous speech.4. An automatic Waveform concatenation speech synthesis method was developed. This method is based on segmenting speech waveform using speech recognition technique, and automatically placing pitch marks after LMA analysis. It was utilized for the Chinese speech synthesis.5. Waveform concatenation synthesizer was combined with formant synthesizer to generate a new speech synthesis system. This system was shown to improve several low quality phonemes.6. A speech synthesis oriented modeling of ; Chinese prosody was developed based on the newly defined function for unified representation of Chinese fundamental frequency contours.7. A method was developed for precise tone recognition of Chinese continuous speech. This method is based on using features of tone nucleus of a syllable only.8. A Japanese/Chinese spoken dialogue system was constructed (or literature retrieval. Chinese responses were pre-stored sentences, while Japanese responses were generated from semantic representations. The system was confirmed to operate both in Japanese and Chinese. Less

本研究以开发日语和汉语的口语对话系统为目的，为了验证实现多语言口语对话的实用系统的可能性，取得了以下主要成果。1.在选择了文献检索作为系统任务后，我们安排了必要的数据库，安装了语音合成词典。并建立了语音语料库，用于语音识别的训练和评价.使用语料库训练音素HMM和音素类HMM。提出了一种基于音素/音素类序列识别输入语音是日语还是汉语的方法.提出了一种基于Viterbi近似的贝叶斯预测分类的鲁棒语音识别方法。进一步提出了一种自适应方法，利用自适应数据通过序贯贝叶斯学习估计改进的后验概率密度函数。另一个强大的方法，极大极小，也进行了研究，使之 ...更多信息适用于连续语音。提出了一种波形拼接语音自动合成方法。该方法基于语音识别技术对语音波形进行分段，并在LMA分析后自动放置基音标记。并将其用于汉语语音合成.将波形级联合成器与共振峰合成器相结合，构成一种新的语音合成系统。该系统被证明可以改善几个低质量的手机。基于新定义的基频轮廓统一表示函数，实现了面向语音合成的汉语韵律建模.提出了一种汉语连续语音声调的精确识别方法。该方法仅利用了音节的声核特征.构建了一个日汉口语对话系统（或文献检索系统）。中文的反应是预先存储的句子，而日语的反应是从语义表征产生的。该系统已被确认为日语和中文操作。少