Develoment of Spoken Dialogue System for Japanese and Chinese
日汉口语对话系统的开发
基本信息
- 批准号:08558028
- 负责人:
- 金额:$ 5.12万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (A)
- 财政年份:1996
- 资助国家:日本
- 起止时间:1996 至 1998
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
With the aim of developing a spoken dialogue system for both Japanese and Chinese in order to check the possibility of realizing practical systems of multilingual spoken dialogue, the following major results were obtained.1. After selecting literature retrieval as the system task, we have arranged necessary databases and installed dictionary for speech synthesis. Also a speech corpus was cotstructed for training and evaluating speech recognition.2. Phoneme HMM's and phoneme class HMM's were trained using the corpus. A method was developed to identify the input speech being Japanese or Chinese based on the phoneme/phoneme class sequences.3. A robust speech recognition method was developed based on Bayesian predictive classification with Viterbi approximation. An adaptation method was further proposed, where improved posterior probability density function was estimated via sequential Bayesian learning using adaptation data. Another robust method, minimax, was also investigated to make it … More applicable to continuous speech.4. An automatic Waveform concatenation speech synthesis method was developed. This method is based on segmenting speech waveform using speech recognition technique, and automatically placing pitch marks after LMA analysis. It was utilized for the Chinese speech synthesis.5. Waveform concatenation synthesizer was combined with formant synthesizer to generate a new speech synthesis system. This system was shown to improve several low quality phonemes.6. A speech synthesis oriented modeling of ; Chinese prosody was developed based on the newly defined function for unified representation of Chinese fundamental frequency contours.7. A method was developed for precise tone recognition of Chinese continuous speech. This method is based on using features of tone nucleus of a syllable only.8. A Japanese/Chinese spoken dialogue system was constructed (or literature retrieval. Chinese responses were pre-stored sentences, while Japanese responses were generated from semantic representations. The system was confirmed to operate both in Japanese and Chinese. Less
本研究以开发日语和汉语的口语对话系统为目的,为了验证实现多语言口语对话的实用系统的可能性,取得了以下主要成果。1.在选择了文献检索作为系统任务后,我们安排了必要的数据库,安装了语音合成词典。并建立了语音语料库,用于语音识别的训练和评价.使用语料库训练音素HMM和音素类HMM。提出了一种基于音素/音素类序列识别输入语音是日语还是汉语的方法.提出了一种基于Viterbi近似的贝叶斯预测分类的鲁棒语音识别方法。进一步提出了一种自适应方法,利用自适应数据通过序贯贝叶斯学习估计改进的后验概率密度函数。另一个强大的方法,极大极小,也进行了研究,使之 ...更多信息 适用于连续语音。提出了一种波形拼接语音自动合成方法。该方法基于语音识别技术对语音波形进行分段,并在LMA分析后自动放置基音标记。并将其用于汉语语音合成.将波形级联合成器与共振峰合成器相结合,构成一种新的语音合成系统。该系统被证明可以改善几个低质量的手机。基于新定义的基频轮廓统一表示函数,实现了面向语音合成的汉语韵律建模.提出了一种汉语连续语音声调的精确识别方法。该方法仅利用了音节的声核特征.构建了一个日汉口语对话系统(或文献检索系统)。中文的反应是预先存储的句子,而日语的反应是从语义表征产生的。该系统已被确认为日语和中文操作。少
项目成果
期刊论文数量(70)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Hui Jiang, Keikichi Hirose and Qiang Huo: "Sequential Bayesian learning of CDHMM based on finite mixture approximation of its prior/posterior density" Proc.IEEE Automatic Speech Recognition Workshop, IEEE SP Society. 373-380 (1997)
Hui Jiang、Keikichi Hirose 和 Gang Huo:“基于先验/后验密度的有限混合近似的 CDHMM 的顺序贝叶斯学习”Proc.IEEE 自动语音识别研讨会,IEEE SP 协会。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
広瀬啓吉: "Use of prosodic features in speech recognition (Invited)" Proc. IEEE Invited Workshop on Pattern Recognition for Multimedia Techniques (IEEE Taegu Section). 99-108 (1996)
Keikichi Hirose:“语音识别中韵律特征的使用(特邀)”Proc. IEEE 多媒体技术模式识别研讨会(IEEE 大邱部分)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Keikichi Hirose, Mayumi Sakata and Hiromichi Kawanami: "Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features" Proc. International Conference on Spoken Language Processing, 1. 378-381 (1996)
Keikichi Hirose、Mayumi Sakata 和 Hiromichi Kawanami:“基于韵律特征定量分析的日语对话语音合成”Proc.
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
MERON Yoram: "Waveform concatenation speech synthesis using phonetic clustering and automatic unit selection" 日本音響学会平成9年度秋季研究発表会講演論文集. I. 263-264 (1997)
MERON Yoram:“使用语音聚类和自动单元选择的波形串联语音合成”日本声学学会 1997 年秋季研究会议论文集 I. 263-264 (1997)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
広瀬啓吉: "対話音声の生成(「音声による人間と機械の対話」の第4章)"オーム社. 375 14 (1998)
Keikichi Hirose:“对话语音的生成(《使用语音的人机对话》第 4 章)” Ohmsha 375 14 (1998)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
HIROSE Keikichi其他文献
HIROSE Keikichi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('HIROSE Keikichi', 18)}}的其他基金
Pronunciation education system based on the systematization of non-mothor tongue speech prosody using generation process model and speech synthesis
基于生成过程模型和语音合成的非母语语音韵律系统化的发音教育系统
- 批准号:
24652115 - 财政年份:2012
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Advanced method of prosody control in statistical-based speech synthesis using generation process model of fundamental frequency contours
使用基频轮廓生成过程模型的基于统计的语音合成中韵律控制的先进方法
- 批准号:
24300068 - 财政年份:2012
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Expressive Multi-language Speech Synthesis Based on the Generation Process Model and Its Use for Automatic Speech Translation
基于生成过程模型的表达性多语言语音合成及其在自动语音翻译中的应用
- 批准号:
21300061 - 财政年份:2009
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Synthesis of speech in any speaking styles based on corpus-based generation of prosodic features using the generation process model
使用生成过程模型基于语料库生成韵律特征来合成任何说话风格的语音
- 批准号:
17300055 - 财政年份:2005
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
High-quality Speech Synthesis based on Accurate Analysis Method and Statistical Method
基于精确分析方法和统计方法的高质量语音合成
- 批准号:
12480079 - 财政年份:2000
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Naturally Sounding Speech Synthesis and Recognition Based on the Formulation of Prosody
基于韵律表述的自然语音合成与识别
- 批准号:
09480061 - 财政年份:1997
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Formulation of Prosodic Features of Speech and its Application to Continuous Speech Recognition
语音韵律特征的制定及其在连续语音识别中的应用
- 批准号:
06452397 - 财政年份:1994
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Rule-Synthesis of Spoken Sentences for the Speech Dialogue Systems
语音对话系统的口语句子规则合成
- 批准号:
03452288 - 财政年份:1991
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for General Scientific Research (B)
Development of Output System of Announcing Speech with Input of Kanji-Kana Sentences
输入汉字假名句子的语音播报输出系统的开发
- 批准号:
01850073 - 财政年份:1989
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Developmental Scientific Research (B).
相似海外基金
An investigation of generative acoustic latent representations for meeting speech recognition and summarization
用于满足语音识别和摘要的生成声学潜在表示的研究
- 批准号:
24K15004 - 财政年份:2024
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Disrupter or enabler? Assessing the impact of using automatic speech recognition technology in interpreter-mediated legal proceedings
颠覆者还是推动者?
- 批准号:
2889440 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Studentship
Analysis of speech recognition as a tool in medical English education
语音识别作为医学英语教育工具的分析
- 批准号:
23K00767 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Automatic Speech Recognition (ASR) engine to improve autistic children speech
自动语音识别(ASR)引擎可改善自闭症儿童的言语能力
- 批准号:
10056712 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Grant for R&D
Industrial research into the reduction of biases in foundational Automatic Speech Recognition models.
减少基础自动语音识别模型中偏差的工业研究。
- 批准号:
10068091 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Collaborative R&D
M3OLR: Towards Effective Multilingual, Multimodal and Multitask Oriental Low-resourced Language Speech Recognition
M3OLR:迈向有效的多语言、多模态和多任务东方稀缺语言语音识别
- 批准号:
23K11227 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Establishment of intraoperative education model using speech recognition and language information processing technology
利用语音识别和语言信息处理技术建立术中教育模型
- 批准号:
23K16281 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
SaTC: CORE: Small: Robust Speaker and Speech Recognition Under AI-Driven Physical and Digital Attacks
SaTC:核心:小型:人工智能驱动的物理和数字攻击下的鲁棒扬声器和语音识别
- 批准号:
2310207 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
Continuing Grant
A State-of-the-Art Automatic Speech Recognition and Conversational Platform to Enable Socially Assistive Robots for Persons with Alzheimer's Disease and Related Dementias
最先进的自动语音识别和对话平台,为阿尔茨海默病和相关痴呆症患者提供社交辅助机器人
- 批准号:
10699887 - 财政年份:2023
- 资助金额:
$ 5.12万 - 项目类别:
CRCNS US-Spain Research Proposal: Collaborative Research: Tracking and modeling the neurobiology of multilingual speech recognition
CRCNS 美国-西班牙研究提案:合作研究:跟踪和建模多语言语音识别的神经生物学
- 批准号:
2207770 - 财政年份:2022
- 资助金额:
$ 5.12万 - 项目类别:
Continuing Grant














{{item.name}}会员




