Development of a speech understanding system
语音理解系统的开发
基本信息
- 批准号:04044108
- 负责人:
- 金额:$ 5.25万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for international Scientific Research
- 财政年份:1992
- 资助国家:日本
- 起止时间:1992 至 1993
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The objective of this research is development of fundamental techniques necessary to understanding spoken dialogue, which include knowledge-based speech recognition system, non-monotonic reasoning in natural language processing, and dialogue modeling. The following are the summary of the research results.1) We verified the efficiency of the knowledge-based approach for Korean speech recognition. Furthermore, some new ideas were proposed to improve the speech recognition. To avoid the difficulties in segmentation, a non-uniform unit is introduced. Every unit has its stationary point at each end of the unit, and transient part in the middle. The parameter trajectory is described by symbolic representation and fuzzy linguistic variables. Redundancy of speech data is used to improve the performance of the recognition system in the post-processor. The prototype system was tested with continuous Korean digit speech of unknown length, and the recognition rate of 97% was obtained.2) Understanding of continuous speech is generally a tough problem, since acoustic information is unreliable. An efficient search mechanism is indispensable because the combination of ambiguous information is very large. Then, we developed a framework of speech understanding system based on ATMS, which is a method of non-monotonic reasoning. The introduction of ATMS reduced elapsed time of natural language processing from 64 sec to 45 sec for understanding speech of 8 Japanese sentences.3) Two kinds of dialogue model characterizing structures in dialogue were proposed for understanding spoken dialogue. One is the SR-plan model which describes utterance pairs composed of the stimulus and the response. The other is Topic Packet Network (TPN) and corresponds to the discourse segments. A mechanism for predicting the next utterance was also developed based on these dialogue models and evaluated on some sample dialogues.
本研究的目标是开发理解口语对话所需的基本技术,包括基于知识的语音识别系统,自然语言处理中的非单调推理和对话建模。本文的主要研究成果如下:1)验证了基于知识的韩语语音识别方法的有效性。此外,还提出了一些改进语音识别的新思想。为了避免分割困难,引入了非均匀单元。每个单元在其两端都有其静止点,中间有其瞬态部分。参数轨迹由符号表示和模糊语言变量描述。在后处理部分,对语音数据进行冗余处理,以提高识别系统的性能。原型系统对未知长度的连续韩语数字语音进行了测试,识别率达到97%。2)由于声学信息的不可靠性,连续语音的理解通常是一个坚韧。由于歧义信息的组合非常大,因此有效的搜索机制是必不可少的。在此基础上,提出了一个基于ATMS的语音理解系统框架。ATMS是一种非单调推理方法。ATMS的引入将自然语言处理的时间从64秒减少到45秒,用于理解8个日语单词的语音。3)提出了两种描述对话结构的对话模型,用于理解口语对话。一种是描述由刺激和反应组成的话语对的SR计划模型。另一种是主题分组网络(TPN),与语篇片段相对应。基于这些对话模型开发了一个预测下一个话语的机制,并在一些样本对话上进行了评估。
项目成果
期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Shingo Nishioka: "A Powerful Disambiguating Mechanism for Speech Understanding Systems Based on ATMS" Proceedings of 1992 International Conference on Spoken Language. 1641-1644 (1992)
Shingo Nishioka:“基于 ATMS 的语音理解系统的强大消歧机制”1992 年国际口语会议论文集。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
山下洋一: "対話音声処理のための模擬対話の収録と分析" 日本音響学会秋季講演論文集. 23-24 (1992)
Yoichi Yamashita:“对话语音处理的模拟对话的记录和分析”日本声学学会秋季会议记录23-24(1992)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Yoichi Yamashita: "Next Utterance Prediction Based on Two Kinds of Dialog Models" Proceedings of Eurospeech'93. 1161-1164 (1993)
Yoichi Yamashita:“基于两种对话模型的下一个话语预测”Eurospeech93 论文集。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
MIZOGUCHI Riichiro其他文献
MIZOGUCHI Riichiro的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('MIZOGUCHI Riichiro', 18)}}的其他基金
Causality-compliant theory of force and motion for its innovative instruction
符合因果关系的力和运动理论的创新指导
- 批准号:
25540162 - 财政年份:2013
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Development of a methodology for the next generation knowledge systems based on ontological engineering
基于本体工程的下一代知识系统方法论的开发
- 批准号:
22240011 - 财政年份:2010
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Building a Theory-aware and Standard-compliant Knowledge Server
构建理论感知且符合标准的知识服务器
- 批准号:
19200012 - 财政年份:2007
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Development of a Theory-Aware Authoring Workbench
理论感知创作工作台的开发
- 批准号:
14208029 - 财政年份:2002
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
An Ontology-based Intelligent Authoring Tool for Training
基于本体的智能训练创作工具
- 批准号:
12558034 - 财政年份:2000
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Fundamental theories of ontology and development of an environment for ontology construction
本体基础理论及本体构建环境的开发
- 批准号:
11480076 - 财政年份:1999
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
An Intelligent Digital Textbook for electric power substation pretator
变电站捕食者智能数字化教材
- 批准号:
08558031 - 财政年份:1996
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Ontology for Knowledge Reuse
知识重用本体
- 批准号:
06452403 - 财政年份:1994
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for General Scientific Research (B)
Development of Speech Understanding System Using Artificial Intelligence Techniques
利用人工智能技术开发语音理解系统
- 批准号:
03044172 - 财政年份:1991
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Overseas Scientific Survey.
Development of expert system tools with knowledge compilation function based on deep knowledge
基于深度知识的具有知识编译功能的专家系统工具开发
- 批准号:
03452290 - 财政年份:1991
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for General Scientific Research (B)
相似海外基金
Peripheral and central contributions to auditory temporal processing deficits and speech understanding in older cochlear implantees
外周和中枢对老年人工耳蜗植入者听觉时间处理缺陷和言语理解的贡献
- 批准号:
10444172 - 财政年份:2022
- 资助金额:
$ 5.25万 - 项目类别:
Effects of Non-Blast mTBI on Binaural Processing and Speech Understanding in Noise
Non-Blast mTBI 对噪声中双耳处理和语音理解的影响
- 批准号:
10537947 - 财政年份:2022
- 资助金额:
$ 5.25万 - 项目类别:
Peripheral and central contributions to auditory temporal processing deficits and speech understanding in older cochlear implantees
外周和中枢对老年人工耳蜗植入者听觉时间处理缺陷和言语理解的贡献
- 批准号:
10630111 - 财政年份:2022
- 资助金额:
$ 5.25万 - 项目类别:
Individual differences in brain networks supporting speech understanding in patients with cochlear implants
支持人工耳蜗患者言语理解的大脑网络的个体差异
- 批准号:
10366520 - 财政年份:2021
- 资助金额:
$ 5.25万 - 项目类别:
Individual differences in brain networks supporting speech understanding in patientswith cochlear implants
支持人工耳蜗植入患者言语理解的大脑网络的个体差异
- 批准号:
10743568 - 财政年份:2021
- 资助金额:
$ 5.25万 - 项目类别:
End-to-End Model for Task-Independent Speech Understanding and Dialogue
与任务无关的语音理解和对话的端到端模型
- 批准号:
20H00602 - 财政年份:2020
- 资助金额:
$ 5.25万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Speech understanding ability and communication intervention for persons with age-related hearing loss and mild cognitive impairment or dementia
年龄相关性听力损失和轻度认知障碍或痴呆患者的言语理解能力和沟通干预
- 批准号:
10437659 - 财政年份:2018
- 资助金额:
$ 5.25万 - 项目类别:
Speech understanding ability and communication intervention for persons with age-related hearing loss and mild cognitive impairment or dementia
年龄相关性听力损失和轻度认知障碍或痴呆患者的言语理解能力和沟通干预
- 批准号:
10201560 - 财政年份:2018
- 资助金额:
$ 5.25万 - 项目类别:
Using Electrophysiology to Complement Speech Understanding-in-Noise Measures
使用电生理学补充噪声中的语音理解测量
- 批准号:
9906072 - 财政年份:2017
- 资助金额:
$ 5.25万 - 项目类别:
Temporal processing and speech understanding in older cochlear implantees
老年人工耳蜗植入者的时间处理和言语理解
- 批准号:
9355563 - 财政年份:2016
- 资助金额:
$ 5.25万 - 项目类别:














{{item.name}}会员




