权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

フォーム型Web情報サービス享受のためのマルチモーダル対話インタフェースの研究

享受基于表单的网络信息服务的多模态对话界面研究

基本信息

批准号：
14019046
负责人：
北岡教英
金额：
$ 2.69万
依托单位：
Toyohashi University of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research on Priority Areas
财政年份：
2002
资助国家：
日本
起止时间：
2002 至无数据
项目状态：
已结题

项目摘要

本年度は、マルチモーダル対話インタフェースに関して以下のことを行った.1.対話機能における認識誤り修正のための言い直し発話検出音声対話では,状況の情報を制約として認識や対話制御に導入することが,音声認識においては性能向上に,対話理解・制御においては曖昧さ・誤解の認識と解消につながる.例えば,ユーザがシステムの誤認識に対して行う「言い直し」を検出することは認識・対話に有効であると考えられる.これまでに大語彙孤立単語認識を用いた地名入力タスクにおける言い直し検出法を提案し,認識性能改善に効果があることを示した.本報告書ではより一般的な対話における言い直しの検出に拡張することを試みた.ダイナミックプログラミングによる直前発話と現発話の対応付けおよび音声認識を行った結果に含まれる単語の重なり度合いを用いることによって,再現率94.8%、適合率89.2%で言い直しか否かを判定できた.2.対話における応答タイミング生成音声対話で自然さをつかさどる要素として、ユーザ発話に対して適切なタイミングで応答を返せることがある。リアルタイムに応答を返すために、韻律的情報および表層的言語情報を素性とした決定木を適用して相槌・発話権取得タイミングを生成する手法を考案し、実際の対話音声でタイミング生成させたものを主観評価した結果、人間と同等の自然さでタイミング生成できることを示した。2.任意文字列の音声認識の研究フォーム入力型のWebページで必要な連続音節の認識の高精度化を、特に氏名入力をタスクに行った.言語的な先見的知識を確率的に表現する言語モデルを氏名タスクに特化することで効果を得た。またその結果を音節ごとの候補リストとしてペンで選択して確定する手法を実装し、さらに効率的な入力であることを示した。

This year, タフェ, とをチモチモチモダダ <s:1> have a conversation with ダタフェスにスにスにスにスに related to <s:1> ててったった.1. Phone can seaborne における knew mistakenly り correction のための said い straight し検発 words the sounds of words で seaborne は, status の intelligence を restrict として know や words suppression seaborne に import することが, sounds know においては performance に upwards, words, understand suppression, seaborne においては ambiguous さ, misunderstanding の know と null につながる. Example えば, ユーザがシステムの mistakenly know にし seaborne て line う "い straight し" を検 out することは know, words に polices have sharper であると exam えられる. これまでに big vocabulary isolated 単 language understanding をいた place names into force タスクにおける said い straight し検 method proposed をし, meet performance improvement に unseen fruit があることを shown した. This report ではより general な words に seaborne おける said い straight しの検 out に company, zhang することを try みた. ダイナミックプログラミングによる ahead 発 words と now 発の応 seaborne pay けおよび sounds know を line っにた results contain まれる単 language の heavy なり degrees or いを with いることによって, recurrence rate 94.8%, suitable for 89.2 %で statement で direct agreement た no をを determination でたた.2. Words に seaborne おける応 answer タイミング generated sounds of words で seaborne natural さをつかさどる elements として, ユーザ発 words にし seaborne て appropriate なタイミングで応を return answer せることがある. リアルタイムに応を return answer すために, rhythmic intelligence および surface of verbal intelligence を primality とした decided to wood を applicable して in the hammer, 発 words 権 obtain タイミングを generated するしを test case, the event be の voice sound seaborne でタイミング generated させたものを main 観 review 価した results, human と equal の natural さでタイミング generated できる Youdaoplaceholder0 とを shows た. 2. Any text columns の sounds know の study フォーム type into force の Web ページでな necessary even 続 syllable の know の high-precision を, especially に's name into the force をタスクに line った. Words な seer knowledge を probabilistic に performance する words モデルを's name タスクに specialized することで unseen fruit をた. またその results を syllable ごとの alternate リストとしてペンで sentaku して determine する gimmick を be し, さらにな sharper rates into force であることを shown した.