权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

フォーム型Web情報サービス享受のためのマルチモーダル対話インタフェースの研究

享受基于表单的网络信息服务的多模态对话界面研究

基本信息

批准号：
13224049
负责人：
北岡教英
金额：
--
依托单位：
Toyohashi University of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research on Priority Areas (C)
财政年份：
2001
资助国家：
日本
起止时间：
2001 至无数据
项目状态：
已结题

项目摘要

一般に,ウェブブラウザを操作する場合,マウスなどによって操作可能なGraphical User Interface(GUI)が用いられる.しかし近年,携帯端末やPDA(携帯情報端末機器)など,マウスなどが不向きな環境からをアクセス可能となっている.ここでは,従来から検討されている音声操作インタフェースに加え,任意文字列の入力を可能にした音声入力インタフェースが有用になる.そこで,情報検索におけるWWWブラウザのフォーム入力に対し任意文字列の入力を行うための音声インタフェースについて検討した.自由な音節系列を認識するために,One-pass Viterbi法により連続音節認識を行う.日本語の文字列を入力対象とする場合,何の制約もなく自由に音節の接続を許す必要はなく,例えばHTMLを詳細に解析し,認識対象が絞り込める場合(例えば氏名入力であることがわかる場合),その情報を言語モデルとして用いることも考えられる.これを仮定して,まず氏名の情報をbigram言語モデルとして導入した.その結果,用いない場合の75.1%から78.3%に音節認識率が向上した.しかし,音節系列すべてが正しく認識できる率は認識結果の上位5位までをみても34.8%と不十分な結果であった.そこで,系列の認識結果の上位N位から,音節ごとに5-bestりストを作成してユーザに提示し,ユーザに,ペンタッチなどによって選択させる,音節選択インタフェースを構築した。これは,あるフォームの入力の際に別のウィンドウが開き,ユーザに音声入力をさせ,その認識結果から作成した音節毎の5-bestリストを表示してユーザに選択させるものである.これにより,音声入力と簡単なペンによる選択によって、入力可能となる率は71.2%となった.

In general, it is possible to operate the Graphical User Interface(GUI) when it is not operational. In recent years, portable terminals and PDAs (portable information terminal machines) have become more and more popular in the environment. This is the first time that a text string has been inserted into a text string. In this case, information search for WWW is not allowed. It is allowed to enter any text string. It is allowed to enter any text string. Free syllable series recognition,One-pass Viterbi method syllable recognition line When Japanese text columns are used as powerful objects, there is no need to restrict the freedom of syllable connection. For example, HTML is analyzed in detail, and when the cognitive object is confused (for example, when the name of the Japanese text is used as powerful as possible), there is no need to restrict the freedom of syllable connection. The name of the person is a bigram. As a result, 75.1% and 78.3% of syllable recognition rates were higher than those in the previous cases. 34.8% of the syllable series are positive, 34.8% are negative, 34.8% are negative. The upper N position of the recognition result of the series is divided into 5-best syllables, which are divided into 4 parts. For this reason, the 5-best syllable of the syllable is selected from the 5-best syllable of the syllable. For example, the sound input force is 71.2% of the total input force.