权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Development for speech interface for form -based in formation access services on Web

基于表单的Web信息访问服务语音接口的开发

基本信息

批准号：
13558033
负责人：
NAKAGAWA Seiichi
金额：
$ 4.29万
依托单位：
Toyohashi University Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (B)
财政年份：
2001
资助国家：
日本
起止时间：
2001 至 2003
项目状态：
已结题

项目摘要

While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose interface for many form-based WWW contents. We also performed some experiments by 12 subjects for the comparison of the usability under different usage conditions. As a result, the proposed system attained comparative and higher expected usability measures over the pen-touch input method under the condition of an ideal speech recognition performance, and could be expected to achieve the effectivenes … More s or the superiority over a pen touch-only interface in terms of the usability as their usage condition approaches to a realistic PDA usage condition.We also proposed an. interface for a name input based on speech recognition using syllable-based N-gram and a word dictionary, which was frequently required to input into form-based web pages. User first utters a name and then chooses the correct word/syllables by pen touch from word/syllable candidates which were obtained from speech recognition. Name utterance is hard to recognize accurately because of the large vocabulary size, so the system uses continuous syllable recognition with syllable-based N-gram and isolated word recognition with a dictionary containing frequent words. The user can find the correct the answer from word candidates or syllable sequence candidates at a rate of 82-86%, and can input correct name at a rate of 94-96% with syllable selection from the syllable lattice. Some subjects used this interface and felt that it was useful. Less

虽然已经开发了一些用于访问Web资源的语音接口系统，但它们在访问某些特定内容方面受到限制，并且不能为WWW上的任意信息检索服务提供通用接口。提出了一种交互式语音用户界面系统，该系统可应用于多种基于表单的信息检索服务。特别地，我们的系统是基于客户机-服务器、以Web代理为中心的体系结构实现的，并采用了HTML文档的信息提取和语言处理，为许多基于表单的WWW内容提供了一个通用的接口。我们还对12名受试者进行了实验，比较了不同使用条件下的可用性。结果表明，在理想的语音识别性能条件下，所提出的系统在可用性方面达到了与笔触输入法相比的较高的预期可用性指标，并且随着其使用条件接近于实际的PDA使用条件，可以预期达到比笔触输入法更大的可用性优势。我们还提出了一个。使用基于音节的N-gram和单词字典的基于语音识别的名称输入接口，这经常需要输入到基于表单的网页中。用户先说出一个名字，然后通过笔触从语音识别得到的候选词/音节中选择正确的词/音节。由于词汇量大，人名发音难以准确识别，因此系统采用基于音节的N-gram连续音节识别和包含频繁词的字典孤立词识别。用户从候选词或音节序列候选词中找到正确答案的率为82-86%，从音节格中选择音节输入正确名称的率为94-96%。一些受试者使用了这个界面，并觉得它很有用。少