SPEECH RECOGNITION WITH SYNCHRONOUS INPUT OF HAND-WRITTEN GESTURES FOR MOBILE DEVICES
移动设备同步输入手写手势的语音识别
基本信息
- 批准号:15300054
- 负责人:
- 金额:$ 3.78万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (B)
- 财政年份:2003
- 资助国家:日本
- 起止时间:2003 至 2004
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Mobile devices have recently been often used in daily life. User-friendly interface with high accuracy has been strongly demanded. For this purpose, we propose an interface using simultaneous inputs of speech and hand-written gestures. This interface is more robust against environmental noise than speech-only interface, and its input speed is faster than the interface with only hand-written gestures. Our target application is e-mail making with the input of sentences.First year, we proposed an interface in which a sentence is input by speech while the "hiragana" character at the head of each phrase in the sentence is input by hand-written gestures. We implemented a recognition algorithm for hand-written gestures, designed a method for recognizing the simultaneous inputs of the two modes. The proposed method was evaluated by simulation experiments using speech data and hand-written gesture data, which are recorded independently, and was proved to be effective.Second year, we constructed a recording system for the input of the two modes, and recorded 530 sentences from ten subjects. For integrating the two modes, we employed a two-pass process in which a word graph generated by speech recognition in the first pass is utilized for the integration process of the two modes in the second pass. The proposed method improved the recognition accuracy by 2.6 point over the method only with speech recognition.For future work, a method for optimizing the weights among the two modes should be developed. We are going to develop a demonstration system which works in real time and evaluate it in noisy environment.
移动的设备近来已经经常用于日常生活中。用户友好的界面,高精度的强烈要求。为此,我们提出了一个接口,同时使用语音和手写手势输入。该接口比纯语音接口对环境噪声的鲁棒性更好,输入速度比纯手写手势接口更快。我们的目标应用程序是电子邮件制作与输入的句子。第一年,我们提出了一个接口,其中一个句子是通过语音输入,而在句子中的每个短语的头部的“平假名”字符是通过手写手势输入。实现了一种手写手势识别算法,设计了一种识别两种模式同时输入的方法。通过对语音和手写手势数据的模拟实验,验证了该方法的有效性。第二年,我们构建了一个语音和手写手势输入的记录系统,记录了10名受试者的530个句子。为了整合这两种模式,我们采用了两遍过程,其中在第一遍中由语音识别生成的词图用于第二遍中的两种模式的整合过程。该方法比单纯语音识别的识别率提高了2.6个百分点,需要进一步研究两种模式间权值的优化方法。我们将开发一个演示系统,它的工作在真实的时间和评估它在嘈杂的环境。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
市屋, 中川, 篠田, 古井: "手書き文字の準同期入力を併用した音声認識手法の予備検討"2004年電子情報通信学会総合大会講演論文集. D. 148-148 (2004)
Ichiya、Nakakawa、Shinoda、Furui:“使用手写字符准同步输入的语音识别方法的初步研究”2004 年 IEICE 大会论文集 D. 148-148 (2004)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Simultaneous Input Interface of Speech and Handwritten Characters
语音和手写字符同时输入接口
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:R.Nakagawa;Y.Kobayashi;R.Kobayashi;K.Shinoda;S.Furui
- 通讯作者:S.Furui
Preliminary Evaluation of Speech Recognition with Quasi-Synchronous Input of Hand-Written Characters.
手写字符准同步输入语音识别的初步评估。
- DOI:
- 发表时间:2004
- 期刊:
- 影响因子:0
- 作者:T.Ichiya;R.Nakagawa;K.Shinoda;S.Furui
- 通讯作者:S.Furui
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
SHINODA Koichi其他文献
SHINODA Koichi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('SHINODA Koichi', 18)}}的其他基金
A study of multimodal recognition for human communication search
人类通信搜索的多模态识别研究
- 批准号:
20300063 - 财政年份:2008
- 资助金额:
$ 3.78万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Systemization of audio-visual knowledge resources using graphical models
利用图模型将视听知识资源系统化
- 批准号:
17300059 - 财政年份:2005
- 资助金额:
$ 3.78万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
相似海外基金
CRII: HCC: Human-automation Interaction: Assistive and Adaptive Multimodal Interface to Support Older Adults in Complex Automated Systems
CRII:HCC:人机交互:辅助和自适应多模式界面,支持复杂自动化系统中的老年人
- 批准号:
2153504 - 财政年份:2022
- 资助金额:
$ 3.78万 - 项目类别:
Standard Grant
SBIR Phase II: Development of a Multimodal Interface for improving independence of Blind and Visually-Impaired people
SBIR 第二阶段:开发多模式界面以提高盲人和视障人士的独立性
- 批准号:
2025772 - 财政年份:2020
- 资助金额:
$ 3.78万 - 项目类别:
Cooperative Agreement
SBIR Phase I: Development of a Multimodal Interface for improving independence of Blind and Visually-Impaired people
SBIR 第一阶段:开发多模式界面以提高盲人和视障人士的独立性
- 批准号:
1843485 - 财政年份:2019
- 资助金额:
$ 3.78万 - 项目类别:
Standard Grant
A Multimodal Interface for Neuro-rehabilitation of Movement Disorders
用于运动障碍神经康复的多模态界面
- 批准号:
RTI-2018-00900 - 财政年份:2017
- 资助金额:
$ 3.78万 - 项目类别:
Research Tools and Instruments
Study on Asymmetric Vibration type Non-Grounded Force Display Using Vibration Speakers and Application for Multimodal Interface
振动扬声器非对称振动型非接地力显示研究及多模态接口应用
- 批准号:
17J01330 - 财政年份:2017
- 资助金额:
$ 3.78万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Elucidation of intention manifestation structure in physically handicapped children using multimodal interface
使用多模态界面阐明身体残疾儿童的意图表现结构
- 批准号:
15K01460 - 财政年份:2015
- 资助金额:
$ 3.78万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Personal Panoramic Multimodal Interface
个人全景多模态界面
- 批准号:
19500110 - 财政年份:2007
- 资助金额:
$ 3.78万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
ITR/PE/SY(SBE): Dialogue-Assisted Visual Environment for Geoinformation: Enabling Collaborative Information Access and Decision-Making Through a Natural, Multimodal Interface
ITR/PE/SY(SBE):地理信息对话辅助视觉环境:通过自然的多模式界面实现协作信息访问和决策
- 批准号:
0113030 - 财政年份:2001
- 资助金额:
$ 3.78万 - 项目类别:
Standard Grant
Vision-Based Hand Gesture Analysis in a Multimodal Interface for Controlling Virtual Environments
用于控制虚拟环境的多模态界面中基于视觉的手势分析
- 批准号:
9634618 - 财政年份:1996
- 资助金额:
$ 3.78万 - 项目类别:
Continuing Grant
Spoken-Language Access to Multimedia (SLAM): A Multimodal Interface to the World-Wide Web
多媒体口语访问 (SLAM):万维网的多模式接口
- 批准号:
9422461 - 财政年份:1994
- 资助金额:
$ 3.78万 - 项目类别:
Standard Grant














{{item.name}}会员




