Research on the word recognition method based on voice and lip shape movements in very noisy circumstances
噪声环境下基于语音和唇形运动的词语识别方法研究
基本信息
- 批准号:11650426
- 负责人:
- 金额:$ 2.11万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:1999
- 资助国家:日本
- 起止时间:1999 至 2000
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Word recognition techniques have been investigated by many researchers. In these researches, many kinds of characteristic parameters of speech signal are studied and many kinds of recognition methods are proposed. The Cepstrum parameter and HMM (Hidden Markov Model) are representative examples of them.In the calm circumstances, they enable us to recognize natural speed speaking speech with relatively high recognition rate. However, in the noisy circumstances, it is still difficult to achieve the high word recognition rate by using methods that depend on only auditory information.On the other hand, it is well known that human has ability to understand other person's talks just by watching his mouth movements without any auditory information. This ability is called "Lip reading". If the word recognition system based on voices and lip shape movements, it can be expected to offer not only effective means of word recognition in very noisy circumstances but also easy human-machine communication methods.To construct the system described above, following problems must be solved.1. Very fast and exact extraction of lip shape from series of face images is required.2. Parameters that describe lip shapes must be investigated.3. Accurate recognition method is required.Because of these problems, the word recognition systems which uses lip shape movements have not been developed until now. In this research, we had tried to develop the real time word recognition system based on the voices and lip shape movements on the commercial base personal computer. For realization of fast and exact operation, the technique called as the modified Sampled Active Contour Model (modified SACM) is adopted to extract lip shapes. For describing the extracted lip shapes, new parameter is proposed in this paper. And the recognition of the lip shape movements is achieved by HMM according to the proposed new parameters.
单词识别技术已经被许多研究人员研究。在这些研究中,人们对语音信号的各种特征参数进行了研究,提出了各种识别方法。倒谱参数和隐马尔可夫模型(HMM)就是其中的代表,在平静的环境下,它们能使我们以较高的识别率识别自然语速的说话人语音。然而,在噪声环境下,仅依靠听觉信息的方法仍然难以达到较高的识别率,另一方面,众所周知,人类仅通过观察他人的嘴部运动就可以理解他人的讲话,而不需要任何听觉信息。这种能力被称为“唇语阅读”。如果基于语音和唇形运动的文字识别系统,不仅可以提供在噪声环境下有效的文字识别手段,而且可以提供简单的人机交流方式,构建上述系统需要解决以下问题.从一系列人脸图像中快速准确地提取嘴唇形状是一个重要的研究课题.必须研究描述唇形状的参数。由于这些问题的存在,目前还没有开发出利用唇形动作的文字识别系统。在本研究中,我们尝试在商用的个人电脑上,发展出以语音与唇形动作为基础的真实的即时文字辨识系统。为了实现快速准确的操作,本文采用了改进的采样主动轮廓模型(modified SACM)技术来提取嘴唇形状。为了描述提取的嘴唇形状,本文提出了新的参数。根据新参数,采用隐马尔可夫模型(HMM)实现了唇型运动的识别。
项目成果
期刊论文数量(21)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
菅原一孔: "パーソナルコンピュータ上での読唇システムの実時間実現"計測自動制御学会論文誌. 36. 1145-1151 (2000)
Kazunori Sukawara:“在个人计算机上实时实现唇读系统”《仪器与控制工程师学会汇刊》36. 1145-1151 (2000)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Kazunori SUGAHARA, Toshimi SHINCHI, Makoto KISHINO, Ryosuke KONISHI: "Real Time Realization of Lip Reading System on the Personal Computer"Transactions of the society of instrument and control engineers. Vol.36, No.12. 1145-1151 (2000)
Kazunori SUGAHARA、Toshimi SHINCHI、Makoto KISHINO、Ryosuke KONISHI:“个人计算机上唇读系统的实时实现”仪器与控制工程师学会会刊。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Makoto KISHINO, Masahiro OKI, Tomoyuki OSAKI, Kazunori SUGAHARA, Ryosuke KONISHI: "A Word Spotting Method by Using Image Data"Proceedings of the l6th SICE Sensing Forum. 45-50 (1999)
Makoto KISHINO、Masahiro OKI、Tomoyuki OSAKI、Kazunori SUGAHARA、Ryosuke KONISHI:“使用图像数据的单词识别方法”第 16 届 SICE 传感论坛论文集。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
新地俊幹: "画像情報と音声情報を併用した単語認識システムの構築について"電子情報通信学会技術研究報告. CAS98-66. 37-44 (1999)
Toshiki Shinchi:“利用图像信息和音频信息构建单词识别系统”IEICE 技术研究报告 37-44 (1999)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
菅原一孔: "画像情報をとり入れた単語認識システムの実時間実現"電子情報通信学会・パターン認識・メディア理解研究会. (発表予定). (2000)
Kazunori Sukawara:“结合图像信息的文字识别系统的实时实现”,电子、信息和通信工程师研究所,模式识别和媒体理解研究小组(计划发表)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
KONISHI Ryosuke其他文献
KONISHI Ryosuke的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('KONISHI Ryosuke', 18)}}的其他基金
Research on running assistance of electric wheelchair by using lip-reading system
利用唇读系统辅助电动轮椅行走的研究
- 批准号:
19500476 - 财政年份:2007
- 资助金额:
$ 2.11万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Surface Structure Analytical Method by Extended Appearance Potential Fine Structure
扩展外观电位精细结构的表面结构分析方法
- 批准号:
60550011 - 财政年份:1985
- 资助金额:
$ 2.11万 - 项目类别:
Grant-in-Aid for General Scientific Research (C)