Fusion of array signal processing and facial image process-ing for hands-free robust communication and authentication.

融合阵列信号处理和面部图像处理,实现免提稳健通信和身份验证。

基本信息

  • 批准号:
    08680443
  • 负责人:
  • 金额:
    $ 1.54万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
  • 财政年份:
    1996
  • 资助国家:
    日本
  • 起止时间:
    1996 至 1998
  • 项目状态:
    已结题

项目摘要

Human speech sounds with speaker characteristics involve complex interactions of the larynx, nasal cavity, oral cavity, and oropharynx. Authentication technique can be used to verify the identity claimed by people accessing systems ; that is, it enables access control of various services by voice and face iniages.In conventional approach, however, there is few study on fusion of array signal processing and facial image processing for hands-free robust communication and authentication. The primary aim of our research is to develop new techniques for automatically recognizing who is speaking by using speaker-specific information included in speech wave and facial image. Intelligent audiovisual sensing system with four-microphone-array and CCD camera has been developed for hands-free robust communication and authentication. Array signal processing performed accurate estimation of mouth position. An articulatory model of three-dimensional vocal tract obtained by magnetic resonance imaging was proposed for extraction of speaker-specific information. A Color image processing technique was proposed for face segmentation and facial feature detection. Our group has successfully developed and tested the intelligent audiovisual sensing system for hands-free robust communication and authentication. The proposed techniques have wide application in the areas of speech communication and in the study of human interaction.
具有扬声器特征的人类语音涉及喉咙、鼻腔、口腔和口咽的复杂相互作用。认证技术可以用来验证访问系统的人的身份,即通过语音和人脸图像来实现对各种服务的访问控制,然而,在传统的方法中,很少有研究将阵列信号处理和人脸图像处理相结合来实现免提鲁棒通信和认证。我们的研究的主要目的是开发新的技术,自动识别谁是说话者通过使用特定的信息,包括在语音波和面部图像。智能视听传感系统与四个麦克风阵列和CCD摄像头已经开发的免提强大的通信和认证。阵列信号处理执行嘴位置的精确估计。提出了一种基于磁共振成像的三维声道发音模型,用于说话人特定信息的提取。提出了一种基于彩色图像处理的人脸分割和特征检测方法。本集团已成功开发及测试智能视听感应系统,用于免提稳健通讯及认证。所提出的技术在语音通信领域和人类交互的研究中具有广泛的应用。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Takuya Niikawa, Eri Kawano, Masafumi Matsumura, Takashi Tachimura, and Takeshi Wada: "Acoustic char-acteristics of three-dimensional vocal tract chapes measured by MRI during vowel production." Hokkaido Workshop on Speech Production. 1-2 (1998)
Takuy​​a Niikawa、Eri Kawano、Masafumi Matsumura、Takashi Tachimura 和 Takeshi Wada:“元音产生过程中通过 MRI 测量的三维声道腔的声学特征。”
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
T.Niikawa: "Accurate measurement of three-dimensional shapes of vocal tract and dental crown using magnetic resonance imaging" Third Joint meeting of Acoustical society of America and Japan. 2pSC13. 867-872 (1996)
T.Niikawa:“利用磁共振成像精确测量声道和牙冠的三维形状”美国和日本声学学会第三次联席会议。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
T.Niikawa: "Acoustic Characteristics of three-dimensional vocal tract shapes measured by MRI during vowel production" Hokkaido Workshop on speech production. 1-2. 8-9 (1998)
T.Niikawa:“元音产生过程中通过 MRI 测量的三维声道形状的声学特征”北海道语音产生研讨会。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Takuya Niikawa, Takashi Tachimura, Takeshi Wada, Masafumi Matsumura Hiroshi Umeo: "FEM analysis aspi-rated air flow in three-dimensional vocal tract dur-ing fricative consonant phonation." Trans.IEE of Japan. (To appear).
Takuy​​a Niikawa、Takashi Tachimura、Takeshi Wada、Masafumi Matsumura Hiroshi Umeo:“FEM 分析摩擦辅音发声期间三维声道中的吸气气流。”
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
新川 拓也: "有限要素法を用いた摩擦音発生時の声道内呼気流の推定" 電気学会論文誌C. 発表予定.
Takuy​​a Shinkawa:“使用有限元法估计摩擦音发生时声道中的呼气气流”,日本电气工程师学会会刊 C. 预定出版。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

MATSUMURA Masafumi其他文献

MATSUMURA Masafumi的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('MATSUMURA Masafumi', 18)}}的其他基金

A life rhythm analysis using unrestricted monitoring of the oral cavity function and vital signs for a lifestyle-diseases-prevention.
通过无限制地监测口腔功能和生命体征进行生活节律分析,以预防生活方式疾病。
  • 批准号:
    20500172
  • 财政年份:
    2008
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Non-restraining measurement of oral-cavity function and vital signs in the elderly to improve safety and to detect early signs of dementia.
对老年人口腔功能和生命体征进行无限制测量,以提高安全性并发现痴呆症的早期迹象。
  • 批准号:
    17500130
  • 财政年份:
    2005
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
An interactive health care system for homebound elderly, based on non-restraining measurement of conversation, deglutition, and behavior.
一种针对居家老年人的交互式医疗保健系统,基于对谈话、吞咽和行为的非限制性测量。
  • 批准号:
    14580463
  • 财政年份:
    2002
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Fusion of array signal processing and facial image proccessing for hands-free robust communication and authenticaion
阵列信号处理和面部图像处理的融合,实现免提稳健通信和身份验证
  • 批准号:
    11680426
  • 财政年份:
    1999
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)

相似海外基金

Array Signal Processing Techniques for Terahertz Communications and Sensing
用于太赫兹通信和传感的阵列信号处理技术
  • 批准号:
    RGPIN-2022-03678
  • 财政年份:
    2022
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Discovery Grants Program - Individual
Incoherent sensor array signal processing.
非相干传感器阵列信号处理。
  • 批准号:
    2284473
  • 财政年份:
    2019
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Studentship
Spectrum Sharing by Adaptive Array Signal Processing
通过自适应阵列信号处理实现频谱共享
  • 批准号:
    17H06562
  • 财政年份:
    2017
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Deepening of Array Signal Processing Theory Expanded to Asynchronous Distributed Channels and Real-World Applications
阵列信号处理理论的深化扩展到异步分布式通道和实际应用
  • 批准号:
    16H01735
  • 财政年份:
    2016
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
Microphone Array Signal Processing with Asynchronous Recording Devices
使用异步录音设备的麦克风阵列信号处理
  • 批准号:
    25280069
  • 财政年份:
    2013
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Research about source separation and source localization bynonlinear array signal processing based on quadratic kernel
基于二次核的非线性阵列信号处理的震源分离与震源定位研究
  • 批准号:
    23760325
  • 财政年份:
    2011
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Fundamental Study on Array Signal Processing Using Cristal-shaped Sensor Array
晶体状传感器阵列阵列信号处理的基础研究
  • 批准号:
    21760309
  • 财政年份:
    2009
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Applied sensor array signal processing
应用传感器阵列信号处理
  • 批准号:
    4049-2002
  • 财政年份:
    2004
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Discovery Grants Program - Individual
Applied sensor array signal processing
应用传感器阵列信号处理
  • 批准号:
    4049-2002
  • 财政年份:
    2003
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Discovery Grants Program - Individual
Applied sensor array signal processing
应用传感器阵列信号处理
  • 批准号:
    4049-2002
  • 财政年份:
    2002
  • 资助金额:
    $ 1.54万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了