EAGER: Lip Reading by Unobtrusive Multimodal Sensors and Machine Learning Algorithms

EAGER:通过不显眼的多模态传感器和机器学习算法进行唇读

基本信息

  • 批准号:
    2129673
  • 负责人:
  • 金额:
    $ 14.99万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2021
  • 资助国家:
    美国
  • 起止时间:
    2021-08-15 至 2023-01-31
  • 项目状态:
    已结题

项目摘要

The project aims to build an unobtrusive system to enable lip reading for patients with Amyotrophic Lateral Sclerosis (ALS, also known as Lou Gehrig's diseases) and individuals with speech and hearing disorders. Although there is rich literature on lip reading, the bulkiness, obtrusiveness, and/or immobility of these solutions impedes their applications in daily practice, especially for patients with neuromuscular disorders. There is an urgent need to develop novel lip-reading technologies to improve the communication capabilities of ALS patients with loved ones and healthcare providers. The proposed system can considerably improve on existing solutions for tracking and interpreting facial movements and more broadly, body movements, such as finger motions and body gestures. The ability to gather multimodal motion patterns from unobtrusive sensors and apply machine learning (ML) to interpret the acquired data would greatly facilitate diagnosis, treatment, and rehabilitation of motion-related disorders, such as stroke and Parkinson's disease. In addition, this work paves the way for the development of nonverbal communication interfaces enabled by facial/body gestures and opens new avenues for rehabilitation, robotics, and human-machine interfaces. This project presents an excellent opportunity for students to participate in cross-disciplinary research. Part of the research will be integrated into the PI's courses and capstone design projects. The PIs are committed to outreach activities and increasing the diversity through local minority organizations and the Vertically Integrated Program at Stony Brook University. The overarching goal of this project is to build an unobtrusive hardware-software platform for ALS patients that can capture speech-relevant lip gestures and decode lip movements for speech. First, a skin-like multimodal strain and electromyography (EMG) sensing system will be designed to track both skin deformations and muscle activities associated with lip movements. Self-assembled structures will be introduced to render the sensors ultrathin, breathable, and semi-transparent. Second, the feasibility of converting the sensed lip signals to corresponding spoken words will be demonstrated. Modern ML methods, and in particular, ensemble Gaussian processes (GPs) will be exploited for speech recognition. In the proposed scheme, each GP serves as a classifier and the final decision is made by fusing the results of all the GPs by making use of methods within the Bayesian framework. The potential contributions of the proposed work include: 1) Design of skin-like strain and EMG sensors with high sensitivity and good skin compatibility through a scalable self-assembly process. 2) Integration of multimodal sensors for comprehensive in-vivo quantification of lip movements associated with speech. 3) Development of ML algorithms that precisely convert lip movements to speech. 4) Laying the grounds for developing a truly natural and unobtrusive hardware-software system for lip reading. Our proposed work can fill the gaps in the existing solutions by an intuitive and unobtrusive technology for lip reading.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该项目旨在建立一个不显眼的系统,使唇阅读的患者肌萎缩侧索硬化症(ALS,也称为卢格里克病)和个人的语言和听力障碍。虽然有丰富的文献唇阅读,笨重,突兀,和/或固定的这些解决方案阻碍了他们的应用在日常实践中,特别是对患者的神经肌肉疾病。迫切需要开发新的唇读技术,以提高ALS患者与亲人和医疗保健提供者的沟通能力。所提出的系统可以大大改善现有的解决方案,用于跟踪和解释面部运动,更广泛地说,身体运动,如手指运动和身体姿势。从不显眼的传感器收集多模态运动模式并应用机器学习(ML)来解释所获取的数据的能力将极大地促进中风和帕金森病等运动相关疾病的诊断、治疗和康复。此外,这项工作为通过面部/身体手势实现的非语言通信接口的开发铺平了道路,并为康复,机器人和人机接口开辟了新的途径。该项目为学生提供了一个参与跨学科研究的绝佳机会。部分研究将被整合到PI的课程和顶点设计项目中。PI致力于通过当地少数民族组织和斯托尼布鲁克大学的纵向综合方案开展外联活动和增加多样性。该项目的总体目标是为ALS患者构建一个不引人注目的硬件软件平台,该平台可以捕获与语音相关的嘴唇手势并解码语音的嘴唇运动。首先,一个类似皮肤的多模态应变和肌电图(EMG)传感系统将被设计为跟踪皮肤变形和肌肉活动与嘴唇运动。将引入自组装结构,使传感器超薄、透气和半透明。其次,将证明将感测到的唇信号转换为对应的口语单词的可行性。现代ML方法,特别是集成高斯过程(GP)将用于语音识别。在所提出的方案中,每个GP作为一个分类器和最终的决定是通过融合所有的GP的结果,利用贝叶斯框架内的方法。本论文的潜在贡献包括:1)通过可扩展的自组装工艺,设计具有高灵敏度和良好皮肤相容性的类皮肤应变和肌电信号传感器。2)多模态传感器的集成,用于与语音相关的嘴唇运动的全面体内量化。3)开发ML算法,精确地将嘴唇运动转换为语音。4)为开发一个真正自然和不引人注目的唇阅读硬件软件系统奠定了基础。我们提出的工作可以填补现有解决方案的空白,通过一个直观和不显眼的技术唇阅读。这个奖项反映了NSF的法定使命,并已被认为是值得通过评估使用基金会的智力价值和更广泛的影响审查标准的支持。

项目成果

期刊论文数量(5)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Decoding silent speech commands from articulatory movements through soft magnetic skin and machine learning
通过软磁皮肤和机器学习从发音运动解码无声语音命令
  • DOI:
    10.1039/d3mh01062g
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    13.3
  • 作者:
    Dong, Penghao;Li, Yizong;Chen, Si;Grafstein, Justin T.;Khan, Irfaan;Yao, Shanshan
  • 通讯作者:
    Yao, Shanshan
A multi-tasking model of speaker-keyword classification for keeping human in the loop of drone-assisted inspection
说话者关键词分类的多任务模型,使人类能够参与无人机辅助检查的循环
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Shanshan Yao其他文献

Investigation of the Movement Characteristics of West Guangdong Longshore Ocean Current System, China
粤西沿岸洋流系统运动特征研究
  • DOI:
    10.2112/si73-064.1
  • 发表时间:
    2015-03
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Shanshan Yao;Wendan Li;Hongbo Zhao;Zhiliang Gao
  • 通讯作者:
    Zhiliang Gao
Corporate Social Responsibility Regulatory System Based on Sustainable Corporation Law Pathway
  • DOI:
    10.3390/su15021638
  • 发表时间:
    2023-01
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Shanshan Yao
  • 通讯作者:
    Shanshan Yao
HapTag: A Compact Actuator for Rendering Push-Button Tactility on Soft Surfaces
HapTag:用于在软表面上渲染按钮触感的紧凑型执行器
Effect of binders on the microstructural and electrochemical performance of high-sulphur-loading electrodes in lithium-sulphur batteries
粘合剂对锂硫电池高硫负载电极微观结构和电化学性能的影响
  • DOI:
    10.1002/er.8532
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    4.6
  • 作者:
    Shanshan Yao;Heli Yu;Mingzhu Bi;Cuijuan Zhang;Tianjie Zhang;Xiaoning Zhang;Hongtao Liu;Xiangqian Shen;Jun Xiang
  • 通讯作者:
    Jun Xiang
Integration of cloud-based molecular networking and docking for enhanced umami peptide screening from Pixian douban.
  • DOI:
    10.1016/j.fochx.2023.101098
  • 发表时间:
    2024-03-30
  • 期刊:
  • 影响因子:
    6.1
  • 作者:
    Sen Mei;Shanshan Yao;Jingjing Mo;Yi Wang;Jie Tang;Weili Li;Tao Wu
  • 通讯作者:
    Tao Wu

Shanshan Yao的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Shanshan Yao', 18)}}的其他基金

CAREER: Closing the Loop of Human-Machine Interactions via Skin-Like Multimodal Haptic Interfaces
职业:通过类肤多模态触觉界面闭合人机交互循环
  • 批准号:
    2238363
  • 财政年份:
    2023
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Continuing Grant

相似国自然基金

新型卷枝毛霉脂肪酶Lip10的双重活性及其动态调控机制
  • 批准号:
    32302009
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
在体肾组织CETSA-MS结合Lip-MS技术解析百令胶囊保护顺铂诱导肾毒性的靶点机制及入肾活性成分
  • 批准号:
    LHDMZ23H280001
  • 批准年份:
    2023
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于HIF-1α-NCOA4-FTH1信号轴调控肝星状细胞铁自噬和LIP紊乱探讨莪术醇抗肝纤维化的作用机制
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
LIP-PFH相变纳米粒介导内质网应激改善乳腺癌免疫抑制微环境的研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于LiP/iTRAQ靶蛋白鉴定技术解析HSYA生物合成途径羟化酶的功能机制
  • 批准号:
  • 批准年份:
    2021
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
LiP8噬菌体侵染单增李斯特菌过程中互作机制的研究
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    34 万元
  • 项目类别:
    地区科学基金项目
基于ICG-mtAuNRs-Lip探针的光热光动力协同作用效率研究
  • 批准号:
    61905066
  • 批准年份:
    2019
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
紫色红曲霉来源脂肪酶LIP05催化合成己酸乙酯的分子识别机制
  • 批准号:
    31801467
  • 批准年份:
    2018
  • 资助金额:
    25.0 万元
  • 项目类别:
    青年科学基金项目
七鳃鳗免疫蛋白LIP的基因表达调控及其对病原体防御作用的分子机制
  • 批准号:
    31772884
  • 批准年份:
    2017
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
七鳃鳗一种新型模式识别蛋白LIP识别病原菌激活VLRB+类淋巴细胞的分子机制研究
  • 批准号:
    31601865
  • 批准年份:
    2016
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

CAREER: Inclusive, Private Mobile Input and Interaction Using Lip Reading
职业:使用唇读进行包容性、私密的移动输入和交互
  • 批准号:
    2239633
  • 财政年份:
    2023
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Continuing Grant
Lip reading Using Facial Expression Analysis Software
使用面部表情分析软件进行唇读
  • 批准号:
    20K18439
  • 财政年份:
    2020
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Use of automated lip reading to communicate with Tracheostomised Covid-19 patients
使用自动唇读与气管造口的 Covid-19 患者进行交流
  • 批准号:
    61152
  • 财政年份:
    2020
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Feasibility Studies
Development of Gaze and Head Direction Detection and Lip-Reading Technology Using Pupil and Nostril Positions for Small Devices
开发利用小型设备的瞳孔和鼻孔位置的注视和头部方向检测以及唇读技术
  • 批准号:
    19K04293
  • 财政年份:
    2019
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Japanese Machine Lip-reading System Based on Human Lip-reading Method
日本基于人类唇读方法的机器唇读系统
  • 批准号:
    23700672
  • 财政年份:
    2011
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Development of speech recognition interface using lip reading for hearing-impaired person' s communication support
开发使用唇读的语音识别接口,为听障人士提供沟通支持
  • 批准号:
    21700582
  • 财政年份:
    2009
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Development of a Pronunciation Practice CAI System Based on Lip Reading Techniques for Deaf Children Using Computer Graphics Animated Mouth Movement
利用计算机图形动画嘴部运动开发基于聋儿唇​​读技术的发音练习CAI系统
  • 批准号:
    20500860
  • 财政年份:
    2008
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
LILiR2 - Language Independent Lip Reading
LILiR2 - 独立于语言的唇读
  • 批准号:
    EP/E028047/1
  • 财政年份:
    2007
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Research Grant
Research on running assistance of electric wheelchair by using lip-reading system
利用唇读系统辅助电动轮椅行走的研究
  • 批准号:
    19500476
  • 财政年份:
    2007
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
LILiR2 - Language Independent Lip Reading
LILiR2 - 独立于语言的唇读
  • 批准号:
    EP/E027946/1
  • 财政年份:
    2007
  • 资助金额:
    $ 14.99万
  • 项目类别:
    Research Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了