Establishment of Speech Communication under Very Heavy Environmental Noise
极重环境噪声下语音通信的建立
基本信息
- 批准号:15500137
- 负责人:
- 金额:$ 1.98万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:2003
- 资助国家:日本
- 起止时间:2003 至 2006
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In general, a bone conduction microphone, which eliminates surrounding noise, is often used in extremely noisy environments such as engine rooms in ships or runways at airports. It detects the vibration of bones such as jaws, and it converts the vibration to voice. Unfortunately, the quality of this voice converted by this microphone is bad for a smooth communication. Therefore, the aim of this research is to develop an algorithm of voice conversion from a bone conduction voice to an air conduction voice, in order to supply a smooth communication method by voice in the extremely noisy environments. The results of this research are the following.1. Voice Conversion by the Proposed TW-SOMA new type of self-organizing map with twin units (TW-SOM), which can describe a nonlinear input-output relation with high accuracy, was proposed, and was applied to voice conversion. Concretely, TW-SOM learns a nonlinear relation between the bone and the air conduction voices by the twin units. After its learning, the bone conduction voice applied to TW-SOM is converted to the corresponding air conduction voice.2. Verification of the Effectiveness of the Proposed Method and Application to Actual Ship-Handling WordsThe effectiveness of the proposed voice conversion method was verified for actual ship-handling words by comparing with the conventional SOM and other competing neural network methods. It was also confirmed that the proposed method is more suitable for a hardware implementation than the other conventional methods.3. Examination of Applicability to Other FieldsThe key idea of the codebook used in the proposed method was successfully applied to an image expansion to get a clear image. TW-SOM is a general method to describe precisely the various nonlinear mappings including voice conversion. We would like then to examine its applicability to other fields as future studies.
通常,消除周围噪声的骨传导麦克风通常用于极其嘈杂的环境,例如船舶的机舱或机场的跑道。它能检测到骨骼的振动,比如下巴,然后把振动转换成声音。不幸的是,这个麦克风转换的声音质量对流畅的交流很不利。因此,本研究的目的是开发一种从骨导语音到空气传导语音的语音转换算法,以便在极端噪声环境中提供一种平滑的语音通信方法。本研究的结果如下.提出了一种能够高精度描述非线性输入输出关系的双元自组织映射(TW-SOM),并将其应用于语音转换。具体地说,TW-SOM通过孪生单元学习骨导语音和气导语音之间的非线性关系。经过学习后,将应用于TW-SOM的骨导语音转换为相应的气导语音.验证所提出的方法的有效性和实际的船舶操作词的应用所提出的语音转换方法的有效性进行了验证,实际的船舶操作词通过比较与传统的SOM和其他竞争的神经网络方法。还证实了所提出的方法比其他传统方法更适合硬件实现。3.该方法的关键思想是将码书的思想成功地应用到图像扩展中,得到了清晰的图像。TW-SOM是精确描述包括语音转换在内的各种非线性映射的通用方法。然后,我们想研究它的适用性,以其他领域作为未来的研究。
项目成果
期刊论文数量(56)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Knowledge-Based Intelligent Information and Engineering Systems, Part III, Lecture Notes in Artificial Intelligence
基于知识的智能信息与工程系统,第三部分,人工智能讲义
- DOI:
- 发表时间:2004
- 期刊:
- 影响因子:0
- 作者:Masahiro YUKAWA;Isao YAMADA;Eiji Uchino
- 通讯作者:Eiji Uchino
A self-organizing map with twin units capable of describing a nonlinear input-output relation applied to speech code vector mapping
- DOI:10.1016/j.ins.2007.05.028
- 发表时间:2007-11
- 期刊:
- 影响因子:0
- 作者:E. Uchino;K. Yano;T. Azetsu
- 通讯作者:E. Uchino;K. Yano;T. Azetsu
Blind Separation and Sound Localization of Delayed Sources by Using ICA-based Estimates of Mixing Parameters
使用基于 ICA 的混合参数估计来实现延迟声源的盲分离和声音定位
- DOI:
- 发表时间:2007
- 期刊:
- 影响因子:0
- 作者:Isao YAMADA;Nobuhiko OGURA;Tadahiro Azetsu
- 通讯作者:Tadahiro Azetsu
自己組織化マップ(Self-Organizing Maps by T.Kohonen, Springer-Verlag, 2001訳書)
自组织映射(由 T.Kohonen 翻译,Springer-Verlag,2001 年)
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:Renato CAVALCANTE;Isao YAMADA;Kohichi SAKANIWA;徳高平蔵
- 通讯作者:徳高平蔵
High performance hybrid-ICA to increase convergence speed and accuracy with use of RBF network
- DOI:10.3233/kes-2006-10505
- 发表时间:2006
- 期刊:
- 影响因子:0
- 作者:E. Uchino;T. Azetsu;N. Suetake
- 通讯作者:E. Uchino;T. Azetsu;N. Suetake
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
UCHINO Eiji其他文献
UCHINO Eiji的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('UCHINO Eiji', 18)}}的其他基金
Screening System for Early Discovery of Cerebrovascular Accident by Analyzing Fundus Video
通过眼底视频分析早期发现脑血管意外的筛查系统
- 批准号:
15K12108 - 财政年份:2015
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Eye Fundus Image Analysis System for Early Detection of Cerebrovascular Disorder
用于早期发现脑血管疾病的眼底图像分析系统
- 批准号:
24650121 - 财政年份:2012
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Realization of High Performance Real Time Arteriosclerosis Diagnosis System by Soft Computing
软计算实现高性能实时动脉硬化诊断系统
- 批准号:
23300086 - 财政年份:2011
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Precise Molecular Model of Human Cochlea System and Its Application to Speech Recognition
人类耳蜗系统精密分子模型及其在语音识别中的应用
- 批准号:
21650039 - 财政年份:2009
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Establishment of Vocal Communication System under Heavy Environmental Noise for Handicapped People with Speech Impediment
重环境噪声下言语障碍残疾人语音通讯系统的建立
- 批准号:
19300078 - 财政年份:2007
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Automatic Analysis of Cephalogram for Orthodontics
正畸头影自动分析
- 批准号:
07680948 - 财政年份:1995
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
相似海外基金
Augmented speech communication using multi-modal signals with real-time, low-latency voice conversion
使用具有实时、低延迟语音转换的多模信号的增强语音通信
- 批准号:
22KJ1519 - 财政年份:2023
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Perceptual Methods for Speech Communication
言语交流的感知方法
- 批准号:
RGPIN-2016-04412 - 财政年份:2022
- 资助金额:
$ 1.98万 - 项目类别:
Discovery Grants Program - Individual
Foundation of speech communication support based on auditory perception models for everyone including elderly persons with hearing impairment
为包括听力障碍老年人在内的所有人提供基于听觉模型的语音交流支持
- 批准号:
21H03468 - 财政年份:2021
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Perceptual Methods for Speech Communication
言语交流的感知方法
- 批准号:
RGPIN-2016-04412 - 财政年份:2021
- 资助金额:
$ 1.98万 - 项目类别:
Discovery Grants Program - Individual
VOICE 2.0: towards augmentation of enriched speech communication
VOICE 2.0:增强丰富的语音通信
- 批准号:
20KK0233 - 财政年份:2020
- 资助金额:
$ 1.98万 - 项目类别:
Fund for the Promotion of Joint International Research (Fostering Joint International Research (B))
Developing L2 automatic pronunciation evaluation and pronunciation learning-support systems for effective speech communication
开发L2自动发音评估和发音学习支持系统以实现有效的语音交流
- 批准号:
19K21638 - 财政年份:2019
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Pause-internal phonetic particles in speech communication
语音交流中的停顿-内部语音粒子
- 批准号:
418659027 - 财政年份:2019
- 资助金额:
$ 1.98万 - 项目类别:
Research Grants
Faculty of speech communication and typical and atypical neurocognitive development
言语交流与典型和非典型神经认知发展学院
- 批准号:
19H00632 - 财政年份:2019
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Perceptual Methods for Speech Communication
言语交流的感知方法
- 批准号:
RGPIN-2016-04412 - 财政年份:2019
- 资助金额:
$ 1.98万 - 项目类别:
Discovery Grants Program - Individual
Development of speech communication and its correlates of brain, cognition and motor system: A longitudinal cohort study of typically and atypically developing infants
言语交流的发展及其与大脑、认知和运动系统的相关性:典型和非典型发育婴儿的纵向队列研究
- 批准号:
19H05594 - 财政年份:2019
- 资助金额:
$ 1.98万 - 项目类别:
Grant-in-Aid for Scientific Research (S)