Mathematical modeling of the temporal acpects of speech based on human perceptual and cognitive mechanisms

基于人类感知和认知机制的语音时间方面的数学建模

基本信息

项目摘要

The purpose of this project was to find clues by which humans retrieve the temporal structure of speech, to understand their usage, and to establish a quantitative method to evaluate the temporal adequateness or naturalness of a given speech sound that can replicate the performance of human judgment. For this purpose, three fundamental investigative tasks were implemented a study at the psychophysical level, a study at the linguistic level, and the construction of an evaluation model. One distinguishing feature of this project is that it emphasized the psychophysical aspects nearly as much as the linguistic, even though its primary object was spoken language.Since a person can easily recognize fast speech, even in the case of a foreign language where the meaning is unknown, it is assumed that the processing of the temporal aspects of speech undoubtedly involve non-linguistic and therefore language-independent activities. By concentrating on such processing that is independent of a give … More n language, we developed the basic technology toward a system with small overhead for language processing as well as simple extensibility to multiple languages. The major results follow.(I) Psychophysical level : An algorithm was developed to predict temporal reference points in a given speech by replicating the function of human auditory processing. This algorithm's most important benefit is its applicability to virtually unlimited language variations since it doesn't require any linguistic knowledge.(II) Linguistic level : An empirical study levealed that factors, which affect the perception of prosodic units, vary depending on the particular language's choice of units. This finding provides practical implications concerning how much weight should be placed on prosodic factors when designing effective foreign-language training methods.(III) Modeling : By integrating auditory functions derived from investigation at the psychophysical level, a mathematical model was implemented to automatically evaluate the naturalness of the speech of the English learners. The model's performance closely approximated the subjective evaluation of a native-speaking English instructor. This result not only suggests the importance of psychophysical factors in the adequateness or naturalness evaluation of speech but it also implies potential extensibility of the proposed model to multiple languages. Less
这个项目的目的是寻找线索,人类通过这些线索来检索语音的时间结构,了解它们的用法,并建立一种量化方法来评估给定语音的时间充分性或自然性,以复制人类判断的表现。为此,实施了三项基本调查任务,一项是心理物理层面的研究,一项是语言层面的研究,以及评估模型的构建。这个项目的一个显著特点是,它几乎和语言一样强调心理物理方面,尽管它的主要对象是口语。由于一个人可以很容易地识别快速语音,即使在意义未知的外语中,我们假设语音的时间方面的处理无疑涉及非语言的、因此与语言无关的活动。通过专注于这种独立于给定…的处理更多的语言,我们将基本技术发展到一个语言处理开销很小的系统,以及对多语言的简单扩展。主要结果如下:(I)心理物理水平:通过复制人类听觉处理的功能,开发了一种算法来预测给定语音中的时间参考点。这个算法最大的好处是它对几乎无限的语言变化的适用性,因为它不需要任何语言知识。(Ii)语言水平:一项实证研究表明,影响韵律单位感知的因素因特定语言对单位的选择而异。这一发现为在设计有效的外语训练方法时应该在多大程度上重视韵律因素提供了实践启示。(Iii)建模:通过在心理物理水平上整合调查得出的听觉功能,实现了一个数学模型来自动评估英语学习者的言语自然度。该模型的表现非常接近母语为英语的教师的主观评价。这一结果不仅表明了心理物理因素在言语充分性或自然性评价中的重要性,而且也暗示了所提出的模型对多种语言的潜在可扩展性。较少

项目成果

期刊论文数量(340)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
第二言語の音声学習-知覚と生成および処理階層間の相互作用-(招待講演)
第二语言语音学习——感知、产生和处理层之间的交互——(特邀演讲)
Judgment of onset asynchrony of two tone components and its relation to the cochlear delay.
两个音调成分起病不同步的判断及其与耳蜗延迟的关系。
Prosody generation for communicative speech synthesis.
用于交际语音合成的韵律生成。
音楽知覚研究用ツールSTRAIGHT&aimmatの機能
音乐感知研究工具STRAIGHT&aimmat的特点
Effects of auditory feedback in the przctice phase of imitating a piano performance
钢琴演奏模仿阶段听觉反馈的影响
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

KATO Hiroaki其他文献

KATO Hiroaki的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('KATO Hiroaki', 18)}}的其他基金

Recognition of pathogen-derived sphingolipid in plants.
植物中病原体衍生鞘脂的识别。
  • 批准号:
    20K15528
  • 财政年份:
    2020
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Structural basis for discrimination between multi-drug exporters and lipid floppies
多种药物出口商和脂质软盘之间歧视的结构基础
  • 批准号:
    19K22495
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
Structural basis for optimization of molecular probe using P-glycoprotein in vivo imaging
P-糖蛋白体内成像优化分子探针的结构基础
  • 批准号:
    24659018
  • 财政年份:
    2012
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Protein structural data mining based on the Neighborhood Fragment Spectra representation
基于邻域片段谱表示的蛋白质结构数据挖掘
  • 批准号:
    22500130
  • 财政年份:
    2010
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Studies on stability of fission yeast heterochromatin and its regulators
裂殖酵母异染色质及其调控因子的稳定性研究
  • 批准号:
    21870024
  • 财政年份:
    2009
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
High frequency and high field magnetic resonance in panoscopic- assembled rare earth magnets
全景组装稀土磁体中的高频高场磁共振
  • 批准号:
    20900111
  • 财政年份:
    2008
  • 资助金额:
    $ 31.62万
  • 项目类别:
A Modeling of Prosody Perception for Second Language Learning
第二语言学习的韵律感知建模
  • 批准号:
    20300069
  • 财政年份:
    2008
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Three-dimensional structural similarity search of proteins based on Geometrical Fragment Spectra
基于几何碎片谱的蛋白质三维结构相似性搜索
  • 批准号:
    19700139
  • 财政年份:
    2007
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Struchural biology of membrane protein trabsporters
膜蛋白转运蛋白的结构生物学
  • 批准号:
    17380066
  • 财政年份:
    2005
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Basic Study on Decentralized Society : role of State Governments in the USA and Provincial governments in Canada
去中心化社会的基础研究:美国州政府和加拿大省政府的作用
  • 批准号:
    16310166
  • 财政年份:
    2004
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)

相似海外基金

Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
  • 批准号:
    10537470
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
  • 批准号:
    10669599
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
Doctoral Dissertation Research: Determining the mechanisms of spoken language processing delay for children with cochlear implants
博士论文研究:确定人工耳蜗儿童口语处理延迟的机制
  • 批准号:
    2141399
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Standard Grant
Doctoral Dissertation Research: Phonological Prediction in Spoken Language Processing
博士论文研究:口语处理中的语音预测
  • 批准号:
    2017696
  • 财政年份:
    2020
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Standard Grant
Integrative mechanisms in real-time spoken language processing
实时口语处理中的整合机制
  • 批准号:
    RGPIN-2015-06595
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Discovery Grants Program - Individual
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10456519
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10307519
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10542851
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10064624
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
Integrative mechanisms in real-time spoken language processing
实时口语处理中的整合机制
  • 批准号:
    RGPIN-2015-06595
  • 财政年份:
    2018
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了