TalkPrinting: New Features and Models for Automatic Speaker Recognition

TalkPrinting:自动说话人识别的新功能和模型

基本信息

  • 批准号:
    0544682
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2005
  • 资助国家:
    美国
  • 起止时间:
    2005-09-15 至 2010-02-28
  • 项目状态:
    已结题

项目摘要

Automatic speaker recognition is critical for many applications, ranging from secure access to intelligence gathering, to archiving and understanding conversation. Current speaker recognition systems model specific speaker characteristics, but a vast range of habitual and stylistic differences has just begun to be explored. These include patterns of intonation, energy and duration, as well as habitual word and phrase usage. Exploiting information in these heterogeneous modes of variation presents challenges in feature selection, modeling, and information combination. Feature discovery and selection efforts will consider the large variety of stylistic features that may be available. The feature space transformation and modeling phase of the work will explore the feature space using dimensionality reduction and clustering. The resulting features will be modeled to focus on specific classes of features. Further system combination research will study how individual systems for specific feature types can best be combined to optimize performance recognition. The new features and modeling approaches will be evaluated in the annual Speaker Recognition Evaluation.The proposed work will lead to identification of new extractable features characterizing individual speaker behavior. It explores more sophisticated models to better capture complex behavior and relationships. The project has impact for intelligence, law enforcement, security and other application by enhancing recognition performance. Because the new features are based on performance behavior rather than simply vocal tract physiology, the new features can also be used for tasks such as emotion recognition or conversation detection. The systems will be freely available and engage under-represented graduate students.
自动说话人识别对于许多应用至关重要,从安全访问到情报收集,再到存档和理解对话。 目前的说话人识别系统模型的具体发言人的特点,但广泛的习惯和风格的差异才刚刚开始探索。 这些包括语调、能量和持续时间的模式,以及习惯性的单词和短语使用。 在这些异构的变化模式中利用信息在特征选择、建模和信息组合方面提出了挑战。 特征发现和选择工作将考虑可能可用的各种各样的风格特征。 工作的特征空间转换和建模阶段将使用降维和聚类来探索特征空间。 将对生成的特征进行建模,以关注特定类别的特征。 进一步的系统组合研究将研究如何将特定特征类型的单个系统最好地组合起来,以优化性能识别。 新的特征和建模方法将在年度说话人识别评估中进行评估。拟议的工作将导致识别表征个体说话人行为的新的可提取特征。 它探索更复杂的模型,以更好地捕捉复杂的行为和关系。 该项目通过提高识别性能,对情报、执法、安全和其他应用产生了影响。 由于新功能是基于表演行为而不是简单的声道生理,因此新功能也可用于情感识别或对话检测等任务。 这些系统将免费提供,并吸引代表性不足的研究生。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Elizabeth Shriberg其他文献

Bootstrapping Domain Detection Using Query Click Logs for New Domains
使用新域的查询点击日志引导域检测
  • DOI:
    10.21437/interspeech.2011-276
  • 发表时间:
    2011
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Dilek Z. Hakkani;Gökhan Tür;Larry Heck;Elizabeth Shriberg
  • 通讯作者:
    Elizabeth Shriberg
Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech
Prosody 可以帮助自动处理多方会议吗?
  • DOI:
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Elizabeth Shriberg;A. Stolcke;D. Baron
  • 通讯作者:
    D. Baron
Spontaneous speech: how people really talk and why engineers should care
  • DOI:
    10.21437/interspeech.2005-3
  • 发表时间:
    2005
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Elizabeth Shriberg
  • 通讯作者:
    Elizabeth Shriberg
Confidence Estimation for Speech Emotion Recognition Based on the Relationship Between Emotion Categories and Primitives
基于情感类别与基元关系的语音情感识别置信度估计
Prosody Modeling for Automatic Speech Understanding: An Overview of Recent Research at SRI
自动语音理解的韵律建模:SRI 最新研究概述
  • DOI:
  • 发表时间:
    2008
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Elizabeth Shriberg;A. Stolcke
  • 通讯作者:
    A. Stolcke

Elizabeth Shriberg的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Elizabeth Shriberg', 18)}}的其他基金

EAGER: A Corpus of Aligned Speech and ANS Sensor Data
EAGER:对齐语音和 ANS 传感器数据的语料库
  • 批准号:
    1449202
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
STIMULATE: Modeling and Automatic Labeling of Hidden Word- Level Events in Spontaneous Speech
刺激:自发语音中隐藏词级事件的建模和自动标记
  • 批准号:
    9619921
  • 财政年份:
    1997
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Modeling Disfluencies in Spontaneous Speech
模拟自发言语的不流畅
  • 批准号:
    9314967
  • 财政年份:
    1994
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
NSF-NATO Postdoctoral Fellowhips
NSF-北约博士后奖学金
  • 批准号:
    9353732
  • 财政年份:
    1993
  • 资助金额:
    --
  • 项目类别:
    Fellowship Award

相似海外基金

A new approach to reconstruct tsunami history: assessments based on identification and numerical modeling of erosional features
重建海啸历史的新方法:基于侵蚀特征识别和数值模拟的评估
  • 批准号:
    23H01252
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Echo - A New Set of High-level Audio Features for Computational Sound Design Systems
Echo - 用于计算声音设计系统的一组新的高级音频功能
  • 批准号:
    RGPIN-2021-02893
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Discovery Grants Program - Individual
GeneMatcher, VariantMatcher and PhenoDB, implementation of new features and connections
GeneMatcher、VariantMatcher 和 PhenoDB,新功能和连接的实现
  • 批准号:
    10332123
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
GeneMatcher, VariantMatcher and PhenoDB, implementation of new features and connections
GeneMatcher、VariantMatcher 和 PhenoDB,新功能和连接的实现
  • 批准号:
    10605159
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
Refined elastic membrane models with new small-scale features
具有新的小尺度特征的精制弹性膜模型
  • 批准号:
    RGPIN-2016-03636
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Discovery Grants Program - Individual
New assay method for pinpointing structural features in amyloid oligomer formation
精确定位淀粉样蛋白寡聚体形成结构特征的新测定方法
  • 批准号:
    10214240
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
Doctoral Dissertation Research: Production, Perception, and Acquisition of New Dialect Features by Speakers Moving Between Two Regions
博士论文研究:在两个地区之间移动的说话者对新方言特征的产生、感知和习得
  • 批准号:
    2041126
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Building on the Success of Project 105617; New Features, Components and Material Construction Towards Commercialising Novel Cell Concentration Devices
以 105617 项目成功为基础;
  • 批准号:
    10005367
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Collaborative R&D
PFI-TT: An Analysis Tool Supporting the Safe Deployment of New Features in Evolving Software Systems
PFI-TT:支持在不断发展的软件系统中安全部署新功能的分析工具
  • 批准号:
    2122689
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Echo - A New Set of High-level Audio Features for Computational Sound Design Systems
Echo - 用于计算声音设计系统的一组新的高级音频功能
  • 批准号:
    DGECR-2021-00050
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Discovery Launch Supplement
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了