RI Medium: Audio Diarization - Towards Comprehensive Description of Audio Events

RI Medium:音频二值化 - 全面描述音频事件

基本信息

项目摘要

?Perceptual salience? is a term used by psychologists of vision to describe the power of an object to draw viewer attention; for example, it has been demonstrated that eye movements target salient objects sooner than less-salient objects, and that salient objects are detected more quickly than less-salient objects. The first sub-goal of this research is to develop automatic measurements of perceptual salience for auditory events, defined here to be a center-surround contrast in terms of amplitude, spectrum, or temporal features such as zero-crossing rate and periodicity. The second sub-goal of this research is to test salience measurements in an audio event detection paradigm, using the 2007 University of Illinois CLEAR evaluation system (Classification and Labeling of Events, Activities and Relationships). The third sub-goal of this research is to compare audio event transcriptions generated by human labelers viewing an audiovisual record of a meeting vs. transcriptions generated by labelers who listen to the audio without watching any accompanying video; the experimental hypothesis states that auditory salience predicts audio-only labels better than it predicts audiovisual labels. This research is designed as a collaboration between experts in computer vision and audio signal processing. If successful, the proposed methods will help to add an audio channel to the video security monitoring systems currently installed in many hospitals, nursing homes, government buildings and industrial sites.
?知觉突显?是视觉心理学家用来描述物体吸引观众注意力的能力的术语;例如,已经证明眼睛运动比不太突出的物体更快地瞄准突出物体,并且突出物体比不太突出的物体更快地被检测到。 本研究的第一个子目标是开发自动测量听觉事件的感知显著性,这里定义为幅度,频谱或时间特征(如过零率和周期性)的中心-环绕对比度。 本研究的第二个子目标是测试的音频事件检测范例中的显着性测量,使用2007年伊利诺伊大学的CLEAR评估系统(事件,活动和关系的分类和标签)。 本研究的第三个子目标是比较音频事件产生的标签观看会议的视听记录与产生的标签谁听音频,而不看任何伴随的视频;实验假设指出,听觉显着性预测音频标签比它预测视听标签。 这项研究是计算机视觉和音频信号处理专家之间的合作。 如果成功,所提出的方法将有助于为目前安装在许多医院、疗养院、政府大楼和工业场所的视频安全监控系统增加一个音频通道。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Mark Hasegawa-Johnson其他文献

Mark Hasegawa-Johnson的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Mark Hasegawa-Johnson', 18)}}的其他基金

FAI: A New Paradigm for the Evaluation and Training of Inclusive Automatic Speech Recognition
FAI:包容性自动语音识别评估和训练的新范式
  • 批准号:
    2147350
  • 财政年份:
    2022
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
RI: Small: Collaborative Research: Automatic Creation of New Speech Sound Inventories
RI:小型:协作研究:自动创建新语音库存
  • 批准号:
    1910319
  • 财政年份:
    2019
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
EAGER: Matching Non-Native Transcribers to the Distinctive Features of the Language Transcribed
EAGER:将非母语转录者与转录语言的独特特征相匹配
  • 批准号:
    1550145
  • 财政年份:
    2015
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
FODAVA-Partner: Visualizing Audio for Anomaly Detection
FODAVA-合作伙伴:可视化音频以进行异常检测
  • 批准号:
    0807329
  • 财政年份:
    2008
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant
Audiovisual Distinctive-Feature-Based Recognition of Dysarthric Speech
基于视听特征的构音障碍语音识别
  • 批准号:
    0534106
  • 财政年份:
    2005
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant
Prosodic, Intonational, and Voice Quality Correlates of Disfluency
韵律、语调和语音质量与不流畅的相关性
  • 批准号:
    0414117
  • 财政年份:
    2004
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant
CAREER: Landmark-Based Speech Recognition in Music and Speech Backgrounds
职业:音乐和语音背景中基于地标的语音识别
  • 批准号:
    0132900
  • 财政年份:
    2002
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant

相似海外基金

Collaborative Research: CyberTraining: Implementation: Medium: Training Users, Developers, and Instructors at the Chemistry/Physics/Materials Science Interface
协作研究:网络培训:实施:媒介:在化学/物理/材料科学界面培训用户、开发人员和讲师
  • 批准号:
    2321102
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
RII Track-4:@NASA: Bluer and Hotter: From Ultraviolet to X-ray Diagnostics of the Circumgalactic Medium
RII Track-4:@NASA:更蓝更热:从紫外到 X 射线对环绕银河系介质的诊断
  • 批准号:
    2327438
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: Topological Defects and Dynamic Motion of Symmetry-breaking Tadpole Particles in Liquid Crystal Medium
合作研究:液晶介质中对称破缺蝌蚪粒子的拓扑缺陷与动态运动
  • 批准号:
    2344489
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: AF: Medium: The Communication Cost of Distributed Computation
合作研究:AF:媒介:分布式计算的通信成本
  • 批准号:
    2402836
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant
Collaborative Research: AF: Medium: Foundations of Oblivious Reconfigurable Networks
合作研究:AF:媒介:遗忘可重构网络的基础
  • 批准号:
    2402851
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Continuing Grant
Collaborative Research: CIF: Medium: Snapshot Computational Imaging with Metaoptics
合作研究:CIF:Medium:Metaoptics 快照计算成像
  • 批准号:
    2403122
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
  • 批准号:
    2403134
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
  • 批准号:
    2402804
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: CIF-Medium: Privacy-preserving Machine Learning on Graphs
合作研究:CIF-Medium:图上的隐私保护机器学习
  • 批准号:
    2402815
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Tiny Chiplets for Big AI: A Reconfigurable-On-Package System
合作研究:SHF:中:用于大人工智能的微型芯片:可重新配置的封装系统
  • 批准号:
    2403408
  • 财政年份:
    2024
  • 资助金额:
    $ 24.99万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了