Informed Sound Activity Detection in Music and Audio Signals

音乐和音频信号中的明智声音活动检测

基本信息

项目摘要

In music information retrieval (MIR), the development of computational methods for analyzing, segmenting, and classifying music signals is of fundamental importance. In this project's first phase (initial proposal), we explored fundamental techniques for detecting characteristic sound events present in a given music recording. Here, our focus was on informed approaches that exploit musical knowledge in the form of score information, instrument samples, or musically salient sections. We considered concrete tasks such as locating audio sections with a specific timbre or instrument, identifying monophonic themes in complex polyphonic music recordings, and classifying music genres or playing styles based on melodic contours. We tested our approaches within complex music scenarios, including instrumental Western classical music, jazz, and opera recordings. In the second phase of the project (renewal proposal), our goals will be significantly extended. First, we want to go beyond the music scenario by considering environmental sounds as a second challenging audio domain. As a central methodology, we plan to explore and combine the benefits of model-based and data-driven techniques to learn task-specific sound event representations. Furthermore, we will investigate hierarchical approaches to simultaneously incorporate, exploit, learn, and capture sound events that manifest on different temporal scales and belong to hierarchically ordered categories. An overarching goal of the project's second phase is to develop explainable deep learning models that provide a better understanding of the structural and acoustic properties of sound events.
在音乐信息检索(MIR)中,开发用于分析、分割和分类音乐信号的计算方法是至关重要的。在这个项目的第一阶段(最初的提议),我们探索了检测给定音乐录音中存在的特征声音事件的基本技术。在这里,我们的重点是以乐谱信息、乐器样本或音乐突出部分的形式利用音乐知识的知情方法。我们考虑了具体的任务,比如用特定的音色或乐器定位音频部分,在复杂的复调音乐录音中识别单音主题,以及根据旋律轮廓对音乐流派或演奏风格进行分类。我们在复杂的音乐场景中测试了我们的方法,包括器乐西方古典音乐、爵士乐和歌剧录音。在项目的第二阶段(续签提案),我们的目标将显著延长。首先,我们希望超越音乐场景,将环境声音视为第二个具有挑战性的音频领域。作为一种中心方法,我们计划探索并结合基于模型和数据驱动的技术的优点,以学习特定于任务的声音事件表示。此外,我们将研究分层方法,以同时合并、利用、学习和捕获在不同时间尺度上表现的、属于分层有序类别的声音事件。该项目第二阶段的首要目标是开发可解释的深度学习模型,以便更好地了解声音事件的结构和声学特性。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dr.-Ing. Jakob Abeßer其他文献

Dr.-Ing. Jakob Abeßer的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

相似海外基金

Emplacing sound perception - Soundscape appraisal, physical activity, and well-being in UK and Australian street verges
加强声音感知——英国和澳大利亚街道边缘的声景评估、体育活动和福祉
  • 批准号:
    2433961
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Studentship
RI: Small: Enabling Sound-based Human Activity Monitoring for Home Service Robots
RI:小型:为家庭服务机器人提供基于声音的人体活动监控
  • 批准号:
    1910993
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Study on Detection Method of Abnormal Situation in Daily Life by Machine Learning of Indoor Activity Sound
室内活动声音机器学习检测日常生活异常情况的方法研究
  • 批准号:
    18K02236
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Long term memory for sound prepares neural activity for perception
对声音的长期记忆为感知做好神经活动的准备
  • 批准号:
    489991-2016
  • 财政年份:
    2017
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Establishment of rehabilitation that improve the muscle activity during sports movement using sound stimulation feedback
建立利用声音刺激反馈改善运动过程中肌肉活动的康复方法
  • 批准号:
    16K16562
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Study on Detection of Abnormality in Daily Life by Frequency Analysis of Indoor Activity Sound
室内活动声音频率分析检测日常生活异常的研究
  • 批准号:
    15K00746
  • 财政年份:
    2015
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of activity sound visualization method for personal evaluation of PBL
开发用于个人评估 PBL 的活动声音可视化方法
  • 批准号:
    15K01069
  • 财政年份:
    2015
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Effect on parasympathetic nerve activity for "the favorite sound stimulation" in the persistent vegetative patient
“喜爱的声音刺激”对持续性植物人副交感神经活动的影响
  • 批准号:
    25463365
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
A study of "Handiwork" and "Environment Arts" learning for Sound Material-Cycle Society Based on the activity of "tree" "straw" "snow" as the local material
基于“树”“稻草”“雪”当地材料活动的健全物质循环社会的“手工”和“环境艺术”学习研究
  • 批准号:
    19530793
  • 财政年份:
    2007
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Voice activity detection and estimation of sound source direction using a single microphone
使用单个麦克风进行语音活动检测和声源方向估计
  • 批准号:
    19700170
  • 财政年份:
    2007
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了