权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

International: An Analysis of Speaker Diarization Systems Errors

国际：说话人二值化系统误差分析

基本信息

批准号：
1135365
负责人：
Nelson Morgan
金额：
$ 1.5万
依托单位：
International Computer Science Institute
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2011
资助国家：
美国
起止时间：
2011-08-01 至 2012-07-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1135365&HistoricalAwards=false
关键词：
International Analysis Speaker Diarization Systems

项目摘要

This project will support the three-month visit of a US PhD student to the Idiap Research Institute in Switzerland, a leading international laboratory that has developed a state-of-the-art diarization system. Speaker diarization is the task of determining ?who spoke when? without a priori knowledge of the number of speakers or speaker identities. The focus of the current effort is to perform error analysis in audio-only speaker diarization for the meeting domain. There are two main areas of interest. The first is to build a framework to analyze speaker diarization performance on specific types of segments (e.g., speaker changes, interruption, overlapped speech, short utterances, long utterances, etc.). By analyzing where speaker diarization systems perform poorly, speaker diarization researchers can focus on improving performance during those problematic types of segments. The second area is to compare speaker diarization performance across systems. The project has substantial broader impacts. Speaker diarization is a useful step in meeting analysis. Considering the time people spend in meetings, improved speaker diarization could be useful for a broad portion of the population. While the goal is to characterize current speaker diarization errors, the knowledge gained from this work will be useful for improving future speaker diarization systems. In particular, by comparing where errors occur across multiple systems, the speaker diarization community can gain insight into the strengths and weaknesses of the various systems which could lead to a more novel way of combining systems to improve speaker diarization performance. In addition, the project will support the development of an international network of collaborators for a US graduate student.

该项目将支持一名美国博士生对瑞士Idiap研究所进行为期三个月的访问，该研究所是一家领先的国际实验室，开发了最先进的二值化系统。确定演讲者身份的任务是什么？谁在什么时候发言？而没有说话者数量或说话者身份的先验知识。当前工作的重点是对会议域中的纯音频说话人二值化进行误差分析。人们感兴趣的主要有两个领域。第一个是建立一个框架来分析特定类型的片段(例如，说话人改变、中断、重叠语音、短话语、长话语等)上的说话人二元化性能。通过分析说话人二元化系统在哪里表现不佳，说话人二元化研究人员可以专注于提高这些有问题的分段类型的性能。第二个领域是比较不同系统的扬声器二元化性能。该项目具有重大而广泛的影响。说话人对分是会议分析中一个有用的步骤。考虑到人们花在会议上的时间，改进发言者二元化对大部分人来说可能是有用的。虽然目标是表征当前的说话人二元化错误，但从这项工作中获得的知识将有助于改进未来的说话人二元化系统。特别是，通过比较多个系统中发生错误的位置，扬声器对分社区可以深入了解各种系统的优势和劣势，这可能导致一种更新颖的组合系统以提高扬声器对分性能的方法。此外，该项目还将支持为一名美国研究生建立一个国际合作者网络。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Nelson Morgan其他文献

Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education]

关于语音识别和理解的最新 MINDS 报告，第 2 部分 [DSP 教育]

DOI：
10.1109/msp.2009.932707
发表时间：
2009
期刊：
IEEE Signal Process. Mag.
影响因子：
0
作者：
J. Baker;Li Deng;S. Khudanpur;Chin;James R. Glass;Nelson Morgan;Douglas D. O'Shaughnessy
通讯作者：
Douglas D. O'Shaughnessy

Updated MINDS Report on Speech Recognition and Understanding

更新后的 MINDS 关于语音识别和理解的报告

DOI：
10.1016/s1567-4231(09)70205-9
发表时间：
2009
期刊：
IEEE Signal Processing Magazine
影响因子：
14.9
作者：
J. Baker;Li Deng;S. Khudanpur;Chin;James R. Glass;Nelson Morgan
通讯作者：
Nelson Morgan

Writing programs that scale with increasing numbers of cores should be as easy as writing programs for sequential computers

编写随着内核数量的增加而扩展的程序应该像为顺序计算机编写程序一样简单

DOI：
发表时间：
2018
期刊：
影响因子：
0
作者：
K. Asanović;Rastislav Bodík;James Demmel;T. Keaveny;K. Keutzer;J. Kubiatowicz;Nelson Morgan;David A. Patterson;Koushik Sen;J. Wawrzynek;David Wessel;K. Yelick
通讯作者：
K. Yelick