权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Content Retrieval against large-scale spoken documents based on the integration of speech and language processing

基于语音和语言处理集成的大规模语音文档内容检索

基本信息

批准号：
22500090
负责人：
AKIBA Tomoyosi
金额：
$ 2.83万
依托单位：
Toyohashi University of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (C)
财政年份：
2010
资助国家：
日本
起止时间：
2010 至 2012
项目状态：
已结题

来源：
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-22500090/
关键词：
情報検索音声情報処理自然言語処理音声ドキュメント処理音声ドキュメント検索音声中の検索語検出音声内容検索音声ドキュメント検索音声検索語検出索引付けパッセージ検索音声認識クエリ拡張認識誤り対策 Spoken Term Detection 適合性モデル

项目摘要

We conducted the research and the development of spoken content retrieval targeting large-scale spoken documents. Firstly, for the spoken term detection (STD) task, which aimed to detect the position in a spoken document that a given term appeared at, we developed the method that did not require any detection threshold but, instead, outputted the candidates in increasing order of their plausibility. Finally, we achieved about 70 times faster detection at the almost same detection performance than the baseline continuous DP matching. Next, for the spoken content retrieval (SCR) task, which aimed to find the segment in a spoken document that was relevant to a given query topic represented in natural language, we developed the method robust for recognition errors and out-of-vocabularies (OOVs) that made use of STD as its preprocessing. We found that the proposed method was effective for the query including OOVs and worked complementally with the conventional SCR method, which made use of the large vocabulary continuous speech recognition (LVCSR), and that the combination of them improved the retrieval performance.

我们针对大规模的口语文档进行了口语内容检索的研究和开发。首先，对于口语术语检测（STD）任务，其目的是检测一个给定的术语出现在口语文档中的位置，我们开发的方法，不需要任何检测阈值，而是，输出的候选人在增加他们的可扩展性的顺序。最后，我们在几乎相同的检测性能下实现了比基线连续DP匹配快约70倍的检测。接下来，对于口语内容检索（SCR）任务，其目的是找到段在一个口语文档中，是相关的一个给定的查询主题表示在自然语言中，我们开发的方法鲁棒的识别错误和词汇表外（OOVs），利用STD作为其预处理。我们发现，所提出的方法是有效的查询，包括OOVs和工作的补充与传统的SCR方法，它利用了大词汇量连续语音识别（LVCSR），和它们的组合提高了检索性能。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

音声言語処理と自然言語処理

口语处理和自然语言处理

DOI：
发表时间：
2013
期刊：
影响因子：
0
作者：
K. Bouyarmane;A. Kheddar;中川聖一,小林聡,峯松信明,宇津呂武仁,秋葉友良,北岡教英,山本幹雄,甲斐充彦,山本一公,土屋雅稔
通讯作者：
中川聖一,小林聡,峯松信明,宇津呂武仁,秋葉友良,北岡教英,山本幹雄,甲斐充彦,山本一公,土屋雅稔

対訳のある教科書を用いた講義音声の統計的機械翻訳

使用双语翻译教材对讲座音频进行统计机器翻译

DOI：
发表时间：
2012
期刊：
影响因子：
0
作者：
福島太喜;秋葉友良
通讯作者：
秋葉友良

Language Modeling Approach for Retrieving Passages in Lecture Audio Data

用于检索讲座音频数据中的段落的语言建模方法

DOI：
发表时间：
2010
期刊：
Proceedings of International Conference on Language Resources and Evaluation
影响因子：
0
作者：
Koichiro Honda;Tomoyoshi Akiba
通讯作者：
Tomoyoshi Akiba

音声ドキュメント検索のための自発クエリの収録と検索性能評価

记录自发查询并评估音频文档检索的搜索性能