权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

音響信号記号変換に基づいたセマンティックインタラクション

基于声学信号符号变换的语义交互

基本信息

批准号：
19024042
负责人：
奥乃博
金额：
$ 8.96万
依托单位：
Kyoto University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research on Priority Areas
财政年份：
2007
资助国家：
日本
起止时间：
2007 至 2008
项目状态：
已结题

项目摘要

(1)「音を聞き分ける」音の量的爆発促進技術 :使用環境, 設置条件に関する事前知識量を極力減らした実時間ロボット聴覚ソフトウエア「HARK」を5月から公開を始め, 11月に京都大学で, 12月には韓国KISTで無料講習会を開催した. 今年度開発したHARKの新機能は, 従来の2値マスクから連続値のソフトマスクによるミッシングフィーチャマスク自動生成法であり, 音声認識率が10%程度向上した. また, ロボットの自己生成音を抑制するICAによるセミブラインド分離法も開発し, 音楽ロボットに応用した. すなわち, ロボットが自分の出す歌声やハミングの影響を抑制し, 音楽だけを聞いて実時間でビート認識する音楽ロボットを開発した. IEEE/RSJ IROS-2008発表の2本の論文が, Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist 4件中の2件に選ばれた.(2)「音を見せる」音の質的複雑さ軽減技術 :HARKと3D viewerを組み合わせた音環境可視化システムに, 既開発の俯瞰モードでの機能拡充,及び, 没入感モードでの「音アウエアネス」提示機能を開発した. 前者は, 音環境の早送り提示, 音声認識結果のカラオケ風表示, 及び, 機能の洗練化である. 後者に対しては, 音の気付きという音アウエアネスを向上させるには, 単なる高忠実音場再生ではなく, 分離合成というプロセスが不可欠であるという観点から取り組んだ. 人の動きは, ディスプレイ上に設置したステレオカメラで取得した画像データから色情報を用いた最近傍探索で認識している. 音の没入モードでの提示に分離合成というアプローチは他に例がなく, 今後の展開の可能性が示唆された.

(1)The explosion promotion technology of sound quantity: the use environment, the setting conditions, the amount of prior knowledge, the maximum reduction of the time, the opening of the HARK in May, the Kyoto University in November, and the KIST in Korea in December. HARK's new function, which was launched this year, is to increase the voice recognition rate by 10%. For example, if the sound is generated, the ICA will be able to separate the sound from the sound. The sound of the song is heard, and the sound of the song is heard. IEEE/RSJ IROS-2008 Publication of 2 of these papers, Award for Entertainment Robots and Systems (NTF Award) Nomination Finalist 2 out of 4 selected. (2)Sound quality reduction technology:HARK and 3D viewer are combined to visualize the sound environment. The function of overlooking the sound environment is developed, and the function of "sound loss" is developed. The former, sound environment early send prompt, sound recognition result The latter is opposite to the former, and the former is opposite to the latter, and the latter is opposite to the former, and the latter is opposite to the former. People move, they move. The sound of the sound into the bottom of the prompt separation synthesis and other examples, the possibility of future development is shown.

项目成果

期刊论文数量（82）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

マルチドメイン音声対話システムにおける対話履歴を利用したドメイン選択

多域口语对话系统中使用对话历史的域选择

DOI：
发表时间：
2007
期刊：
情報処理学会論文誌 48・5
影响因子：
0
作者：
Haruki Nagata;Shin'ichi Toda;Hiroshi Itsumura;Kenji Koyama;Yasunori Saito;Masanori Suzuki;Noboru Takahashi;神田直之他
通讯作者：
神田直之他

独立成分分析に基づく適応フィルタのロボット聴覚への応用

基于独立分量分析的自适应滤波器在机器人听觉中的应用

DOI：
发表时间：
2008
期刊：
日本ロボット学会誌 Vol.26, No.6
影响因子：
0
作者：
武田龍;中臺一博;駒谷和範;尾形哲也;奥乃博
通讯作者：
奥乃博

楽譜情報を援用した多重奏音楽音響信号の音源分離と調波・非調波統合モデルの制約付パラメータ推定の同時実現

利用乐谱信息同时实现多个音乐声信号的源分离和谐波/非谐波综合模型的约束参数估计

DOI：
发表时间：
2008
期刊：
情報処理学会論文誌 Vol.49, No.3
影响因子：
0
作者：
糸山克寿;後藤真孝;駒谷和範;尾形哲也、奥乃博
通讯作者：
尾形哲也、奥乃博

移動型および静止型マイクロホンアレイ統合による複数移動音源追跡

通过移动和固定麦克风阵列集成进行多移动声源跟踪

DOI：
发表时间：
2007
期刊：
日本ロボット学会誌 Vol.25, No.6
影响因子：
0
作者：
中臺一博;中島弘史;村瀬昌満;奥乃博;長谷川雄二;辻野広司
通讯作者：
辻野広司

Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR

使用 ICA BSS 和基于 MTF 的 ASR 进行两个同时连续语音识别的评估

DOI：
发表时间：
2007
期刊：
Lecture Notes in Artificial Intelligence 4570
影响因子：
0
作者：
Ryu Takeda;Shun' ichi Yamamoto;Kazunori Komatani;Tetsuya Ogata;Hiroshi G. Okuno
通讯作者：
Hiroshi G. Okuno

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

奥乃博其他文献

ロボット聴覚技術を用いた鳥類の歌行動分析の試み - 複数のマイクロホンアレイを用いた二次元リアルタイム歌定位 -

尝试利用机器人听觉技术分析鸟类的歌唱行为 - 使用多个麦克风阵列进行二维实时歌曲定位 -

DOI：
发表时间：
2017
期刊：
影响因子：
0
作者：
鈴木麗璽;炭谷晋司;中臺一博;奥乃博
通讯作者：
奥乃博

複数時期のデータを用いたNAMセグメントによる個人認証

使用多个时期的数据使用 NAM 分段进行个人身份验证

DOI：
发表时间：
2007
期刊：
情報とセキュリティシンポジウム (SCIS2007) 4F2-4
影响因子：
0
作者：
Sarker;B.K.;Yoshiyuki Nakatani;Yoshiaki Yasumura;Tetsuro Kitahara;奥乃博;Hiroshi G.Okuno;清水敬太;服部佑哉;田口明裕;Tetsuya Ogata;Yuya Hattori;人工知能学会(奥乃博);小島摩里子
通讯作者：
小島摩里子

Study on non-audible murmur speaker verification using multiple session data

基于多会话数据的非可闻杂音说话人验证研究

DOI：
发表时间：
2006
期刊：
ASA/ASJ Joint Meeting
影响因子：
0
作者：
Sarker;B.K.;Yoshiyuki Nakatani;Yoshiaki Yasumura;Tetsuro Kitahara;奥乃博;Hiroshi G.Okuno;清水敬太;服部佑哉;田口明裕;Tetsuya Ogata;Yuya Hattori;人工知能学会(奥乃博);小島摩里子;小島摩里子;Mariko Kojima;Mariko Kojima
通讯作者：
Mariko Kojima