权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Development of emotion recognition system by transfer learning for various speeches

通过各种语音的迁移学习开发情感识别系统

基本信息

批准号：
22K12087
负责人：
小坂哲夫
金额：
$ 2.58万
依托单位：
Yamagata University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (C)
财政年份：
2022
资助国家：
日本
起止时间：
2022-04-01 至 2025-03-31
项目状态：
未结题

项目摘要

本年度は音声感情認識に関し以下の２点について検討した．1.言語的特徴および音響的特徴による感情認識の結果統合2.OGVC(オンラインゲームチャットコーパス)を対象とした音声認識1.について，従来我々は音声認識結果を深層学習モデルの一種であるBERTに入力し感情を認識する言語特徴を用いた方法と，音響特徴から時系列や統計量を用いて認識する２種類の出力を重み付き統合する方法を検討してきた．今回は２種類の特徴をディープニューラルネットワークで統合する方法を検討し，より高い性能を得ることができた．システムの概要としては，言語的特徴抽出のため，まず感情音声の音声認識を行い得られた誤りを含む音声認識結果を用いBERTで感情認識を行い4種類の感情に対する事後確率を得る．一方音響的特徴については，発話全体から各種特徴の統計量を得て認識する手法と，LSTMやGRUなどの時系列を表現できる深層学習モデルを用いて感情認識を行い，同様に事後確率を得る．その両者を統合してDNNに入力し最終的な認識結果を得る．日本語感情コーパスJTESを対象に評価を行った結果，4感情の識別タスクにおいて従来法では80.25%であったが提案法では82.25%を得ることができた．2.についてOGVCを対象に音声認識の検討を行い言語モデル適応が有効であることを示した．音響モデルにはJTESで適応したモデルを使用し，言語モデルはツイート文に適応したモデル，OGVCに適応したモデル，更にはツイート適応モデルを更にOGVCで適応したモデルの３種類を比較した．この結果いずれの方法も性能向上が得られることが分かったが，特にツイート適応が有効であることが分かった．

This year, the following two points are related to the understanding of sound and emotion. 1. The characteristics of speech and the characteristics of sound. 2. OGVC A method of integrating the two kinds of efforts is discussed. The method of integrating the two kinds of efforts is used to study the speech characteristics of the two kinds of speech characteristics. This is the first time I've ever seen such a thing. A summary of speech characteristics is extracted from speech, and the results of speech recognition are obtained by using BERT. The characteristics of a party sound are related to each other, and the whole speech is related to the statistical quantity of various characteristics. The method of understanding, the LSTM and GRU time series performance, the deep learning, the application, the emotional recognition, the same post-validation rate are obtained. The final result of the study was obtained. Japanese emotion JTES target evaluation line results, 4 emotion recognition line 80.25% The sound of the sound is not only suitable for the use of the sound, but also suitable for the use of the sound. The result is that the performance of the method is upward.

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

深層学習モデルを用いた言語特徴と音響特徴の後期融合による音声感情認識

使用深度学习模型通过后期融合语言特征和声学特征进行语音情感识别

DOI：
发表时间：
2023
期刊：
影响因子：
0
作者：
Daiki Akiyama;Tomio Goto;杉尾達也，小篠裕子;岡田純京，小篠裕子;城所悠太，新田直子，中村和晃，馬場口登;佐藤清秀，岸恵太，小坂哲夫
通讯作者：
佐藤清秀，岸恵太，小坂哲夫

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

小坂哲夫其他文献

Unsupervised Cross Adaptation Using Deep Neural Networks in Speech Recognition Systems

在语音识别系统中使用深度神经网络的无监督交叉适应

DOI：
10.14923/transinfj.2017jdp7076
发表时间：
2018
期刊：
電子情報通信学会論文誌D 情報・システム
影响因子：
0
作者：
冨田健斗;高木瑛;加藤正治;小坂哲夫
通讯作者：
小坂哲夫

Business Application for Sales Transaction Data by Using Genome Analysis Technology

利用基因组分析技术进行销售交易数据的商业应用

DOI：
发表时间：
2003
期刊：
Proc.of The 6th International Conference on Discovery Science 2003 LNAI 2843
影响因子：
0
作者：
佐藤裕亮;米田完;Kazuya Negishi;小坂哲夫;N.Katoh
通讯作者：
N.Katoh

Noisy speech recognition with discrete-mixture HMMs based on MAP estimation

基于 MAP 估计的离散混合 HMM 噪声语音识别

DOI：
发表时间：
2004
期刊：
Proc. of The 18th International Congress on Acoustics 2
影响因子：
0
作者：
T.Kosaka;M.Katoh;M.Kohda;小坂哲夫;阿部拓也;小坂哲夫;阿部拓也;小坂哲夫;加藤正治;阿部拓也;松本和樹;小坂哲夫;小坂哲夫;T.Kosaka
通讯作者：
T.Kosaka