权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

ベイズ的大規模深層学習のための自然勾配近似法

贝叶斯大规模深度学习的自然梯度逼近方法

基本信息

批准号：
19J13477
负责人：
大沢和樹
金额：
$ 1.09万
依托单位：
Tokyo Institute of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for JSPS Fellows
财政年份：
2019
资助国家：
日本
起止时间：
2019-04-25 至 2021-03-31
项目状态：
已结题

项目摘要

本研究の一年目に取り組んだ、「自然勾配法による大規模 DNN 学習の高速化」の研究成果を発展させた。まず始めに、自然勾配法で計算のボトルネックとなっていた、フィッシャー情報行列及びその逆行列の計算コストを軽減させる研究に取り組んだ。フィッシャー情報行列の性質を利用し、本研究では、フィッシャー情報行列及びその逆行列の計算頻度を自動調整する手法を提案した。学習に必要になる計算時間を約50%にまで削減することに成功した。この研究成果は機械学習分野最難関のジャーナルである IEEE TPAMI に採択された。次に、大規模計算機上における自然勾配法に伴う計算時間の詳細な解析とその計算パフォーマンスの最適化を行った。具体的には、フィッシャー情報行列の逆行列計算の分散化、フィッシャー情報行列の対称性を利用した計算の高速化、低精度演算による行列計算の高速化、計算と通信のオーバーラップによる通信コストの隠蔽、などの最適化を行った。大規模画像データセット ImageNetの分類タスクにおいてDNNの学習を2分で完了させることに成功した。この研究成果はデータマイニング分野最難関の国際会議 KDD2020 の本会議で発表を行った(採択率 16.9% 216/1279 件)。ここまでの本研究で用いてきた自然勾配法では、フィッシャー情報行列を1) DNNの層ごとのブロック対角行列に近似し、2) 各対角ブロックのクロネッカー因子分解近似を行っている。本研究では、この二つの近似について数理的な解析を行った。ニューロンの数を無限に持つ「無限幅DNN」に議論を限定することで、こうした近似を伴う自然勾配法は、正確な自然勾配法と同様の収束を実現できることを初めて証明した。この研究成果は機械学習分野最難関の NeurIPS2020 の本会議で口頭発表を行った(採択率 1.1% 105/9454 件)。

The research results of "natural matching method for speeding up large-scale DNN learning" are presented. In this paper, we study the natural matching method for calculating the information matrix and the inverse matrix. This study proposes a method for automatically adjusting the frequency of calculation of the information matrix. The time required for learning is reduced by approximately 50%. This research achievement is the most difficult part of mechanical learning. The natural matching method is used to optimize the calculation time on the secondary and large-scale computers Specifically, the decentralization of retrograde column calculation of information matrix, the symmetry of information matrix, the utilization of high-speed calculation, the high-speed calculation of low-precision calculation, the shielding of communication matrix, and the optimization of communication matrix are discussed. Large-scale image classification is completed in 2 minutes. The research results of this conference were released at the most difficult international conference KDD2020 (acquisition rate 16.9% 216/1279). In this study, the natural matching method was used to approximate the information array of DNN layers, and the factorization method was used to approximate the information array of DNN layers. This paper studies the mathematical analysis of the two approximate methods. The number of DNN is infinite. The argument is limited. The natural matching method is correct. The natural matching method is correct. The same bundle is realized. This research achievement is the most difficult part of the mechanical learning division, NeurIPS2020, and the oral presentation of this conference is carried out (the acquisition rate is 1.1% 105/9454).

项目成果

期刊论文数量（4）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Understanding approximate Fisher information for fast convergence of natural gradient descent in wide neural networks*

了解宽神经网络中自然梯度下降快速收敛的近似 Fisher 信息*

DOI：
10.1088/1742-5468/ac3ae3
发表时间：
2021
期刊：
Journal of Statistical Mechanics: Theory and Experiment
影响因子：
0
作者：
Karakida Ryo;Osawa Kazuki
通讯作者：
Osawa Kazuki

Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC

DOI：
10.1145/3394486.3403265
发表时间：
2020-07
期刊：
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
影响因子：
0
作者：
Yuichiro Ueno;Kazuki Osawa;Yohei Tsuji;Akira Naruse;Rio Yokota
通讯作者：
Yuichiro Ueno;Kazuki Osawa;Yohei Tsuji;Akira Naruse;Rio Yokota

Practical Deep Learning with Bayesian Principle

贝叶斯原理的实用深度学习