权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Study on fast and accurate classifier learning method from unlabeled big data

无标签大数据快速准确分类器学习方法研究

基本信息

批准号：
20K21815
负责人：
鷲尾隆
金额：
$ 4.08万
依托单位：
Osaka University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Challenging Research (Exploratory)
财政年份：
2020
资助国家：
日本
起止时间：
2020-07-30 至 2024-03-31
项目状态：
已结题

项目摘要

近年、ビッグデータから分類器や回帰式を学習するニーズが増しているが、データ収集の制約やコストから目的変数値が教師信号として得られ難いことが問題となっている。これに対し近年、目的変数値無し事例集合とその目的変数値の分布情報のみが与えられる場合に、正負例割合の異なる２つの事例集合から分類器を学習するUUC手法や、事例間の目的変数値の大小関係のみが知られた事例集合と目的変数値無し事例集合から回帰式を学習する非結合回帰手法が提案されている。何れも事例集合中の正負例の割合など目的変数値の分布が予め知られていることを前提としている。しかし、現実のビッグデータでは目的変数値の分布が知られていることは少なく、これらの手法を実適用する上で障害となっている。さらに、真の目的変数値が全く得られない条件で、如何に学習した分類器や回帰式の精度や不確実性を評価するかも問題である。そこで本研究では令和３年度までに、(1)目的変数値の分布が知られていないデータから目的変数値の分布を推定し、分類器や回帰式を学習する手法の研究、さらに(2)教師信号無しに学習した分類器や回帰式の精度・不確実性を評価する手法の研究に取り組んだ。しかしながら各々の研究項目についてコロナ禍の状況下で研究が十分進まず、(1)については目的変数値分布を用いないUUC手法の開発、(2)については分類器や回帰式のパラメータや目的変数値の事後分布推定手法の構築が積み残しとなった。令和４年度は、(1)について目的変数値の分布が得られなくても分類器を学習可能な条件を探求し、それに基づくUUC分類器学習原理と学習アルゴリズムの開発を行った。(2)については対象問題に関する事前知識を反映したモデルを補助情報として用いることで、モデルパラメータや目的変数値の事後分布を推定可能な原理とアルゴリズムを開発した。

In recent years, ビッグデータからを learning classifier や帰 back type するニーズが raised しているが, データ収 set の restrict やコストから purpose - the numerical signal とが teachers して must られ difficult いことが problem となっている. これにし seaborne in recent years, purpose - no し the numerical examples collection とその purpose - the numerical の distributed intelligence のみが and えられにる occasions, positive and negative cases cut の different なる 2 つの case collection から classifier を learning する UUC や, case の purpose - between the numerical の size masato is のみが know られた case set no しと purpose - several numerical examples から帰 type を back Learn the する non-associative return to 帰 technique が proposal されてるる. What れも case collection の positive and negative cases の cut close など purpose - the numerical の distribution が to know められていることを premise としている. しかし, now be のビッグデータでは purpose - the numerical の distributing が know られていることは less なく, これらの gimmick を be applicable する on で handicap of となっている. さらに, true の purpose - all of the numerical がく be られなでい conditions, how to learn にした classifier や back 帰のや precision uncertain be sex を review 価するかも problem である. そこで this study では make and 3 year までに, (1) the purpose - the numerical の distributing が know られていないデータから purpose - the numerical distribution of のを presumption し, classifier や back 帰を learning する technique research, さのらに (2) signal without しに learning した classifier や帰 type back の precision, uncertain be sex を review 価する gimmick の research group take りにんだ . しかしながら each 々の research project についてコロナ disaster study での situations very into まがず, (1) については purpose - the numerical distribution を with いない UUC gimmick の発, (2) については classifier や帰 type back のパラメータや purpose - the numerical の presumption distribution technique の afterwards build が product み residual しとなった. Make annual は, (1) and 4 について purpose - the numerical の distribution が must られなくても classifier を learning conditions may なを explore し, それに base づく UUC classifier learning principle とアルゴリズムの open 発を line った. (2) については like problem seaborne に masato する prior knowledge を reflect したモデルを subsidies intelligence として in いることで, モデルパラメータや purpose - after the numerical の distribution を presumption principle may なとアルゴリズムを open 発した.

项目成果

期刊论文数量（25）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Isolation Distributional Kernel: A New Tool for Kernel based Anomaly Detection

DOI：
10.1145/3394486.3403062
发表时间：
2020-07
期刊：
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
影响因子：
0
作者：
K. Ting;Bi-Cun Xu;T. Washio;Zhi-Hua Zhou
通讯作者：
K. Ting;Bi-Cun Xu;T. Washio;Zhi-Hua Zhou

アンサンブル最近傍距離を用いたラベル無しデータからの分類器学習

使用集成最近邻距离从未标记数据中学习分类器

DOI：
发表时间：
2020
期刊：
影响因子：
0
作者：
松本瑞季;鷲尾隆
通讯作者：
鷲尾隆

Class Prior Probability Estimation Using Density Ratio from Unlabeled and Contaminated Positive Datasets

使用未标记和污染的阳性数据集的密度比进行类先验概率估计

DOI：
发表时间：
2021
期刊：
影响因子：
0
作者：
Takemoto Ayumi;Iwaki Sunao;Duo Zhoumao;Yasumuro Shinobu;Kumada Takatsune;伊東亮太，小川奈美，鳴海拓志，廣瀬通孝;関本大勢・本吉勇;Takeshi Yoshida and Eitaro Shinya
通讯作者：
Takeshi Yoshida and Eitaro Shinya

Isolation kernel: the X factor in efficient and effective large scale online kernel learning

DOI：
10.1007/s10618-021-00785-1
发表时间：
2019-07
期刊：
Data Mining and Knowledge Discovery
影响因子：
4.8
作者：
K. Ting;Jonathan R. Wells;T. Washio
通讯作者：
K. Ting;Jonathan R. Wells;T. Washio

Nanjing University(中国)

南京大学（中国）

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

鷲尾隆其他文献

因果関係モデリングにおけるデータマイニング・グラフマイニング技術の活用

数据挖掘和图挖掘技术在因果关系建模中的应用

DOI：
发表时间：
2007
期刊：
日本化学会情報化学部会誌 Vol.25,No.3
影响因子：
0
作者：
Y. Higuchi;A. Foronda;C. Ohta;M. Yoshimoto;Y. Okada;Masato Tsukada;Kouki Miyoshi;西尾佳祐,岩井儀雄,長原一,谷内田正彦;鷲尾隆
通讯作者：
鷲尾隆