权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

少量の実データに基づく画像内文字認識及びその応用

基于少量真实数据的图像字符识别及其应用

基本信息

批准号：
22KJ0905
负责人：
白定勳
金额：
$ 1.47万
依托单位：
The University of Tokyo
依托单位国家：
日本
项目类别：
Grant-in-Aid for JSPS Fellows
财政年份：
2023
资助国家：
日本
起止时间：
2023-03-08 至 2024-03-31
项目状态：
已结题

项目摘要

目的「少量の実データに基づく画像内文字認識及びその応用」に合う研究を順調に進めた。まず、計画通りに、合成データで作り難い、難しいデータの例として「漫画内のオノマトペテキスト」に注目し、それらを集めたデータセットを作成して公開した。オノマトペテキストは、合成で作り難い分、少量の実データを上手く活用して認識する必要があり、それに役立ついくつかの手法を適用して、精度を改善した。その内容を、7月には国内の最大級画像処理学会MIRUにて発表し、MIRUインタラクティブ発表賞を頂いた。また、10月に画像処理系のトップ国際学会ECCVでも発表した。その後、研究課題の目的「少量の実データの有効活用」に繋がる別の研究として、少量の文字画像(character image)を有効活用する研究を行っている。具体的には、複数の文字画像を組み合わせて、一つの疑似単語画像(word image)を作る研究を勧めている。文字画像が多ければ多いほど、文字画像を組み合わせるパターンは膨大な数になるため、文字画像を組み合わせることで、膨大な量の「疑似単語画像」を得られる。我々はこの組み合わせで作った「疑似単語画像」が、少量の単語画像を補うのに有効的であることを示した。今現在トップ国際会議ICCVに提出して、結果を待っている。今後は、この研究の改善や拡張を行う予定である。

Objective: To study the relationship between Chinese characters recognition and Chinese characters use in basic images. In addition, it is difficult to create a "comic book" and "comic book". To improve the accuracy of the method, we must first understand the necessity of using the method. In July, MIRU, the largest image processing society in China, launched its first exhibition. In October, the International Society for Image Processing (ECCV) launched its report. The purpose of this research project is to conduct research on the effective use of a small number of characters. Specific words, plural words and images are grouped together, and a suspected word image is studied. Text portrait is composed of multiple characters, text portrait is composed of multiple characters. I am a member of the group and I am a member of the group. The ICCV is now presented and the results are awaited. Future research and improvement