权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

画像中の重要領域の抽出と高能率符号化への応用

图像中重要区域的提取及其在高效编码中的应用

基本信息

批准号：
11750313
负责人：
長井隆行
金额：
$ 1.41万
依托单位：
The University of Electro-Communications
依托单位国家：
日本
项目类别：
Grant-in-Aid for Encouragement of Young Scientists (A)
财政年份：
1999
资助国家：
日本
起止时间：
1999 至 2000
项目状态：
已结题

项目摘要

本研究は、画像中の重要な領域を自動抽出し、その領域をより詳細に符号化することで、より効率よく画像情報を圧縮する手法の実現を目的として行った。本研究の成果は以下の通りである。1.ベースとなる画像符号化の性能向上フィルタバンク(ウェーヴレット)をベースとした画像符号化の性能向上のために、周波数帯域によって異なる基底長を持つ新しいフィルタバンクの構造と設計法を提案した。これにより、復号画像の品質向上を図った。また、提案するフィルタバンクが、seismic dataの圧縮に有効であることも明らかにした。2.重要領域を考慮した画像符号化方式上記のフィルタバンクをSPIHT符号化に適用し、さらに抽出した重要領域を重み付けすることにより重要領域を考慮した画像符号化を実現した。実際の画像を用いて主観評価実験を行い、有効性を確かめた。3.重要領域の定義と抽出手法画像中の重要領域のひとつとして、人間の顔を定義し、その抽出手法を検討した。具体的には、カラー静止画像の場合は、固有空間と色(肌の色)を用いて抽出を行い、動画像の場合は、抽出した顔領域を色情報により高速にトラッキングする手法を実現した。また、画像中の重要領域の二つ目として文字領域を定義し、その抽出手法を検討した。画像中の文字領域は、ウェーヴレット変換、独立成分分析、特徴空間からの距離の3つを組み合わせて特徴とし、ニューラルネットワークによって大量のデータから学習することで高い抽出精度を実現した。4.音声を併用した重要領域の抽出入力信号として、多チャンネルの音声が得られる時、これを用いて話者位置を推定し、その結果を画像符号化に反映させることを検討した。このためにまず、複数のマイクを2次元的に配置し、話者の位置を推定する手法を提案した。これにより、画像中のどの話者が現在発話しているかを知ることができ、動画像符号化に反映させることが可能となる。5.重要領域抽出と画像符号化手法の統合上述の顔領域抽出手法と画像符号化方式を統合し、PC上に実装した。

This study aims at automatically extracting important areas from images, symbolizing them in detail, and reducing image information. The results of this study are as follows. 1. The performance of image symbolization is upward, the frequency range is different, the base length is new, and the structural design method is proposed.これにより、复号画像の品质向上を図った。The proposal was made in the form of a proposal to reduce seismic data. 2. Important areas are considered to be represented by SPIHT symbolization. The portrait of the real world is in the middle of the evaluation process, and there is a certain quality. 3. The definition of important areas and the extraction of important areas in the portrait are discussed. In particular, in the case of a static image, the method of extracting color information from a natural space and color information from a moving image is realized at a high speed. The definition and extraction method of the important field in the portrait are discussed. The text field in the portrait is transformed, the independent component analysis, the feature space is separated, the distance is divided into three groups, the feature is combined, the character is separated, and the high extraction accuracy is realized. 4. Sound and sound are used together to extract input signals from important areas. When sound and sound are obtained, the position of the speaker is estimated, and the result is symbolized. The method of estimating the position of the speaker is proposed The words in the portrait are now transmitted, the symbolization of animation is reflected, and the words are possible. 5. Integration of important field extraction and image symbolization methods The integration of the above color field extraction methods and image symbolization methods is carried out on PC.