权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

ベクトル量子化による状態・行動地図の不可逆圧縮

使用矢量量化对状态/行为图进行不可逆压缩

基本信息

批准号：
17760199
负责人：
上田隆一
金额：
$ 2.24万
依托单位：
The University of Tokyo
依托单位国家：
日本
项目类别：
Grant-in-Aid for Young Scientists (B)
财政年份：
2005
资助国家：
日本
起止时间：
2005 至 2006
项目状态：
已结题

来源：
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-17760199/
关键词：
state-action map dynamic programing vector quantization autonomous mobile robots the Acrobot dynamic programming

项目摘要

本研究課題は,最適制御問題を動的計画法で解いた解「状態行動地図」のメモリ容量を不可逆圧縮の代表的な手法であるベクトル量子化で圧縮するというものである.本年度は,申請者らの既発表のアルゴリズムの応用と評価を中心に研究を行い,成果を国内外の学会において発表し,今後雑誌論文として発表できる様々なデータを得た.本年度は,開発したアルゴリズムを大規模な問題へ適用し,評価を行った.その一つとして,ロボットサッカーでのマルチエージェント系となるタスク(パス)に本手法を適用した.状態行動地図の要素数は6億程度となったが,これを3.0GHz CPU,3.0GBのRAMを搭載した計算機で10日間計算することで,パスや互いに衝突を回避するなどの協調行動が見られる状態行動地図を得ることができた.また,この状態行動地図を同計算機で1日で圧縮することに成功し,結果的に実装するロボットのメモリ量(16MB)を下回る,8.2MBのベクトル量子化地図を得ることができた.また,シミュレーションではあるが,圧縮による地図でもロボットが協調して効率よく作業できることを示すことで,提案手法が,複雑なタスク用の地図を破綻させないで小さく圧縮できることを示せた.さらに,非常に非線形な制御問題であるアクロボットの振りあがりタスク,上記のロボットサッカーのタスク,人工知能の標準問題の一つである水たまりタスクにおいて,本手法と競合する手法との比較を行った.評価指標として,圧縮率と圧縮による性能劣化を計測した.結果,本手法で得られるベクトル量子化地図は,タスクの種類にかかわらず,他手法よりも安定して低消費メモリで性能劣化の小さい行動決定手法を記憶できることが示せた。

This research topic is to solve the optimal control problem by a dynamic planning method. This year, the applicant's application and evaluation center conducted research, the results were presented to domestic and foreign societies, and future research papers were presented to the public. This year, the development of large-scale problems, evaluation and implementation. This method is applicable to the application of this method in the system of. The number of elements in the state of action is 600 million, and the number of elements in the state of action is 3.0 GHz CPU, 3.0 GB RAM, and the number of elements in the state of action is 10. The status of the mobile site is the same as the computer on the 1st day. The result is the installation of the mobile site (16MB). The next time, 8.2 MB of the mobile site is quantized. In addition to the above, the author also pointed out that there is no need to reduce the pressure on the ground, and the proposal method is to reduce the pressure on the ground. In addition, the problem of very non-linear control is very difficult to solve, and the problem of artificial intelligence is very difficult to solve. Evaluation metrics are used to measure the compression rate and performance degradation due to compression. As a result, this method has been used to determine the behavior of the quantized ground, the type of ground, and the stability of the method.