权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

同型性に基づく抽象化プランニングのロボットの行動学習への応用

基于同构的抽象规划在机器人行为学习中的应用

基本信息

批准号：
07750460
负责人：
山口智浩
金额：
$ 0.64万
依托单位：
Osaka University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Encouragement of Young Scientists (A)
财政年份：
1995
资助国家：
日本
起止时间：
1995 至无数据
项目状态：
已结题

项目摘要

本年度は、前年度の成果として得られた、状態の同型性を抽象化に利用する“同型性に基づく抽象化問題解決"を拡張し、状態空間の階層的な同型性を利用して、効率的に抽象化問題解決する方法を研究すると共に、一般的な分野への応用として、同型な機能、構造を持つロボットの行動学習として、同型性に基づく抽象化強化学習法を考案し、以下の研究を行った。(1)状態空間の階層的な同型性の解析による、階層化抽象空間の生成同型性に基づく抽象化だけでは不十分な場合、抽象空間の階層的な同型性を利用すると、階層的な抽象空間を段階的に生成して、より小さな抽象空間を求め、解析の計算コストを削減することができることを示した。(2)効率的な抽象化プランニングと詳細化生成した階層的な抽象空間中に、初期状態と目標状態とを写像し、抽象空間における、初期状態と目標状態とを結ぶ状態遷移をプランニングにより求めて、抽象プランを効率よく探索できることを示した。(3)ロボットの行動学習システムの構築現有の計算機と通信しながら学習するロボットの行動学習システムを構築した。シミュレーション学習と実環境での実ロボットとのハイブリッド強化学習システムを作成し、両者の学習システムを共通化することにより、仮想個体、実ロボット間での学習結果の交換を可能とした。学習法として、経験強化型のClassifier Systemを元にして、高速化の拡張を行い、従来困難だった実ロボットでの実時間強化学習を実現した。(4)同型性に基づく強化学習法による、ロボットの多様な行動の獲得構築したロボットの行動学習システムを用いて、まずあるタスクで強化学習を行い、得た学習結果に対し、行為の同型性を利用した置換を組み合わせ的に施して同型な学習結果を生成し、学習結果のバリエーションの探索を行う。その結果、学習したタスクを達成する、同型な挙動や、学習タスクに似た、類似挙動など、従来の強化学習法では、得られない多様な行動を、効率的に獲得することができた。

In this year and the previous year, the results of this year and the previous year have been successful, the status of homomorphism has been abstracted, the abstraction of homomorphism has been used to solve the problem of abstraction of homomorphism, and the homomorphism of spatial distribution has been used to solve the problem of abstraction and efficiency. The methods of solving problems of abstraction have been used to study the problem of homomorphism, the same type of equipment, the In order to strengthen the examination plan of the chemical method, and the following research program, we should carry out the training program of behavioral science, the basis of homology and the abstract of the chemical examination plan. The main results are as follows: (1) the analysis of the homomorphism of the state space, the generation of the same type of the abstract space, the analysis of the homomorphism of the abstract space, the analysis of the homomorphism of the state space, the analysis of the homomorphism of the state space, the analysis and calculation of the abstract space, the abstract space segment, the abstract space segment, the abstract space segment and the abstract space segment. (2) the abstraction of the operating rate, the generation of the abstract space, the initial status header, the initial status target, the initial status target, the initial (3) there is an existing calculation machine, communication system, computer science and communication system. (3) there is an existing calculation machine. You may want to learn more about the environment, and the environment. Study the method to improve the performance of Classifier System, enhance the performance of high-speed equipment, and improve the performance of chemical analysis when it is necessary to improve the performance of the system. (4) the homotypic basic method strengthens the chemical experiment and the multi-test behavior of the chemical medicine. The results of the chemical experiments, the results of the The result of the study is that you will learn how to explore and explore. The result of the experiment, the result of the experiment, the result of the experiment,