权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Nonlinear stochastic and dynamic decision processes by invariantAnd imbedding methods

采用不变和嵌入方法的非线性随机和动态决策过程

基本信息

批准号：
21540132
负责人：
OHTSUBO Yoshio
金额：
$ 2.91万
依托单位：
Kochi University
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (C)
财政年份：
2009
资助国家：
日本
起止时间：
2009 至 2012
项目状态：
已结题

项目摘要

We consider undiscounted semi-Markov decision process with a target set and our main concern is a problem minimizing threshold probability. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists a stationary optimal policy. Also several value iteration methods and a policy improvement method are given in our model. Furthermore, we investigate a relationship between threshold probabilities and expectations for total rewards.

考虑具有目标集的未折现半马尔可夫决策过程，主要关注阈值概率最小化问题。我们将问题表述为具有循环类的无限视界情况。我们证明了最优值函数是最优性方程的唯一解，并且存在平稳最优策略。同时给出了几种数值迭代方法和一种策略改进方法。此外，我们研究了总奖励的阈值概率和期望之间的关系。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

ラグランジュ関数のフィボナッチ鞍点

拉格朗日函数的斐波那契鞍点

DOI：
发表时间：
2012
期刊：
京大数理研講究録「不確実・不確定環境下における数理的意思決定とその周辺」、
影响因子：
0
作者：
岩本誠一;木村寛
通讯作者：
木村寛

Threshold Probability and Expectation Criteria for Additive Reward System

加性奖励系统的阈值概率和期望标准

DOI：
发表时间：
2011
期刊：
影响因子：
0
作者：
M. Sakaguchi;Y.Ohtsubo
通讯作者：
Y.Ohtsubo

Weighted Quasi-Arithmetic Means and Domain Translations

加权准算术平均值和域翻译

DOI：
发表时间：
2012
期刊：
Journal of Advanced Computational Intelligence and Intelligent Informatics
影响因子：
0.7
作者：
Toshio Sakata;Kazumitsu Maehara;Takeshi Sasaki;Toshio Sumi;Mitsuhiro Miyazaki;Yoshitaka Watanabe;and Makoto Tagami;瀬野裕美;林正美・税所康正;桑野一成;K. Yagasaki;Yoshida Yuji
通讯作者：
Yoshida Yuji

負のマルコフ決定過程における二つの閾値確率最適化の方法,数理解析研究所講究録

负马尔可夫决策过程的两种阈值概率优化方法，数学研究所 Kokyuroku

DOI：
发表时间：
2011
期刊：
最適化モデルとアルゴリズムの新展開
影响因子：
0
作者：
岩本誠一;木村寛;矢ヶ崎一幸;瀬野裕美;渡部善隆;阪口昌彦,大坪義夫
通讯作者：
阪口昌彦,大坪義夫

Autocountinuity from below of set functions and convergence in measure

集合函数自下而上的自计数性和测度收敛性

DOI：
10.1007/978-3-642-22833-9_9
发表时间：
2011
期刊：
Nonlinear Maths.for Uncertainty and its Appli., Advances in Intel.and Soft Computing, Springer
影响因子：
0
作者：
Jun Li;Masami Yasuda;Ling Zhou
通讯作者：
Ling Zhou