Studios on theory of optimization with utility in stochastic model
随机模型效用优化理论工作室
基本信息
- 批准号:14540125
- 负责人:
- 金额:$ 2.18万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:2002
- 资助国家:日本
- 起止时间:2002 至 2004
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The summary of research results is as follows.1.We consider risk minimizing problems in undiscounted Markov decisions processes with a target set. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists an stationary optimal policy. Also we give several value iteration methods and a policy improvement method. We also consider eight problems in which we maximize or minimize threshold probabilities in discounted Markov decision processes with bounded reward set. We show that such problems are classified to two equivalence classes and give a relationship between optimal values and optimal policies of problems in each equivalence class. We also give two sufficient conditions for the existence of an optimal policy. Finally we give a relationship of optimal values between first and second equivalence classes.2.We solves a finite horizon stochastic optimization problem with forward recursive criterion through dynamic programming. The basic idea is to apply invariant imbedding method for stochastic programming.3.We show that weakly null-additive fuzzy measures on metric spaces posses regularity Lusin's theorem is generalized to fuzzy measure space by using the regularity and weakly null-additivity4.We introduce an idea of finite intersection family into a topological space, characterize several concepts in a topological space by mean of finite intersection family and illustrate some applications of finite intersection family.5.EM-algorithm users believe that the conditions of Wu(1983) assure the convergence of GEM sequence, but this paper gives a brief counter example which satisfies Wu's conditions but not converge to MILE or any optimal solutions. It also gives a correction of his proof for the convergence of EM sequence.
主要研究结果如下:1.考虑了目标集为非折扣马氏决策过程的风险最小化问题。我们制定的问题作为一个无限的地平线的情况下,经常性的类。我们证明了最优值函数是最优性方程的唯一解,并且存在平稳最优策略。给出了几种数值迭代方法和一种策略改进方法。我们还考虑了八个问题,其中我们最大化或最小化的阈值概率折扣马尔可夫决策过程有界的回报集。我们证明了此类问题可分为两个等价类,并给出了每个等价类中问题的最优值和最优策略之间的关系。我们还给出了最优策略存在的两个充分条件。最后给出了第一等价类和第二等价类之间的最优值关系。2.利用动态规划方法求解了一个具有前向递归准则的有限时间随机优化问题。3.证明了度量空间上的弱零可加模糊测度的正则性,利用正则性和弱零可加性将Lusin定理推广到模糊测度空间。4.在拓扑空间中引入有限交族的概念,利用有限交族刻画了拓扑空间中的几个概念,并举例说明了有限交族的一些应用。本文给出了一个简单的反例,它满足吴的条件,但不收敛于MILE或任何最优解。对他关于EM序列收敛性的证明进行了修正。
项目成果
期刊论文数量(28)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
K.Nomakuchi: "Does GEM converge to MLE under Wu's conditions-A counter example-"Bulletin of Informatics and Cybernetics. (印刷中). 1-7 (2004)
K.Nomakuchi:“在 Wu 的条件下,GEM 是否收敛于 MLE - 一个反例 -”《信息学和控制论通报》(出版中)1-7(2004 年)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Does GEM converge to MLE under Wu' s condition -A counter example-
在吴的条件下GEM是否收敛到MLE-反例-
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:K.Nomakuchi;K.Nomakuchi
- 通讯作者:K.Nomakuchi
M.Yasuda: "Au approach to stopping problems of a dymanic fuzzy system (with M.Kurano, J.Nakagami and Y.Yoshida"Fuzzy Sets and Systems. 131. 225-233 (2002)
M.Yasuda:“Au 方法来阻止动态模糊系统的问题(与 M.Kurano、J.Nakagami 和 Y.Yoshida 合作)”Fuzzy Sets and Systems. 131. 225-233 (2002)
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
S.Iwamoto: "Optimal stopping in fuzzy environment"Proceedings of the 9-th Bellman Continuum, Series of Information & Management Sciences. 2. 264-269 (2002)
S.Iwamoto:“模糊环境中的最优停止”第 9 届贝尔曼连续体论文集,信息系列
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Equivalence classes for optimization risk models in Markov decision processes
马尔可夫决策过程中优化风险模型的等价类
- DOI:
- 发表时间:2004
- 期刊:
- 影响因子:0
- 作者:Y.Ohtsubo;K.Toyonaga
- 通讯作者:K.Toyonaga
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
OHTSUBO Yoshio其他文献
OHTSUBO Yoshio的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('OHTSUBO Yoshio', 18)}}的其他基金
Nonlinear stochastic and dynamic decision processes by invariantAnd imbedding methods
采用不变和嵌入方法的非线性随机和动态决策过程
- 批准号:
21540132 - 财政年份:2009
- 资助金额:
$ 2.18万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Nonlinear dynamic optimization theory on stochastic model and its application to mathematical finance
随机模型的非线性动态优化理论及其在数理金融中的应用
- 批准号:
17540121 - 财政年份:2005
- 资助金额:
$ 2.18万 - 项目类别:
Grant-in-Aid for Scientific Research (C)