Nonlinear dynamic optimization theory on stochastic model and its application to mathematical finance
随机模型的非线性动态优化理论及其在数理金融中的应用
基本信息
- 批准号:17540121
- 负责人:
- 金额:$ 2.37万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:2005
- 资助国家:日本
- 起止时间:2005 至 2007
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The summary of research results is as follows.1. We consider multistage decision processes where a criterion function is an expectation of minimum function and formulate it as Markov decision processes with imbedded parameters. The policy depends upon a history including past imbedded parameters and the rewards at each stage is random and depends upon a current state, a current action and a next state. We give an optimality equation by using operators and show that there exist a right continuous deterministic Markov policy which depend upon a current state and an imbedded parameter.2. We consider Markov decisions processes with a target set, where criterion function is an expectation of minimum function. We formulate the problem as an infinite horizon case with a recurrent class. We show under some conditions that an optimal value function is a unique solution to an optimality equation and there exists an stationary optimal policy. Also we give a policy improvement method.3. We conside … More r a stochastic shortest path problem with associative criteria in which for each node of a graph we choose a probability distribution over the set of successor nodes so as to reach a given target node optimally. We formulate such a problem as an associative Markov decision processes. We show that an optimal value function is a unique solution to an optimality equation and find an optimal stationary policy. Also we give a value iteration method and a policy improvement method.4. We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is characterized by optimal action sets specified with a parametric utility.5. We consider an inequality condition where one side is greater than or equal to a multiple of the other side and an equality holds if and only if one value is a multiple of the other variable. We show a cross-duality between four pairs of Golden inequalities for one-variable functions. Less
研究结果总结如下:1。考虑准则函数为最小函数期望的多阶段决策过程,并将其表述为带嵌入参数的马尔可夫决策过程。策略取决于历史,包括过去嵌入的参数,每个阶段的奖励是随机的,取决于当前状态、当前行动和下一个状态。利用算子给出了一个最优性方程,并证明了存在一个依赖于当前状态和嵌入参数的右连续确定性马尔可夫策略。我们考虑具有目标集的马尔可夫决策过程,其中准则函数是最小函数的期望。我们将问题表述为具有循环类的无限视界情况。在一定条件下,证明了最优值函数是最优性方程的唯一解,且存在平稳最优策略。并提出了政策改进的方法。我们考虑一个具有关联准则的随机最短路径问题,其中对于图的每个节点,我们在后继节点集合上选择一个概率分布,以便最优到达给定的目标节点。我们将这个问题表述为一个关联马尔可夫决策过程。我们证明了最优值函数是最优性方程的唯一解,并找到了最优平稳策略。给出了一种值迭代法和一种策略改进法。我们考虑效用约束的马尔可夫决策过程。在多重期望效用约束下,总贴现奖励的期望效用最大化。通过引入相应的拉格朗日函数,导出了效用约束优化的鞍点定理。约束最优策略的存在性以参数效用指定的最优行为集为特征。我们考虑一个不等式条件,其中一边大于或等于另一边的倍数,等式成立当且仅当一个值是另一个变量的倍数。我们证明了单变量函数的四对金不等式之间的交叉对偶性。少
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Stopping game problem for dynamic fuzzy systems.
动态模糊系统的停止博弈问题。
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:Y.Yoshida;M.Yasuda;J.Nakagami;M.Kurano
- 通讯作者:M.Kurano
A New Evaluation of Mean Value for Fuzzy Numbers and its Application to American Put Option under Uncertainty
模糊数均值的新评价及其在不确定性下美式看跌期权中的应用
- DOI:
- 发表时间:2006
- 期刊:
- 影响因子:0
- 作者:Y.Yoshida;M.Yasuda;J.Nakagami;M.Kurano
- 通讯作者:M.Kurano
A Fuzzy Approach to Markov Decision Processes with Uncertain Transition Probabilities
具有不确定转移概率的马尔可夫决策过程的模糊方法
- DOI:
- 发表时间:2006
- 期刊:
- 影响因子:0
- 作者:M.Kurano;M.Yasuda;J.Nakagami;Y.Yoshida
- 通讯作者:Y.Yoshida
Optimal threshold probability in semi-Markov decision processes with a target set
具有目标集的半马尔可夫决策过程中的最优阈值概率
- DOI:
- 发表时间:2008
- 期刊:
- 影响因子:0
- 作者:M. Sakaguchi;Y. Ohtsubo
- 通讯作者:Y. Ohtsubo
Golden optimal policy on dynamic optimization
动态优化的黄金最优策略
- DOI:
- 发表时间:2007
- 期刊:
- 影响因子:0
- 作者:T.Hida;Si Si;S. Iwamoto
- 通讯作者:S. Iwamoto
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
OHTSUBO Yoshio其他文献
OHTSUBO Yoshio的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('OHTSUBO Yoshio', 18)}}的其他基金
Nonlinear stochastic and dynamic decision processes by invariantAnd imbedding methods
采用不变和嵌入方法的非线性随机和动态决策过程
- 批准号:
21540132 - 财政年份:2009
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Studios on theory of optimization with utility in stochastic model
随机模型效用优化理论工作室
- 批准号:
14540125 - 财政年份:2002
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
相似海外基金
CAREER: Structured Minimax Optimization: Theory, Algorithms, and Applications in Robust Learning
职业:结构化极小极大优化:稳健学习中的理论、算法和应用
- 批准号:
2338846 - 财政年份:2024
- 资助金额:
$ 2.37万 - 项目类别:
Continuing Grant
Development of Riemannian constrained optimization theory and applications
黎曼约束优化理论及应用的发展
- 批准号:
22KJ0563 - 财政年份:2023
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Graph Algorithms and Optimization: Theory and Scalable Algorithms
图算法和优化:理论和可扩展算法
- 批准号:
22H05001 - 财政年份:2022
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (S)
Linear Optimization: Theory and Applications
线性优化:理论与应用
- 批准号:
RGPIN-2020-06846 - 财政年份:2022
- 资助金额:
$ 2.37万 - 项目类别:
Discovery Grants Program - Individual
Mathematical Optimization: Theory and Algorithms
数学优化:理论与算法
- 批准号:
RGPIN-2020-04324 - 财政年份:2022
- 资助金额:
$ 2.37万 - 项目类别:
Discovery Grants Program - Individual
Floquet Theory for Stochastic Temporal Networks and Optimization Theory for the Design of Schedules for COVID-19
随机时间网络的 Floquet 理论和 COVID-19 时间表设计的优化理论
- 批准号:
2052720 - 财政年份:2021
- 资助金额:
$ 2.37万 - 项目类别:
Continuing Grant
Mathematical Optimization: Theory and Algorithms
数学优化:理论与算法
- 批准号:
RGPIN-2020-04324 - 财政年份:2021
- 资助金额:
$ 2.37万 - 项目类别:
Discovery Grants Program - Individual
Linear Optimization: Theory and Applications
线性优化:理论与应用
- 批准号:
RGPIN-2020-06846 - 财政年份:2021
- 资助金额:
$ 2.37万 - 项目类别:
Discovery Grants Program - Individual
Security analysis and development of post-quantum cryptography using optimization theory
利用优化理论进行后量子密码学的安全分析和发展
- 批准号:
20K03741 - 财政年份:2020
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of nonlinear semidefinite optimization theory and application to machine learning
非线性半定优化理论的发展及其在机器学习中的应用
- 批准号:
20K19748 - 财政年份:2020
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Early-Career Scientists