权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Overcoming the curse of dimensionality in dynamic programming by tensor decompositions

通过张量分解克服动态规划中的维数灾难

基本信息

批准号：
EP/V04771X/1
负责人：
Sergey Dolgov
金额：
$ 25.79万
依托单位：
University of Bath
依托单位国家：
英国
项目类别：
Research Grant
财政年份：
2021
资助国家：
英国
起止时间：
2021 至无数据
项目状态：
已结题

来源：
https://gtr.ukri.org/projects?ref=EP%2FV04771X%2F1
关键词：
Overcoming curse dimensionality dynamic programming

项目摘要

Almost 60 years ago Richard Bellman coined the expression ``curse of dimensionality'' when referring to the overwhelming computational complexity associated with the solution of multi-stage decision processes through dynamic programming (DP), leading to the well-known Bellman equation.Nowadays the curse of dimensionality has become an ubiquitous expression shared in different fields such as numerical analysis, compressed sensing and machine learning. However, it is in the computation of optimal feedback laws for the control of dynamical systems where its meaning continues to be most evident.Consider a simple pendulum, which is characterised by two variables, the position of the mass, and its velocity.A classical demonstration is the stabilisation of the pendulum in the unstable upwards position by moving the base adaptively.To synthesize the control signal for the actuator of the base that would be robust to stochastic fluctuations (e.g. from the air),one needs to compute a feedback map as a two-dimensional function of the position and velocity.This feedback map satisfies the two-dimensional partial differential equation (PDE), which has no analytic solution in general.Solving it numerically requires a discretisation of both the position and velocity with some n points.However, the total number of unknowns in the feedback map is n^2, corresponding to all combinations of the position and velocity points,and therefore the computations take longer.For d variables, the complexity of the straightforward numerical solutions grows exponentially as n^d.Even a simple quadrocopter model is described by 12 variables, and modern applications in particle physics or opinion dynamics require optimal control of dynamical systems, which are in turn PDEs, discretised with hundreds or thousands of variables.However, such systems are often very special in the sense that each variable is driven effectively by only neighbouring variables in the dynamics.In this case, the sought feedback map admits a tensor approximation with (almost) separated variables.This allows us to reduce the computational complexity dramatically, to a number of operations growing only polynomially with the number of variables, albeit at a price of introducing some error.This error can be further corrected by using methods from reinforcement learning, similar to those that made the winning artificial Go player.

近60年前，理查德·贝尔曼(Richard Bellman)在提到通过动态规划(DP)求解多阶段决策过程所带来的极大计算复杂性时，创造了“维度诅咒”一词，导致了著名的贝尔曼方程。如今，维度诅咒已成为数值分析、压缩传感和机器学习等不同领域中普遍存在的表达方式。然而，在动力学系统控制的最优反馈律的计算中，它的意义仍然是最明显的。考虑一个简单的摆，它由质量的位置和它的速度两个变量来表征。经典的演示是通过自适应地移动基座来使摆处于不稳定的向上位置。为了合成对随机波动(例如从空中)具有鲁棒性的基座执行机构的控制信号，需要计算一个反馈映射作为位置和速度的二维函数。这个反馈映射满足二维偏微分方程(PDE)，它一般没有解析解。数值求解需要将位置和速度同时离散化到n个点。然而，反馈映射中的未知数总数为n^2，对应于位置和速度点的所有组合，因此计算时间较长。对于d个变量，直接的数值解的复杂性以n^指数增长。即使一个简单的四轴飞行器模型由12个变量描述，现代粒子物理或观点动力学中的应用也需要动力系统的最优控制，而动力系统反过来又是由数百或数千个变量离散的偏微分方程。然而，对于d个变量，直接的数值解的复杂性是指数增长的这样的系统通常非常特殊，因为每个变量都只由动力学中的相邻变量有效地驱动。在这种情况下，寻求的反馈图允许(几乎)分离变量的张量近似。这允许我们大幅降低计算复杂性，使一些操作仅随着变量的数量以多项式增长，尽管代价是引入一些错误。这个错误可以通过使用强化学习的方法进一步纠正，类似于使获胜的人工围棋棋手的方法。

项目成果

期刊论文数量（10）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Data-Driven Tensor Train Gradient Cross Approximation for Hamilton-Jacobi-Bellman Equations

Hamilton-Jacobi-Bellman 方程的数据驱动张量训练梯度交叉逼近

DOI：
10.1137/22m1498401
发表时间：
2023
期刊：
SIAM Journal on Scientific Computing
影响因子：
3.1
作者：
Dolgov S
通讯作者：
Dolgov S

Data-driven initialization of deep learning solvers for Hamilton-Jacobi-Bellman PDEs

Hamilton-Jacobi-Bellman PDE 深度学习求解器的数据驱动初始化

DOI：
10.1016/j.ifacol.2022.11.047
发表时间：
2022
期刊：
IFAC-PapersOnLine
影响因子：
0
作者：
Borovykh A
通讯作者：
Borovykh A

Minimising emissions from flights through realistic wind fields with varying aircraft weights

DOI：
10.1016/j.trd.2023.103660
发表时间：
2023-04
期刊：
Transportation Research Part D: Transport and Environment
影响因子：
0
作者：
Cathie A. Wells;P. Williams;N. Nichols;D. Kalise;I. Poll
通讯作者：
Cathie A. Wells;P. Williams;N. Nichols;D. Kalise;I. Poll

Tensor Decomposition Methods for High-dimensional Hamilton--Jacobi--Bellman Equations

高维Hamilton--Jacobi--Bellman方程的张量分解方法

DOI：
10.1137/19m1305136
发表时间：
2021
期刊：
SIAM Journal on Scientific Computing
影响因子：
3.1
作者：
Dolgov S
通讯作者：
Dolgov S

State-dependent Riccati equation feedback stabilization for nonlinear PDEs

DOI：
10.1007/s10444-022-09998-4
发表时间：
2021-06
期刊：
Advances in Computational Mathematics
影响因子：
1.7
作者：
A. Alla;D. Kalise;V. Simoncini
通讯作者：
A. Alla;D. Kalise;V. Simoncini

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Sergey Dolgov其他文献

Somatic embryogenesis and Agrobacterium-mediated transformation in a number of grape cultivars

DOI：
10.1007/s11240-025-02997-5
发表时间：
2025-03-04
期刊：
PLANT CELL TISSUE AND ORGAN CULTURE
影响因子：
2.400
作者：
Galina Maletich;Igor Gavrilenko;Alexander Pushin;Svetlana Chelombit;Tatyana Khmelnitskaya;Yuri Plugatar;Sergey Dolgov;Pavel Khvatkov
通讯作者：
Pavel Khvatkov

Transgenic tomato plants as supersweet protein thaumatin II producers

转基因番茄植物作为超甜蛋白索马甜 II 生产者

DOI：
10.1134/s0003683812090025
发表时间：
2012
期刊：
Applied Biochemistry and Microbiology
影响因子：
0.8
作者：
A. Firsov;A. Firsov;A. Pushin;A. Pushin;I. Korneeva;Sergey Dolgov;Sergey Dolgov
通讯作者：
Sergey Dolgov

Statistical Proper Orthogonal Decomposition for model reduction in feedback control

反馈控制中模型简化的统计本征正交分解

DOI：
10.48550/arxiv.2311.16332
发表时间：
2023
期刊：
ArXiv
影响因子：
0
作者：
Sergey Dolgov;D. Kalise;Luca Saluzzi
通讯作者：
Luca Saluzzi

A reciprocal preconditioner for structured matrices arising from elliptic problems with jumping coefficients

DOI：
10.1016/j.laa.2011.09.010
发表时间：
2012-05-01
期刊：
Research article
影响因子：
作者：
Sergey Dolgov;Boris N. Khoromskij;Ivan Oseledets;Eugene Tyrtyshnikov
通讯作者：
Eugene Tyrtyshnikov

Correction to: Somatic embryogenesis and Agrobacterium-mediated transformation in a number of grape cultivars

DOI：
10.1007/s11240-025-03038-x
发表时间：
2025-03-25
期刊：
PLANT CELL TISSUE AND ORGAN CULTURE
影响因子：
2.400
作者：
Galina Maletich;Igor Gavrilenko;Alexander Pushin;Svetlana Chelombit;Tatyana Khmelnitskaya;Yuri Plugatar;Sergey Dolgov;Pavel Khvatkov
通讯作者：
Pavel Khvatkov