Bandits and beyond: Index heuristics for dynamic resource allocation

Bandits 及其他:动态资源分配的索引启发法

基本信息

  • 批准号:
    EP/E049265/1
  • 负责人:
  • 金额:
    $ 32.78万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2007
  • 资助国家:
    英国
  • 起止时间:
    2007 至 无数据
  • 项目状态:
    已结题

项目摘要

The research programme is concerned with the development of simple, yet effective methods for determining how some key resource should be distributed over time among a collection of entities which require it. To fix some ideas, consider the example described in the following imagined scenario, referred to in the literature as the 'stochastic multiproduct batch dispatch problem' :Different types of products arrive at a holding station according to some random process. They are kept there until dispatched onward (to a retailer, say) by one of a fleet of vehicles. Costs are incurred by the items kept in storage at the station. The nature of these holding costs may differ markedly between product types. Products may also differ with regard to their space requirements for transportation. At the beginning of each time period (the start of each day, say), a decision has to be made regarding how many vehicles should be dispatched and what the composition of their loads should be. Such decisions may be based in part, say, on the number of units of each product type at the station at the time. The goal is to take such decisions in a way that minimises the overall costs incurred in holding the products and in dispatching the vehicles. To use some mathematical jargon, the decision-maker is looking for a dynamic policy for the allocation of her key resource (ie, a rule for the deployment of the fleet of vehicles) among a set of entities (here the product types) in a situation which is both stochastic (evolves randomly) and complex. How to develop such a policy is extremely difficult, not least because decisions must be assessed not only in terms of their immediate impact on costs but also with regard to their influence on the costs which will be incurred in the future. The conventional approach to such problems, called stochastic dynamic programming, is unlikely to be able to cope well with problems of realistic size . What would be invaluable to the decision-maker would be some effective, yet reasonably simple and computationally tractable, way of calibrating the value to the respective product types of (differing amounts of) space on the vehicles. Such calibrations could then be used to inform decision-making. Simple so-called index-based solutions do indeed exist (and are very effective), but mainly for (bandit-type) problems which impose serious limitations on how the key resource may be distributed among the entities. The goal of the research project is to extend the scope of such simple index-based solution approaches to more general allocation problems which are freed of such restrictions . There is a rich variety of practical situations which share the broad features of the above example. The methodologies developed during the research programme will yield simple and effective index-based approaches to resource allocation in many such settings.
研究方案涉及发展简单而有效的方法,以确定某些关键资源应如何随时间在需要它的一组实体之间分配。为了修正一些想法,考虑以下想象场景中描述的例子,在文献中称为“随机多产品批量调度问题”:不同类型的产品根据一些随机过程到达一个持有站。它们被保存在那里,直到被车队中的一辆送去(比如送到零售商那里)。费用是由储存在车站的物品引起的。这些持有成本的性质可能因产品类型而有显著差异。产品的运输空间要求也可能有所不同。在每个时间段的开始(例如,每天的开始),必须决定应该派遣多少车辆以及它们的负载应该是什么组成。这样的决定可能部分地基于,比如说,当时车站上每种产品类型的单位数量。我们的目标是做出这样的决定,使持有产品和调度车辆所产生的总成本降到最低。用一些数学术语来说,决策者正在寻找一种动态策略,用于在一组实体(这里是产品类型)中分配她的关键资源(即车队部署的规则),这种情况既随机(随机发展)又复杂。如何制定这样一项政策是极其困难的,尤其是因为必须不仅根据其对费用的直接影响,而且根据其对将来将产生的费用的影响来评估各项决定。处理这类问题的传统方法,称为随机动态规划,不太可能很好地处理实际规模的问题。对决策者来说,最有价值的是一些有效的、但相当简单的、计算上容易处理的方法,来校准车辆上不同产品类型(不同数量)空间的价值。这样的校准可以用来为决策提供信息。简单的所谓基于索引的解决方案确实存在(并且非常有效),但主要是针对(强盗类型的)问题,这些问题对如何在实体之间分配密钥资源施加了严重的限制。该研究项目的目标是将这种简单的基于索引的解决方法的范围扩展到不受此类限制的更一般的分配问题。有各种各样的实际情况都具有上述例子的广泛特征。在研究方案期间制定的方法将在许多这种情况下产生简单有效的基于指数的资源分配办法。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements
具有一般资源需求的一类多臂强盗的广义 Gittins 指数
General notions of indexability for queueing control and asset management
排队控制和资产管理的可索引性的一般概念
Multi-armed Bandit Allocation Indices
Index Policies for the Admission Control and Routing of Impatient Customers to Heterogeneous Service Stations
异类服务站的准入控制和不耐烦客户路由的索引策略
  • DOI:
    10.1287/opre.1080.0632
  • 发表时间:
    2009
  • 期刊:
  • 影响因子:
    2.7
  • 作者:
    Glazebrook K
  • 通讯作者:
    Glazebrook K
ON THE ASYMPTOTIC OPTIMALITY OF GREEDY INDEX HEURISTICS FOR MULTI-ACTION RESTLESS BANDITS
多动作不安强盗贼贪婪指数启发式的渐近最优性
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Kevin Glazebrook其他文献

Enhanced lateral transshipments in a multi-location inventory system
  • DOI:
    10.1016/j.ejor.2012.03.005
  • 发表时间:
    2012-09-01
  • 期刊:
  • 影响因子:
  • 作者:
    Colin Paterson;Ruud Teunter;Kevin Glazebrook
  • 通讯作者:
    Kevin Glazebrook
Resource allocation in congested queueing systems with time-varying demand: An application to airport operations
  • DOI:
    10.1016/j.ejor.2019.01.024
  • 发表时间:
    2019-07-16
  • 期刊:
  • 影响因子:
  • 作者:
    Rob Shone;Kevin Glazebrook;Konstantinos G. Zografos
  • 通讯作者:
    Konstantinos G. Zografos

Kevin Glazebrook的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Kevin Glazebrook', 18)}}的其他基金

A National Taught Course Centre in Operational Research (NATCOR)
国家运筹学教学课程中心 (NATCOR)
  • 批准号:
    EP/E502067/1
  • 财政年份:
    2006
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Training Grant

相似国自然基金

微分遍历理论和廖山涛的一些方法的应用
  • 批准号:
    10671006
  • 批准年份:
    2006
  • 资助金额:
    21.0 万元
  • 项目类别:
    面上项目

相似海外基金

Amalgamating Evidence About Causes: Medicine, the Medical Sciences, and Beyond
合并有关原因的证据:医学、医学科学及其他领域
  • 批准号:
    AH/Y007654/1
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Research Grant
Democratizing HIV science beyond community-based research
将艾滋病毒科学民主化,超越社区研究
  • 批准号:
    502555
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
LSS_BeyondAverage: Probing cosmic large-scale structure beyond the average
LSS_BeyondAverage:探测超出平均水平的宇宙大尺度结构
  • 批准号:
    EP/Y027906/1
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Research Grant
Collaborative Research: Beyond the Single-Atom Paradigm: A Priori Design of Dual-Atom Alloy Active Sites for Efficient and Selective Chemical Conversions
合作研究:超越单原子范式:双原子合金活性位点的先验设计,用于高效和选择性化学转化
  • 批准号:
    2334970
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Standard Grant
Collaborative Research: Research Infrastructure: MorphoCloud: A Cloud Powered, Open-Source Platform For Research, Teaching And Collaboration In 3d Digital Morphology And Beyond
协作研究:研究基础设施:MorphoCloud:云驱动的开源平台,用于 3D 数字形态学及其他领域的研究、教学和协作
  • 批准号:
    2301410
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Standard Grant
Beyond thiols, beyond gold: Novel NHC-stabilized nanoclusters in catalysis
超越硫醇,超越金:催化中新型 NHC 稳定纳米团簇
  • 批准号:
    23K21120
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Droughts Beyond Hydro-climatological Extremes
超出水文气候极端值的干旱
  • 批准号:
    24K17352
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
BeyondSNO: Signalling beyond protein S-nitrosylation - determining the roles of nitroxyl and hydroxylamine
BeyondSNO:蛋白质 S-亚硝基化之外的信号传导 - 确定硝酰基和羟胺的作用
  • 批准号:
    EP/Y027698/1
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Research Grant
Twistors and Quantum Field Theory: Strong fields, holography and beyond
扭量和量子场论:强场、全息术及其他
  • 批准号:
    EP/Z000157/1
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Research Grant
Exploitation of High Voltage CMOS sensors for tracking applications in physics experiments and beyond
利用高压 CMOS 传感器跟踪物理实验及其他领域的应用
  • 批准号:
    MR/X023834/1
  • 财政年份:
    2024
  • 资助金额:
    $ 32.78万
  • 项目类别:
    Fellowship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了