Multi-armed Bandit Problems with Covariates

具有协变量的多臂老虎机问题

基本信息

  • 批准号:
    1106576
  • 负责人:
  • 金额:
    $ 25万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-06-01 至 2014-10-31
  • 项目状态:
    已结题

项目摘要

Multi-armed bandit (MAB) refers to a class of sequential decision making problems where in each step one needs to choose a population from which a random reward will be generated. The goal is to maximize the total accumulated reward. The literature on MAB, with few exceptions, ignores available covariates. In this project, the PI will study MAB with covariates in general frameworks and develop methodologies as well as theories for various applications. The project will 1) provide methods for selecting key covariates; 2) establish consistency in variable selection; 3) establish consistency of the allocation rule in terms of the accumulated reward; 4) derive the rate of convergence of the accumulated reward relative to the oracle choices. In addition, nonparametric estimation of the mean reward functions and model combinations will be utilized for achieving higher expected reward. Strategies that simultaneously achieve high expected reward and also provide sufficient information for identifying the best arm (with high probability) will be sought.In practice of medicine, treatments previously shown to be the best at population levels in clinical trials are given to new patients with minimal consideration of his/her own personal characteristics such as genetic profile. If practically feasible, there is every reason for a patient to be treated in a way that the outcomes of all previous treatments of patients with the same disease will have been taken into account and consequently the most promising individualized treatment is selected based on genetic information, clinical assessments, and all the accumulated trial/treatment results. The proposed research will set up statistical frameworks and build theories and methodologies for application of individualized medicine using the statistical machinery of sequential allocation with covariates. Besides medicine, sequential allocation has applications in operations research, industrial engineering, economics and other fields. Due to the ease of getting and processing information furnished by the exponential growth of modern technology, with new research to bring effective use of key predictors, applications of sequential allocation with covariates will make a real impact, saving lives, improving health, promoting business, and reducing operating cost for the society.
多臂盗贼(MAB)指的是一类序贯决策问题,其中每一步都需要选择一个种群,并从中产生随机报酬。目标是最大化累积的总回报。除了极少数例外,有关MAB的文献忽略了可用的协变量。在这个项目中,PI将在总体框架中研究带有协变量的人与生物圈,并为各种应用开发方法和理论。该项目将1)提供选择关键协变量的方法;2)建立变量选择的一致性;3)根据累积奖励建立分配规则的一致性;4)推导累积奖励相对于先知选择的收敛速度。此外,将利用平均报酬函数和模型组合的非参数估计来获得更高的期望报酬。在医学实践中,以前在临床试验中被证明是人群水平上最好的治疗方法被给予新患者,而最少考虑他/她自己的个人特征,如基因特征。如果实际可行,患者完全有理由接受这样的治疗,即考虑到相同疾病患者以前所有治疗的结果,因此根据遗传信息、临床评估和所有累积的试验/治疗结果选择最有希望的个体化治疗。建议的研究将建立统计框架,并利用协变量序贯分配的统计机制建立个体化药物应用的理论和方法。除医学外,序列分配在运筹学、工业工程、经济学等领域都有应用。由于现代技术的指数级增长提供的信息易于获取和处理,随着新的研究带来关键预测因素的有效使用,带协变量的顺序分配应用将产生真正的影响,拯救生命、改善健康、促进商业发展,并为社会降低运营成本。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Yuhong Yang其他文献

Breeding of cabbage (Brassica oleracea L. var. capitata) with Fusarium wilt resistance based on microspore culture and biomarker selection
基于小孢子培养和生物标志物选择的抗枯萎病甘蓝育种
  • DOI:
  • 发表时间:
    2014
  • 期刊:
  • 影响因子:
    1.9
  • 作者:
    Zhi-yuan Fang;Yuhong Yang;Bingyan Xie;Xiaowu Wang
  • 通讯作者:
    Xiaowu Wang
Asymmetric addition of benzothiazole to N-tert-butanesulfinyl imine for the synthesis of chiral α-branched heteroaryl amines
苯并噻唑与 N-叔丁亚磺酰亚胺的不对称加成合成手性 α-支化杂芳胺
  • DOI:
    10.1016/j.tetlet.2012.09.131
  • 发表时间:
    2012-12
  • 期刊:
  • 影响因子:
    1.8
  • 作者:
    Yuhong Yang;Mei Wang;Li Lin;Rui Wang
  • 通讯作者:
    Rui Wang
Asymmetric addition of benzothiazole to N-tert-butanesulfinyl imine for the synthesis of chiral -branched heteroaryl amines
苯并噻唑与N-叔丁亚磺酰亚胺不对称加成合成手性化合物
  • DOI:
  • 发表时间:
    2012
  • 期刊:
  • 影响因子:
    1.8
  • 作者:
    Jinlong Zhang;Yuhong Yang;Mei Wang;Li Lin;Rui Wang
  • 通讯作者:
    Rui Wang
Combining regression quantile estimators
组合回归分位数估计器
  • DOI:
  • 发表时间:
    2009
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Kejia Shan;Yuhong Yang
  • 通讯作者:
    Yuhong Yang
How Powerful Can Any Regression Learning Procedure Be?
回归学习过程有多强大?
  • DOI:
  • 发表时间:
    2007
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Yuhong Yang
  • 通讯作者:
    Yuhong Yang

Yuhong Yang的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Yuhong Yang', 18)}}的其他基金

Model Selection Diagnostics and Localized Model Selection/Combination
选型诊断和本地化选型/组合
  • 批准号:
    0706850
  • 财政年份:
    2007
  • 资助金额:
    $ 25万
  • 项目类别:
    Standard Grant
Adaptive Regression for Dependent Data by Combining Different Procedures
通过组合不同的过程对相关数据进行自适应回归
  • 批准号:
    0515990
  • 财政年份:
    2004
  • 资助金额:
    $ 25万
  • 项目类别:
    Continuing Grant
Adaptive Regression for Dependent Data by Combining Different Procedures
通过组合不同的过程对相关数据进行自适应回归
  • 批准号:
    0094323
  • 财政年份:
    2001
  • 资助金额:
    $ 25万
  • 项目类别:
    Continuing Grant

相似海外基金

Highlight: Identifying barriers to mental healthcare for civilians affected by protracted armed conflict in Colombia
亮点:确定哥伦比亚受持久武装冲突影响的平民的心理保健障碍
  • 批准号:
    ES/X012808/1
  • 财政年份:
    2024
  • 资助金额:
    $ 25万
  • 项目类别:
    Research Grant
CRII: CIF: Sequential Decision-Making Algorithms for Efficient Subset Selection in Multi-Armed Bandits and Optimization of Black-Box Functions
CRII:CIF:多臂老虎机中高效子集选择和黑盒函数优化的顺序决策算法
  • 批准号:
    2246187
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
    Standard Grant
An Empirical Study of the Determinants and Consequences of Citizens' Confidence in the Armed Forces
公民对武装部队信心的决定因素及其后果的实证研究
  • 批准号:
    23KJ2013
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Counterterrorism policies of Pakistan; How the war generation survives and resists the armed conflict in the Tribal districts?
巴基斯坦的反恐政策;
  • 批准号:
    2888672
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
    Studentship
the protection of cultural property in the event of armed conflict - the policies and measures that contain strategies and frameworks for the second world war period
发生武装冲突时保护文化财产——包含第二次世界大战期间战略和框架的政策和措施
  • 批准号:
    23K00957
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Exposure to armed conflict, climate shocks, and the nutritional status of women and children
武装冲突、气候冲击以及妇女和儿童的营养状况
  • 批准号:
    10740395
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
Mobilisation of the local armed groups by the Chinese Communist Party in Manchuria and the Soviet-Japanese confrontation
中共满洲地方武装的动员与苏日对抗
  • 批准号:
    22KJ0930
  • 财政年份:
    2023
  • 资助金额:
    $ 25万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Understanding and managing the relationship between soldier burden, mobility and susceptibility to enemy fire in the Canadian Armed Forces
了解和管理加拿大武装部队中士兵负担、机动性和对敌人火力的敏感性之间的关系
  • 批准号:
    567175-2021
  • 财政年份:
    2022
  • 资助金额:
    $ 25万
  • 项目类别:
    Alliance Grants
International Labor Migration, Armed Conflict and Dementia Risk in Nepal: A Population Study
尼泊尔的国际劳工移民、武装冲突和痴呆症风险:人口研究
  • 批准号:
    10535019
  • 财政年份:
    2022
  • 资助金额:
    $ 25万
  • 项目类别:
Statistical inference in the big data era: using hierarchical models to estimate the socio-economic situation of Colombia's armed conflict victims wit
大数据时代的统计推断:利用分层模型估算哥伦比亚武装冲突受害者的社会经济状况
  • 批准号:
    2750472
  • 财政年份:
    2022
  • 资助金额:
    $ 25万
  • 项目类别:
    Studentship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了