What recommendation algorithms are optimal in (MAB) settings when they compete, and evaluates the intertemporal welfare effect of competition

哪些推荐算法在竞争时在(MAB)设置中是最优的,并评估竞争的跨期福利效应

基本信息

  • 批准号:
    2570600
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Studentship
  • 财政年份:
    2021
  • 资助国家:
    英国
  • 起止时间:
    2021 至 无数据
  • 项目状态:
    未结题

项目摘要

Which consumers gain when recommendation services like Trivago compete? What recommendation strategy should an entrant pick? This research project explores what recommendation algorithms are optimal in multi-armed bandit (MAB) settings when they compete, and evaluates the intertemporal welfare effect of competition.MABs are a popular tool to model explore-exploit trade-offs - from medical trials to the internet economy. A recommender tries to gather information on payoff distributions of different products by persuading consumers to choose the desired good. When a recommender acts in isolation, for example, Google's ad-revenue-maximization algorithm, the problem is well studied. However, little attention has been given to situations where agents are faced with recommendations from multiple algorithms. In other words, the existing literature has explored the MAB problem when customers choose between products rather than product-recommender pairs.When algorithms compete for customers, two effects impact consumer utility, (i) for a given set of algorithms, information acquisition is slower as each algorithm observes a strict subset of the available information, and (ii) different algorithms may be chosen in case of no competition. Smarter algorithms will reduce cumulative regret from persistent suboptimal choices but have a lower expected utility for earlier customers due to information acquisition. They are also computationally costlier. All of the above raises interesting questions about the intertemporal distribution of consumer utility. My research question relates directly to two recent papers. In 'Competing Bandits: Learning Under Competition', Mansour et al. (2018) derive some analytical solutions under very restrictive ad hoc assumptions. Aridor et al. (2019) consider a more natural setting but use simulations to approximate the optimal algorithm in 'The Perils of Exploration under Competition: A Computational Modeling Approach'. A common feature of their models is that customers only receive one recommendation. To model setups like the internet economy, where checking multiple websites is effectively costless, I instead assume that each consumer observes a recommendation from each algorithm and chooses a product thereafter. Competition arises because the consumer only reveals their experience to one algorithm.I plan to solve for the optimal algorithm analytically. Which algorithm is optimal will depend heavily on what customers know. In the Mansour et al. (2018) setup, the basic greedy algorithm wins, but in Aridor et al. (2019), for long enough horizons, more sophisticated algorithms beat the greedy one. Similarly, assumptions on the information principals can observe are extremely important. The more principals can observe, the greedier I expect the optimal algorithm to be in any non-cooperative equilibrium. I plan to supplement the theoretical findings with simulations where appropriate. The research has important implications for policymakers and regulators of online markets.
当Trivago等推荐服务竞争时,消费者会获得哪些好处?参赛者应该选择什么样的推荐策略?本研究项目探讨了在多臂强盗(MAB)环境中,当它们竞争时,什么样的推荐算法是最优的,并评估竞争的跨期福利效应。MAB是一种流行的工具,用于模拟探索-利用权衡-从医学试验到互联网经济。推荐系统试图通过说服消费者选择所需的商品来收集不同产品的收益分布信息。当一个推荐者单独行动时,例如,谷歌的广告收入最大化算法,这个问题得到了很好的研究。然而,很少有人注意到代理人面临来自多个算法的建议的情况。换句话说,现有的文献已经探讨了MAB问题,当客户之间选择的产品,而不是产品推荐pairs.When算法竞争的客户,两个影响消费者效用,(i)对于一组给定的算法,信息获取是缓慢的,因为每个算法观察到一个严格的子集的可用信息,(ii)不同的算法可能会选择在没有竞争的情况下。更聪明的算法将减少持续次优选择的累积遗憾,但由于信息获取,对早期客户的预期效用较低。它们的计算成本也更高。所有这些都提出了关于消费者效用跨期分布的有趣问题。我的研究问题与最近的两篇论文直接相关。在“竞争的强盗:在竞争中学习”中,Mansour et al.(2018)在非常严格的特设假设下推导出一些分析解。Aridor等人(2019)考虑了一种更自然的设置,但使用模拟来近似“竞争下探索的危险:计算建模方法”中的最佳算法。他们的模型的一个共同特点是,客户只收到一个建议。为了模拟像互联网经济这样的设置,检查多个网站实际上是没有成本的,我假设每个消费者都观察每个算法的推荐,然后选择一个产品。竞争的出现是因为消费者只向一种算法透露他们的经验。我计划用分析的方法来解决最优算法。哪种算法是最佳的将在很大程度上取决于客户知道什么。在Mansour et al.(2018)的设置中,基本的贪婪算法获胜,但在Aridor et al.(2019)中,对于足够长的时间,更复杂的算法击败了贪婪算法。同样,对当事人可以观察到的信息的假设也是极其重要的。可以观察到的主体越多,我期望最优算法在任何非合作均衡中越贪婪。我计划在适当的情况下用模拟来补充理论研究结果。这项研究对在线市场的政策制定者和监管者具有重要意义。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

其他文献

Internet-administered, low-intensity cognitive behavioral therapy for parents of children treated for cancer: A feasibility trial (ENGAGE).
针对癌症儿童父母的互联网管理、低强度认知行为疗法:可行性试验 (ENGAGE)。
  • DOI:
    10.1002/cam4.5377
  • 发表时间:
    2023-03
  • 期刊:
  • 影响因子:
    4
  • 作者:
  • 通讯作者:
Differences in child and adolescent exposure to unhealthy food and beverage advertising on television in a self-regulatory environment.
在自我监管的环境中,儿童和青少年在电视上接触不健康食品和饮料广告的情况存在差异。
  • DOI:
    10.1186/s12889-023-15027-w
  • 发表时间:
    2023-03-23
  • 期刊:
  • 影响因子:
    4.5
  • 作者:
  • 通讯作者:
The association between rheumatoid arthritis and reduced estimated cardiorespiratory fitness is mediated by physical symptoms and negative emotions: a cross-sectional study.
类风湿性关节炎与估计心肺健康降低之间的关联是由身体症状和负面情绪介导的:一项横断面研究。
  • DOI:
    10.1007/s10067-023-06584-x
  • 发表时间:
    2023-07
  • 期刊:
  • 影响因子:
    3.4
  • 作者:
  • 通讯作者:
ElasticBLAST: accelerating sequence search via cloud computing.
ElasticBLAST:通过云计算加速序列搜索。
  • DOI:
    10.1186/s12859-023-05245-9
  • 发表时间:
    2023-03-26
  • 期刊:
  • 影响因子:
    3
  • 作者:
  • 通讯作者:
Amplified EQCM-D detection of extracellular vesicles using 2D gold nanostructured arrays fabricated by block copolymer self-assembly.
使用通过嵌段共聚物自组装制造的 2D 金纳米结构阵列放大 EQCM-D 检测细胞外囊泡。
  • DOI:
    10.1039/d2nh00424k
  • 发表时间:
    2023-03-27
  • 期刊:
  • 影响因子:
    9.7
  • 作者:
  • 通讯作者:

的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('', 18)}}的其他基金

An implantable biosensor microsystem for real-time measurement of circulating biomarkers
用于实时测量循环生物标志物的植入式生物传感器微系统
  • 批准号:
    2901954
  • 财政年份:
    2028
  • 资助金额:
    --
  • 项目类别:
    Studentship
Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions
利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案
  • 批准号:
    2896097
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
A Robot that Swims Through Granular Materials
可以在颗粒材料中游动的机器人
  • 批准号:
    2780268
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.
严重空间天气事件对核电和保障监督的恢复力的可能性和影响。
  • 批准号:
    2908918
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface
质子、α 和 γ 辐照辅助应力腐蚀开裂:了解燃料-不锈钢界面
  • 批准号:
    2908693
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Field Assisted Sintering of Nuclear Fuel Simulants
核燃料模拟物的现场辅助烧结
  • 批准号:
    2908917
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Assessment of new fatigue capable titanium alloys for aerospace applications
评估用于航空航天应用的新型抗疲劳钛合金
  • 批准号:
    2879438
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in
使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型,以分析白细胞介素 17 抑制剂的细胞和表观遗传效应
  • 批准号:
    2890513
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
CDT year 1 so TBC in Oct 2024
CDT 第 1 年,预计 2024 年 10 月
  • 批准号:
    2879865
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds
了解野生鸟类肠道微生物组、行为和城市化之间的相互作用
  • 批准号:
    2876993
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship

相似国自然基金

Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    外国青年学者研究基金项目

相似海外基金

Customizable Artificial Intelligence for the Biomedical Masses: Development of a User-Friendly Automated Machine Learning Platform for Biology Image Analysis.
面向生物医学大众的可定制人工智能:开发用于生物图像分析的用户友好的自动化机器学习平台。
  • 批准号:
    10699828
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Previvors Recharge: A Resilience Program for Cancer Previvors
癌症预防者恢复活力计划:癌症预防者恢复力计划
  • 批准号:
    10698965
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Traumatic Brain Injury Anti-Seizure Prophylaxis in the Medicare Program
医疗保险计划中的创伤性脑损伤抗癫痫预防
  • 批准号:
    10715238
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Enhanced Medication Management to Control ADRD Risk Factors Among African Americans and Latinos
加强药物管理以控制非裔美国人和拉丁裔的 ADRD 风险因素
  • 批准号:
    10610975
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Early Detection of Right Ventricular Dysfunction and Emerging Pulmonary Hypertension in Systemic Sclerosis
系统性硬化症患者右心室功能障碍和肺动脉高压的早期发现
  • 批准号:
    10585312
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Objective quantification of vitreous inflammation using optical coherence tomography
使用光学相干断层扫描客观量化玻璃体炎症
  • 批准号:
    10574348
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Development of a Novel Virtual Reality Treatment for Emerging Adults with ADHD
开发一种针对患有多动症的新兴成人的新型虚拟现实治疗方法
  • 批准号:
    10721084
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Mobile Health and Oral Testing to Optimize Tuberculosis Contact Tracing in Colombia
移动健康和口腔测试可优化哥伦比亚的结核病接触者追踪
  • 批准号:
    10667885
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Move and Snooze: Adding insomnia treatment to an exercise program to improve pain outcomes in older adults with knee osteoarthritis
活动和小睡:在锻炼计划中添加失眠治疗,以改善患有膝骨关节炎的老年人的疼痛结果
  • 批准号:
    10797056
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Marketing Monitoring Core (MMC)
营销监控核心 (MMC)
  • 批准号:
    10666074
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了