Partial recovery of missing responses - a toolbox for efficient design and analysis when data may be missing not at random
部分恢复丢失的响应 - 当数据可能非随机丢失时进行有效设计和分析的工具箱
基本信息
- 批准号:EP/V00641X/1
- 负责人:
- 金额:$ 35.84万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2021
- 资助国家:英国
- 起止时间:2021 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Missing data are a common problem in many application areas. The presence of missing values complicates analyses, and if not dealt with properly can result in incorrect conclusions being drawn from the data. It is often helpful to assume there is a process that produces the missing values, typically called a missing data mechanism. A particularly problematic scenario is when this mechanism is in part determined by some other unknown variables, such as the missing values themselves. This is known as a missing not at random (MNAR) mechanism.If missing values arise due to a MNAR mechanism then conclusions drawn from the data will typically be biased. Also, importantly, it is not possible to know whether this problem occurs or not in the data. This is the challenging problem area that this proposal seeks to address, namely developing procedures that can best test whether or MNAR occurs in the data.The proposal will consider scenarios where it is possible to estimate some of the missing values through a follow up sample. The main purpose of this is to learn about the missing data mechanism and specifically test whether the MNAR assumption is valid or not. Further, the recovered data will also help to correct for the effect the missing data have on conclusions. The proposal makes use of optimal design techniques to decide which missing values to follow up. Essentially certain missing values might yield more information about the type of missing data mechanism than others; in addition some values might be more likely than others to be recovered. In this way we would ensure maximum information from the recovered data is obtained. This will allow data analysts to determine whether the presence of MNAR is likely and take appropriate action.We will collaborate with our project partners, the Office for National Statistics and NHS Blood and Transplant in the development of these methods. Our project partners will provide relevant data for us to consider realistic scenarios and we will discuss interim results with them to ensure our methods are most useful for practitioners. We will also present the work as part of a missing data course at the African Institute of Mathematical Sciences (AIMS) to maximise the global benefit of the work.The methods developed in this proposal will be disseminated through papers and presentations. In addition, we will create a free to use R package that will implement the methods to allow easy uptake by users. We will provide training in using this R package as part of a two-day workshop where we will describe our methods to users. A dedicated website will be updated throughout the project to describe developments and facilitate engagement with interested parties.
丢失数据是许多应用领域中的常见问题。缺失值的存在使分析复杂化,如果处理不当,可能会导致从数据中得出错误的结论。假设存在一个产生缺失值的过程通常是有帮助的,通常称为缺失数据机制。一个特别有问题的场景是,当这种机制部分地由一些其他未知变量(例如缺失值本身)确定时。这被称为MNAR机制。如果由于MNAR机制而出现缺失值,则从数据中得出的结论通常会有偏差。此外,重要的是,不可能知道数据中是否存在此问题。这是本提案寻求解决的具有挑战性的问题领域,即开发能够最好地测试数据中是否出现或MNAR的程序。提案将考虑通过后续样本估计一些缺失值的可能性。其主要目的是了解缺失数据机制,并具体测试MNAR假设是否有效。此外,恢复的数据也将有助于纠正缺失数据对结论的影响。该提案利用最优设计技术来决定哪些缺失值需要跟进。从本质上讲,某些缺失值可能比其他值产生更多关于缺失数据机制类型的信息;此外,某些值可能比其他值更有可能被恢复。通过这种方式,我们将确保从恢复的数据中获得最大的信息。这将使数据分析师能够确定MNAR的存在是否可能并采取适当的行动。我们将与我们的项目合作伙伴,国家统计局和NHS血液和移植部门合作开发这些方法。我们的项目合作伙伴将为我们提供相关数据,以考虑现实的情况,我们将与他们讨论中期结果,以确保我们的方法对从业者最有用。我们还将把这项工作作为非洲数学科学研究所(AIMS)缺失数据课程的一部分,以最大限度地提高这项工作的全球效益。本提案中开发的方法将通过论文和演讲传播。此外,我们还将创建一个免费使用的R包,该包将实现这些方法,以便于用户使用。我们将提供使用这个R包的培训,作为为期两天的研讨会的一部分,我们将向用户描述我们的方法。将在整个项目期间更新一个专门的网站,以介绍进展情况,并促进与有关各方的接触。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Comparing recovery sample designs to test for the presence of MNAR
比较回收样品设计以测试 MNAR 的存在
- DOI:
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Adediran A
- 通讯作者:Adediran A
An integrated approach to test for missing not at random
- DOI:
- 发表时间:2022-08
- 期刊:
- 影响因子:0
- 作者:J. Noonan;A. A. Adediran-A.;R. Mitra;Stefanie Biedermann
- 通讯作者:J. Noonan;A. A. Adediran-A.;R. Mitra;Stefanie Biedermann
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Robin Mitra其他文献
Optimized conditions for rapd analysis inPinus radiata
- DOI:
10.1007/bf02822712 - 发表时间:
1998-07-01 - 期刊:
- 影响因子:1.900
- 作者:
Ewa Ostrowska;Morley Muralitharan;Stephen Chandler;Peter Volker;Sandra Hetherington;Robin Mitra;Frank Dunshea - 通讯作者:
Frank Dunshea
Personalized uncertainty quantification in artificial intelligence
人工智能中的个性化不确定性量化
- DOI:
10.1038/s42256-025-01024-8 - 发表时间:
2025-04-23 - 期刊:
- 影响因子:23.900
- 作者:
Tapabrata Chakraborti;Christopher R. S. Banerji;Ariane Marandon;Vicky Hellon;Robin Mitra;Brieuc Lehmann;Leandra Bräuninger;Sarah McGough;Cagatay Turkay;Alejandro F. Frangi;Ginestra Bianconi;Weizi Li;Owen Rackham;Deepak Parashar;Chris Harbron;Ben MacArthur - 通讯作者:
Ben MacArthur
Robin Mitra的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Robin Mitra', 18)}}的其他基金
Partial recovery of missing responses - a toolbox for efficient design and analysis when data may be missing not at random
部分恢复丢失的响应 - 当数据可能非随机丢失时进行有效设计和分析的工具箱
- 批准号:
EP/V00641X/2 - 财政年份:2022
- 资助金额:
$ 35.84万 - 项目类别:
Research Grant
相似海外基金
Understanding the Association between Sublingual Buprenorphine and Oral Health Outcomes
了解舌下含服丁丙诺啡与口腔健康结果之间的关联
- 批准号:
10765299 - 财政年份:2023
- 资助金额:
$ 35.84万 - 项目类别:
Partial recovery of missing responses - a toolbox for efficient design and analysis when data may be missing not at random
部分恢复丢失的响应 - 当数据可能非随机丢失时进行有效设计和分析的工具箱
- 批准号:
EP/V00641X/2 - 财政年份:2022
- 资助金额:
$ 35.84万 - 项目类别:
Research Grant
Recovery of Missing Data Modalities for Land Use and Land Cover Classification
土地利用和土地覆盖分类缺失数据模式的恢复
- 批准号:
554117-2020 - 财政年份:2020
- 资助金额:
$ 35.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's
Differentially Culturable Tubercle Bacteria: The missing link in TB Transmission?
可差异培养的结核菌:结核病传播中缺失的环节?
- 批准号:
10472540 - 财政年份:2019
- 资助金额:
$ 35.84万 - 项目类别:
Differentially Culturable Tubercle Bacteria: The missing link in TB Transmission?
可差异培养的结核菌:结核病传播中缺失的环节?
- 批准号:
10238034 - 财政年份:2019
- 资助金额:
$ 35.84万 - 项目类别:
CAREER: Fire impacts on forest carbon recovery in a warming world: training the next generation of Earth analysts by exploring a missing scale of observations
职业:在变暖的世界中火灾对森林碳恢复的影响:通过探索缺失的观测规模来培训下一代地球分析师
- 批准号:
1846384 - 财政年份:2019
- 资助金额:
$ 35.84万 - 项目类别:
Continuing Grant
Differentially Culturable Tubercle Bacteria: The missing link in TB Transmission?
可差异培养的结核菌:结核病传播中缺失的环节?
- 批准号:
10688273 - 财政年份:2019
- 资助金额:
$ 35.84万 - 项目类别:
Differentially Culturable Tubercle Bacteria: The missing link in TB Transmission?
可差异培养的结核菌:结核病传播中缺失的环节?
- 批准号:
10005114 - 财政年份:2019
- 资助金额:
$ 35.84万 - 项目类别:
Aging of Mesenchymal Stem Cells Missing Link in IPF
间充质干细胞的老化是 IPF 中缺失的环节
- 批准号:
9298707 - 财政年份:2015
- 资助金额:
$ 35.84万 - 项目类别:
Aging of Mesenchymal Stem Cells Missing Link in IPF
间充质干细胞的老化是 IPF 中缺失的环节
- 批准号:
8962475 - 财政年份:2015
- 资助金额:
$ 35.84万 - 项目类别: