Novel statistical methods for biased sampling problems
针对有偏抽样问题的新颖统计方法
基本信息
- 批准号:RGPIN-2020-04964
- 负责人:
- 金额:$ 2.7万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Biased sampling has been identified in many scientific disciplines, such as biology, ecology, fishery studies, and social sciences. It appears when the distribution of the collected data is different from that of the target population. For example, in capture-recapture experiments, larger animals are more likely to be captured, so the observed data are a biased sample of the target population. In non-ignorable missing-data problems, the response probability depends on the study variable that is subject to missing; consequently, the distribution of this variable for the completely observed data is different from that of the population, even conditional on covariates. Because of the intrinsic nature of biased sampling problems, valid and effective statistical inference is challenging. Through an extensive literature review, we have seen that existing methods either rely on strong parametric assumptions or suffer from unstable algorithms and efficiency loss in the parameter estimation. This research proposal aims to develop novel methods for biased sampling problems, focusing on inference problems from capture-recapture data and non-ignorable missing data.
The first theme will consider the estimation of abundance in a closed population with capture-recapture data. First, we propose to model the capture probabilities via the generalized additive model with shape constraints such as monotonicity for each additive component. Second, we will develop a penalized empirical likelihood (EL) method to find the point estimate and confidence interval (CI) of the abundance; this is an effective method for preventing spuriously large estimates of the abundance. Third, we will design an effective and fast algorithm for calculating the aforementioned point estimate and CI.
The second theme will consider non-ignorable missing-data problems. We will model the study variable conditional on the covariates for the completely observed data via a semiparametric BoxCox transformation model. We will proceed in two steps. In the first step, we will develop a maximum binomial likelihood method to analyze the semiparametric BoxCox transformation model. In the second step, we will combine this method with the EL method to develop valid estimators for the mean of the study variable and the unknown parameters in the response probability model.
The projects in this proposal will develop novel tuning-parameter-free statistical methods to solve important and challenging problems in abundance estimation with capture-recapture data and non-ignorable missing-data problems. These methods will be theoretically solid and implemented in publicly accessible R packages. This research program will in turn benefit theoretical and methodological research into nonparametric likelihood methods and shape-constrained inference. The outcomes will be applicable to a range of scientific problems in Canada arising in health science, economics, wildlife management, and social sciences.
有偏抽样在许多科学学科中都有发现,如生物学、生态学、渔业研究和社会科学。当收集到的数据的分布与目标人群的分布不同时,就会出现这种情况。例如,在捕获-再捕获实验中,更大的动物更有可能被捕获,因此观察到的数据是目标种群的偏差样本。在不可验证的缺失数据问题中,响应概率取决于受缺失影响的研究变量;因此,完全观察数据的该变量的分布与总体的分布不同,即使是在协变量的条件下。由于有偏抽样问题的固有性质,有效的统计推断是具有挑战性的。通过大量的文献回顾,我们已经看到,现有的方法要么依赖于强参数假设,或遭受不稳定的算法和效率损失的参数估计。本研究的目的是开发新的方法有偏抽样问题,重点是从捕获-再捕获数据和不可重复的缺失数据的推断问题。
第一个主题将考虑利用捕获-再捕获数据估计封闭种群的丰度。首先,我们建议通过广义加性模型的形状约束,如单调性为每个添加剂成分的捕获概率建模。其次,我们将开发一个惩罚的经验似然(EL)的方法来找到点估计和置信区间(CI)的丰度,这是一个有效的方法,防止虚假的大估计的丰度。第三,我们将设计一个有效和快速的算法来计算上述点估计和CI。
第二个主题将考虑不可解释的缺失数据问题。我们将通过半参数BoxCox转换模型,以完全观察数据的协变量为条件,对研究变量进行建模。我们将分两步进行。在第一步中,我们将开发一个最大二项似然方法来分析半参数BoxCox转换模型。在第二步中,我们将联合收割机这种方法与EL方法相结合,以开发有效的估计的平均值的研究变量和未知参数的响应概率模型。
本提案中的项目将开发新的无调谐参数统计方法,以解决捕获-再捕获数据和不可重复的丢失数据问题的丰度估计中的重要和具有挑战性的问题。这些方法在理论上是可靠的,并在可公开访问的R包中实现。该研究计划将反过来有利于理论和方法研究非参数似然方法和形状约束推理。这些成果将适用于加拿大卫生科学、经济学、野生动物管理和社会科学中出现的一系列科学问题。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Li, Pengfei其他文献
Phosphine-mediated enantioselective [1 + 4]-annulation of Morita-Baylis-Hillman carbonates with 2-enoylpyridines.
- DOI:
10.1039/c8ra09453e - 发表时间:
2018-12-07 - 期刊:
- 影响因子:3.9
- 作者:
Wang, Tao;Zhang, Pengfei;Li, Wenjun;Li, Pengfei - 通讯作者:
Li, Pengfei
Hospitalizations of Chronic Dialysis Patients: A National Study in China.
- DOI:
10.1159/000530069 - 发表时间:
2023-08 - 期刊:
- 影响因子:3.7
- 作者:
Chu, Hong;Yang, Chao;Lin, Yu;Wu, Jingyi;Kong, Guilan;Li, Pengfei;Zhang, Luxia;Zhao, Minghui - 通讯作者:
Zhao, Minghui
Systematic Parameterization of Monovalent Ions Employing the Nonbonded Model
- DOI:
10.1021/ct500918t - 发表时间:
2015-04-01 - 期刊:
- 影响因子:5.5
- 作者:
Li, Pengfei;Song, Lin Frank;Merz, Kenneth M., Jr. - 通讯作者:
Merz, Kenneth M., Jr.
Static composting of cow manure and corn stalk covered with a membrane in cold regions.
- DOI:
10.3389/fbioe.2022.969137 - 发表时间:
2022 - 期刊:
- 影响因子:5.7
- 作者:
Shi, Fengmei;Xu, Chengjiao;Liu, Jie;Sun, Fang;Yu, Hongjiu;Wang, Su;Li, Pengfei;Yu, Qiuyue;Li, Dan;Zuo, Xin;Liu, Li;Pei, Zhanjiang - 通讯作者:
Pei, Zhanjiang
Inter-domain communication in SARS-CoV-2 spike proteins controls protease-triggered cell entry.
- DOI:
10.1016/j.celrep.2022.110786 - 发表时间:
2022-05-03 - 期刊:
- 影响因子:8.8
- 作者:
Qing, Enya;Li, Pengfei;Cooper, Laura;Schulz, Sebastian;Jaeck, Hans-Martin;Rong, Lijun;Perlman, Stanley;Gallagher, Tom - 通讯作者:
Gallagher, Tom
Li, Pengfei的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Li, Pengfei', 18)}}的其他基金
Novel statistical methods for biased sampling problems
针对有偏抽样问题的新颖统计方法
- 批准号:
RGPIN-2020-04964 - 财政年份:2022
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Novel statistical methods for biased sampling problems
针对有偏抽样问题的新颖统计方法
- 批准号:
RGPIN-2020-04964 - 财政年份:2021
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Empirical likelihood, smoothed likelihood, and their applications
经验似然、平滑似然及其应用
- 批准号:
RGPIN-2015-06592 - 财政年份:2019
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Empirical likelihood, smoothed likelihood, and their applications
经验似然、平滑似然及其应用
- 批准号:
RGPIN-2015-06592 - 财政年份:2018
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Empirical likelihood, smoothed likelihood, and their applications
经验似然、平滑似然及其应用
- 批准号:
RGPIN-2015-06592 - 财政年份:2017
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Empirical likelihood, smoothed likelihood, and their applications
经验似然、平滑似然及其应用
- 批准号:
RGPIN-2015-06592 - 财政年份:2016
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Empirical likelihood, smoothed likelihood, and their applications
经验似然、平滑似然及其应用
- 批准号:
RGPIN-2015-06592 - 财政年份:2015
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Finite mixture models and their applications
有限混合模型及其应用
- 批准号:
371502-2009 - 财政年份:2013
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Finite mixture models and their applications
有限混合模型及其应用
- 批准号:
371502-2009 - 财政年份:2012
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
Finite mixture models and their applications
有限混合模型及其应用
- 批准号:
371502-2009 - 财政年份:2011
- 资助金额:
$ 2.7万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
基于随机网络演算的无线机会调度算法研究
- 批准号:60702009
- 批准年份:2007
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Development of a novel visualization, labeling, communication and tracking engine for human anatomy.
开发一种新颖的人体解剖学可视化、标签、通信和跟踪引擎。
- 批准号:
10761060 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
Novel Computational Methods for Microbiome Data Analysis in Longitudinal Study
纵向研究中微生物组数据分析的新计算方法
- 批准号:
10660234 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
A novel approach to pinpoint predisposed recombination regions in HIV for a global profile of HIV recombinants' occurrence and evolution
一种查明 HIV 中易发生重组区域的新方法,以了解 HIV 重组体发生和进化的全球概况
- 批准号:
10762779 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
Novel methods to improve the utility of genomics summary statistics
提高基因组学汇总统计效用的新方法
- 批准号:
10646125 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
Novel risk stratification score for patients presenting with acute Cerebral Venous Sinus Thrombosis
急性脑静脉窦血栓形成患者的新风险分层评分
- 批准号:
10592974 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
Liver Fibrosis: Leveraging Novel Statistical Methods to Determine Optimal Screening Strategy for People Living with Type 2 Diabetes
肝纤维化:利用新的统计方法确定 2 型糖尿病患者的最佳筛查策略
- 批准号:
488421 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
Operating Grants
Reducing health disparities in foregut cancers by using modifiable barriers to predict risk for inequitable care: a novel implementation science-based approach
通过使用可修改的障碍来预测不公平护理的风险来减少前肠癌症的健康差异:一种基于科学的新颖实施方法
- 批准号:
10633373 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
What works, for whom? Applying novel precision medicine methods to people with mental illness in the justice system.
什么有效,对谁有效?
- 批准号:
10723566 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
A novel instrument for continuous blood pressure monitoring
一种新型连续血压监测仪器
- 批准号:
10696510 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别:
SCH: Novel and Interpretable Statistical Learning for Brain Images in AD/ADRDs
SCH:针对 AD/ADRD 大脑图像的新颖且可解释的统计学习
- 批准号:
10816764 - 财政年份:2023
- 资助金额:
$ 2.7万 - 项目类别: