Development of a general classification framework under the Neyman-Pearson Paradigm, with biomedical and social applications
在内曼-皮尔逊范式下开发通用分类框架,并具有生物医学和社会应用
基本信息
- 批准号:1613338
- 负责人:
- 金额:$ 12万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-08-15 至 2019-10-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Classification has broad applications in various fields, including biological sciences, medicine, engineering, finance, and social sciences. The aim of classification is to accurately predict class labels for new observations based on labeled training data. For example, an email service provider needs to decide whether an incoming email is spam. Among different types of classification problems, binary classification is the most basic and important type for theoretical, methodological and algorithmic development. An important question in binary classification is how to control a prioritized type of error, either the type I error (the chance of misclassifying a class 0 data point as class 1) or the type II error (the chance of misclassifying a class 1 data point as class 0). The Neyman-Pearson (NP) classification paradigm is a theoretic framework aiming to control the type I (or type II) error with theoretic guarantee. Yet how to implement the NP paradigm with practical classification algorithms remains a great challenge. In this research, the PIs will tackle this challenge by developing new statistical theory, methods, algorithms, and a novel evaluation metric under the NP paradigm. Results from this proposal will have broad potential applications, such as reducing false positive rates in disease diagnosis and improving prediction accuracy of social events from social media data. The PIs will supervise graduate and undergraduate students of diverse background in the proposed project, and the project outcomes will be taught in graduate-level seminar courses. To aid statistical and interdisciplinary research, the PIs will distribute methods developed in this project as open-source software packages.The PIs will develop new statistical theory, methods, algorithms and applications to control asymmetric classification errors under the Neyman-Pearson (NP) paradigm. The NP paradigm addresses cases where users insist on a specific bound on type I error while keeping type II error to a minimum. Although the NP paradigm has a century-long history in hypothesis testing, until recently it did not receive much attention in the classification area, and its theory and methodologies are as yet incomplete. With the following four aims, the PIs will develop a general NP classification framework and show how it can be applied in the biomedical and social sciences. Under Aim I, the PIs will develop new NP classification theory and methods by exploring feature dependency and interactions for different data structures and sample sizes. Under Aim II, the PIs will design an umbrella algorithm to adapt popular classification methods to the NP paradigm. Under Aim III, the PIs will construct an NP version of Receiver Operating Characteristic (ROC) curves: "NP-ROC", a new evaluation metric based on the NP classification theory and methodologies. Under Aim IV, the PIs will apply the novel NP classification methodologies developed in Aims I-III to large-scale biomedical and social applications.
分类在生物科学、医学、工程、金融、社会科学等领域有着广泛的应用。分类的目的是根据标注的训练数据准确地预测新观测的类别标签。例如,电子邮件服务提供商需要确定传入的电子邮件是否为垃圾邮件。在不同类型的分类问题中,二分类是理论、方法和算法发展中最基本、最重要的类型。二进制分类中的一个重要问题是如何控制区分优先级的错误类型,要么是I类错误(将0类数据点错误分类为1类的可能性),要么是II类错误(将1类数据点错误分类为0类的可能性)。Neyman-Pearson(NP)分类范式是一种理论框架,旨在为控制第一类(或第二类)错误提供理论保障。然而,如何用实用的分类算法来实现NP范型仍然是一个巨大的挑战。在这项研究中,PI将通过发展新的统计理论、方法、算法和NP范式下的新评估指标来应对这一挑战。这一建议的结果将具有广泛的潜在应用,如降低疾病诊断中的假阳性率,提高从社交媒体数据预测社交事件的准确性。在拟议的项目中,项目督导将指导不同背景的研究生和本科生,项目成果将在研究生级别的研讨会课程中教授。为协助统计和跨学科研究,统计研究所将以开放源码软件的形式分发在这项计划中开发的方法,并将发展新的统计理论、方法、算法和应用程序,以控制奈曼-皮尔逊(NP)模式下的不对称分类错误。NP范式解决了这样的情况:用户坚持类型I错误的特定界限,同时将类型II错误保持在最低限度。尽管NP范式在假设检验方面已经有一个世纪的历史,但直到最近才在分类领域受到足够的重视,其理论和方法也仍然不完善。在以下四个目标下,个人信息系统将开发一个通用的NP分类框架,并展示如何将其应用于生物医学和社会科学。在目标I下,个人信息系统将通过探索不同数据结构和样本大小的特征依赖和相互作用来开发新的NP分类理论和方法。在AIM II下,个人信息系统将设计一个伞形算法,使流行的分类方法适应NP范式。在AIM III下,私营部门将构建接收器工作特性(ROC)曲线的NP版本:“NP-ROC”,这是一种基于NP分类理论和方法的新评估指标。在AIM IV下,PIS将把AIMS I-III中开发的新NP分类方法应用于大规模生物医学和社会应用。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Xin Tong其他文献
Image-based acquisition and modeling of polarimetric reflectance
基于图像的偏振反射率采集和建模
- DOI:
10.1145/3386569.3392387 - 发表时间:
2020 - 期刊:
- 影响因子:0
- 作者:
Seung;Tizian Zeltner;Hyunjin Ku;In;Xin Tong;Wenzel Jakob;Min H. Kim - 通讯作者:
Min H. Kim
Acupuncture for the Treatment of Hiccups following Stroke: A Systematic Review and Meta-Analysis
针灸治疗中风后打嗝:系统评价和荟萃分析
- DOI:
10.1136/acupmed-2015-011024 - 发表时间:
2017 - 期刊:
- 影响因子:2.5
- 作者:
J. Yue;Ming Liu;Jun Li;Yuming Wang;E. Hung;Xin Tong;Zhong;Qin;B. Golianu - 通讯作者:
B. Golianu
The Innovative Talents Training Mode of the Generic Architecture Discipline--Take the An Introduction to Soundscape Design As Example
通用建筑学科创新人才培养模式--以《声景设计导论》为例
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Zhengzheng Tong;Zhenkun Han;Xin Tong - 通讯作者:
Xin Tong
QTrace: An interface for customizable full system instrumentation
QTrace:可定制的完整系统仪器的界面
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Xin Tong;Jack Luo;Andreas Moshovos - 通讯作者:
Andreas Moshovos
Development of Time-of-Flight Polarized Neutron Imaging at the China Spallation Neutron Source
中国散裂中子源飞行时间偏振中子成像技术的发展
- DOI:
10.1088/0256-307x/39/6/062901 - 发表时间:
2022-06 - 期刊:
- 影响因子:3.5
- 作者:
Ahmed Salman;Jianrong Zhou;Jianqing Yang;Junpei Zhang;Chuyi Huang;Fan Ye;Zecong Qin;Xingfen Jiang;Syed Mohd Amir;Wolfgang Kreuzpaintner;Zhijia Sun;Tianhao Wang;Xin Tong - 通讯作者:
Xin Tong
Xin Tong的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Xin Tong', 18)}}的其他基金
Collaborative Research: Development of Classification Theory and Methods for Objective Asymmetry, Sample Size Limitation, Labeling Ambiguity, and Feature Importance
合作研究:针对客观不对称性、样本量限制、标签歧义和特征重要性的分类理论和方法的发展
- 批准号:
2113500 - 财政年份:2021
- 资助金额:
$ 12万 - 项目类别:
Standard Grant
Collaborative Research: Transfer Learning for Large-Scale Inference: General Framework and Data-Driven Algorithms
协作研究:大规模推理的迁移学习:通用框架和数据驱动算法
- 批准号:
2015339 - 财政年份:2020
- 资助金额:
$ 12万 - 项目类别:
Standard Grant
Robust and Interpretable Bayesian Quantile Longitudinal Analysis in Social and Behavioral Sciences
社会和行为科学中稳健且可解释的贝叶斯分位数纵向分析
- 批准号:
1951038 - 财政年份:2020
- 资助金额:
$ 12万 - 项目类别:
Standard Grant
相似国自然基金
Toward a general theory of intermittent aeolian and fluvial nonsuspended sediment transport
- 批准号:
- 批准年份:2022
- 资助金额:55 万元
- 项目类别:
一类新的连分数动力系统的研究
- 批准号:11361025
- 批准年份:2013
- 资助金额:33.0 万元
- 项目类别:地区科学基金项目
全身麻醉药作用于生殖系统GABAA受体对男性生殖功能的影响及机制研究
- 批准号:30901390
- 批准年份:2009
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
图的一般染色数与博弈染色数
- 批准号:10771035
- 批准年份:2007
- 资助金额:18.0 万元
- 项目类别:面上项目
全麻药作用脑内G蛋白相关基因表达谱和调控网络的研究
- 批准号:30371375
- 批准年份:2003
- 资助金额:20.0 万元
- 项目类别:面上项目
相似海外基金
Development of Opportunities for Research (DOOR) in Dental Schools: Future Academic Interdisciplinary Workforce and Collaborators for the National Dental Practice-Based Research Network (PBRN)
牙科学校研究机会 (DOOR) 的发展:国家牙科实践研究网络 (PBRN) 的未来学术跨学科劳动力和合作者
- 批准号:
10755060 - 财政年份:2023
- 资助金额:
$ 12万 - 项目类别:
Impact of germline mutations on the development of breast-implant associated anaplastic large cell lymphoma (BIA-ALCL) in women with textured breast implants
种系突变对有纹理乳房植入物的女性发生乳房植入物相关间变性大细胞淋巴瘤 (BIA-ALCL) 的影响
- 批准号:
10685505 - 财政年份:2022
- 资助金额:
$ 12万 - 项目类别:
Impact of germline mutations on the development of breast-implant associated anaplastic large cell lymphoma (BIA-ALCL) in women with textured breast implants
种系突变对有纹理乳房植入物的女性发生乳房植入物相关间变性大细胞淋巴瘤 (BIA-ALCL) 的影响
- 批准号:
10512644 - 财政年份:2022
- 资助金额:
$ 12万 - 项目类别:
The Development of a Smart Telehealth ECG and Human Activity Monitoring System to Improve Cardiovascular health of Older Adults
开发智能远程医疗心电图和人体活动监测系统以改善老年人的心血管健康
- 批准号:
10439299 - 财政年份:2022
- 资助金额:
$ 12万 - 项目类别:
I-Corps: Optical design and the development of high accuracy automated tick classification using computer vision
I-Corps:使用计算机视觉进行光学设计和高精度自动蜱分类的开发
- 批准号:
10561399 - 财政年份:2022
- 资助金额:
$ 12万 - 项目类别:
The Behavioral Determinants of Metabolic Syndrome Risk Development in Young Adults
年轻人代谢综合征风险发展的行为决定因素
- 批准号:
10289913 - 财政年份:2021
- 资助金额:
$ 12万 - 项目类别:
Development of quantitative tools to predict patients with difficult intubation to minimize treatment related complications
开发定量工具来预测插管困难的患者,以尽量减少治疗相关的并发症
- 批准号:
10543840 - 财政年份:2021
- 资助金额:
$ 12万 - 项目类别:
Bringing real-time stress detection to scale: Development of a biosensor driven, stress detection classifier for smartwatches
大规模实现实时压力检测:为智能手表开发生物传感器驱动的压力检测分类器
- 批准号:
9891764 - 财政年份:2020
- 资助金额:
$ 12万 - 项目类别:
Development of a minimally invasive biomarker assay to detect delayed radiation injury
开发微创生物标志物检测来检测迟发性辐射损伤
- 批准号:
10336587 - 财政年份:2020
- 资助金额:
$ 12万 - 项目类别:
Development of a minimally invasive biomarker assay to detect delayed radiation injury
开发微创生物标志物检测来检测迟发性辐射损伤
- 批准号:
10546448 - 财政年份:2020
- 资助金额:
$ 12万 - 项目类别: