BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
基本信息
- 批准号:1837964
- 负责人:
- 金额:$ 30万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Recent advances in multimodal brain imaging and high throughput genotyping and sequencing techniques provide exciting new opportunities to ultimately improve our understanding of brain structure and neural dynamics, their genetic architecture, and their influences on cognition and behavior. However, data privacy and security issues have inhibited data sharing across institutes. Emerging multi-site collaborative data analysis can address these issues and facilitate data and computing resource sharing. In collaborative data analysis, the participating institutes keep their own data, which are analyzed and computed locally, and only share the computed results by communicating with a server. The server communicates with all institutes and updates the local models such that the trained machine learning models indirectly use all data and are shared with all institutes. Although some distributed/parallel computation techniques were recently proposed to address big data mining problems, most of them are synchronous models. Asynchronous distributed learning methods are much more efficient, because they allow the server to update the model with information from only one worker node without waiting for slow worker nodes in each round. However, the convergence analysis for the asynchronous distributed algorithms is much more difficult due to the inconsistent variables update across nodes. Thus, it is challenging to design efficient distributed machine learning algorithms for collaborative big data analysis. The research objective of this project is to address the computational challenges in the emerging multi-site collaborative data mining for brain big data. This project seeks to harness the opportunities of designing new efficient asynchronous distributed machine learning algorithms with rigorous theoretical foundations for multi-site collaborative brain big data mining, creating large-scale computational strategies and effective software tools to reveal sophisticated relationships among heterogeneous brain data. This project designs the asynchronous distributed machine learning and principled big data mining models to conduct the comprehensive study of brain imaging genomics and connectomics. Specifically, the principal investigators investigate: 1) collaborative genotype and phenotype association study using new asynchronous doubly stochastic proximal gradient algorithms; 2) communication-efficient multi-site collaborative data integration models to integrate imaging genomics data for predicting outcomes of interest; 3) collaborative deep learning algorithm speedup by the asynchronous distributed algorithms with applications in temporal cognitive change prediction; and 4) new graph convolutional deep learning models for brain network mining. It is innovative to integrate new distributed machine learning and data-intensive computing with brain imaging genomics and connectomics that hold great promise for a systems biology of the brain. The developed methods and tools impact other neuroimaging, genomics, and neuroscience research, and enable investigators working on brain science to effectively test their scientific hypotheses. This project will also facilitate the development of novel educational tools.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
多模式脑成像和高通量基因分型和测序技术的最新进展为最终提高我们对大脑结构和神经动力学、它们的遗传结构以及它们对认知和行为的影响提供了令人兴奋的新机会。然而,数据隐私和安全问题阻碍了各机构之间的数据共享。新兴的多站点协同数据分析可以解决这些问题,促进数据和计算资源的共享。在协同数据分析中,参与机构保留自己的数据,这些数据在本地进行分析和计算,并且只通过与服务器通信来共享计算结果。服务器与所有机构通信并更新本地模型,以便训练的机器学习模型间接使用所有数据并与所有机构共享。虽然最近提出了一些分布式/并行计算技术来解决大数据挖掘问题,但它们大多是同步模型。异步分布式学习方法效率高得多,因为它们允许服务器使用仅来自一个工作节点的信息来更新模型,而无需等待每一轮速度较慢的工作节点。然而,由于节点间变量更新不一致,使得异步分布式算法的收敛分析变得更加困难。因此,为协作大数据分析设计高效的分布式机器学习算法是具有挑战性的。本项目的研究目标是解决新兴的脑部大数据多站点协同数据挖掘中的计算挑战。该项目旨在利用设计新的高效异步分布式机器学习算法的机会,为多站点协作脑大数据挖掘提供严密的理论基础,创建大规模计算策略和有效的软件工具,以揭示异质脑数据之间的复杂关系。本项目设计了异步分布式机器学习和原则性大数据挖掘模型,进行脑成像基因组学和连接学的综合研究。具体地说,主要研究人员研究了:1)使用新的异步双随机近端梯度算法进行基因和表型关联的协作研究;2)通信高效的多站点协作数据集成模型,用于集成成像基因组数据以预测感兴趣的结果;3)通过异步分布式算法加速协作深度学习算法,并将其应用于时间认知变化预测;以及4)新的用于脑网络挖掘的图卷积深度学习模型。将新的分布式机器学习和数据密集型计算与脑成像、基因组学和连接学相结合是一种创新,它们为大脑的系统生物学带来了巨大的希望。开发的方法和工具影响了其他神经成像、基因组学和神经科学研究,并使从事脑科学工作的研究人员能够有效地检验他们的科学假设。该项目还将促进新教育工具的开发。该奖项反映了NSF的法定使命,并通过使用基金会的智力优势和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(26)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Improved Prediction of Cognitive Outcomes via Globally Aligned Imaging Biomarker Enrichments over Progressions
通过全局对齐的成像生物标志物丰富进展来改进认知结果的预测
- DOI:10.1007/978-3-030-32251-9_16
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Elbeleidy, S;Baker, L;Wang, H;Huang, H;Shen, L
- 通讯作者:Shen, L
A Dirty Multi-task Learning Method for Multi-modal Brain Imaging Genetics
- DOI:10.1007/978-3-030-32251-9_49
- 发表时间:2019-10
- 期刊:
- 影响因子:0
- 作者:Lei Du;Fang Liu;Kefei Liu;Xiaohui Yao;S. Risacher;Junwei Han;Lei Guo;A. Saykin;Li Shen
- 通讯作者:Lei Du;Fang Liu;Kefei Liu;Xiaohui Yao;S. Risacher;Junwei Han;Lei Guo;A. Saykin;Li Shen
Multi-task learning based structured sparse canonical correlation analysis for brain imaging genetics.
- DOI:10.1016/j.media.2021.102297
- 发表时间:2022-03
- 期刊:
- 影响因子:10.9
- 作者:Kim M;Min EJ;Liu K;Yan J;Saykin AJ;Moore JH;Long Q;Shen L
- 通讯作者:Shen L
Deep multiview learning to identify imaging-driven subtypes in mild cognitive impairment.
- DOI:10.1186/s12859-022-04946-x
- 发表时间:2022-09-29
- 期刊:
- 影响因子:3
- 作者:Feng, Yixue;Kim, Mansu;Yao, Xiaohui;Liu, Kefei;Long, Qi;Shen, Li
- 通讯作者:Shen, Li
A structural enriched functional network: An application to predict brain cognitive performance.
- DOI:10.1016/j.media.2021.102026
- 发表时间:2021-07
- 期刊:
- 影响因子:10.9
- 作者:Kim M;Bao J;Liu K;Park BY;Park H;Baik JY;Shen L
- 通讯作者:Shen L
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Li Shen其他文献
High-performance silicon 2 × 2 thermo-optic switch for the 2-μm wavelength band
适用于 2μm 波长带的高性能硅 2 × 2 热光开关
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:2.4
- 作者:
Li Shen;Meng Huang;Shuang Zheng;Lesi Yang;Xiangfeng Peng;Xiaoping Cao;Shuhui Li;Jian Wang - 通讯作者:
Jian Wang
Generating structured light with phase helix and intensity helix using reflection-enhanced plasmonic metasurface at 2 lm
使用反射增强等离子体超表面在 2 lm 下生成具有相位螺旋和强度螺旋的结构光
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:4
- 作者:
Yifan Zhao;Jing Du;Jinrun Zhang;Li Shen;Jian Wang - 通讯作者:
Jian Wang
[Expression of costimulatory molecules on peripheral blood lymphocytes of patients with idiopathic thrombocytopenic purpura].
特发性血小板减少性紫癜患者外周血淋巴细胞共刺激分子的表达
- DOI:
- 发表时间:
2003 - 期刊:
- 影响因子:0
- 作者:
Yan;Qing;You Cheng;Xu;Lun;Nian Yang;Qiong;Li Shen - 通讯作者:
Li Shen
In Vitro and In Vivo Metabolism of a Novel Antimitochondrial Cancer Metabolism Agent, CPI-613, in Rat and Human
新型抗线粒体癌症代谢剂 CPI-613 在大鼠和人类体内的体外和体内代谢
- DOI:
10.1124/dmd.121.000726 - 发表时间:
2022 - 期刊:
- 影响因子:3.9
- 作者:
Vijay Reddy;L. Boteju;Asela Boteju;Li Shen;K. Kassahun;Nageshwar Reddy;A. Sheldon;S. Luther;Ke Hu - 通讯作者:
Ke Hu
Phase I study of high-dose interleukin 2, aldesleukin, in combination with the histone deacetylase inhibitor, entinostat, in patients with metastatic renal cell carcinoma: Safety and tolerability results.
高剂量白介素 2(阿地白介素)联合组蛋白脱乙酰酶抑制剂恩替司他治疗转移性肾细胞癌患者的 I 期研究:安全性和耐受性结果。
- DOI:
10.1200/jco.2013.31.6_suppl.369 - 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Li Shen;S. George;H. Hammers;A. Sandecki;C. Collins;M. Carducci - 通讯作者:
M. Carducci
Li Shen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Li Shen', 18)}}的其他基金
Intrinsic Instabilities at Impure Interfaces
不纯界面的内在不稳定性
- 批准号:
EP/V005073/1 - 财政年份:2021
- 资助金额:
$ 30万 - 项目类别:
Fellowship
SCH: INT: Mining Drug-Drug Interaction Induced Adverse Effects from Health Record Databases
SCH:INT:从健康记录数据库中挖掘药物相互作用引起的不良反应
- 批准号:
1827472 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
SCH: INT: Mining Drug-Drug Interaction Induced Adverse Effects from Health Record Databases
SCH:INT:从健康记录数据库中挖掘药物相互作用引起的不良反应
- 批准号:
1622526 - 财政年份:2016
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
III: Small: Collaborative Research: A Large-Scale Data Mining Framework for Genome-Wide Mapping of Multi-Modal Phenotypic Biomarkers and Outcome Prediction
III:小型:协作研究:用于多模式表型生物标志物全基因组图谱和结果预测的大规模数据挖掘框架
- 批准号:
1117335 - 财政年份:2011
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
相似国自然基金
多任务深度学习融合多模态数据术前精准预测IA期非小细胞肺癌亚肺叶切除术复发风险
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
Ia型超新星多波段实测特性及其机理研究
- 批准号:JCZRYB202500270
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
Ia型超新星及相关特殊天体研究
- 批准号:12333008
- 批准年份:2023
- 资助金额:239.00 万元
- 项目类别:重点项目
南方根结线虫Mi-UNP与Bt-Cry1Ia36互作研究及其功能分析
- 批准号:2023JJ30355
- 批准年份:2023
- 资助金额:0.0 万元
- 项目类别:省市级项目
胞苷脱氨酶调控南方根结线虫响应Bt-Cry1Ia 胁迫的机制研究
- 批准号:2022JJ40235
- 批准年份:2022
- 资助金额:0.0 万元
- 项目类别:省市级项目
甘蓝型油菜BnaA01.IA调控花序结构的分子机制解析
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
年轻Ia型超新星遗迹在湍动背景场中的数值模拟研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
Ia型超新星抛射物元素丰度与时域观测特征相关性研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
miR-23a~27a簇介导DNMT调控PD-L1和HLA-Ia表达促进早期肺腺癌复发的机制研究
- 批准号:
- 批准年份:2021
- 资助金额:0.0 万元
- 项目类别:省市级项目
大豆GmCPSF73-Ia调控侧根发育的分子机制
- 批准号:
- 批准年份:2021
- 资助金额:58 万元
- 项目类别:面上项目
相似海外基金
BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
- 批准号:
2348159 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Intelligent Solutions for Navigating Big Data from the Arctic and Antarctic
BIGDATA:IA:协作研究:导航北极和南极大数据的智能解决方案
- 批准号:
2308649 - 财政年份:2022
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Protecting Yourself from Wildfire Smoke: Big Data-Driven Adaptive Air Quality Prediction Methodologies
大数据:IA:协作研究:保护自己免受野火烟雾的侵害:大数据驱动的自适应空气质量预测方法
- 批准号:
1838022 - 财政年份:2019
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Intelligent Solutions for Navigating Big Data from the Arctic and Antarctic
BIGDATA:IA:协作研究:导航北极和南极大数据的智能解决方案
- 批准号:
1947584 - 财政年份:2019
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
- 批准号:
1837956 - 财政年份:2019
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Protecting Yourself from Wildfire Smoke: Big Data Driven Adaptive Air Quality Prediction Methodologies
大数据:IA:协作研究:保护自己免受野火烟雾的侵害:大数据驱动的自适应空气质量预测方法
- 批准号:
1838024 - 财政年份:2019
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
- 批准号:
1837999 - 财政年份:2019
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
大数据:IA:协作研究:社交媒体上危机相关数据分类的领域适应方法
- 批准号:
1741370 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Data-Driven, Multi-Scale Design of Liquid Crystals for Wearable Sensors for Monitoring Human Exposure and Air Quality
大数据:IA:协作研究:用于监测人体暴露和空气质量的可穿戴传感器的数据驱动、多尺度液晶设计
- 批准号:
1837821 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Understanding the Financial Market Ecosystem
BIGDATA:IA:协作研究:了解金融市场生态系统
- 批准号:
1838183 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant