EAGER: Preliminary Study of Hashing Algorithms for Large-Scale Learning

EAGER:大规模学习的哈希算法初步研究

基本信息

  • 批准号:
    1249316
  • 负责人:
  • 金额:
    $ 10万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2012
  • 资助国家:
    美国
  • 起止时间:
    2012-09-01 至 2014-08-31
  • 项目状态:
    已结题

项目摘要

Many emerging applications of data mining call for techniques that can deal with data instances with millions, if not billions of dimensions. Hence, there is a need for effective approaches to dealing with extremely high dimensional data sets. This project focuses on a class of novel theoretically well-founded hashing algorithms that allow high dimensional data to be encoded in a form that can be efficiently processed by standard machine learning algorithms. Specifically, it explores: One-permutation hashing, to dramatically reduce the computational and energy cost of hashing; Sparsity-preserving hashing, to take advantage of data sparsity for efficient data storage and improved generalization; Application of the new hashing techniques with standard algorithms for learning "linear" separators in high dimensional spaces. The success of this EAGER project could lay the foundations of a longer-term research agenda by the PI and other investigators focused on developing effective methods for building predictive models from extremely high dimensional data using "standard" machine learning algorithms. Broader Impacts: Effective approaches to building predictive models from extremely high dimensional data can impact many areas of science that rely on machine learning as the primary methodology for knowledge acquisition from data. The PI's education and outreach efforts aim to broaden the participation of women and underrepresented groups. The publications, software, and datasets resulting from the project will be freely disseminated to the larger scientific community.
许多新兴的数据挖掘应用需要能够处理数百万甚至数十亿维数据实例的技术。因此,需要有效的方法来处理极高维数据集。该项目专注于一类新的理论上有充分依据的哈希算法,允许高维数据以标准机器学习算法可以有效处理的形式进行编码。具体而言,它探讨了:单排列散列,以显着降低散列的计算和能源成本;稀疏性保持散列,利用数据稀疏性进行有效的数据存储和改进的泛化;新的散列技术与标准算法的应用,用于学习高维空间中的“线性”分隔符。这个EAGER项目的成功可以为PI和其他研究人员的长期研究议程奠定基础,这些研究人员专注于开发使用“标准”机器学习算法从极高维数据构建预测模型的有效方法。更广泛的影响:从极高维数据中构建预测模型的有效方法可以影响许多依赖机器学习作为从数据中获取知识的主要方法的科学领域。PI的教育和外联工作旨在扩大妇女和代表性不足群体的参与。该项目产生的出版物、软件和数据集将免费传播给更广泛的科学界。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Ping Li其他文献

Effect of CaO/Na2O on slag viscosity behavior under entrained flow gasification conditions
气流床气化条件下CaO/Na2O对炉渣粘度行为的影响
  • DOI:
    10.1016/j.fuproc.2018.10.002
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    7.5
  • 作者:
    Zefeng Ge;Lingxue Kong;Jin Bai;Xiaodong Chen;Chong He;Huaizhu Li;Zongqing Bai;Ping Li;Wen Li
  • 通讯作者:
    Wen Li
The psychological results of 438 patients with persisting GERD symptoms by Symptom Checklist 90-Revised (SCL-90-R) questionnaire
根据症状检查表 90 修订版 (SCL-90-R) 问卷对 438 名持续性 GERD 症状患者的心理结果
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    1.6
  • 作者:
    Ping Li;Fei Wang;Guo;Lin Miao;Sihong You;Xia Chen
  • 通讯作者:
    Xia Chen
BMI-adjusted prognosis of signet ring cell carcinoma in patients undergoing radical gastrectomy for gastric adenocarcinoma
接受根治性胃切除术治疗胃腺癌的印戒细胞癌的BMI调整预后
  • DOI:
    10.1016/j.asjsur.2020.03.023
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    3.5
  • 作者:
    Jia-Bin Wang;Man-Qiang Lin;Jian-Wei Xie;Jian-Xian Lin;Jun Lu;Qi-Yue Chen;Long-Long Cao;Mi Lin;Ru-Hong Tu;Ping Li;Chao-Hui Zheng;Chang-Ming Huang
  • 通讯作者:
    Chang-Ming Huang
Translational epidemiology: The powerful tool for precision cancer medicine
转化流行病学:精准癌症医学的强大工具
Effect of Sairei‐to on irreversible glomerular sclerotic lesions in rats
Sairei-to 对大鼠不可逆性肾小球硬化病变的影响
  • DOI:
  • 发表时间:
    1998
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Ping Li;H. Kawachi;M. Orikasa;Zhen Sheng Shi;F. Shimizu
  • 通讯作者:
    F. Shimizu

Ping Li的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Ping Li', 18)}}的其他基金

Collaborative Research: Study of A- and B-class dye-decolorizing peroxidases (DyPs): From molecular mechanisms to applications in dye removal and lignin degradation
合作研究:A 类和 B 类染料脱色过氧化物酶 (DyPs) 的研究:从分子机制到在染料去除和木质素降解中的应用
  • 批准号:
    1807532
  • 财政年份:
    2018
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
Efficient Data Reduction and Summarization
高效的数据缩减和汇总
  • 批准号:
    1444124
  • 财政年份:
    2014
  • 资助金额:
    $ 10万
  • 项目类别:
    Continuing Grant
Neurocognitive Mechanisms of Second Language Learning: Role of Learning Context and Cognitive Functions
第二语言学习的神经认知机制:学习情境和认知功能的作用
  • 批准号:
    1338946
  • 财政年份:
    2013
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
III: Small: Probabilistic Hashing for Efficient Search Learning
III:小:用于高效搜索学习的概率哈希
  • 批准号:
    1360971
  • 财政年份:
    2013
  • 资助金额:
    $ 10万
  • 项目类别:
    Continuing Grant
BIGDATA: Small: DA: A Random Projection Approach
大数据:小:DA:随机投影方法
  • 批准号:
    1419210
  • 财政年份:
    2013
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
III: Small: Probabilistic Hashing for Efficient Search Learning
III:小:用于高效搜索学习的概率哈希
  • 批准号:
    1319830
  • 财政年份:
    2013
  • 资助金额:
    $ 10万
  • 项目类别:
    Continuing Grant
BIGDATA: Small: DA: A Random Projection Approach
大数据:小:DA:随机投影方法
  • 批准号:
    1250914
  • 财政年份:
    2013
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
Collaborative Research: Cross-Language Lexical Interaction
合作研究:跨语言词汇交互
  • 批准号:
    1057877
  • 财政年份:
    2011
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
Efficient Data Reduction and Summarization
高效的数据缩减和汇总
  • 批准号:
    0808864
  • 财政年份:
    2008
  • 资助金额:
    $ 10万
  • 项目类别:
    Continuing Grant
RUI: Self-organization and the Acquisition, Representation, and Processing of Language
RUI:自组织和语言的习得、表示和处理
  • 批准号:
    0131829
  • 财政年份:
    2003
  • 资助金额:
    $ 10万
  • 项目类别:
    Continuing Grant

相似海外基金

"Ethical Review to Support Responsible AI in Policing - A Preliminary Study of West Midlands Police's Specialist Data Ethics Review Committee "
“支持警务中负责任的人工智能的道德审查——西米德兰兹郡警察专家数据道德审查委员会的初步研究”
  • 批准号:
    AH/Z505626/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Novel 'extended labour induction' balloon to improve safety of labour induction: Prototype development and preliminary clinical study
新型“延长引产”球囊可提高引产安全性:原型开发和初步临床研究
  • 批准号:
    MR/Y503423/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Preliminary Study to Establish Heavy Ion Ablation Therapy for Lethal Ventricular Arrhythmia
重离子消融治疗致死性室性心律失常的初步研究
  • 批准号:
    23K14885
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
A Preliminary Study for Constructing International Network of Image Archives on Afghan Cultural Heritages
构建阿富汗文化遗产国际图像档案网络的初步研究
  • 批准号:
    23K00915
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Anti-Müllerian Hormone and Cardiovascular Risk in Males with Chronic Kidney Disease: Preliminary Findings From a Cross-Sectional Study
男性慢性肾病患者的抗苗勒氏管激素和心血管风险:横断面研究的初步结果
  • 批准号:
    493131
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
Chronic/latent viral infection prevalence and estimated all-cause mortality risk among women living with HIV and HIV-negative women participating in the British Columbia CARMA-CHIWOS Collaboration (BCC3) study: preliminary findings
参与不列颠哥伦比亚省 CARMA-CHIWOS 合作 (BCC3) 研究的艾滋病毒感染者和艾滋病毒阴性女性的慢性/潜伏性病毒感染患病率和估计全因死亡风险:初步结果
  • 批准号:
    467895
  • 财政年份:
    2022
  • 资助金额:
    $ 10万
  • 项目类别:
Chronic/latent viral infection prevalence and estimated all-cause mortality risk among women living with HIV and HIV-negative women participating in the British Columbia CARMA-CHIWOS Collaboration (BCC3) study: preliminary findings
参与不列颠哥伦比亚省 CARMA-CHIWOS 合作 (BCC3) 研究的艾滋病毒感染者和艾滋病毒阴性女性的慢性/潜伏性病毒感染患病率和估计全因死亡风险:初步结果
  • 批准号:
    467881
  • 财政年份:
    2022
  • 资助金额:
    $ 10万
  • 项目类别:
Exploring the concept of dyadic health in Thai couples coping with breast cancer: A preliminary study of a partnered approach to physical activity in breast cancer survivors and spouse care partners
探索泰国夫妇应对乳腺癌的二元健康概念:乳腺癌幸存者和配偶护理伙伴合作体育活动方法的初步研究
  • 批准号:
    10426561
  • 财政年份:
    2021
  • 资助金额:
    $ 10万
  • 项目类别:
A preliminary study on the preceramic and early Maya civilization
前陶瓷及早期玛雅文明的初步研究
  • 批准号:
    20K20712
  • 财政年份:
    2020
  • 资助金额:
    $ 10万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
A preliminary study for magnitude estimates and quantitative comparisons of the extreme space weather events based on the early modern analog observational records
基于现代早期模拟观测记录的极端空间天气事件震级估算与定量比较初步研究
  • 批准号:
    20K22367
  • 财政年份:
    2020
  • 资助金额:
    $ 10万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了