USING CORPUS STATISTICS TO REMOVE REDUNDANT WORDS IN TEXT CATEGORIZATION

利用语料库统计去除文本分类中的冗余单词

基本信息

  • 批准号:
    2578635
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
  • 资助国家:
    美国
  • 起止时间:
  • 项目状态:
    未结题

项目摘要

In this collaborative project with Yiming Yang at Mayo Clinic, we use the term strength I have defined and use in the current Bayesian retrieval system for the Entrez neighbors, to determine thresholds for term removal. This allows a large number of terms to be identified as relatively useless. When these are removed the problem of text categorization based on the terms appearing in the text is greatly simplified. For the linear least squares fitting method developed and used by Dr. Yang, we find a time savings of 70 to 90% which comes from the removal of 80% or more of the terms. Dr. Yang has also developed what she terms an expert network method of text classification. It is based on finding the nearest neighbors to a text and using their classifications to predict the best classification for the text. The term removal methods provide significant time and space savings for this approach as well.
在这个与梅奥的杨一鸣合作的项目中

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

W WILBUR其他文献

W WILBUR的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('W WILBUR', 18)}}的其他基金

ALGORITHMIC COMPLEXITY AND PRACTICAL PROBLEMS
算法复杂性和实际问题
  • 批准号:
    2578637
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
THEORECTICAL INVESTIGATION OF THE LIMITS OF AI AND KNOWLEDGE REPRESENTATION
人工智能和知识表示的局限性的理论研究
  • 批准号:
    5203634
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
THEORETICAL INVESTIGATION OF THE LIMITS OF AI AND KNOWLEDGE REPRESENTATION
人工智能和知识表示的局限性的理论研究
  • 批准号:
    2578636
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
ALGORITHMIC COMPLEXITY AND PRACTICAL PROBLEMS
算法复杂性和实际问题
  • 批准号:
    5203635
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
USING CORPUS STATISTICS TO REMOVE REDUNDANT WORDS IN TEXT CATEGORIZATION
利用语料库统计去除文本分类中的冗余单词
  • 批准号:
    5203633
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:

相似海外基金

TRUST2 - Improving TRUST in artificial intelligence and machine learning for critical building management
TRUST2 - 提高关键建筑管理的人工智能和机器学习的信任度
  • 批准号:
    10093095
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Collaborative R&D
QUANTUM-TOX - Revolutionizing Computational Toxicology with Electronic Structure Descriptors and Artificial Intelligence
QUANTUM-TOX - 利用电子结构描述符和人工智能彻底改变计算毒理学
  • 批准号:
    10106704
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    EU-Funded
Artificial intelligence in education: Democratising policy
教育中的人工智能:政策民主化
  • 批准号:
    DP240100602
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Discovery Projects
Application of artificial intelligence to predict biologic systemic therapy clinical response, effectiveness and adverse events in psoriasis
应用人工智能预测生物系统治疗银屑病的临床反应、有效性和不良事件
  • 批准号:
    MR/Y009657/1
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Fellowship
REU Site: CyberAI: Cybersecurity Solutions Leveraging Artificial Intelligence for Smart Systems
REU 网站:Cyber​​AI:利用人工智能实现智能系统的网络安全解决方案
  • 批准号:
    2349104
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
EAGER: Artificial Intelligence to Understand Engineering Cultural Norms
EAGER:人工智能理解工程文化规范
  • 批准号:
    2342384
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Reversible Computing and Reservoir Computing with Magnetic Skyrmions for Energy-Efficient Boolean Logic and Artificial Intelligence Hardware
用于节能布尔逻辑和人工智能硬件的磁斯格明子可逆计算和储层计算
  • 批准号:
    2343607
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
I-Corps: Translation Potential of a Secure Data Platform Empowering Artificial Intelligence Assisted Digital Pathology
I-Corps:安全数据平台的翻译潜力,赋能人工智能辅助数字病理学
  • 批准号:
    2409130
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Planning: Artificial Intelligence Assisted High-Performance Parallel Computing for Power System Optimization
规划:人工智能辅助高性能并行计算电力系统优化
  • 批准号:
    2414141
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Reassessing the Appropriateness of currently-available Data-set Protection Levers in the era of Artificial Intelligence
重新评估人工智能时代现有数据集保护手段的适用性
  • 批准号:
    23K22068
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了