Methods and Tools to Advance the Retrieval of Mathematical Knowledge from Digital Libraries for Search-, Recommendation- and Assistance-Systems

促进从数字图书馆检索数学知识以用于搜索、推荐和辅助系统的方法和工具

基本信息

项目摘要

The goal of our project is to investigate fundamental methods and tools for making mathematical knowledge accessible to information retrieval tools. Achieving this goal requires methods to reliably extract mathematical knowledge from documents. In the domain of natural language processing (NLP), a number of well-established, general purpose text processing methods and tools exist that are applied to a text to enable domain specific extraction tasks. Similar to state-of-the-art text processing tools, such as the Stanford NLP toolkit, our research will determine how similar tools for processing mathematical language can be realized.Our approach is to expand upon the concept of Mathematical Language Processing (MLP), a concept for which we have already demonstrated its feasibility when we presented it at the ACM SIGIR 2016. In the context of this project, we will expand upon our preliminary research to make the approach more effective and applicable for real world mathematical information retrieval applications. Specifically, the project has the following objectives:1. Identify mathematical formulae and expressions in documents, and reliably differentiate them from similar or neighboring structures.2. Perform type detection and tokenization of mathematical expressions.3. Extract the corresponding mathematical concepts from the tokenized mathematical formulae and expressions.Our goal is to enable other scientists to use our methods and tools for mathematical language processing to tackle their own novel problems. We hope that MLP will continue to improve in this process, as was once the case for early NLP approaches.A wide variety of applications would benefit from advancements to mathematical information retrieval. In the STEM disciplines, improvements could be made to academic literature search, literature recommendation, and even plagiarism prevention. Additionally, expert search or applications in pure mathematics, such as theorem search or definition lookup, would significantly benefit from our developments. Applications beyond STEM fields include the improvement of tutoring assistance tools, as well as patent search and enterprise search, which could become more valuable to companies if they integrate math-aware information retrieval methods.
我们的项目的目标是调查的基本方法和工具,使数学知识的信息检索工具。实现这一目标需要从文档中可靠地提取数学知识的方法。在自然语言处理(NLP)领域中,存在许多成熟的通用文本处理方法和工具,其应用于文本以实现领域特定的提取任务。类似于最先进的文本处理工具,如斯坦福大学NLP工具包,我们的研究将确定如何实现类似的数学语言处理工具。我们的方法是扩展数学语言处理(MLP)的概念,我们已经在ACM SIGIR 2016上展示了这一概念的可行性。在这个项目的背景下,我们将扩大我们的初步研究,使该方法更有效和适用于真实的世界的数学信息检索应用。具体而言,该项目有以下目标:1.识别文档中的数学公式和表达式,并可靠地将它们与相似或相邻的结构区分开来。2.对数学表达式进行类型检测和标记化.从标记化的数学公式和表达式中提取相应的数学概念。我们的目标是使其他科学家能够使用我们的数学语言处理方法和工具来解决他们自己的新问题。我们希望MLP在这个过程中继续改进,就像早期的NLP方法一样。各种各样的应用将从数学信息检索的进步中受益。在STEM学科中,可以改进学术文献搜索,文献推荐,甚至防止剽窃。此外,专家搜索或纯数学中的应用程序,如定理搜索或定义查找,将大大受益于我们的发展。STEM领域以外的应用包括改进辅导辅助工具,以及专利搜索和企业搜索,如果他们整合数学感知信息检索方法,这些应用对公司来说可能会变得更有价值。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professor Dr.-Ing. Bela Gipp其他文献

Professor Dr.-Ing. Bela Gipp的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professor Dr.-Ing. Bela Gipp', 18)}}的其他基金

Analyzing Mathematics to Detect Disguised Academic Plagiarism
分析数学以检测伪装的学术抄袭
  • 批准号:
    437179652
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Research Grants

相似海外基金

Developing and implementing tools to advance precision psychiatry using electronic health records
开发和实施工具,利用电子健康记录推进精准精神病学
  • 批准号:
    2886557
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Studentship
New tools to advance biophysical study and inhibition of peripheral membrane proteins.
推进生物物理研究和外周膜蛋白抑制的新工具。
  • 批准号:
    10795403
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
NetEthics: Building Tools & Training to Advance Responsible Conduct in Complex Research Networks Pioneering Novel Technologies
NetEthics:构建工具
  • 批准号:
    2220611
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Establishing Essential Research Tools to Advance the Knowledge of Interrelated Sex Differences in Adipose and Immune Functions in Health and Disease (The SDAI Core Models) and Exploring the Mechanisms Involved in the Sex-dimorphic Role of Prohibitin Above
建立必要的研究工具,以增进对健康和疾病中脂肪和免疫功能相关性别差异的了解(SDAI核心模型),并探索上述抑制素的性别二态性作用所涉及的机制
  • 批准号:
    438069
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Operating Grants
Adapting Advance Care Planning Tools for Permanent Supportive Housing Residents
为永久支持性住房居民调整预先护理计划工具
  • 批准号:
    10112805
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
EDGE CT: Tools to advance functional genomic studies in sea urchins
EDGE CT:推进海胆功能基因组研究的工具
  • 批准号:
    1923445
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Data Visualization Literacy: Research and Tools that Advance Public Understanding of Scientific Data
数据可视化素养:促进公众对科学数据理解的研究和工具
  • 批准号:
    1713567
  • 财政年份:
    2017
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Innovation in Medical Evidence Development and Surveillance(IMEDS)serves to advance the science and tools necessary to support post-market evidence generation on regulated products.
医学证据开发和监测创新 (IMEDS) 致力于推进支持受监管产品上市后证据生成所需的科学和工具。
  • 批准号:
    9074987
  • 财政年份:
    2015
  • 资助金额:
    --
  • 项目类别:
Advance planning tools for responsive supply chains
用于响应式供应链的高级规划工具
  • 批准号:
    355566-2008
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Discovery Grants Program - Individual
Arbitrary Pulse Shaping to Advance Electron Paramagnetic Resonance Tools for Biom
任意脉冲整形促进 Biom 电子顺磁共振工具的发展
  • 批准号:
    8465247
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了