Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
基本信息
- 批准号:539433-2019
- 负责人:
- 金额:$ 8.32万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Collaborative Research and Development Grants
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Scientific and engineering documents typically contain both text and mathematics.  However, when it comes to searching such documents, current commercial technology is able to match text only and so is not well-suited for tasks such as finding mathematical expressions; this must be extended to become math-aware. In this proposal we aim to search text and mathematical content together in order to get the best search results. 
It has been shown that search engines become more effective when they are able to infer some semantics for terms appearing in documents since, in such instances, terms with similar meanings can be matched.  As a result we wish to ensure that math-aware search can infer semantics for mathematical formulas. In this way retrieval based on both text and mathematical formulas can become more effective. 
 
In order for a math-aware search engine to have broad impact, it is important that a user be able to enter their mathematical formulas in a natural handwritten manner rather than to use one-dimensional LaTeX-like commands. This can be natural on today's pen-based devices and so an important part of our project is to investigate how a pen-based interface can be used to retrieve documents that are rich in mathematical content. 
Converting handwritten mathematical expressions to be used in a math-aware search system has considerable difficulties. Challenges include recognition of individual characters, combining handwritten characters into math expressions, distinguishing among similar looking expressions, use of gestures, and  determining wild cards. We plan to investigate data-driven methods to ensure good recognition of handwritten math. 
The implementation and evaluation of our math-aware retrieval system plays a central role in our work.
科学和工程文档通常既包含文本又包含数学。然而,在搜索这类文档时,当前的商业技术只能匹配文本,因此不太适合查找数学表达式等任务;必须扩展这一功能以使其具有数学意识。在本方案中,我们的目标是同时搜索文本和数学内容,以获得最佳搜索结果。
已经表明,当搜索引擎能够为文档中出现的术语推断某些语义时,它们变得更有效,因为在这种情况下,具有相似含义的术语可以被匹配。因此,我们希望确保数学感知搜索能够推断出数学公式的语义。通过这种方式,基于文本和数学公式的检索可以变得更有效。
为了让一个有数学意识的搜索引擎产生广泛的影响,重要的是用户能够以自然的手写方式输入他们的数学公式,而不是使用一维LaTeX之类的命令。这在当今基于笔的设备上是很自然的,因此我们项目的一个重要部分是研究如何使用基于笔的界面来检索包含丰富数学内容的文档。
将手写数学表达式转换为在数学感知搜索系统中使用是相当困难的。挑战包括识别单个字符、将手写字符组合成数学表达式、区分相似的表情、使用手势和确定通配符。我们计划研究数据驱动的方法,以确保对手写数学的良好识别。
我们的数学意识检索系统的实施和评估在我们的工作中起着核心作用。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
Labahn, George其他文献
Labahn, George的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('Labahn, George', 18)}}的其他基金
Exact linear algebra,  polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:RGPIN-2020-04276 
- 财政年份:2022
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Exact linear algebra,  polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:RGPIN-2020-04276 
- 财政年份:2021
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:539433-2019 
- 财政年份:2021
- 资助金额:$ 8.32万 
- 项目类别:Collaborative Research and Development Grants 
Exact linear algebra,  polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:RGPIN-2020-04276 
- 财政年份:2020
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:539433-2019 
- 财政年份:2019
- 资助金额:$ 8.32万 
- 项目类别:Collaborative Research and Development Grants 
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:RGPIN-2015-04168 
- 财政年份:2019
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:RGPIN-2015-04168 
- 财政年份:2018
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:RGPIN-2015-04168 
- 财政年份:2017
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:RGPIN-2015-04168 
- 财政年份:2016
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:RGPIN-2015-04168 
- 财政年份:2015
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 
相似海外基金
CAREER: Mining Hints from Text Documents to Guide Automated Database Performance Tuning
职业:从文本文档中挖掘提示来指导自动数据库性能调优
- 批准号:2239326 
- 财政年份:2023
- 资助金额:$ 8.32万 
- 项目类别:Continuing Grant 
Digital text archiving for cursive writing documents by using the reading voice of which the experts read aloud.
通过专家朗读的朗读声音,对草书文档进行数字文本归档。
- 批准号:21K18372 
- 财政年份:2021
- 资助金额:$ 8.32万 
- 项目类别:Grant-in-Aid for Challenging Research (Exploratory) 
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:539433-2019 
- 财政年份:2021
- 资助金额:$ 8.32万 
- 项目类别:Collaborative Research and Development Grants 
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:539433-2019 
- 财政年份:2019
- 资助金额:$ 8.32万 
- 项目类别:Collaborative Research and Development Grants 
Text Recognition of Historical Japanese Documents
日本历史文献的文本识别
- 批准号:18K19800 
- 财政年份:2018
- 资助金额:$ 8.32万 
- 项目类别:Grant-in-Aid for Challenging Research (Exploratory) 
Semantics based text matching for unstructured text documents
非结构化文本文档的基于语义的文本匹配
- 批准号:508398-2017 
- 财政年份:2017
- 资助金额:$ 8.32万 
- 项目类别:Engage Plus Grants Program 
SBIR Phase I: Software application for conversion of text-heavy documents into interactive diagram-based documents
SBIR 第一阶段:将文本文档转换为基于图表的交互式文档的软件应用程序
- 批准号:1548308 
- 财政年份:2016
- 资助金额:$ 8.32万 
- 项目类别:Standard Grant 
Semantics based text matching for unstructured text documents
非结构化文本文档的基于语义的文本匹配
- 批准号:500745-2016 
- 财政年份:2016
- 资助金额:$ 8.32万 
- 项目类别:Engage Grants Program 
Text mining and content analysis of recovery process in autobiographical documents by people with schizophrenia.
精神分裂症患者自传体文档恢复过程的文本挖掘和内容分析。
- 批准号:15K11827 
- 财政年份:2015
- 资助金额:$ 8.32万 
- 项目类别:Grant-in-Aid for Scientific Research (C) 
Inferencing and synthesizing information from multiple documents using text summarization and question answering models
使用文本摘要和问答模型从多个文档中推断和合成信息
- 批准号:228139-2011 
- 财政年份:2015
- 资助金额:$ 8.32万 
- 项目类别:Discovery Grants Program - Individual 

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



