Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
基本信息
- 批准号:539433-2019
- 负责人:
- 金额:$ 8.32万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Collaborative Research and Development Grants
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Scientific and engineering documents typically contain both text and mathematics. However, when it comes to searching such documents, current commercial technology is able to match text only and so is not well-suited for tasks such as finding mathematical expressions; this must be extended to become math-aware. In this proposal we aim to search text and mathematical content together in order to get the best search results.
It has been shown that search engines become more effective when they are able to infer some semantics for terms appearing in documents since, in such instances, terms with similar meanings can be matched. As a result we wish to ensure that math-aware search can infer semantics for mathematical formulas. In this way retrieval based on both text and mathematical formulas can become more effective.
In order for a math-aware search engine to have broad impact, it is important that a user be able to enter their mathematical formulas in a natural handwritten manner rather than to use one-dimensional LaTeX-like commands. This can be natural on today's pen-based devices and so an important part of our project is to investigate how a pen-based interface can be used to retrieve documents that are rich in mathematical content.
Converting handwritten mathematical expressions to be used in a math-aware search system has considerable difficulties. Challenges include recognition of individual characters, combining handwritten characters into math expressions, distinguishing among similar looking expressions, use of gestures, and determining wild cards. We plan to investigate data-driven methods to ensure good recognition of handwritten math.
The implementation and evaluation of our math-aware retrieval system plays a central role in our work.
科学和工程文档通常包含文本和数学。 然而,当涉及到搜索此类文档时,当前的商业技术只能匹配文本,因此不适合查找数学表达式等任务;必须扩展到数学感知。在这个建议中,我们的目标是搜索文本和数学内容在一起,以获得最佳的搜索结果。
它已被证明,搜索引擎变得更加有效,当他们能够推断出一些语义出现在文档中的术语,因为在这种情况下,具有相似的含义的术语可以匹配。 因此,我们希望确保数学感知搜索可以推断数学公式的语义。通过这种方式,基于文本和数学公式的检索可以变得更加有效。
为了使数学感知搜索引擎具有广泛的影响力,重要的是用户能够以自然的手写方式输入他们的数学公式,而不是使用一维的类LaTeX命令。这在今天的基于笔的设备上是很自然的,因此我们项目的一个重要部分是研究如何使用基于笔的界面来检索数学内容丰富的文档。
将手写数学表达式转换为用于数学感知搜索系统具有相当大的困难。挑战包括识别单个字符,将手写字符组合成数学表达式,区分相似的表情,手势的使用以及确定通配符。我们计划研究数据驱动的方法,以确保手写数学的良好识别。
我们的数学感知检索系统的实施和评估在我们的工作中起着核心作用。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Labahn, George其他文献
Labahn, George的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Labahn, George', 18)}}的其他基金
Exact linear algebra, polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:
RGPIN-2020-04276 - 财政年份:2022
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Exact linear algebra, polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:
RGPIN-2020-04276 - 财政年份:2021
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:
539433-2019 - 财政年份:2021
- 资助金额:
$ 8.32万 - 项目类别:
Collaborative Research and Development Grants
Exact linear algebra, polynomial systems and applications of computer algebra
精确线性代数、多项式系统及计算机代数应用
- 批准号:
RGPIN-2020-04276 - 财政年份:2020
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:
539433-2019 - 财政年份:2019
- 资助金额:
$ 8.32万 - 项目类别:
Collaborative Research and Development Grants
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:
RGPIN-2015-04168 - 财政年份:2019
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:
RGPIN-2015-04168 - 财政年份:2018
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:
RGPIN-2015-04168 - 财政年份:2017
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:
RGPIN-2015-04168 - 财政年份:2016
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
Symbolic linear algebra, symbolic-numeric computation and applications
符号线性代数、符号数值计算及应用
- 批准号:
RGPIN-2015-04168 - 财政年份:2015
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
CAREER: Mining Hints from Text Documents to Guide Automated Database Performance Tuning
职业:从文本文档中挖掘提示来指导自动数据库性能调优
- 批准号:
2239326 - 财政年份:2023
- 资助金额:
$ 8.32万 - 项目类别:
Continuing Grant
Digital text archiving for cursive writing documents by using the reading voice of which the experts read aloud.
通过专家朗读的朗读声音,对草书文档进行数字文本归档。
- 批准号:
21K18372 - 财政年份:2021
- 资助金额:
$ 8.32万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:
539433-2019 - 财政年份:2021
- 资助金额:
$ 8.32万 - 项目类别:
Collaborative Research and Development Grants
Searching Documents with Text and Mathematical Content Using a Pen-Based Interface
使用基于笔的界面搜索包含文本和数学内容的文档
- 批准号:
539433-2019 - 财政年份:2019
- 资助金额:
$ 8.32万 - 项目类别:
Collaborative Research and Development Grants
Text Recognition of Historical Japanese Documents
日本历史文献的文本识别
- 批准号:
18K19800 - 财政年份:2018
- 资助金额:
$ 8.32万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Semantics based text matching for unstructured text documents
非结构化文本文档的基于语义的文本匹配
- 批准号:
508398-2017 - 财政年份:2017
- 资助金额:
$ 8.32万 - 项目类别:
Engage Plus Grants Program
SBIR Phase I: Software application for conversion of text-heavy documents into interactive diagram-based documents
SBIR 第一阶段:将文本文档转换为基于图表的交互式文档的软件应用程序
- 批准号:
1548308 - 财政年份:2016
- 资助金额:
$ 8.32万 - 项目类别:
Standard Grant
Semantics based text matching for unstructured text documents
非结构化文本文档的基于语义的文本匹配
- 批准号:
500745-2016 - 财政年份:2016
- 资助金额:
$ 8.32万 - 项目类别:
Engage Grants Program
Text mining and content analysis of recovery process in autobiographical documents by people with schizophrenia.
精神分裂症患者自传体文档恢复过程的文本挖掘和内容分析。
- 批准号:
15K11827 - 财政年份:2015
- 资助金额:
$ 8.32万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Inferencing and synthesizing information from multiple documents using text summarization and question answering models
使用文本摘要和问答模型从多个文档中推断和合成信息
- 批准号:
228139-2011 - 财政年份:2015
- 资助金额:
$ 8.32万 - 项目类别:
Discovery Grants Program - Individual