Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
基本信息
- 批准号:RGPIN-2016-03659
- 负责人:
- 金额:$ 1.89万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2018
- 资助国家:加拿大
- 起止时间:2018-01-01 至 2019-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Applications relying on text must be built upon solid foundations of data organization similar to those that underlie conventional database systems. Thus the central long-term objective of the research program is to support document storage and management by applying sound database principles to this domain of information. The challenge is to discover how the complexity of text, with its intricate structure and diversity of expression, can be efficiently and effectively accessed and managed. In the short term, this objective will be addressed through the design and implementation of improved search engines.***The research program will include three related investigations. I will explore problems of general-purpose search engines, including aspects specific to reference texts, where nested structure is central to locating data. For example, I will investigate how the mechanism of database views can be best applied to describe, design, and implement an index that provides efficient search, including support for phrasal queries. I will also examine aspects specific to providing optometry researchers with access to Electronic Medical Records, where free-text fields include many spelling errors and where ensuring patient privacy is mandatory. In the third thrust, I will concentrate on aspects specific to mathematics information retrieval, where the relative positioning of symbols is at least as important as the particular symbols used. The inadequacy of existing solutions leads to the design of improved algorithms, which I then analyze, implement, and evaluate. Standard benchmarks provide the basis for comparative evaluations, but often additional test collections must be developed to examine specific aspects of the problems from unexplored perspectives. Graduate students will participate in all aspects of this research, resulting in HQP well-prepared to contribute to this area. ***The centrality of documents in recording and preserving information suggests that this research remains timely and crucial to economic growth in Canada. The integration of information retrieval and database management is well-recognized as being important to Canadian business and could be equally important in promoting Canadian culture. More specifically, the reliance on text search has become a universal need, with search engines providing critical roles in gathering information internet-wide, from across one or more enterprises, or from one's own personal data collections. In addition, other text applications are found throughout business and government. Publishers, data providers (e.g., via the World Wide Web), and organizations that rely on any form of text-dominated knowledge base for conducting their internal and external business will benefit from specific tools that arise from this research as well as from the theory, which will provide a framework for designing their text management systems.**
依赖文本的应用程序必须建立在坚实的数据组织基础上,类似于传统数据库系统的基础。因此,该研究计划的中心长期目标是通过将良好的数据库原则应用于这一信息领域来支持文档存储和管理。挑战在于发现如何有效地访问和管理具有复杂结构和表达多样性的文本的复杂性。在短期内,将通过设计和实施改进的搜索引擎来实现这一目标。该研究计划将包括三个相关的调查。我将探讨通用搜索引擎的问题,包括特定于参考文本的方面,其中嵌套结构是定位数据的核心。例如,我将研究如何最好地应用数据库视图的机制来描述、设计和实现一个提供有效搜索的索引,包括对短语查询的支持。我还将研究特定于提供验光研究人员访问电子病历的方面,其中自由文本字段包括许多拼写错误,并且必须确保患者隐私。在第三个重点中,我将集中讨论数学信息检索的特定方面,其中符号的相对位置至少与所使用的特定符号一样重要。现有的解决方案的不足之处导致改进算法的设计,然后我分析,实施和评估。标准基准为比较评估提供了基础,但通常必须开发额外的测试集,以从未探索的角度检查问题的特定方面。研究生将参与这项研究的各个方面,从而使HQP做好充分准备,为这一领域做出贡献。* 文件在记录和保存信息方面的中心地位表明,这项研究对加拿大的经济增长仍然是及时和至关重要的。信息检索和数据库管理的整合被公认为对加拿大企业很重要,在促进加拿大文化方面也同样重要。更具体地说,对文本搜索的依赖已经成为一种普遍的需求,搜索引擎在从一个或多个企业或从自己的个人数据集合收集互联网范围内的信息方面发挥着关键作用。此外,在整个企业和政府中也可以找到其他文本应用程序。发布者、数据提供者(例如,通过万维网),以及依赖于任何形式的文本主导的知识库进行内部和外部业务的组织将受益于从本研究以及理论中产生的特定工具,这将为设计其文本管理系统提供一个框架。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Tompa, Frank其他文献
Tompa, Frank的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Tompa, Frank', 18)}}的其他基金
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2021
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2020
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2019
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2017
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2016
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2015
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2013
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2012
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2011
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2010
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2021
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2020
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2019
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2017
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2016
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2015
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2013
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2012
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2011
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2010
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual