Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
基本信息
- 批准号:RGPIN-2014-06292
- 负责人:
- 金额:$ 1.42万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2019
- 资助国家:加拿大
- 起止时间:2019-01-01 至 2020-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Unstructured information generally refers to text-heavy information that is not organized in a pre-defined manner such as what is found on the Web or in collections of texts. Most developments in unstructured information searching have addressed the technical algorithms for retrieving billions of Web pages; meanwhile, interfaces used for Web searching have not evolved at the same pace. To successfully use current Web searching tools, users must generally possess or acquire the vocabulary used by the authors of the relevant documents, and searchers inevitably encounter information needs that they cannot adequately express using one or two vague and broad keywords that often have multiple meanings. This is the vocabulary mismatch problem that plagues information retrieval systems-especially when searching unfamiliar knowledge domains or subject areas. *This research program seeks to develop novel online information searching tools that bridge the gap between organized information (e.g., scientific, library, business or personal information collections) and Web searching. It assumes that searchers seek information to meet their needs, regardless of whether the information is structured (e.g., scientific, library or business information) or unstructured (e.g., Web searching or text). For example, structured information can suggest new keywords to better describe users' needs. This research program capitalizes on existing information organization investments to complement unstructured information retrieval technologies. It will ensure that the tools are useful and appreciated by recording how test participants use the tools over a period of at least 3 months. Published results will include open-source online search tool prototypes, a testing engine that could be used by other interface designers and researchers, and results of the usability testing over time. *This research program is innovative by virtue of the novel search systems it will design and test over time. Firstly, it will integrate existing information organization investments with the ubiquitous keyword searching and ranking in order to improve information discovery. For example, users could search a library catalogue and the Web using one integrated tool instead of two different tools they must currently use. Secondly, the tools will be tested over time to ensure that they meet searchers' expectations and require little or no training. This type of testing over time is very rare and highly appropriate when the objective is to ensure users can truly use and appreciate a tool beyond its initial novelty. *This research aims to support students who are interested in improving information exploration and searching technologies: 88% of the budget is given directly to students (i.e., salaries, travel expenses, and computers). The supported PhD and master's students will be part of an existing research group where they will have the opportunity to collaborate with other research groups from the School of Information Studies, McGill and the University of Montreal. They will acquire skills in research, software design and development, testing, and oral/written communication, which are valuable in academic and industrial settings. Taken as a whole, this research program has the potential to improve the tools Canadian citizens use to search for all kinds of information by suggesting new keywords, grouping similar information together, and tearing down the artificial boundary between organized information collections (e.g., library catalogues or business taxonomies) and Web searching.
非结构化信息通常指的是没有以预定义方式组织的大量文本信息,例如在Web上或在文本集合中找到的信息。非结构化信息搜索的大多数发展已经解决了检索数十亿Web页面的技术算法;与此同时,用于网络搜索的界面并没有以同样的速度发展。为了成功地使用当前的Web搜索工具,用户通常必须拥有或获得相关文档作者使用的词汇表,而搜索者不可避免地会遇到无法用一两个模糊和宽泛的关键字充分表达的信息需求,这些关键字通常具有多种含义。这就是困扰信息检索系统的词汇不匹配问题,尤其是在搜索不熟悉的知识领域或主题领域时。*该研究项目旨在开发新颖的在线信息搜索工具,以弥合有组织信息(如科学、图书馆、商业或个人信息收集)与网络搜索之间的差距。它假设搜索者寻找信息以满足他们的需要,而不管信息是结构化的(例如,科学、图书馆或商业信息)还是非结构化的(例如,Web搜索或文本)。例如,结构化信息可以建议新的关键字,以更好地描述用户的需求。本研究计划利用现有的信息组织投资来补充非结构化信息检索技术。通过记录测试参与者如何在至少3个月的时间内使用这些工具,确保这些工具是有用的和值得赞赏的。公布的结果将包括开源的在线搜索工具原型,一个可供其他界面设计师和研究人员使用的测试引擎,以及随着时间推移的可用性测试结果。*这个研究项目是创新的,因为它将设计和测试新的搜索系统。首先,它将现有的信息组织投资与无处不在的关键字搜索和排名相结合,以提高信息发现能力。例如,用户可以使用一个集成工具来搜索图书馆目录和Web,而不是当前必须使用的两个不同工具。其次,这些工具将经过一段时间的测试,以确保它们满足搜索者的期望,并且只需要很少的培训或不需要培训。随着时间的推移,这种类型的测试是非常罕见的,当目标是确保用户能够真正使用和欣赏工具时,这种测试是非常合适的。*本研究旨在支持对改进信息探索和搜索技术感兴趣的学生:88%的预算直接提供给学生(即工资,差旅费和计算机)。受资助的博士和硕士学生将成为现有研究小组的一部分,他们将有机会与麦吉尔信息研究学院和蒙特利尔大学的其他研究小组合作。他们将获得研究、软件设计和开发、测试以及口头/书面沟通方面的技能,这些技能在学术和工业环境中都很有价值。作为一个整体,这个研究项目有潜力改进加拿大公民用来搜索各种信息的工具,建议新的关键字,将相似的信息分组在一起,并打破有组织的信息集合(例如,图书馆目录或商业分类法)和Web搜索之间的人为界限。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Julien, CharlesAntoine其他文献
Julien, CharlesAntoine的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Julien, CharlesAntoine', 18)}}的其他基金
Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
- 批准号:
RGPIN-2014-06292 - 财政年份:2018
- 资助金额:
$ 1.42万 - 项目类别:
Discovery Grants Program - Individual
Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
- 批准号:
RGPIN-2014-06292 - 财政年份:2017
- 资助金额:
$ 1.42万 - 项目类别:
Discovery Grants Program - Individual
Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
- 批准号:
RGPIN-2014-06292 - 财政年份:2016
- 资助金额:
$ 1.42万 - 项目类别:
Discovery Grants Program - Individual
Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
- 批准号:
RGPIN-2014-06292 - 财政年份:2015
- 资助金额:
$ 1.42万 - 项目类别:
Discovery Grants Program - Individual
Improving Retrieval of Unstructured Information using existing Information Structures
使用现有信息结构改进非结构化信息的检索
- 批准号:
RGPIN-2014-06292 - 财政年份:2014
- 资助金额:
$ 1.42万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Developing and Visualising a Retrieval-Augmented Deep Learning Model for Population Health Management
开发和可视化用于人口健康管理的检索增强深度学习模型
- 批准号:
2905946 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Studentship
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
- 批准号:
2347624 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Standard Grant
Travel: Student Support for the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)
旅行:学生支持第 47 届国际 ACM SIGIR 信息检索研究与发展会议 (SIGIR 2024)
- 批准号:
2409649 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Standard Grant
III: Small: Query-By-Sketch: Simplifying Video Clip Retrieval Through A Visual Query Paradigm
III:小:按草图查询:通过可视化查询范式简化视频剪辑检索
- 批准号:
2335881 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Standard Grant
FlexNIR-PD: A resource efficient UK-based production process for patented flexible Near Infrared Sensors for LIDAR, Facial recognition and high-speed data retrieval
FlexNIR-PD:基于英国的资源高效生产工艺,用于 LIDAR、面部识别和高速数据检索的专利柔性近红外传感器
- 批准号:
10098113 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Collaborative R&D
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
- 批准号:
2347623 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Standard Grant
CAREER: Explanation-based Optimization of Diversified Information Retrieval to Enhance AI Systems
职业:基于解释的多样化信息检索优化以增强人工智能系统
- 批准号:
2339932 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Continuing Grant
Ritual, Rubbish and Retrieval: new approaches to Roman river finds
仪式、垃圾和检索:罗马河流发现的新方法
- 批准号:
AH/Y007514/1 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Research Grant
SBIR Phase I: Knowledge Graph-powered Information Retrieval and Causal Inference
SBIR 第一阶段:知识图谱驱动的信息检索和因果推理
- 批准号:
2335357 - 财政年份:2024
- 资助金额:
$ 1.42万 - 项目类别:
Standard Grant
P.E.A.R.L. (Project - Enterprise Asset Retrieval & Location)
珍珠。
- 批准号:
83001498 - 财政年份:2023
- 资助金额:
$ 1.42万 - 项目类别:
Innovation Loans