Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
基本信息
- 批准号:RGPIN-2017-06487
- 负责人:
- 金额:$ 1.46万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2019
- 资助国家:加拿大
- 起止时间:2019-01-01 至 2020-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The massive amount of publicly available data is an amazing opportunity for artificial intelligence to play a key role in life sciences. Automatic approaches have proven to be effective in supporting life sciences research, yet mining complex and unstructured data is still a major challenge. In this context, the objective of my research program is to contribute to knowledge discovery in life sciences by easing access to existing knowledge, and supporting its exploration. I propose to reach this objective by creating algorithms to jointly retrieve and mine textual and non-textual data. Life scientists looking for existing knowledge face critical challenges such as discovering entities in documents, retrieving documents and data relevant to specific topics, or analyze data according to their contribution to experiments. ***Over the next five years, my research will hence focus on two objectives:***O1. The investigation of new models and algorithms to jointly retrieve various types of documents from natural language (NL) queries. ***The retrieval of documents is a critical step for life sciences since the retrieved results can be used as input for a variety of tasks, such as curation, triage, or biological network modeling. There is a twofold challenge in understanding NL queries, and retrieving heterogeneous types of documents. The objective is to investigate the best way of analyzing NL queries to expand them in directions that trigger the retrieval of articles, gene or protein sequences, related database entries, experimental data, etc.***O2. The exploration of new algorithms to discover bio-entities in documents, and link them to relevant knowledge bases. Though much work has been done toward entity discovery and linking (EDL) in social media and news, many challenges still remain in life sciences. As automatically annotated documents support researchers in building computational models of biological processes, further work on the bio-entity discovery and linking task is necessary.***EDL is very challenging in genomics because bio-entities are often highly ambiguous, and little context is usually available for disambiguation. The objective is to investigate how generic approaches for solving the EDL task can be adapted to the genomics field, and how several reference databases can be used together to support linking and disambiguation of bio-entities.***This research program is cross-disciplinary. In the computer science domain, the program combines natural language processing, information retrieval, machine learning, and big data mining. My collaboration with genomics researchers provides a challenging environment involving real users. ***Involved Highly Qualified Personal will get advanced training in natural language processing, applied machine learning, and text/data mining. ***The released work will be open-source in order to be easily reused by the community, and transferred to the industry.
大量的公开数据是人工智能在生命科学中发挥关键作用的绝佳机会。事实证明,自动方法可以有效支持生命科学研究,但挖掘复杂和非结构化数据仍然是一个重大挑战。在这种背景下,我的研究计划的目标是通过简化现有知识的获取并支持其探索,为生命科学领域的知识发现做出贡献。我建议通过创建联合检索和挖掘文本和非文本数据的算法来实现这一目标。寻找现有知识的生命科学家面临着严峻的挑战,例如发现文档中的实体、检索与特定主题相关的文档和数据,或根据数据对实验的贡献来分析数据。 ***在接下来的五年里,我的研究将集中在两个目标上:***O1。研究新模型和算法,以从自然语言 (NL) 查询中联合检索各种类型的文档。 ***文档检索是生命科学的关键步骤,因为检索结果可用作各种任务的输入,例如管理、分类或生物网络建模。理解 NL 查询和检索异构类型的文档存在双重挑战。目标是研究分析 NL 查询的最佳方法,以将其扩展到触发文章、基因或蛋白质序列、相关数据库条目、实验数据等检索的方向。***O2。探索新算法来发现文档中的生物实体,并将其链接到相关知识库。尽管在社交媒体和新闻中的实体发现和链接(EDL)方面已经做了很多工作,但生命科学领域仍然存在许多挑战。由于自动注释文档支持研究人员构建生物过程的计算模型,因此有必要对生物实体发现和链接任务进行进一步的工作。***EDL 在基因组学中非常具有挑战性,因为生物实体通常高度模糊,并且通常很少有上下文可用于消歧。目的是研究解决 EDL 任务的通用方法如何适应基因组学领域,以及如何一起使用多个参考数据库来支持生物实体的链接和消除歧义。***该研究项目是跨学科的。在计算机科学领域,该程序结合了自然语言处理、信息检索、机器学习和大数据挖掘。我与基因组学研究人员的合作提供了一个涉及真实用户的具有挑战性的环境。 ***参与的高素质人员将获得自然语言处理、应用机器学习和文本/数据挖掘方面的高级培训。 ***发布的作品将开源,以便于社区重用,并转移到业界。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Meurs, MarieJean其他文献
Meurs, MarieJean的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Meurs, MarieJean', 18)}}的其他基金
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2022
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2021
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2020
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2018
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2017
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2022
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2021
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2020
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2018
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Supporting Knowledge Discovery in Life Sciences
支持生命科学领域的知识发现
- 批准号:
RGPIN-2017-06487 - 财政年份:2017
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
CI-EN: Collaborative Research: Enhancement of Foldit, a Community Infrastructure Supporting Research on Knowledge Discovery Via Crowdsourcing in Computational Biology
CI-EN:协作研究:Foldit 的增强,Foldit 是一个支持计算生物学中通过众包进行知识发现研究的社区基础设施
- 批准号:
1629811 - 财政年份:2016
- 资助金额:
$ 1.46万 - 项目类别:
Standard Grant
CI-EN: Collaborative Research: Enhancement of Foldit, a Community Infrastructure Supporting Research on Knowledge Discovery Via Crowdsourcing in Computational Biology
CI-EN:协作研究:Foldit 的增强,Foldit 是一个支持计算生物学中通过众包进行知识发现研究的社区基础设施
- 批准号:
1629879 - 财政年份:2016
- 资助金额:
$ 1.46万 - 项目类别:
Standard Grant
CI-EN: Collaborative Research: Enhancement of Foldit, a Community Infrastructure Supporting Research on Knowledge Discovery Via Crowdsourcing in Computational Biology
CI-EN:协作研究:Foldit 的增强,Foldit 是一个支持计算生物学中通过众包进行知识发现研究的社区基础设施
- 批准号:
1627539 - 财政年份:2016
- 资助金额:
$ 1.46万 - 项目类别:
Standard Grant
CI-EN: Collaborative Research: Enhancement of Foldit, a Community Infrastructure Supporting Research on Knowledge Discovery Via Crowdsourcing in Computational Biology
CI-EN:协作研究:Foldit 的增强,Foldit 是一个支持计算生物学中通过众包进行知识发现研究的社区基础设施
- 批准号:
1625811 - 财政年份:2016
- 资助金额:
$ 1.46万 - 项目类别:
Standard Grant
GV: Small: Collaborative Research: Supporting Knowledge Discovery through a Scientific Visualization Language
GV:小型:协作研究:通过科学可视化语言支持知识发现
- 批准号:
1302755 - 财政年份:2012
- 资助金额:
$ 1.46万 - 项目类别:
Standard Grant














{{item.name}}会员




