DeFacto: Acquiring, Curating, and Using a Bilingual Domain Aware Commonsense Knowledge Base
DeFacto:获取、整理和使用双语领域感知常识知识库
基本信息
- 批准号:RGPIN-2017-05068
- 负责人:
- 金额:$ 1.68万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2019
- 资助国家:加拿大
- 起止时间:2019-01-01 至 2020-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Automatically extracting knowledge from a large set of mostly unstructured documents (such as the Web) and organizing it into a knowledge base (KB) is a key challenge in artificial intelligence. Intuitively, such KBs should directly impact the quality of many NLP applications such as question answering, information retrieval or Text Analytics. Open information extraction, the task of extracting knowledge from texts without much supervision (especially not a prescription of the kind of information to mine), has brought new hope for such an endeavour. ******Despite a number of well-designed components are nowadays widespread and readily available for extracting facts and relations (so-called tuples) from texts, tapping information in large collections of texts still raises a number of issues. The technology embedded in a typical knowledge extraction pipeline is fraught with shortcomings: coreference resolution, named-entity resolution and parsing errors are collapsing so that many tuples (if not the vast majority) are simply useless. Also, most works are targeting very frequent entities and relations, which exclude a large quantity of information on domain specific texts that are pervasive over the Web. ******Our long term objective consists in developing the necessary expertise in populating, curating, maintaining and using a KB. Our proposal departs from several existing initiatives by a number of key factors. First, since specific domains are prevalent over the Web, we want our technology to be domain aware. Second, since today's world is multi-lingual and because not everything is written in English, we further want our technology to be multi-lingual in nature. Last, most works are devoted to develop fully automatic technology for assisting humans. In our proposal, we are interested in measuring how much gaming with a purpose can make humans assist the computer. ******In order to succeed, we target in this proposal the development of deFacto, a multi-domain, bilingual KB (French -- English) acquired iteratively from texts mined over the web, with the help of feedback collected from users via serious gaming.
从大量非结构化文档(如Web)中自动提取知识并将其组织成知识库(KB)是人工智能中的一个关键挑战。直观地说,这样的KBs应该直接影响许多NLP应用程序的质量,如问答、信息检索或文本分析。开放信息提取,即在没有太多监督的情况下从文本中提取知识的任务(特别是没有对信息进行挖掘的规定),为这种努力带来了新的希望。******尽管现在有许多设计良好的组件广泛使用,并且可以很容易地从文本中提取事实和关系(所谓的元组),但是在大量文本集合中挖掘信息仍然会引起许多问题。嵌入在典型知识提取管道中的技术充满了缺点:共同引用解析、命名实体解析和解析错误正在崩溃,因此许多元组(如果不是绝大多数的话)根本没用。此外,大多数作品针对的是非常频繁的实体和关系,这就排除了大量在网络上普遍存在的特定领域文本的信息。******我们的长期目标包括发展必要的专业知识,在人口,策划,维护和使用知识库。我们的建议在若干关键因素上有别于若干现有倡议。首先,由于特定的域在Web上普遍存在,我们希望我们的技术具有域感知能力。第二,由于今天的世界是多语言的,而且不是所有的东西都是用英语写的,我们进一步希望我们的技术在本质上是多语言的。最后,大部分工作致力于开发全自动技术来帮助人类。在我们的提议中,我们感兴趣的是衡量有目的的游戏能在多大程度上使人类帮助计算机。******为了取得成功,我们的目标是开发deFacto,这是一种多领域的双语知识库(法语-英语),通过从用户通过严肃游戏收集的反馈,从网络上挖掘的文本中迭代获取。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Langlais, Philippe其他文献
Connected speech markers of amyloid burden in primary progressive aphasia.
- DOI:
10.1016/j.cortex.2021.09.010 - 发表时间:
2021-12 - 期刊:
- 影响因子:3.6
- 作者:
Slegers, Antoine;Chafouleas, Genevieve;Montembeault, Maxime;Bedetti, Christophe;Welch, Ariane E.;Rabinovici, Gil D.;Langlais, Philippe;Gorno-Tempini, Maria L.;Brambati, Simona M. - 通讯作者:
Brambati, Simona M.
Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition
- DOI:
10.1162/tacl_a_00386 - 发表时间:
2021-01-01 - 期刊:
- 影响因子:10.9
- 作者:
Ghaddar, Abbas;Langlais, Philippe;Rezagholizadeh, Mehdi - 通讯作者:
Rezagholizadeh, Mehdi
Langlais, Philippe的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Langlais, Philippe', 18)}}的其他基金
DeFacto: Acquiring, Curating, and Using a Bilingual Domain Aware Commonsense Knowledge Base
DeFacto:获取、整理和使用双语领域感知常识知识库
- 批准号:
RGPIN-2017-05068 - 财政年份:2021
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
DeFacto: Acquiring, Curating, and Using a Bilingual Domain Aware Commonsense Knowledge Base
DeFacto:获取、整理和使用双语领域感知常识知识库
- 批准号:
RGPIN-2017-05068 - 财政年份:2020
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Extraction automatique d'informations structurées depuis les pièces jointes de courriels échangés sur la plateforme TIGER
TIGER 平台上各部件接头自动提取信息结构
- 批准号:
534554-2018 - 财政年份:2018
- 资助金额:
$ 1.68万 - 项目类别:
Engage Plus Grants Program
DeFacto: Acquiring, Curating, and Using a Bilingual Domain Aware Commonsense Knowledge Base
DeFacto:获取、整理和使用双语领域感知常识知识库
- 批准号:
RGPIN-2017-05068 - 财政年份:2018
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Normalisation des rapports de maintenance traités par Orora
Orora 维护特性关系正常化
- 批准号:
532039-2018 - 财政年份:2018
- 资助金额:
$ 1.68万 - 项目类别:
Engage Grants Program
DeFacto: Acquiring, Curating, and Using a Bilingual Domain Aware Commonsense Knowledge Base
DeFacto:获取、整理和使用双语领域感知常识知识库
- 批准号:
RGPIN-2017-05068 - 财政年份:2017
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Extraction automatique d'informations structurées depuis les courriels échangés sur la plateforme TIGER
TIGER 平台上的自动信息提取结构
- 批准号:
521814-2017 - 财政年份:2017
- 资助金额:
$ 1.68万 - 项目类别:
Engage Grants Program
Analogical Learning for Natural Language Processing
自然语言处理的类比学习
- 批准号:
249630-2012 - 财政年份:2015
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Analogical Learning for Natural Language Processing
自然语言处理的类比学习
- 批准号:
249630-2012 - 财政年份:2014
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Analogical Learning for Natural Language Processing
自然语言处理的类比学习
- 批准号:
249630-2012 - 财政年份:2013
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Graduate Yearlong Experience and Residency for Acquiring STEM Teaching Competencies
研究生为期一年的经验和住院实习以获得 STEM 教学能力
- 批准号:
2344779 - 财政年份:2024
- 资助金额:
$ 1.68万 - 项目类别:
Continuing Grant
The ecological shift by acquiring nuchal glands' toxin in Rhabdophis tigrinus and its adaptation significance on Sado Island
虎纹雉颈腺毒素获取的生态转变及其对佐渡岛的适应意义
- 批准号:
22KJ0402 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Development of VR Educational Materials for Acquiring Self-Regulation Skills in Novice Home health Nurses to Manage Perceived Difficulty
开发 VR 教材,帮助新手家庭保健护士获得自我调节技能,以应对感知到的困难
- 批准号:
23K19820 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Development of a sustainable plant production support system for protected horticulture by acquiring massflow information from environmental control systems
通过从环境控制系统获取质量流量信息,开发保护园艺的可持续植物生产支持系统
- 批准号:
23K05473 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Acquiring cognitive maps: how brains learn hidden structure
获取认知图:大脑如何学习隐藏结构
- 批准号:
10739622 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Elucidation of new mechanism of acquiring resistance to Immune checkpoint inhibitor
阐明获得免疫检查点抑制剂耐药性的新机制
- 批准号:
23KJ1108 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Acquiring rich longitudinal passive sleep data across childhood and adolescence (8-18yrs)-the AMBIENT sleep study
获取童年和青春期(8-18 岁)丰富的纵向被动睡眠数据 - AMBIENT 睡眠研究
- 批准号:
MR/X028917/1 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Research Grant
Support for acquiring chunking skills by reading source code using refactoring principles
支持通过使用重构原则阅读源代码来获得分块技能
- 批准号:
23K02697 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Acquiring chemical intuition into the catalytic properties of UiO-type monolithic frameworks using machine learning techniques
使用机器学习技术获得对 UiO 型整体框架催化特性的化学直觉
- 批准号:
EP/Y023447/1 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Fellowship
Development of ultrasensitive methods for acquiring differential spectra using quantum light
开发利用量子光获取微分光谱的超灵敏方法
- 批准号:
23K04684 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)