Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
基本信息
- 批准号:RGPIN-2018-06736
- 负责人:
- 金额:$ 1.68万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2019
- 资助国家:加拿大
- 起止时间:2019-01-01 至 2020-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Motivation: Roughly 7,000 languages are spoken in the world, including about 60 distinct indigenous languages in Canada alone. Many of them are under-studied and poorly supported by computational and computerized***systems: the number of linguists who can study them is insufficient and most of their efforts pour into mainstream languages. Efficient methods that automatically induce grammars that can be incorporated into modern technological and educational tools would greatly impact the Grammar Induction field and related Natural Language Processing (NLP) areas, while significantly increasing survival chances for endangered languages, with consequent socio-economic potential.******State-of-the-art: Most grammar induction models involve probabilities (so that learning a grammar amounts to selecting a model from a pre-specified model family) or statistical models of machine learning. With respect to precision, reliability and explanatory power, such models are inferior to rule and inference-based models, but they can often overcome the inflexibilities of classical NLP methods through extensive search on voluminous data sets, and can achieve impressive results when what matters is only to simulate linguistic understanding in the Turing test sense. A few models that use linguistic information from one language for the task of describing another language rather than addressing induction in full, are usually restricted to specific tasks, e.g. disambiguating the other language and may require parallel corpora.******Position of the research within the state-of-the-art: In contrast, with support from my previous NSERC grant, my students and I developed a novel model for inducing unknown grammars from known ones, which works for more than just specific tasks and needs neither a pre-specified model family, nor parallel corpora, nor any of the typical models of machine learning. Our model makes it possible to generate an under-studied language's grammar using representative and correct input sentences in that language, together with its lexicon relevant to that input, all of which is parsed with respect to the correct grammar of a well-studied language. We call our model the Womb Grammar Model (WGM) because it can generate new grammars given appropriate input, much as human wombs can generate all races.******Proposed research: The present proposal focuses on solidifying our experimental WGM results into a full-blown and executable computational linguistic theory of parsing and of grammatical inference (long term goal), with particular focus on semantic and pragmatic interpretation (medium-term). We shall fine-tune this theory by interleaving its development with test-bed short-term concrete applications to grammar induction (in particular, around linguistic diversity preservation in Canada), to parsing per se, and to linguistic experimentation (as testing alternative linguistic constraints).
动机:世界上大约有7000种语言,其中仅在加拿大就有大约60种不同的土著语言。其中许多语言研究不足,没有得到计算和计算机化*系统的支持:能够研究这些语言的语言学家人数不足,他们的大部分努力都涌入主流语言。自动归纳语法的有效方法将极大地影响语法归纳领域和相关的自然语言处理(NLP)领域,同时显著增加濒危语言的生存机会,从而带来社会经济潜力。最新水平:大多数语法归纳模型涉及概率(因此,学习语法相当于从预先指定的模型家族中选择一个模型)或机器学习的统计模型。在精度、可靠性和解释力方面,这类模型不如基于规则和推理的模型,但它们往往可以通过在海量数据集上的广泛搜索来克服经典自然语言处理方法的不灵活性,并且当重要的是模拟图灵测试意义上的语言理解时,可以取得令人印象深刻的结果。一些使用一种语言的语言信息来描述另一种语言的模型,而不是完全解决归纳的任务,通常局限于特定的任务,例如消除另一种语言的歧义,可能需要平行的语料库。*这项研究在最新的研究中的地位:相反,在我之前的NSERC资助下,我的学生和我开发了一个从已知语法中归纳未知语法的新模型,该模型不仅适用于特定的任务,而且不需要预先指定的模型家族,也不需要平行的语料库,也不需要任何典型的机器学习模型。我们的模型可以使用该语言中具有代表性的正确输入句子以及与该输入相关的词典来生成该语言的语法,所有这些都是相对于一种研究充分的语言的正确语法进行分析的。我们称我们的模型为子宫语法模型(WGM),因为它可以在适当的输入下生成新的语法,就像人类子宫可以生成所有种族一样。*建议的研究:当前的建议侧重于将我们的实验WGM结果巩固为一个成熟的、可执行的语法分析和语法推理的计算语言学理论(长期目标),特别关注语义和语用解释(中期)。我们将通过将这一理论的发展与试验台短期具体应用交织在一起,对语法归纳(特别是在加拿大保持语言多样性)、分析本身和语言实验(作为测试替代语言约束)进行微调。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Dahl, Veronica其他文献
Dahl, Veronica的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Dahl, Veronica', 18)}}的其他基金
Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
- 批准号:
RGPIN-2018-06736 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
- 批准号:
RGPIN-2018-06736 - 财政年份:2021
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
- 批准号:
RGPIN-2018-06736 - 财政年份:2020
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
- 批准号:
RGPIN-2018-06736 - 财政年份:2018
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint Solving for Language Processing and Bioinformatics
语言处理和生物信息学的约束求解
- 批准号:
2436-2012 - 财政年份:2016
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint Solving for Language Processing and Bioinformatics
语言处理和生物信息学的约束求解
- 批准号:
2436-2012 - 财政年份:2015
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint Solving for Language Processing and Bioinformatics
语言处理和生物信息学的约束求解
- 批准号:
2436-2012 - 财政年份:2014
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint Solving for Language Processing and Bioinformatics
语言处理和生物信息学的约束求解
- 批准号:
2436-2012 - 财政年份:2013
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint Solving for Language Processing and Bioinformatics
语言处理和生物信息学的约束求解
- 批准号:
2436-2012 - 财政年份:2012
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Constraint based and hypothetical reasoning for human and molecular biology languages
人类和分子生物学语言的基于约束和假设的推理
- 批准号:
2436-2006 - 财政年份:2010
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Investigation on sentential inference bridging between lexical/grammatical knowledge and text comprehension
词汇/语法知识与文本理解之间的句子推理桥接研究
- 批准号:
23K00628 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Postdoctoral Fellowship: SPRF: Neurolinguistic Mechanisms of Grammatical Processing across Different Dialects
博士后奖学金:SPRF:不同方言语法处理的神经语言学机制
- 批准号:
2313956 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Fellowship Award
The role of length of residency in first language grammatical attrition
居住时间长短对母语语法损耗的影响
- 批准号:
2890794 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Studentship
Representing and learning stress: Grammatical constraints and neural networks
表示和学习压力:语法约束和神经网络
- 批准号:
2140826 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Standard Grant
CAREER: Grammatical change and reconstruction
职业:语法变化和重建
- 批准号:
2048220 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Continuing Grant
NSF Postdoctoral Fellowship in Biology FY 2021: Deciphering grammatical rules of notochord enhancers and their role in notochord development in Ciona intestinalis
2021 财年 NSF 生物学博士后奖学金:破译脊索增强子的语法规则及其在海鞘脊索发育中的作用
- 批准号:
2109907 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Fellowship Award
A cross-linguistic approach to constraints on lexical representation and grammatical realization of the concepts of "possession", "participation", and "experience"
一种跨语言方法来限制“占有”、“参与”和“体验”概念的词汇表示和语法实现
- 批准号:
22K00555 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
What effects do online classes have on the acquisition, development and consolidation of grammatical knowledge?
在线课程对语法知识的获取、发展和巩固有何影响?
- 批准号:
22K18476 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Parsing and Grammatical Induction as Constraint Solving
解析和语法归纳作为约束求解
- 批准号:
RGPIN-2018-06736 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Effects of language grammatical structure on consumers' cognitive style and advertising evaluation
语言语法结构对消费者认知风格及广告评价的影响
- 批准号:
22K01753 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)