SGER: Beyond the Core: A Pilot Project on Cataloguing Grammatical Constructions and Multiword Expressions in English.
SGER:超越核心:英语语法结构和多词表达编目试点项目。
基本信息
- 批准号:0739426
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2007
- 资助国家:美国
- 起止时间:2007-09-15 至 2009-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Most word-centered linguistic annotations of texts proceed by identifying keywords and labeling the phrases around them that show their roles in the meaning structures evoked by the keywords. This procedure misses most idioms (took a turn for the worse) and irregular grammatical patterns (only then would she agree to it). The "Beyond the Core" project is exploring ways of augmenting such annotations with layered representations of multiword units and "non-core" grammatical constructions present in such texts. Toward this end, using FrameNet annotation tools, researchers are finding non-core structures in texts and labeling the phrases in a way that shows how they satisfy formal and semantic constraints dictated by the individual constructions. The "Constructicon", where such information is archived, links each construction with annotated sentences that exemplify it.Although there is a strong interest in non-core structures in the Computational Linguistics community, researchers don't know how many there are, how important they are in NLP applications, how frequent they are in texts of different kinds, or whether the skills that enable trained linguists to recognize them can be reliably communicated to time-pressured annotators. This empirical study is providing that missing information.The Constructicon and the full body of annotations will be made available to researchers via the FrameNet website, in both human-browsable and machine-readable form. The data will provide rich material for research on parsing, language understanding, and compositional semantics, and may possibly serve as a training corpus for machine-learning methods of detecting known non-core constructions in raw text.
大多数以词为中心的文本语言注释都是通过识别关键字并标记它们周围的短语来进行的,这些短语表明了它们在由关键字引起的意义结构中的角色。这个过程遗漏了大多数习语(变坏了)和不规则的语法模式(只有这样她才会同意)。“超越核心”项目正在探索用这种文本中存在的多个单词单元和“非核心”语法结构的分层表示来扩充这种注释的方法。为此,使用FrameNet注释工具,研究人员正在寻找文本中的非核心结构,并以一种显示它们如何满足个别结构所规定的形式和语义约束的方式对短语进行标记。尽管计算语言学界对非核心结构很感兴趣,但研究人员不知道非核心结构有多少,它们在自然语言处理应用中有多重要,它们在不同类型的文本中出现的频率有多高,也不知道让训练有素的语言学家识别它们的技能是否能可靠地传达给时间紧迫的注释者。这项实证研究提供了缺失的信息。结构和全文注释将通过FrameNet网站以人类可浏览和机器可读的形式提供给研究人员。这些数据将为句法分析、语言理解和成分语义的研究提供丰富的材料,并可能作为机器学习方法的训练语料库,以检测原始文本中已知的非核心结构。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Charles Fillmore其他文献
Charles Fillmore的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Charles Fillmore', 18)}}的其他基金
ITR: Framenet++: An On-Line Lexical Semantic Resource and its Application to Speech and Language Understanding
ITR:框架网:在线词汇语义资源及其在语音和语言理解中的应用
- 批准号:
0086132 - 财政年份:2000
- 资助金额:
-- - 项目类别:
Continuing Grant
STIMULATE: Tools for Lexicon Building
STIMULATE:词典构建工具
- 批准号:
9618838 - 财政年份:1997
- 资助金额:
-- - 项目类别:
Continuing Grant
相似海外基金
Collaborative Research: NSF-AoF: CNS Core: Small: Towards Scalable and Al-based Solutions for Beyond-5G Radio Access Networks
合作研究:NSF-AoF:CNS 核心:小型:面向超 5G 无线接入网络的可扩展和基于人工智能的解决方案
- 批准号:
2225578 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Exploration of physics beyond the Standard Model through core-collapse supernovae
通过核心塌陷超新星探索标准模型之外的物理学
- 批准号:
23K13097 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
Collaborative Research: SaTC: CORE: Medium: Beyond App-centric Privacy: Investigating Privacy Ecosystems among Vulnerable Populations
协作研究:SaTC:核心:媒介:超越以应用程序为中心的隐私:调查弱势群体的隐私生态系统
- 批准号:
2309275 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Research: SaTC: CORE: Medium: Beyond App-centric Privacy: Investigating Privacy Ecosystems among Vulnerable Populations
协作研究:SaTC:核心:媒介:超越以应用程序为中心的隐私:调查弱势群体的隐私生态系统
- 批准号:
2309278 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Beyond 1D Structure of Earth's Core - Reconciling Inferences from Seismic and Geomagnetic Observations
超越地核的一维结构 - 协调地震和地磁观测的推论
- 批准号:
NE/W005247/1 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Research Grant
Fibrosis Beyond the Core: A New Application of MRI to Noninvasively Quantify Whole Kidney Fibrosis
超越核心的纤维化:MRI 无创量化全肾纤维化的新应用
- 批准号:
10796499 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Collaborative Research: SaTC: CORE: Medium: Beyond App-centric Privacy: Investigating Privacy Ecosystems among Vulnerable Populations
协作研究:SaTC:核心:媒介:超越以应用程序为中心的隐私:调查弱势群体的隐私生态系统
- 批准号:
2309277 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Research: SaTC: CORE: Medium: Beyond App-centric Privacy: Investigating Privacy Ecosystems among Vulnerable Populations
协作研究:SaTC:核心:媒介:超越以应用程序为中心的隐私:调查弱势群体的隐私生态系统
- 批准号:
2309276 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
CNS Core: Small: Seamless Coexistence of Positioning and Communication in 5G and Beyond Wireless Systems
CNS 核心:小型:5G 及其他无线系统中定位和通信的无缝共存
- 批准号:
2208761 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CNS Core: Small: Towards Scalable and Al-based Solutions for Beyond-5G Radio Access Networks
合作研究:NSF-AoF:CNS 核心:小型:面向超 5G 无线接入网络的可扩展和基于人工智能的解决方案
- 批准号:
2225577 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant