CAREER: Metalinguistic Natural Language Understanding
职业:元语言自然语言理解
基本信息
- 批准号:2144881
- 负责人:
- 金额:$ 54.99万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-07-01 至 2027-06-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
How people use language is a topic of study in a variety of fields, including linguistics, literature, law, andlanguage education. Extensive descriptions about many languages already exist in natural language itself(e.g., grammar textbooks, writing advice, and linguistics articles). Descriptions in (for example) Englishmay include example sentences supplemented with technical terminology and formal notation. Thisproject will develop natural language processing (NLP) algorithms to process and mine these textualresources so that language analysts can better detect and synthesize patterns of interest. First, algorithmswill be developed to recognize where a piece of text is commenting on the meaning of a word, or givingan example of how it could be used. Second, algorithms for enriching text with technical descriptions willbe improved to report better estimates of their own strengths and weaknesses. Finally, capabilities forretrieving specific uses of an ambiguous word in a large text collection will be developed to aid analysts.The algorithmic contributions in this project are expected to have direct application to technologies invarious fields where close analysis of language in text collections is crucial, including law and linguistics.On a wider scale, these capabilities have the potential to be transformative for artificial intelligence (AI),allowing humans and machines to teach each other explicitly about how language works, to deftly accessscholarly work about language, and to give and interpret language advice (e.g., writing assistance).This project develops algorithms and tasks with an eye toward technologies that would enable humans tomore efficiently and accurately conduct metalinguistic inquiries about text. Key challenges to beaddressed are: (1) Detecting textual metalanguage: This project will formulate tasks and algorithms torecognize metalinguistic descriptions (such as the use/mention distinction, definitions, linguisticexamples) in text, focusing on three genres where they are abundant: law, language discussion forums,and linguistics. A new benchmark dataset and shared task will be developed to compare metalinguistictaggers. (2) Improving model confidence calibration, focusing on taggers with long-tail tagsets. Betterprobability estimates will enable analysts to make informed decisions about how to balance automatic andmanual processing and can anticipate rates of different types of errors. (3) Query-by-example algorithmswill be developed for retrieving specific usages of an ambiguous word or phrase from a large textcollection. Tools leveraging such algorithms would open the way to new kinds of corpus-basedinvestigations by linguists, lexicographers, language teachers, and literary scholars.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
人们如何使用语言是语言学、文学、法律和语言教育等多个领域研究的课题。关于许多语言的广泛描述已经存在于自然语言本身中(例如,语法教科书、写作建议和语言学文章)。(例如)英语中的描述可以包括补充有技术术语和正式符号的例句。该项目将开发自然语言处理(NLP)算法来处理和挖掘这些文本资源,以便语言分析师能够更好地检测和合成感兴趣的模式。首先,将开发算法来识别一段文字在哪里评论一个词的含义,或者给出一个如何使用它的例子。第二,用技术描述丰富文本的算法将得到改进,以更好地估计其自身的优势和劣势。最后,将开发在大型文本集合中检索模糊词的特定用途的能力,以帮助分析人员。该项目中的算法贡献预计将直接应用于对文本集合中的语言进行密切分析至关重要的各个领域的技术,包括法律和语言学。在更广泛的范围内,这些能力有可能成为人工智能(AI)的变革,允许人类和机器明确地相互教导语言如何工作,熟练地访问关于语言的学术著作,以及给出和解释语言建议(例如,这个项目开发的算法和任务着眼于使人类能够更有效、更准确地对文本进行元语言查询的技术。要解决的关键挑战是:(1)检测文本元语言:本项目将制定任务和算法来识别文本中的元语言描述(如使用/提及区分,定义,语言示例),重点关注三种类型,它们丰富:法律,语言论坛和语言学。将开发一个新的基准数据集和共享任务来比较元语言标签。(2)改进模型置信度校准,重点关注长尾标签集的标签器。更好的概率估计将使分析师能够就如何平衡自动和手动处理做出明智的决定,并可以预测不同类型错误的发生率。(3)通过实例查询算法将被开发用于从大型文本集合中检索模糊单词或短语的特定用法。利用这种算法的工具将为语言学家、词典编纂者、语言教师和文学学者开辟新的基于语料库的研究方式。该奖项反映了NSF的法定使命,并被认为值得通过使用基金会的智力价值和更广泛的影响审查标准进行评估来支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Nathan Schneider其他文献
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
社交网络语料库中多词表达的综合注释
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
Nathan Schneider;Spencer Onuffer;Nora Kazour;Emily Danchik;Michael T. Mordowanec;H. Conrad;Noah A. Smith - 通讯作者:
Noah A. Smith
BERT Has Uncommon Sense: Similarity Ranking for Word Sense BERTology
BERT 具有不寻常的意义:词义相似度排序 BERTology
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Luke Gessler;Nathan Schneider - 通讯作者:
Nathan Schneider
Thank You, Anarchy: Notes from the Occupy Apocalypse
谢谢你,无政府状态:占领启示录的笔记
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Nathan Schneider;Rebecca Solnit - 通讯作者:
Rebecca Solnit
SOFIA/FORCAST OBSERVATIONS OF WARM DUST IN S106: A FRAGMENTED ENVIRONMENT
索菲亚/预测 S106 中温暖尘埃的观测:支离破碎的环境
- DOI:
10.1088/0004-637x/814/1/54 - 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
J. Adams;J. Adams;T. Herter;J. Hora;Nathan Schneider;R. Lau;J. Staguhn;J. Staguhn;R. Simon;Nathan Smith;R. Gehrz;Lori Allen;S. Bontemps;S. Carey;Giovanni G. Fazio;R. Gutermuth;A. Fernandez;M. Hankins;T. Hill;E. Keto;X. Koenig;K. Kraemer;S. Megeath;D. Mizuno;F. Motte;P. Myers;Howard A. Smith - 通讯作者:
Howard A. Smith
The IRAM M 33 CO(2–1) survey - A complete census of molecular gas out to 7 kpc
IRAM M 33 CO(2–1) 调查 - 分子气体的完整普查,直至 7 kpc
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
C. Druard;J. Braine;K. Schuster;Nathan Schneider;P. Gratier;S. Bontemps;M. Boquien;F. Combes;E. Corbelli;Christian Henkel;Christian Henkel;F. Herpin;C. Kramer;F. V. D. Tak;F. V. D. Tak;P. Werf - 通讯作者:
P. Werf
Nathan Schneider的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Nathan Schneider', 18)}}的其他基金
Collaborative Research: DASS: Transitioning open-source software projects to accountable community governance
合作研究:DASS:将开源软件项目转变为负责任的社区治理
- 批准号:
2217654 - 财政年份:2022
- 资助金额:
$ 54.99万 - 项目类别:
Standard Grant
NSF-BSF: RI: Small: Collaborative Research: Modeling Crosslinguistic Influences Between Language Varieties
NSF-BSF:RI:小型:协作研究:模拟语言品种之间的跨语言影响
- 批准号:
1812778 - 财政年份:2018
- 资助金额:
$ 54.99万 - 项目类别:
Continuing Grant
相似海外基金
A quantitative analysis research on the correlation between the embodiment principles and the development of metalinguistic ability in foreign language acquisition
外语习得中体现原则与元语言能力发展相关性的定量分析研究
- 批准号:
23K00737 - 财政年份:2023
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
An Empirical Study on the Coordination of Native Language Education and English Language Education Based on the Development of Metalinguistic Abilities
基于元语言能力发展的母语教育与英语教育协调的实证研究
- 批准号:
22K00776 - 财政年份:2022
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A Metalinguistic Theory of Metafictional Discourse
元小说话语的元语言理论
- 批准号:
2606671 - 财政年份:2021
- 资助金额:
$ 54.99万 - 项目类别:
Studentship
A study of metalinguistic corrective feedback to improve grammatical accuracy
提高语法准确性的元语言纠正反馈研究
- 批准号:
21K19986 - 财政年份:2021
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Metalinguistic modelling of writing: re-framing classroom talk about writing
写作的元语言建模:重新构建关于写作的课堂讨论
- 批准号:
2096450 - 财政年份:2018
- 资助金额:
$ 54.99万 - 项目类别:
Studentship
Development of a language production-oriented TILT teaching method to cultivate metalinguistic abilities
开发以语言产出为导向的 TILT 教学法培养元语言能力
- 批准号:
18K00895 - 财政年份:2018
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A theoretical and empirical study of the relationship between the effectiveness of explicit instruction and learners' metalinguistic competence
显性教学有效性与学习者元语言能力关系的理论与实证研究
- 批准号:
17K13514 - 财政年份:2017
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Comparative study on metalinguistic abilities between Japanese/Engish bilingual and Japanese monolingual kindergardeners: an fNIRS study
日语/英语双语和日语单语幼儿园儿童元语言能力的比较研究:一项 fNIRS 研究
- 批准号:
16K13225 - 财政年份:2016
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Metalinguistic Functions in Doctor-Patient Interactions
医患互动中的元语言功能
- 批准号:
16K02687 - 财政年份:2016
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Mapping of phonetic cues and phonological categories: the role of metalinguistic knowledge
语音线索和音系类别的映射:元语言知识的作用
- 批准号:
26370711 - 财政年份:2014
- 资助金额:
$ 54.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)