NNA: Collaborative Research: Integrating Language Documentation and Computational Tools for Yupik, an Alaska Native Language
NNA:协作研究:集成阿拉斯加母语 Yupik 的语言文档和计算工具
基本信息
- 批准号:1760977
- 负责人:
- 金额:$ 12.41万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-08-01 至 2024-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
One locus of crosslinguistic variation in how languages build words is whether meaning is encoded in free morphemes ('units of meaning') that stand alone as words, or whether those morphemes must combine with other morphemes to become words. While English has many free morphemes, the Alaska Native language St. Lawrence Island/Siberian Yupik uses the second strategy with very complex words, often sentence-sized. These properties are known as agglutination and polysynthesis. Researchers will document critical structures in the language, digitize existing Yupik materials, and build computational tools to help the community and other researchers. The data from Yupik are extremely important to language science, since many of the phenomena displayed in the language are rare and not well understood. Creating computational tools for languages with very complex words, like Yupik, is of additional benefit to computer scientists and language scientists in that it helps researchers improve computational tools for languages like English. The Native American Languages Act, passed by the U.S. Congress in 1990, enacted into policy the recognition of the unique status and importance of Native American languages. This project will build and improve tools like a morphological analyzer, a spellchecker, and a searchable dictionary, of value to the community in revitalizing their language. Graduate students will be trained in these methods, and researchers will hold outreach meetings with high school students in the language community to teach them important computer and coding skills that will enable them to build further tools. All data gathered will be permanently archived at the Alaska Native Language Archive.The investigators, a collaboration of language and computer scientists from the University of Illinois at Urbana-Champaign and George Mason University, will undertake this project. It involves three interconnected parts: digitization of existing materials on and in Yupik for use by community members and researchers; recording and analyzing the speech of Yupik speakers; and working with the community to build computer tools for Yupik and teaching students how to do so. A successful computational model of Yupik linguistic phenomena has implications for unsupervised and semi-supervised methods in morphology induction and grammar induction because the types of morphophonological change are pervasive, much more so than models used in other approaches to unsupervised morphology induction. This work is likely to have important implications regarding appropriate computational modeling of polysynthetic agglutinative morphosyntax. Accessing materials at several archives, the team will scan them, and clean and process the scans so they are accessible digitally and searchable. This will create a digital corpus of Yupik materials for use by the community and for linguistic investigations into grammatical mood, tense, and aspect to better understand these complex morphosemantic constructions. The data will also improve the computational tools being developed in this project, providing the Yupik community with access to modern tools like spellcheckers, electronically searchable dictionaries, and electronic books. Finally, in its tight integration of field work and the development of computational tools for the analysis of the language, this project will serve as a model for future collaborations of this kind.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
语言如何造词的跨语言差异之一是,意义是用独立的自由语素(“意义单位”)编码的,还是这些语素必须与其他语素结合才能成为单词。虽然英语有很多自由语素,但阿拉斯加土著语言圣劳伦斯岛/西伯利亚尤皮克语使用第二种策略来处理非常复杂的单词,通常是句子大小的。这些性质被称为凝集和多合成。研究人员将记录语言中的关键结构,将现有的尤皮克材料数字化,并建立计算工具以帮助社区和其他研究人员。来自Yupik的数据对语言科学非常重要,因为语言中显示的许多现象是罕见的,而且还没有被很好地理解。为像Yupik这样包含非常复杂单词的语言创建计算工具,对计算机科学家和语言科学家来说是额外的好处,因为它可以帮助研究人员改进像英语这样的语言的计算工具。美国国会于1990年通过了《美洲原住民语言法案》,将承认美洲原住民语言的独特地位和重要性制定为政策。这个项目将建立和改进一些工具,如词法分析器、拼写检查器和可搜索字典,这些工具对社区重振他们的语言很有价值。研究生将接受这些方法的培训,研究人员将与语言社区的高中生举行拓展会议,向他们传授重要的计算机和编码技能,使他们能够开发更多的工具。所有收集到的数据将永久保存在阿拉斯加本土语言档案馆。由伊利诺伊大学厄巴纳-香槟分校和乔治梅森大学的语言和计算机科学家组成的研究小组将承担这个项目。它涉及三个相互关联的部分:将Yupik上和Yupik中的现有材料数字化,供社区成员和研究人员使用;对尤皮克语使用者的语音进行记录和分析;并与社区合作为Yupik建立计算机工具,并教学生如何做到这一点。Yupik语言现象的成功计算模型对形态学归纳和语法归纳中的无监督和半监督方法具有启示意义,因为词音变化的类型普遍存在,比其他无监督形态学归纳方法中使用的模型要广泛得多。这项工作可能对适当的多合成粘合形态语法的计算建模具有重要意义。访问几个档案馆的材料,团队将对它们进行扫描,并对扫描进行清理和处理,以便以数字方式访问和搜索。这将创建一个尤皮克语材料的数字语料库,供社区使用,并用于语法语气、时态和语体的语言研究,以更好地理解这些复杂的语素结构。这些数据还将改进这个项目中正在开发的计算工具,为Yupik社区提供现代工具,如拼写检查器、电子搜索词典和电子书。最后,该项目将实地工作与语言分析计算工具的开发紧密结合起来,将成为未来此类合作的典范。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(8)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Semantic fieldwork from a distance with speakers of Akuzipik
与 Akuzipik 使用者进行远距离语义实地考察
- DOI:
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Schreiner, Sylvia L.R.;Hunt, Benjamin;Chen, Emily;Haas, Preston;Aningayou, Ukaall Crystal
- 通讯作者:Aningayou, Ukaall Crystal
Akuzipik/Yupik (St. Lawrence Island, Alaska, USA; Chukotka, Russia) - Language Snapshot
Akuzipik/Yupik(美国阿拉斯加州圣劳伦斯岛;俄罗斯楚科奇半岛)- 语言快照
- DOI:10.25895/ldd43
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Koonooka, Christopher Petuwaq;Schreiner, Sylvia L.R.;Soldati, Giulia Masella;Schwartz, Lane;Hunt, Benjamin;Haas, Preston;Chen, Emily;Park, Hyunji Hayley
- 通讯作者:Park, Hyunji Hayley
Bootstrapping a Neural Morphological Analyzer for St. Lawrence Island Yupik from a Finite-State Transducer
从有限状态传感器引导圣劳伦斯岛 Yupik 的神经形态分析器
- DOI:
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Schwartz, Lane;Chen, Emily;Hunt, Benjamin;Schreiner, Sylvia L.R.
- 通讯作者:Schreiner, Sylvia L.R.
A Digital Corpus of St. Lawrence Island Yupik
- DOI:10.33011/computel.v2i.985
- 发表时间:2021-01
- 期刊:
- 影响因子:0
- 作者:Lane Schwartz;Emily Chen;Hyunji Hayley Park;Edward Jahn;Sylvia L. R. Schreiner
- 通讯作者:Lane Schwartz;Emily Chen;Hyunji Hayley Park;Edward Jahn;Sylvia L. R. Schreiner
Multidirectional leveraging for computational morphology and language documentation and revitalization
计算形态学和语言文档及振兴的多向利用
- DOI:
- 发表时间:2020
- 期刊:
- 影响因子:1.8
- 作者:Schreiner, Sylvia L.R.;Schwartz, Lane;Hunt, Benjamin;Chen, Emily
- 通讯作者:Chen, Emily
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Sylvia Schreiner其他文献
Sylvia Schreiner的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Sylvia Schreiner', 18)}}的其他基金
CAREER: Documenting temporal contrasts in an endangered language via community linguistics
职业:通过社区语言学记录濒危语言的时间对比
- 批准号:
2142340 - 财政年份:2022
- 资助金额:
$ 12.41万 - 项目类别:
Continuing Grant
相似海外基金
Collaborative Research: NNA Research: Electric Vehicles in the Arctic (EVITA) - Interactions with Cold Weather, Microgrids, People, and Policy
合作研究:NNA 研究:北极电动汽车 (EVITA) - 与寒冷天气、微电网、人员和政策的相互作用
- 批准号:
2318385 - 财政年份:2024
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
Collaborative Research: NNA Research: Electric Vehicles in the Arctic (EVITA) - Interactions with Cold Weather, Microgrids, People, and Policy
合作研究:NNA 研究:北极电动汽车 (EVITA) - 与寒冷天气、微电网、人员和政策的相互作用
- 批准号:
2318384 - 财政年份:2024
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Incubator: Collaborative Research: Indigenous-led Strategies for Co-Productive and Convergent Arctic Research
NNA 孵化器:合作研究:土著主导的北极研究协同生产和融合策略
- 批准号:
2318276 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Collaboratory: Collaborative Research: ACTION - Alaska Coastal Cooperative for Co-producing Transformative Ideas and Opportunities in the North
NNA 合作实验室:合作研究:行动 - 阿拉斯加沿海合作社,共同在北部产生变革性的想法和机遇
- 批准号:
2318377 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Cooperative Agreement
NNA Collaboratory: Collaborative Research: ACTION - Alaska Coastal Cooperative for Co-producing Transformative Ideas and Opportunities in the North
NNA 合作实验室:合作研究:行动 - 阿拉斯加沿海合作社,共同在北部产生变革性的想法和机遇
- 批准号:
2318375 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Cooperative Agreement
NNA Research: Collaborative Research: Socio-Ecological Systems Transformation in River basins of the sub-Arctic under climate change (SESTRA)
NNA 研究:合作研究:气候变化下亚北极河流流域的社会生态系统转型 (SESTRA)
- 批准号:
2318383 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Research: Collaborative Research: Arctic, Climate, and Earthquakes (ACE): Seismic Resilience and Adaptation of Arctic Infrastructure and Social Systems amid Changing Climate
NNA 研究:合作研究:北极、气候和地震 (ACE):气候变化中北极基础设施和社会系统的抗震能力和适应
- 批准号:
2220221 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Research: Collaborative Research: Towards resilient water infrastructure in Alaska Native communities through knowledge co-production
NNA 研究:合作研究:通过知识共同生产为阿拉斯加原住民社区打造具有复原力的水基础设施
- 批准号:
2220518 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Research: Collaborative Research: Towards resilient water infrastructure in Alaska Native communities through knowledge co-production
NNA 研究:合作研究:通过知识共同生产为阿拉斯加原住民社区打造具有复原力的水基础设施
- 批准号:
2220516 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant
NNA Research: Collaborative Research: Towards resilient water infrastructure in Alaska Native communities through knowledge co-production
NNA 研究:合作研究:通过知识共同生产为阿拉斯加原住民社区打造具有复原力的水基础设施
- 批准号:
2220517 - 财政年份:2023
- 资助金额:
$ 12.41万 - 项目类别:
Standard Grant