Workshop: Developing collection management tools to create more robust and reliable linguistic data
研讨会:开发馆藏管理工具以创建更强大、更可靠的语言数据
基本信息
- 批准号:1648984
- 负责人:
- 金额:$ 10.86万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-08-15 至 2023-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The world's linguistic and cultural diversity is encoded in the approximately 7,000 distinct languages spoken across the world. With many of these languages currently endangered or threatened, the creation of an enduring record of these language is of paramount importance. Endangered language documentation includes many elements, including raw audio and video recordings, photographs, transcription files, databases, files containing linguistic analysis and other research details, responses to experimental stimuli, and field observations. Together these files make up a collection of interlinked data for a particular project. For example, recordings go along with their transcriptions, and data in the transcriptions is added to databases. Managing all these kinds of data is necessary before archiving and making the data widely accessible. Researchers in the language sciences manage a large amount of interlinked data prior to depositing it in an archive. However, there are no guidelines for best practices for this type of collection and there are no standard tools for managing the files. As a result, current practices are inefficient and create bottlenecks that delay archiving. This project will use workshops to bring together stakeholders in language documentation, including software developers, to develop standardized software tools to address the hold-ups that have the potential to prevent research products from being properly archived and thus publicly accessible.The workshop series proposed here addresses this obstacle by developing standardized tools for management of linguistic data collections. Such tools will facilitate a more robust and reproducible science of language by providing researchers with standard methods to manage data from the point of collection to the point of archive deposit. The aim is to eliminate the collection management bottleneck and to facilitate greater uptake of language archives. The workshop series will bring together relevant stakeholders including: field linguists who collect data; theoretical linguists who make use of archival linguistic data; experts in data curation; and software developers. In order to encourage broad participation, the three workshops will be scheduled in conjunction with major gatherings of linguistic researchers, including the Linguistic Society of America annual meeting. The outcome of these workshops will be a sustainable plan for development of a cross-platform, open source collection management tool. By making data more accessible and better described this tool will facilitate increased reproducibility and accessibility of linguistic research. This greater availability of primary language resources will transform not only various subfields of linguistics, but also related fields such as anthropology and social psychology, which rely on careful management of field data. Further, by taking a stakeholder-driven approach via a series of workshops, the project has the potential to encourage broad adoption of collection management tools by both the language documentation community and by linguists representative of other subdisciplines. In doing so, the project will decrease the barriers to proper description and archiving of linguistic data of a wide variety. Moreover, by improving the dialogue between language documenters, language archivists, linguists and developers, this project will serve as a model for the development of software in linguistics, as well as other social and behavioral sciences.
世界语言和文化的多样性体现在全世界大约7 000种不同的语言中。由于这些语言中有许多目前濒临灭绝或受到威胁,因此创建这些语言的持久记录至关重要。 濒危语言文件包括许多元素,包括原始音频和视频记录,照片,转录文件,数据库,包含语言分析和其他研究细节的文件,对实验刺激的反应,以及实地观察。 这些文件一起构成了特定项目的相互关联的数据集合。例如,记录沿着其传输,并且传输中的数据被添加到数据库中。 在将数据存档并使其可广泛访问之前,有必要管理所有这些类型的数据。语言科学的研究人员在将大量相互关联的数据存入档案之前对其进行管理。但是,对于这种类型的集合,没有最佳实践的指导方针,也没有管理文件的标准工具。因此,目前的做法效率低下,并造成延迟归档的瓶颈。该项目将通过举办讲习班,使包括软件开发人员在内的语文文件工作的利益攸关方聚集在一起,开发标准化软件工具,以解决有可能妨碍研究产品妥善存档从而无法公开查阅的问题,这里提议的系列讲习班通过开发管理语文数据收集的标准化工具来解决这一障碍。这些工具将为研究人员提供从收集到存档存款的标准方法,从而促进更强大和可复制的语言科学。目的是消除收集管理瓶颈,促进更多地利用语文档案。该系列研讨会将汇集相关利益攸关方,包括:收集数据的实地语言学家;利用档案语言数据的理论语言学家;数据管理专家;和软件开发人员。为了鼓励广泛参与,这三个讲习班将与语言研究人员的主要聚会,包括美国语言学会年会一起安排。这些讲习班的成果将是一项可持续的计划,以开发一个跨平台、开放源码的收藏管理工具。通过使数据更容易获得和更好地描述,这一工具将有助于提高语言研究的可重复性和可获得性。这种主要语言资源的更大可用性不仅将改变语言学的各个子领域,而且还将改变人类学和社会心理学等相关领域,这些领域依赖于对实地数据的仔细管理。此外,通过一系列研讨会采取由专家驱动的方法,该项目有可能鼓励语言文献界和代表其他分支学科的语言学家广泛采用馆藏管理工具。 这样,该项目将减少对各种语言数据进行适当描述和归档的障碍。此外,通过改善语言文档管理员、语言档案管理员、语言学家和开发人员之间的对话,该项目将成为语言学以及其他社会和行为科学软件开发的典范。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Developing collection management tools to create more robust and reliable linguistic data
开发馆藏管理工具以创建更强大、更可靠的语言数据
- DOI:
- 发表时间:2017
- 期刊:
- 影响因子:0
- 作者:Holton, Gary;Hooshiar, Kavon;Thieberger, Nick
- 通讯作者:Thieberger, Nick
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Gary Holton其他文献
Public access to research data in language documentation: Challenges and possible strategies
公众获取语言文献中的研究数据:挑战和可能的策略
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Mandana Seyfeddinipur;F. Ameka;Lissant M Bolton;J. Blumtritt;B. Carpenter;Hilaria Cruz;Sebastian Drude;Patience Epps;Vera Ferreira;Ana Vilacy Moreira Galúcio;Brigit Hellwig;Oliver Hinte;Gary Holton;Dagmar Jung;Irmgarda Kasinskaite Buddeberg;M. Krifka;S. Kung;Miyuki Monroig;A. N. Neba;S. Nordhoff;B. Pakendorf;Kilu von Prince;F. Rau;K. Rice;Michael Rießler;Vera Szoelloesi Brenig;N. Thieberger;Paul Trilsbeek;H. V. D. Voort;Tonya Woodbury - 通讯作者:
Tonya Woodbury
Indigenous Peoples, Ethics, and Linguistic Data
原住民、伦理和语言数据
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Gary Holton - 通讯作者:
Gary Holton
SPLIT INTRANSITIVITY IN CANTONESE
粤语中的不及物性分割
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
V. Anderson;Andrea L. Berez;R. Blust;K. Deen;K. Drager;Shelece Easterday;Shinichiroh Fukuda;Gary Holton;Bradley McDonnell;William O’Grady;A. Schafer;J. Woodward;Jennifer Sou - 通讯作者:
Jennifer Sou
The rise and fall of semantic alignment in North Halmahera, Indonesia
印度尼西亚北哈马黑拉语义对齐的兴衰
- DOI:
- 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
Gary Holton - 通讯作者:
Gary Holton
Gary Holton的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Gary Holton', 18)}}的其他基金
Doctoral Dissertation Research: A Multi-Modal Study of Gesture in a Spatial Language
博士论文研究:空间语言中手势的多模态研究
- 批准号:
2025315 - 财政年份:2020
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Syntactic Description of a Language with Unique Patterns of Symmetrical Voice Alternations
博士论文研究:具有独特的对称语音交替模式的语言的句法描述
- 批准号:
1926376 - 财政年份:2019
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
Conference on Minority Language Documentation for Community Language Practitioners
社区语言从业者少数民族语言文献会议
- 批准号:
1761223 - 财政年份:2018
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
The 2019 International Conference on Language Documentation & Conservation: Connecting Languages, Communities, and Technology
2019年国际语言文献会议
- 批准号:
1745711 - 财政年份:2017
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
Completion of Eyak (ISO 693-3 eya) Grammar, Dictionary, Texts
完成 Eyak (ISO 693-3 eya) 语法、词典、文本
- 批准号:
1642783 - 财政年份:2016
- 资助金额:
$ 10.86万 - 项目类别:
Continuing Grant
DDRIG: Illiamna Yup'ik Geographic Knowledge and Sense of Place in Southwest Alaska
DDRIG:伊利亚姆纳·尤皮克(Illiamna Yupik)阿拉斯加西南部的地理知识和地方感
- 批准号:
1640812 - 财政年份:2016
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
Collaborative Research: Linking Maps, Manuscripts, and Place Names Data to Improve Environmental Knowledge in Alaska
合作研究:链接地图、手稿和地名数据以提高阿拉斯加的环境知识
- 批准号:
1624365 - 财政年份:2015
- 资助金额:
$ 10.86万 - 项目类别:
Continuing Grant
Collaborative Research: Workshop on User-Centered Design of Language Archives
合作研究:以用户为中心的语言档案设计研讨会
- 批准号:
1543828 - 财政年份:2015
- 资助金额:
$ 10.86万 - 项目类别:
Standard Grant
Collaborative Research: Linking Maps, Manuscripts, and Place Names Data to Improve Environmental Knowledge in Alaska
合作研究:链接地图、手稿和地名数据以提高阿拉斯加的环境知识
- 批准号:
1415603 - 财政年份:2014
- 资助金额:
$ 10.86万 - 项目类别:
Continuing Grant
相似海外基金
Developing Tools to Understand an Alternative Fate of Urate in Neurodegenerative Diseases
开发工具来了解尿酸盐在神经退行性疾病中的替代命运
- 批准号:
10668103 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
ACTS (AD Clinical Trial Simulation): Developing Advanced Informatics Approaches for an Alzheimer's Disease Clinical Trial Simulation System
ACTS(AD 临床试验模拟):为阿尔茨海默病临床试验模拟系统开发先进的信息学方法
- 批准号:
10753675 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Does social motivation in adolescence differentially predict the impact of childhood threat exposure on developing suicidal thoughts and behaviors
青春期的社会动机是否可以差异预测童年威胁暴露对自杀想法和行为的影响
- 批准号:
10785373 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Home Alone: Developing a Home-Based Intervention for People with Cognitive Impairment Who Live Alone
独自在家:为独居认知障碍患者制定家庭干预措施
- 批准号:
10590347 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Developing a P4 Medicine Approach to Obstructive Sleep Apnea
开发治疗阻塞性睡眠呼吸暂停的 P4 医学方法
- 批准号:
10555805 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Developing genetically encodable probes for multimodal tracking of exosomal RNA cargo
开发用于外泌体 RNA 货物多模式追踪的基因可编码探针
- 批准号:
10681827 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Futureproofing Health: Developing a Center for Resilient Health in Disasters
面向未来的健康:建立灾难恢复健康中心
- 批准号:
10835243 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Developing and testing a multi-component intervention to improve Perinatal Mental Health
开发和测试多成分干预措施以改善围产期心理健康
- 批准号:
10723985 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别:
Assessing inhibitor efficacy in vivo and developing a biomarker for use during early phase clinical trials
评估抑制剂的体内功效并开发用于早期临床试验的生物标志物
- 批准号:
10747157 - 财政年份:2023
- 资助金额:
$ 10.86万 - 项目类别: