RR: EAGER: Data Science Literacy for All of Linguistics
RR:EAGER:所有语言学的数据科学素养
基本信息
- 批准号:1745249
- 负责人:
- 金额:$ 15.1万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-09-01 至 2022-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Like other social and behavioral sciences, linguistic science is inherently data-driven, and the proper care for that data is essential if linguistics is to be a robust, reliable, and reproducible endeavor well into the future. However, many linguists today are still unfamiliar with contemporary practices for handling digital data, and recent work has identified an urgent need for immediate education across most subfields of linguistics about standards and tools for collecting, structuring, archiving, sharing, citing and evaluating linguistic data sets. While some linguistics subfields (language documentation, computational linguistics) have developed strong methods for data handling, outreach to the rest of the discipline has been deficient. This project will radically and rapidly increase the literacy of linguistic scientists at all professional levels in the management of linguistic data, from undergraduate education, to graduate, early- and mid-career training and beyond. This project aims to foster sociological change across the entire discipline of linguistics, and to bring the value of data-handling skills to the forefront of linguistics education, enabling workforce development and potential employment opportunities. Broader impacts include research opportunities in the data sciences for a post-doctoral scholar, downloadable materials with guidelines and metrics for evaluating the scholarship of language data sets for hiring, tenure and promotion, and the development and delivery of formal and informal educational modules on linguistic data management. This project is designed to enable linguistics, as a data-driven social science in which inferences about human cognition and social structure are drawn from observations of behavior, is well positioned to benefit from principles of reproducible research. Currently, there is a disparity between how much digital data is produced by scientists and how much of that data has actually been deposited or made accessible through sustainable repositories or other means. The team will reduce this disparity by increasing knowledge and resources within the language sciences as part of efforts to change the discipline's culture by reducing structural and knowledge barriers. These efforts include increasing resources and rewards (such as jobs, tenure, and promotion) for accessible data management practices. Project activities include the rapid development of educational modules at several levels: a massive open online course (MOOC) aimed at undergraduate linguistics majors, and workshops for graduate students and faculty delivered at several widely-attended professional meetings over two years. This will provide a number of offerings and educational modules designed to foster best practices in data handling and data science for reproducible research in linguistics, targeting both junior scholars and mid-career faculty.The project will also disseminate training materials widely throughout the academic linguistic community, through an open-access handbook, online formats, and conference formats. This project takes seriously the contribution of data work as an intellectual achievement in its own right, and will promote methods for increasing the ability and willingness of linguists to effectively create, manage, preserve, curate, and share linguistic data.
像其他社会和行为科学一样,语言科学本质上是数据驱动的,如果语言学要成为一个强大的,可靠的和可复制的奋进,对这些数据的适当照顾是必不可少的。然而,今天的许多语言学家仍然不熟悉处理数字数据的当代实践,最近的工作已经确定了迫切需要立即在语言学的大多数子领域进行关于收集,结构化,归档,共享,引用和评估语言数据集的标准和工具的教育。虽然一些语言学子领域(语言文献,计算语言学)已经开发出强大的数据处理方法,但对其他学科的推广一直不足。该项目将从根本上迅速提高语言科学家在语言数据管理方面的所有专业水平的素养,从本科教育到研究生教育,早期和中期职业培训及以后。该项目旨在促进整个语言学学科的社会学变革,并将数据处理技能的价值带到语言学教育的最前沿,使劳动力发展和潜在的就业机会成为可能。更广泛的影响包括博士后学者在数据科学方面的研究机会,可下载的材料,用于评估语言数据集的奖学金,用于招聘,任期和晋升的指导方针和指标,以及语言数据管理的正式和非正式教育模块的开发和交付。 该项目旨在使语言学,作为一种数据驱动的社会科学,其中关于人类认知和社会结构的推断是从行为观察中得出的,能够很好地从可重复研究的原则中受益。 目前,科学家制作的数字数据与通过可持续存储库或其他手段实际存储或提供的数据之间存在差距。 该团队将通过增加语言科学的知识和资源来减少这种差距,作为通过减少结构和知识障碍来改变学科文化的努力的一部分。 这些努力包括为可访问的数据管理实践增加资源和奖励(例如工作、任期和晋升)。项目活动包括在几个层面上快速开发教育模块:针对本科语言学专业的大规模开放式在线课程(MOOC),以及两年多来在几次广泛参加的专业会议上为研究生和教师举办的研讨会。该项目将提供一系列课程和教育模块,旨在培养数据处理和数据科学方面的最佳实践,以促进语言学的可复制研究,针对初级学者和在职教师。该项目还将通过开放获取手册、在线格式和会议形式,在整个学术语言学界广泛传播培训材料。该项目重视数据工作本身作为智力成果的贡献,并将促进提高语言学家有效创建、管理、保存、策划和共享语言数据的能力和意愿的方法。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Andrea Berez-Kroeker其他文献
Andrea Berez-Kroeker的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Andrea Berez-Kroeker', 18)}}的其他基金
Doctoral Dissertation Research: Integration of Quantitative and Documentary Methodologies in the Analysis of a Segmentally-Rich Language
博士论文研究:定量和文献方法论在分析分段丰富语言中的整合
- 批准号:
1840668 - 财政年份:2019
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Child and Child-Directed Expression of Possession in a Polysynthetic Language
博士论文研究:多合成语言中儿童和儿童导向的占有表达
- 批准号:
1912062 - 财政年份:2019
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
Vital Voices: Linking Language and Wellbeing at the International Conference on Language Documentation and Conservation
重要声音:国际语言文献与保护会议上将语言与福祉联系起来
- 批准号:
1614134 - 财政年份:2016
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
WORKSHOP: Enriching Theory, Practice, and Application: Classes and Special Sessions at the 4th International Conference on Language Documentation & Conservation
研讨会:丰富理论、实践和应用:第四届国际语言文献会议的课程和特别会议
- 批准号:
1405434 - 财政年份:2014
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
Developing Standards for Data Citation and Attribution for Reproducible Research in Linguistics
为语言学研究的可重复性研究制定数据引用和归因标准
- 批准号:
1447886 - 财政年份:2014
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
Workshop Proposal: Master Class Series at the 3rd International Conference on Language Documentation and Conservation
研讨会提案:第三届国际语言文献与保护会议大师班系列
- 批准号:
1209489 - 财政年份:2012
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
相似海外基金
Collaborative Research: EAGER: IMPRESS-U: Groundwater Resilience Assessment through iNtegrated Data Exploration for Ukraine (GRANDE-U)
合作研究:EAGER:IMPRESS-U:通过乌克兰综合数据探索进行地下水恢复力评估 (GRANDE-U)
- 批准号:
2409395 - 财政年份:2024
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: Integrating Pathological Image and Biomedical Text Data for Clinical Outcome Prediction
EAGER:整合病理图像和生物医学文本数据进行临床结果预测
- 批准号:
2412195 - 财政年份:2024
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: Algorithms for Analyzing Faulty Data Using Domain Information
EAGER:使用域信息分析错误数据的算法
- 批准号:
2414736 - 财政年份:2024
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: IMPRESS-U: Groundwater Resilience Assessment through iNtegrated Data Exploration for Ukraine (GRANDE-U)
合作研究:EAGER:IMPRESS-U:通过乌克兰综合数据探索进行地下水恢复力评估 (GRANDE-U)
- 批准号:
2409396 - 财政年份:2024
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: IMPRESS-U: Modeling and Forecasting of Infection Spread in War and Post War Settings Using Epidemiological, Behavioral and Genomic Surveillance Data
EAGER:IMPRESS-U:使用流行病学、行为和基因组监测数据对战争和战后环境中的感染传播进行建模和预测
- 批准号:
2412914 - 财政年份:2024
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: Development of a Hybrid Knowledge- and Data-Driven Approach to Guide the Design of Immunotherapeutic Cells
EAGER:开发混合知识和数据驱动的方法来指导免疫治疗细胞的设计
- 批准号:
2324742 - 财政年份:2023
- 资助金额:
$ 15.1万 - 项目类别:
Continuing Grant
EAGER: Secure Research Impact Metric Data Exchange: Data Supply Chain and Vocabulary Development
EAGER:安全研究影响指标数据交换:数据供应链和词汇开发
- 批准号:
2335827 - 财政年份:2023
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: SMART-DMSP: Streamlining Metadata, Automation, and Research Tracking for Data Management and Sharing Plans
EAGER:SMART-DMSP:简化数据管理和共享计划的元数据、自动化和研究跟踪
- 批准号:
2332353 - 财政年份:2023
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
EAGER: Building a Provable Differentially Private Real-time Data-blind ML Algorithm: A case study on Enhancing STEM Student Engagement in Online Learning
EAGER:构建可证明的差分隐私实时数据盲机器学习算法:关于增强 STEM 学生在线学习参与度的案例研究
- 批准号:
2329919 - 财政年份:2023
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant
COLLABORATIVE RESEARCH: EAGER: Towards Building a CyberInfrastructure for Facilitating the Assessment, Dissemination, Discovery, & Reuse of Software and Data Products
合作研究:渴望:建立网络基础设施以促进评估、传播、发现、
- 批准号:
2314202 - 财政年份:2023
- 资助金额:
$ 15.1万 - 项目类别:
Standard Grant