CI-P: Planning for Scalable Language Resource Creation through Novel Incentives and Crowdsourcing
CI-P:通过新颖的激励措施和众包规划可扩展的语言资源创建
基本信息
- 批准号:1629923
- 负责人:
- 金额:$ 9.98万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-06-01 至 2018-04-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Advances in human language technologies enable systems that, for example, obey natural language commands and respond in kind, translate among many language pairs and summarize multilingual news. However, the technology's potential remains largely untapped because the linguistic resources that fuel development still fall far short of need. This community infrastructure planning (CI-P) initiative begins the process of building infrastructure to continuously develop high quality language resources, by employing techniques proven to work in multiple scientific disciplines. Social media, crowd-sourcing, games with a purpose and citizen science show us that human resources are effectively limitless for some activities. By offering human contributors appropriate opportunities and incentives, this project enhances language resource development well beyond what direct funding alone can produce. By removing constraints on participation, designing activities to appeal to multiple communities the project creates educational opportunities for the public including students and under-represented groups. The increase in scale and diversity of data also benefits those working in language related research, education and technology development. The availability of an ever-growing body of resources for an expanding range of languages will permit developers to supply technologies to a greater proportion of the world.This project is the first step in the creation of infrastructure capable of high volume, continuous collection of language data and judgments through: ubiquity, perseverance, comprehensive annotation, automated training and certification, appropriate incentives, task engineering and variants of crowdsourcing. Building upon Linguistic Data Consortium's WebAnn framework, virtual front end web servers provide multiple interfaces to incentivize and engineer linguistic data contributions from targeted groups: linguists, citizen scientists, game players and students. Collection and annotation activities are analyzed into component tasks according to the skills they require and are assigned as appropriate to different workforces using different workflows. The combination of customized interfaces and novel incentive strategies enables ongoing, scalable data collection and annotation resulting in diverse language resources available to the wider Computer and Information Science and Engineering research and education communities.
例如,人类语言技术的进步使系统能够服从自然语言命令和实物回应,在许多语言对之间进行翻译,并总结多语言新闻。然而,这项技术的潜力在很大程度上仍然没有得到开发,因为推动发展的语言资源仍然远远满足不了需求。这一社区基础设施规划(CI-P)计划开始了基础设施建设的进程,通过采用在多个科学学科中被证明有效的技术,不断开发高质量的语言资源。社交媒体、众包、有目的的游戏和公民科学告诉我们,人力资源在某些活动中实际上是无限的。通过为人类贡献者提供适当的机会和奖励,该项目促进了语言资源的开发,远远超出了直接资金本身所能产生的效果。通过消除参与的限制,设计吸引多个社区的活动,该项目为包括学生和代表不足的群体在内的公众创造了教育机会。数据规模和多样性的增加也有利于从事与语言有关的研究、教育和技术开发的人员。越来越多的资源可用于越来越多的语言,这将使开发人员能够向世界上更大比例的地区提供技术。该项目是建立能够通过以下方式持续收集大量语言数据和判断的基础设施的第一步:无处不在、坚持不懈、全面的注释、自动化培训和认证、适当的激励、任务工程和众包的变体。在语言数据联盟的WebAnn框架的基础上,虚拟前端网络服务器提供多个接口,以激励和设计目标群体的语言数据贡献:语言学家、公民科学家、游戏玩家和学生。收集和注释活动根据它们所需的技能被分析为组件任务,并使用不同的工作流程适当地分配给不同的工作人员。定制的界面和新颖的激励策略相结合,能够持续、可扩展的数据收集和注释,从而为更广泛的计算机和信息科学以及工程研究和教育社区提供不同的语言资源。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Christopher Cieri其他文献
Christopher Cieri的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Christopher Cieri', 18)}}的其他基金
Workshop on Sociolinguistic Archive Preparation
社会语言学档案准备研讨会
- 批准号:
1144480 - 财政年份:2011
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
CRI:CRD Collaborative Research: General Techniques for Creating Treebanks with Multiple Representations: A Large-Scale Russian Application
CRI:CRD 协作研究:创建具有多种表示的树库的通用技术:俄罗斯的大规模应用
- 批准号:
0708276 - 财政年份:2007
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
相似海外基金
Collaborative Research: Scalable Circuit theoretic Framework for Large Grid Simulations and Optimizations: from Combined T&D Planning to Electromagnetic Transients
协作研究:大型电网仿真和优化的可扩展电路理论框架:来自组合 T
- 批准号:
2330195 - 财政年份:2024
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
Collaborative Research: Scalable Circuit theoretic Framework for Large Grid Simulations and Optimizations: from Combined T&D Planning to Electromagnetic Transients
协作研究:大型电网仿真和优化的可扩展电路理论框架:来自组合 T
- 批准号:
2330196 - 财政年份:2024
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
Developing a Scalable FASD-Informed Person-Centered Planning Intervention
制定可扩展的 FASD 知情的以人为中心的规划干预措施
- 批准号:
10644186 - 财政年份:2023
- 资助金额:
$ 9.98万 - 项目类别:
PreSize Net medical device software for realistic surgery planning: next-generation scalable technology for selecting the best surgical scenario for every patient
用于现实手术规划的 PreSize Net 医疗设备软件:下一代可扩展技术,可为每位患者选择最佳手术方案
- 批准号:
10055877 - 财政年份:2023
- 资助金额:
$ 9.98万 - 项目类别:
Collaborative R&D
Collaborative Research: PPoSS: Planning: Software Stack for Scalable Heterogeneous NISQ Cluster
协作研究:PPoSS:规划:可扩展异构 NISQ 集群的软件堆栈
- 批准号:
2216923 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217028 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217086 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant
mHealth for suicide prevention: Design, development, and feasibility of a scalable SMS-based safety planning intervention
用于预防自杀的移动医疗:基于短信的可扩展安全规划干预措施的设计、开发和可行性
- 批准号:
10654851 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
mHealth for suicide prevention: Design, development, and feasibility of a scalable SMS-based safety planning intervention
用于预防自杀的移动医疗:基于短信的可扩展安全规划干预措施的设计、开发和可行性
- 批准号:
10524928 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
Collaborative Research: PPoSS: Planning: Software Stack for Scalable Heterogeneous NISQ Cluster
协作研究:PPoSS:规划:可扩展异构 NISQ 集群的软件堆栈
- 批准号:
2217021 - 财政年份:2022
- 资助金额:
$ 9.98万 - 项目类别:
Standard Grant