CAREER: Information Engineering and Synthesis for Resource-poor Languages
职业:资源匮乏语言的信息工程和综合
基本信息
- 批准号:0748919
- 负责人:
- 金额:$ 50万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2008
- 资助国家:美国
- 起止时间:2008-06-15 至 2017-05-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
For the majority of the world's languages, the amount of linguistic resources (e.g., annotated corpora and parallel data) is very limited. Consequently, supervised methods and many unsupervised methods cannot be applied directly, leaving these languages largely untouched and unnoticed. Another crucial issue, which has received little attention from the natural language processing (NLP) community, is that to date there have been very few studies that examine a large number of languages and incorporate cross-lingual information into NLP systems. As a result, languages are researched and processed in isolation rather than being looked at as part of a big language family.This proposed research has two intertwined goals. The first goal is to create a framework that allows the rapid development of resources for resource-poor languages. This goal will be accomplished by bootstrapping NLP tools with initial seeds created by projecting syntactic information from resource-rich languages to resource-poor ones. The second goal is to use the automatically created resources to perform cross-lingual study on a large number of languages to discover linguistic knowledge. The knowledge will not only deepen our understanding on languages, but also provide additional information that can be incorporated into the bootstrapping module to produce better NLP tools. The research explores two key ideas: The first idea is to take advantage of resource-rich languages by using them to create seeds for bootstrapping NLP tools. The second idea is to identify the relation between languages and use this information to help machine learning. Both ideas point to the same direction; that is, languages are related to one another and should be treated as such.
对于世界上大多数语言来说,语言资源(例如,注释语料库和平行数据)的数量非常有限。因此,监督方法和许多非监督方法不能直接应用,使得这些语言在很大程度上没有受到影响和注意。另一个关键问题是,到目前为止,很少有研究将大量语言和跨语言信息纳入自然语言处理系统中,这一问题几乎没有得到自然语言处理界的关注。因此,语言被孤立地研究和处理,而不是被视为一个大语言家族的一部分。这项拟议的研究有两个相互交织的目标。第一个目标是创建一个框架,允许为资源匮乏的语言快速开发资源。这一目标将通过用初始种子引导NLP工具来实现,所述初始种子是通过将句法信息从资源丰富的语言投射到资源贫乏的语言而创建的。第二个目标是利用自动创建的资源对大量语言进行跨语言研究,以发现语言知识。这些知识不仅将加深我们对语言的理解,而且还将提供额外的信息,这些信息可以被纳入引导模块,以产生更好的NLP工具。这项研究探索了两个关键想法:第一个想法是利用资源丰富的语言来为自举的NLP工具创建种子。第二个想法是识别语言之间的关系,并利用这些信息来帮助机器学习。这两种观点都指向同一个方向,即语言是相互关联的,应该这样对待。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Fei Xia其他文献
Two allergens from Scylla paramamosain share common epitopes showed different allergenic potential in Balb/c mice
拟青青蟹的两种过敏原具有共同的表位,在 Balb/c 小鼠中表现出不同的致敏潜力
- DOI:
10.1016/j.foodchem.2021.131132 - 发表时间:
2022 - 期刊:
- 影响因子:8.8
- 作者:
Yang Yang;Xin-Rong He;Shao-Gui He;Meng Liu;Yong-Xia Zhang;Fei Xia;Min-Jie Cao;Wen-Jin Su;Guang-Ming Liu - 通讯作者:
Guang-Ming Liu
Combined liquid hot water with sodium carbonate-oxygen pretreatment to improve enzymatic saccharification of reed.
液体热水与碳酸钠-氧气预处理相结合提高芦苇酶解糖化效果。
- DOI:
10.1016/j.biortech.2019.122498 - 发表时间:
2020 - 期刊:
- 影响因子:11.4
- 作者:
Fei Xia;Jingwei Gong;Jie Lu;Yi Cheng;Shangru Zhai;Qingda An;Haisong Wang - 通讯作者:
Haisong Wang
Enhanced antiviral immunity against Bombyx mori cytoplasmic polyhedrosis virus via overexpression of peptidoglycan recognition protein S2 in transgenic silkworms
通过在转基因蚕中过度表达肽聚糖识别蛋白S2增强对家蚕细胞质多角体病毒的抗病毒免疫力
- DOI:
10.1016/j.dci.2018.05.021 - 发表时间:
2018 - 期刊:
- 影响因子:2.9
- 作者:
Ping Zhao;Fei Xia;Liang Jiang;Huizhen Guo;Guowen Xu;Qiang Sun;Bingbing Wang;Yumei Wang;Zhongyan Lu;Qingyou Xia - 通讯作者:
Qingyou Xia
FPGA implementation of an exact dot product and its application in variable-precision floating-point arithmetic
精确点积的FPGA实现及其在变精度浮点运算中的应用
- DOI:
10.1007/s11227-012-0860-0 - 发表时间:
2013-05 - 期刊:
- 影响因子:3.3
- 作者:
Yong Dou;Yazhuo Dong;Jie Zhou;Fei Xia - 通讯作者:
Fei Xia
Cloning, Expression, and Epitope Identification of Myosin Light Chain 1: An Allergen in Mud Crab
青蟹过敏原肌球蛋白轻链 1 的克隆、表达和表位鉴定
- DOI:
10.1021/acs.jafc.9b04294 - 发表时间:
2019 - 期刊:
- 影响因子:6.1
- 作者:
Meng-Si Li;Fei Xia;Meng Liu;Xin-Rong He;Yi-Yu Chen;Tian-Liang Bai;Gui-Xia Chen;Li Wang;Min-Jie Cao;Guang-Ming Liu - 通讯作者:
Guang-Ming Liu
Fei Xia的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Fei Xia', 18)}}的其他基金
Workshop on NLP and Linguistics: finding the common ground
NLP 和语言学研讨会:寻找共同点
- 批准号:
1027289 - 财政年份:2010
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: CRI: CRD: A Multi-Representational and Multi-Layered Treebank for Hindi/Urdu
合作研究:CRI:CRD:印地语/乌尔都语的多表征和多层树库
- 批准号:
0751213 - 财政年份:2008
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
CRI:CRD Collaborative Research: General Techniques for Creating Treebanks with Multiple Representations: A Large-Scale Russian
CRI:CRD 协作研究:创建具有多重表示的树库的通用技术:大型俄罗斯树库
- 批准号:
0708719 - 财政年份:2007
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
相似国自然基金
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Exploring the Intrinsic Mechanisms of CEO Turnover and Market Reaction: An Explanation Based on Information Asymmetry
- 批准号:W2433169
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
SCIENCE CHINA Information Sciences
- 批准号:61224002
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
相似海外基金
2024 - 2025 National Science Foundation (NSF) Computer and Information Science and Engineering (CISE) Research Experiences for Undergraduates (REU) Principal Investigator Workshops
2024 - 2025 美国国家科学基金会 (NSF) 计算机与信息科学与工程 (CISE) 本科生研究经验 (REU) 首席研究员研讨会
- 批准号:
2407231 - 财政年份:2024
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
Collaborative Research: Education Landscape for Quantum Information Science and Engineering: Guiding Education Innovation to Support Quantum Career Paths
合作研究:量子信息科学与工程的教育格局:指导教育创新以支持量子职业道路
- 批准号:
2333073 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: Education Landscape for Quantum Information Science and Engineering: Guiding Education Innovation to Support Quantum Career Paths
合作研究:量子信息科学与工程的教育格局:指导教育创新以支持量子职业道路
- 批准号:
2333074 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
2023 National Science Foundation (NSF) Computer and Information Science and Engineering (CISE) Research Experiences for Undergraduates (REU) Principal Investigator (PI) Workshop
2023年美国国家科学基金会(NSF)计算机与信息科学与工程(CISE)本科生研究经验(REU)首席研究员(PI)研讨会
- 批准号:
2316050 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
NRT-QISE: a new interdisciplinary degree program for convergent research and graduate training in quantum information science and engineering
NRT-QISE:一个新的跨学科学位项目,用于量子信息科学与工程的融合研究和研究生培训
- 批准号:
2244045 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Computer and Information Science and Engineering Graduate Fellowships (CSGrad4US)
计算机与信息科学与工程研究生奖学金(CSGrad4US)
- 批准号:
2240199 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Fellowship Award
Catalyst Project: Broadening Participation in Quantum Information Science and Engineering through Culturally-Relevant Experiential Learning
催化剂项目:通过文化相关的体验式学习扩大对量子信息科学与工程的参与
- 批准号:
2205862 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: HSI Implementation and Evaluation Project: Transfer Students’ Success in Quantum Information Science and Engineering
合作研究:HSI 实施和评估项目:转学生 — 量子信息科学与工程的成功
- 批准号:
2150532 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Computer and Information Science and Engineering Graduate Fellowships (CSGrad4US)
计算机与信息科学与工程研究生奖学金(CSGrad4US)
- 批准号:
2243307 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Fellowship Award
Computer and Information Science and Engineering Graduate Fellowships (CSGrad4US)
计算机与信息科学与工程研究生奖学金(CSGrad4US)
- 批准号:
2240204 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Fellowship Award