SHF: Large: Collaborative Research: Exploiting the Naturalness of Software
SHF:大型:协作研究:利用软件的自然性
基本信息
- 批准号:1723215
- 负责人:
- 金额:$ 26.07万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-07-01 至 2022-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This inter-disciplinary project has its roots in Natural Language (NL) processing. Languages such as English allow intricate, lovely and complex constructions; yet, everyday, ``natural? speech and writing is simple, prosaic, and repetitive, and thus amenable to statistical modeling. Once large NL corpora became available, computational muscle and algorithmic insight led to rapid advances in the statistical modeling of natural utterances, and revolutionized tasks such as translation, speech recognition, text summarization, etc. While programming languages, like NL, are flexible and powerful, in theory allowing a great variety of complex programs to be written, we find that ``natural? programs that people actually write are regular, repetitive and predictable. This project will use statistical models to capture and exploit this regularity to create a new generation of software engineering tools to achieve transformative improvements in software quality and productivity. The project will exploit language modeling techniques to capture the regularity in natural programs at the lexical, syntactic, and semantic levels. Statistical modeling will also be used to capture alignment regularities in ``bilingual? corpora such as code with comments, or explanatory text (e.g., Stackoverflow) and in systems developed on two platforms such as Java and C#. These statistical models will help drive novel, data-driven approaches for applications such as code suggestion and completion, and assistive devices for programmers with movement or visual challenges. These models will also be exploited to correct simple errors in programs. Models of bilingual data will used to build code summarization and code retrieval tools, as well as tools for porting across platforms. Finally, this project will create a large, curated corpus of software, and code analysis products, as well as a corpus of alignments within software bilingual corpora, to help create and nurture a research community in this area.
这个跨学科的项目源于自然语言(NL)处理。像英语这样的语言允许复杂的,可爱的和复杂的结构;然而,每天,“自然?”言语和写作是简单的、可预测的和重复的,并且因此服从于统计建模。一旦大型NL语料库可用,计算能力和算法洞察力导致自然话语统计建模的快速发展,并彻底改变了翻译,语音识别,文本摘要等任务。人们实际编写的程序是有规律的、重复的和可预测的。该项目将使用统计模型来捕获和利用这种规律性,以创建新一代软件工程工具,从而实现软件质量和生产力的变革性改进。 该项目将利用语言建模技术来捕捉自然程序在词汇,句法和语义层面上的规律性。统计建模也将被用来捕捉对齐在``双语?诸如带有注释的代码的语料库,或者解释性文本(例如,Stackoverflow)和在两个平台上开发的系统中,如Java和C#。 这些统计模型将有助于推动新的、数据驱动的应用方法,如代码建议和完成,以及为有运动或视觉挑战的程序员提供辅助设备。这些模型也将被用来纠正程序中的简单错误。双语数据模型将用于构建代码摘要和代码检索工具,以及跨平台移植工具。最后,该项目将创建一个大型的软件和代码分析产品的精选语料库,以及软件双语语料库中的对齐语料库,以帮助创建和培育这一领域的研究社区。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Tien Nguyen其他文献
Research on Test Flakiness: from Unit to System Testing
测试脆弱性研究:从单元测试到系统测试
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Kiet Ngo;Vu;Tien Nguyen - 通讯作者:
Tien Nguyen
2004 Education for All Handicapped Children Act
2004 年《所有残疾儿童教育法》
- DOI:
10.1007/springerreference_179606 - 发表时间:
2011 - 期刊:
- 影响因子:7.4
- 作者:
Lynna Lan;Tien Nguyen - 通讯作者:
Tien Nguyen
Applicability of CPT-based methods in predicting toe bearing capacities of driven piles in sand
基于 CPT 的方法在预测砂土中打入桩趾部承载力中的适用性
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Le Chi Hung;Tien Nguyen;Ju;Sung - 通讯作者:
Sung
QRS 2018 Program Committee
QRS 2018 程序委员会
- DOI:
10.1109/qrs-c.2018.00010 - 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Jun Ai;Doo;Xiaoying Bai;David Benavides;Arun Chakrapani;Junjie Chen;Frédéric Dadeau;Junhua Ding;T. Dohi;Mercedes G. Merayo;Osamu Mizuno;Tien Nguyen;Manuel Nuñez;H. Okamura;Hailong Sun;Linzhang Wang;Husnu Yenigun - 通讯作者:
Husnu Yenigun
Agreement between two versions of a CADx system: a simulation study
CADx 系统两个版本之间的一致性:模拟研究
- DOI:
- 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
B. Sahiner;N. Petrick;S. Paquerault;Weijie Chen;Tien Nguyen - 通讯作者:
Tien Nguyen
Tien Nguyen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Tien Nguyen', 18)}}的其他基金
Collaborative Research: CCRI: ENS: Boa 2.0: Enhancing Infrastructure for Studying Software and its Evolution at a Large Scale
合作研究:CCRI:ENS:Boa 2.0:增强大规模研究软件及其演化的基础设施
- 批准号:
2120386 - 财政年份:2021
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF:Small: Build Code Maintenance and Detecting, Testing, Locating Configuration and Build Errors
SHF:Small:构建代码维护以及检测、测试、定位配置和构建错误
- 批准号:
1723432 - 财政年份:2016
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
TWC: Small: Detection and Prevention of Prior Known Software Security Vulnerabilities
TWC:小:检测和预防先前已知的软件安全漏洞
- 批准号:
1723198 - 财政年份:2016
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Exploiting the Naturalness of Software
SHF:大型:协作研究:利用软件的自然性
- 批准号:
1413927 - 财政年份:2014
- 资助金额:
$ 26.07万 - 项目类别:
Continuing Grant
SHF:Small: Build Code Maintenance and Detecting, Testing, Locating Configuration and Build Errors
SHF:Small:构建代码维护以及检测、测试、定位配置和构建错误
- 批准号:
1320578 - 财政年份:2013
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
TWC: Small: Detection and Prevention of Prior Known Software Security Vulnerabilities
TWC:小:检测和预防先前已知的软件安全漏洞
- 批准号:
1223828 - 财政年份:2012
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Small: Find and Fix Similar Software Bugs
SHF:小型:查找并修复类似的软件错误
- 批准号:
1018600 - 财政年份:2010
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
Improving Embedded System Education with Software Engineering Methodologies
利用软件工程方法改进嵌入式系统教育
- 批准号:
0737029 - 财政年份:2008
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
相似国自然基金
水稻穗粒数调控关键因子LARGE6的分子遗传网络解析
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
量子自旋液体中拓扑拟粒子的性质:量子蒙特卡罗和新的large-N理论
- 批准号:
- 批准年份:2020
- 资助金额:62 万元
- 项目类别:面上项目
甘蓝型油菜Large Grain基因调控粒重的分子机制研究
- 批准号:31972875
- 批准年份:2019
- 资助金额:58.0 万元
- 项目类别:面上项目
Large PB/PB小鼠 视网膜新生血管模型的研究
- 批准号:30971650
- 批准年份:2009
- 资助金额:8.0 万元
- 项目类别:面上项目
基因discs large在果蝇卵母细胞的后端定位及其体轴极性形成中的作用机制
- 批准号:30800648
- 批准年份:2008
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
LARGE基因对口腔癌细胞中α-DG糖基化及表达的分子调控
- 批准号:30772435
- 批准年份:2007
- 资助金额:29.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
- 批准号:
2402804 - 财政年份:2024
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
- 批准号:
2402806 - 财政年份:2024
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
- 批准号:
2402805 - 财政年份:2024
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:
1832985 - 财政年份:2018
- 资助金额:
$ 26.07万 - 项目类别:
Continuing Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:
1565336 - 财政年份:2016
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:
1565414 - 财政年份:2016
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:
1565431 - 财政年份:2016
- 资助金额:
$ 26.07万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:
1518715 - 财政年份:2015
- 资助金额:
$ 26.07万 - 项目类别:
Continuing Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:
1518833 - 财政年份:2015
- 资助金额:
$ 26.07万 - 项目类别:
Continuing Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:
1518723 - 财政年份:2015
- 资助金额:
$ 26.07万 - 项目类别:
Continuing Grant