SHF: Large: Collaborative Research: Exploiting the Naturalness of Software
SHF:大型:协作研究:利用软件的自然性
基本信息
- 批准号:1414030
- 负责人:
- 金额:$ 66.67万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-07-01 至 2018-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This inter-disciplinary project has its roots in Natural Language (NL) processing. Languages such as English allow intricate, lovely and complex constructions; yet, everyday, ``natural? speech and writing is simple, prosaic, and repetitive, and thus amenable to statistical modeling. Once large NL corpora became available, computational muscle and algorithmic insight led to rapid advances in the statistical modeling of natural utterances, and revolutionized tasks such as translation, speech recognition, text summarization, etc.  While programming languages, like NL, are flexible and powerful, in theory allowing a great variety of complex programs to be written, we find that ``natural? programs that people actually write are regular, repetitive and predictable. This project will use statistical models to capture and exploit this regularity to create a new generation of software engineering tools to achieve transformative improvements in software quality and productivity.  The project will exploit language modeling techniques to capture the regularity in natural programs at the lexical, syntactic, and semantic levels. Statistical modeling will also be used to capture alignment regularities in ``bilingual? corpora such as code with comments, or explanatory text (e.g., Stackoverflow) and in systems developed on two platforms such as Java and C#.  These statistical models will help drive novel, data-driven approaches for applications such as code suggestion and completion, and assistive devices for programmers with movement or visual challenges. These models will also be exploited to correct simple errors in programs. Models of bilingual data will used to build code summarization and code retrieval tools, as well as tools for porting across platforms. Finally, this project will create a large, curated corpus of  software, and code analysis products, as well as a corpus of alignments within software bilingual corpora, to help create and nurture a research community in this area.
这个跨学科的项目源于自然语言(NL)处理。像英语这样的语言允许复杂的,可爱的和复杂的结构;然而,每天,“自然?”言语和写作是简单的、可预测的和重复的,并且因此服从于统计建模。一旦大型NL语料库可用,计算能力和算法洞察力导致自然话语统计建模的快速发展,并彻底改变了翻译,语音识别,文本摘要等任务。人们实际编写的程序是有规律的、重复的和可预测的。该项目将使用统计模型来捕获和利用这种规律性,以创建新一代软件工程工具,从而实现软件质量和生产力的变革性改进。  该项目将利用语言建模技术来捕捉自然程序在词汇,句法和语义层面上的规律性。统计建模也将被用来捕捉对齐在``双语?诸如带有注释的代码的语料库,或者解释性文本(例如,Stackoverflow)和在两个平台上开发的系统中,如Java和C#。  这些统计模型将有助于推动新的、数据驱动的应用方法,如代码建议和完成,以及为有运动或视觉挑战的程序员提供辅助设备。这些模型也将被用来纠正程序中的简单错误。双语数据模型将用于构建代码摘要和代码检索工具,以及跨平台移植工具。最后,该项目将创建一个大型的软件和代码分析产品的精选语料库,以及软件双语语料库中的对齐语料库,以帮助创建和培育这一领域的研究社区。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
William Cohen其他文献
Prevalence and underdiagnosis of diabetes mellitus in a food insecure population
- DOI:10.1038/s41598-025-97154-5 
- 发表时间:2025-04-10 
- 期刊:
- 影响因子:3.900
- 作者:Camden Richter;William Cohen;Ethan Belnap;Abigail McIntosh;Ishan Khosla;Daniel Luger 
- 通讯作者:Daniel Luger 
Impact of hospital acquired infections on post-transplant one year mortality in pediatric bone marrow transplant patients
- DOI:10.1016/j.ajic.2020.07.020 
- 发表时间:2021-02-01 
- 期刊:
- 影响因子:
- 作者:Benjamin R. Hanisch;William Cohen;David Jacobsohn;Xiaoyan Song 
- 通讯作者:Xiaoyan Song 
Expert systems tools for hubble space telescope observation scheduling
- DOI:10.1016/s0736-5853(87)80016-3 
- 发表时间:1987-01-01 
- 期刊:
- 影响因子:
- 作者:Glenn Miller;Don Rosenthal;William Cohen;Mark Johnston 
- 通讯作者:Mark Johnston 
Congenitally Abnormal Aortic Valve Causing Coronary Obstruction and Cardiac Arrest in Infancy
- DOI:10.1016/j.athoracsur.2020.07.070 
- 发表时间:2021-05-01 
- 期刊:
- 影响因子:
- 作者:Mubeena Abdulkarim;Julia Weld;William Cohen;Pamela Combs;Dhaval Patel;Chawki El-Zein;Michel Ilbawi;Luca Vricella;Narutoshi Hibino 
- 通讯作者:Narutoshi Hibino 
65.2 Professional Growth Through International Collaboration: The Experiences of Trainees and Faculty
- DOI:10.1016/j.jaac.2017.07.377 
- 发表时间:2017-10-01 
- 期刊:
- 影响因子:
- 作者:William Cohen;Beatriz Ortega 
- 通讯作者:Beatriz Ortega 
William Cohen的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('William Cohen', 18)}}的其他基金
BIGDATA: Small: Big Data for Everyone
大数据:小:每个人的大数据
- 批准号:1250956 
- 财政年份:2013
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
EAGER: Exploiting the "Naturalness" of Software
EAGER:利用软件的“自然性”
- 批准号:1247088 
- 财政年份:2012
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Collaborative Research: SoCS: Analysis of Social Media Driven By Theories of Political Psychology
合作研究:SoCS:政治心理学理论驱动的社交媒体分析
- 批准号:0968481 
- 财政年份:2010
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
III-CTX-Small: Adaptive Integration of Structured and Unstructured Data from Many Sources in a Biological Domain
III-CTX-Small:生物领域多源结构化和非结构化数据的自适应集成
- 批准号:0811562 
- 财政年份:2008
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Building Cognitive Tutors with Programming by Demonstration: When Simulated Students help Cognitive Modeling and Educational Studies
通过演示编程培养认知导师:当模拟学生帮助认知建模和教育研究时
- 批准号:0537198 
- 财政年份:2005
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Studies on Cytoskeletal Function in the Generation and Maintenance of Cell Shape
细胞骨架在细胞形状生成和维持中的功能研究
- 批准号:9808368 
- 财政年份:1998
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
CISE Research Instrumentation: Infrastructure to Support Accurate Performance Measurements of Multithreaded Programs
CISE 研究仪器:支持多线程程序准确性能测量的基础设施
- 批准号:9729889 
- 财政年份:1998
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Improving Cell Biology Laboratories and Research Access for Biology Majors
改善细胞生物学实验室和生物学专业的研究机会
- 批准号:9351203 
- 财政年份:1993
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Marginal Band Formation and Cellular Morphogenesis in Nucleated Erythrocytes
有核红细胞的边缘带形成和细胞形态发生
- 批准号:9118773 
- 财政年份:1992
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 
Studies of Marginal Band Formation and Function in Erythrocytes
红细胞边缘带形成和功能的研究
- 批准号:8711810 
- 财政年份:1987
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 
相似国自然基金
水稻穗粒数调控关键因子LARGE6的分子遗传网络解析
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
量子自旋液体中拓扑拟粒子的性质:量子蒙特卡罗和新的large-N理论
- 批准号:
- 批准年份:2020
- 资助金额:62 万元
- 项目类别:面上项目
甘蓝型油菜Large Grain基因调控粒重的分子机制研究
- 批准号:31972875
- 批准年份:2019
- 资助金额:58.0 万元
- 项目类别:面上项目
Large PB/PB小鼠 视网膜新生血管模型的研究
- 批准号:30971650
- 批准年份:2009
- 资助金额:8.0 万元
- 项目类别:面上项目
基因discs large在果蝇卵母细胞的后端定位及其体轴极性形成中的作用机制
- 批准号:30800648
- 批准年份:2008
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
LARGE基因对口腔癌细胞中α-DG糖基化及表达的分子调控
- 批准号:30772435
- 批准年份:2007
- 资助金额:29.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
- 批准号:2402804 
- 财政年份:2024
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
- 批准号:2402806 
- 财政年份:2024
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
- 批准号:2402805 
- 财政年份:2024
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:1832985 
- 财政年份:2018
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:1565336 
- 财政年份:2016
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:1565414 
- 财政年份:2016
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
SHF: Large: Collaborative Research: Exploiting the Naturalness of Software
SHF:大型:协作研究:利用软件的自然性
- 批准号:1723215 
- 财政年份:2016
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:1565431 
- 财政年份:2016
- 资助金额:$ 66.67万 
- 项目类别:Standard Grant 
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:1518715 
- 财政年份:2015
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
- 批准号:1518833 
- 财政年份:2015
- 资助金额:$ 66.67万 
- 项目类别:Continuing Grant 

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



