I-Corps: Interactive and Automated Debugging for Big Data Analytics

I-Corps:大数据分析的交互式和自动调试

基本信息

  • 批准号:
    1842657
  • 负责人:
  • 金额:
    $ 5万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2018
  • 资助国家:
    美国
  • 起止时间:
    2018-09-15 至 2020-02-29
  • 项目状态:
    已结题

项目摘要

The broader impact of this I-Corps project is to investigate the challenges that data scientists face in debugging big data analytics today and to investigate the commercial potential of research work on interactive and automated debugging of big data analytics. Big data analytics is increasingly important in the 21st century, where our daily lives leave behind a detailed digital record. Decision-makers of all kinds, from companies to government agencies, would like to base their actions on data. If successful, this project will offer a unique opportunity to discover software development tooling needs for big data systems and to identify innovative software tooling products that sit across the software stack from the user-facing API all the way down to the systems infrastructure.This I-Corps project builds on early research work on real-time debugging primitives and tool-assisted fault-localization services for big data processing applications written in modern data intensive scalable computing (DISC) systems like Apache Spark. Designing debugging primitives for DISC requires re-thinking the traditional step-through debugging primitives as provided by tools such as gdb. For example, a breakpoint feature that simply pauses the entire computation would waste large amounts of computational resources and prevent correct tasks from completing, reducing overall throughput. Further, requiring the user to inspect the millions of intermediate records produced during execution is clearly infeasible. When a failure or incorrect result is generated (e.g., outlier), pinpointing the root cause is extremely time-consuming and expensive due to massive scale of data. In short, the intellectual merit of this I-Corps project is to investigate how users can benefit from expressive debugging primitives and automated fault localization services when they must leverage for data science and big data analytics capabilities.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
这个I-Corps项目的更广泛的影响是调查数据科学家在调试大数据分析时面临的挑战,并调查大数据分析的交互式和自动化调试研究工作的商业潜力。大数据分析在世纪越来越重要,我们的日常生活留下了详细的数字记录。从公司到政府机构,各种决策者都希望将他们的行动建立在数据的基础上。如果成功,该项目将提供一个独特的机会,以发现大数据系统的软件开发工具需求,并确定创新的软件工具产品,这些产品横跨软件堆栈,从面向用户的API一直到系统基础设施。该I-Corps项目建立在实时调试原语和工具辅助故障的早期研究工作基础上,为在现代数据密集型可扩展计算(DISC)系统(如Apache Spark)中编写的大数据处理应用程序提供本地化服务。为DISC设计调试原语需要重新考虑由gdb等工具提供的传统分步调试原语。例如,简单地暂停整个计算的断点功能将浪费大量的计算资源,并阻止正确的任务完成,从而降低整体吞吐量。此外,要求用户检查在执行期间产生的数百万个中间记录显然是不可行的。当产生失败或不正确的结果时(例如,异常值),由于数据规模庞大,查明根本原因非常耗时且昂贵。简而言之,这个I-Corps项目的智力价值是调查用户在必须利用数据科学和大数据分析能力时,如何从表达性调试原语和自动故障定位服务中受益。这个奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Miryung Kim其他文献

Chapter 16 Recommending Program Transformations Automating Repetitive Software Changes
第 16 章建议程序转换自动化重复的软件更改
  • DOI:
  • 发表时间:
    2014
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Miryung Kim;Na Meng
  • 通讯作者:
    Na Meng
Equity and Access in Algorithms, Mechanisms, and Optimization
算法、机制、优化的公平与准入
NaturalFuzz: Natural Input Generation for Big Data Analytics
NaturalFuzz:大数据分析的自然输入生成
C p – C d ≠ ? Eclipse Refactoring APIs P ’ Pure Refactoring Version P ’ ≠
C p – C d ≠ Eclipse 重构 API P ’ 纯重构版本 P ’ ≠ ?
  • DOI:
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Everton L. G. Alves;Myoungkyu Song;T. Massoni;Patricia D. L. Machado;Miryung Kim
  • 通讯作者:
    Miryung Kim
SE4ML - Software Engineering for AI-ML-based Systems (Dagstuhl Seminar 20091)
SE4ML - 基于 AI-ML 的系统的软件工程(Dagstuhl 研讨会 20091)
  • DOI:
    10.4230/dagrep.10.2.76
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    K. Kersting;Miryung Kim;Guy Van den Broeck;Thomas Zimmermann
  • 通讯作者:
    Thomas Zimmermann

Miryung Kim的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Miryung Kim', 18)}}的其他基金

Collaborative Research: SHF: Medium: Reinventing Fuzz Testing for Data and Compute Intensive Systems
协作研究:SHF:中:重新发明数据和计算密集型系统的模糊测试
  • 批准号:
    2106404
  • 财政年份:
    2021
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
CHS: Medium: Collaborative Research: Code demography: Addressing information needs at scale for programming interface users and designers
CHS:媒介:协作研究:代码人口统计:大规模解决编程接口用户和设计者的信息需求
  • 批准号:
    1956322
  • 财政年份:
    2020
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
SHF: Medium: Interactive Debegging for Big Data Analytics
SHF:中:大数据分析的交互式调试
  • 批准号:
    1764077
  • 财政年份:
    2018
  • 资助金额:
    $ 5万
  • 项目类别:
    Continuing Grant
SHF: Small: Analytical Support for Investigating Software Modifications in Collaborative Development Environment
SHF:小型:为研究协作开发环境中的软件修改提供分析支持
  • 批准号:
    1533791
  • 财政年份:
    2014
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
CAREER: Analysis and Automation of Systematic Software Modifications
职业:系统软件修改的分析和自动化
  • 批准号:
    1460325
  • 财政年份:
    2014
  • 资助金额:
    $ 5万
  • 项目类别:
    Continuing Grant
CAREER: Analysis and Automation of Systematic Software Modifications
职业:系统软件修改的分析和自动化
  • 批准号:
    1149391
  • 财政年份:
    2012
  • 资助金额:
    $ 5万
  • 项目类别:
    Continuing Grant
SHF: Small: Analytical Support for Investigating Software Modifications in Collaborative Development Environment
SHF:小型:为研究协作开发环境中的软件修改提供分析支持
  • 批准号:
    1117902
  • 财政年份:
    2011
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
Information Needs about Software Modification during Collaborative Development Tasks
协同开发任务期间软件修改的信息需求
  • 批准号:
    1043810
  • 财政年份:
    2010
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant

相似海外基金

Automated interactive definition of the clinical target volume in radiation oncology
放射肿瘤学中临床靶区的自动交互定义
  • 批准号:
    10547813
  • 财政年份:
    2022
  • 资助金额:
    $ 5万
  • 项目类别:
SHF: Small: Synergy between Automated Reasoning and Interactive Theorem Proving
SHF:小:自动推理和交互式定理证明之间的协同作用
  • 批准号:
    2229099
  • 财政年份:
    2022
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
Automated interactive definition of the clinical target volume in radiation oncology
放射肿瘤学中临床靶区的自动交互定义
  • 批准号:
    10342574
  • 财政年份:
    2022
  • 资助金额:
    $ 5万
  • 项目类别:
Research and Development of an Automated, Interactive, and User-Configurable Conversational Agent for Always-Available Personalized Language Tutoring
研究和开发自动化、交互式和用户可配置的对话代理,以实现始终可用的个性化语言辅导
  • 批准号:
    21K17779
  • 财政年份:
    2021
  • 资助金额:
    $ 5万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Automated postcorrection of OCRed historical printings with integrated optional interactive postcorrection
通过集成的可选交互式后期校正对 ORed 历史打印进行自动后期校正
  • 批准号:
    393215159
  • 财政年份:
    2018
  • 资助金额:
    $ 5万
  • 项目类别:
    Research data and software (Scientific Library Services and Information Systems)
Search-Based and Interactive Environment for Semi-Automated Refactoring
用于半自动重构的基于搜索的交互式环境
  • 批准号:
    18K11238
  • 财政年份:
    2018
  • 资助金额:
    $ 5万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
I-Corps: Fast Deployment Service for Automated Business-to-Client Interactive Messaging
I-Corps:用于自动化企业对客户交互式消息传递的快速部署服务
  • 批准号:
    1736955
  • 财政年份:
    2017
  • 资助金额:
    $ 5万
  • 项目类别:
    Standard Grant
Automated Social Interactive Interface to Monitor and Update Intervention Plans for People with MCI, Alzheimer's and Other Dementias
自动社交互动界面,用于监测和更新 MCI、阿尔茨海默病和其他痴呆症患者的干预计划
  • 批准号:
    9409961
  • 财政年份:
    2017
  • 资助金额:
    $ 5万
  • 项目类别:
Automated Social Interactive Interface to Monitor and Update Intervention Plans for People with MCI, Alzheimer's and Other Dementias
自动社交互动界面,用于监测和更新 MCI、阿尔茨海默病和其他痴呆症患者的干预计划
  • 批准号:
    10002162
  • 财政年份:
    2017
  • 资助金额:
    $ 5万
  • 项目类别:
Interactive issues for highly automated driving
高度自动化驾驶的交互问题
  • 批准号:
    512143-2017
  • 财政年份:
    2017
  • 资助金额:
    $ 5万
  • 项目类别:
    University Undergraduate Student Research Awards
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了