EAGER: Online Processing of Data in Large Facilities using National Advanced CyberInfrastructure
EAGER:使用国家先进网络基础设施在线处理大型设施中的数据
基本信息
- 批准号:1745246
- 负责人:
- 金额:$ 29.24万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-09-01 至 2020-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Open, large-scale scientific facilities are an essential part of science and engineering enterprise. These facilities provide shared-use infrastructure, instrumentation, and data products that are openly accessible to a broad community of researchers and/or educators. Current facilities provide increasing volumes of data and data products that have the potential to deliver new insights in a wide range of science and engineering domains. However, while these facilities provide reliable and pervasive access to the data and data products, users typically must download the data of interest and process them using local resources. Consequently, transforming these data and data products into insights requires local access to powerful computing, storage, and networking resources. On the other hand, the NSF Advanced Cyberinfrastructure (ACI) is playing an increasingly important role as an open platform for computational and data-enabled science and engineering and can provide the necessary capabilities to allow a broad user community to effectively process the data in large facilities. However, despite clearly complementing each other, large scientific facilities and NSF ACI remain largely disconnected. As a result, users are forced to actively be part of the process that moves data from large facilities to local computational resources or NSF ACI. Therefore, this data-delivery mode becomes inefficient and limits the potential utility that the data would have if processed in an automatic manner. The outcome of this research can have a significant impact on the scientific and engineering community by improving the accessibility of data and the way scientists interact with both data sources and computational infrastructures. Bringing national ACI and large scientific facilities together will democratize access to science and improve the impact of the NSF-funded infrastructure. This is especially important for small public institutions that have limited resources and do not have high bandwidth Internet connection to the Academic/Research network. The development of human resources, including the training of students, researchers and software professionals, as well as the outreach to minorities and underrepresented groups, will be an integral aspect of this effort. The project uses an open repository to disseminate research papers, prototype implementations, and associated data products to the community.The goal of this project is to explore how NSF-funded ACI, such as the Extreme Science and Engineering Discovery Environment (XSEDE), can be integrated with large facilities generally, and the Ocean Observatories Initiative (OOI) specifically, in an automated manner to support end-to-end user workflows. Specifically, we propose to enable workflows that when triggered can seamlessly orchestrate the entire data-to-discovery pipeline. This involves executing queries on the OOI cyberinfrastructure (possibly based on the occurrence of events of interest), streaming data to appropriate ACI facilities using high bandwidth interconnects (such as Internet2) in order to stage this data close to computing/analytics resources (e.g., XSEDE JetStream), and then launching the modeling and analysis processes to transform such data into insights. In this way, the project will leverage high-performance networks that typically connect these facilities to support data movement, and process this data using state-of-the-art high-performance systems.
开放式的大型科学设施是科学和工程事业的重要组成部分。这些设施提供共享使用的基础设施,仪器和数据产品,可供广泛的研究人员和/或教育工作者社区开放访问。目前的设施提供了越来越多的数据和数据产品,这些数据和产品有可能在广泛的科学和工程领域提供新的见解。然而,虽然这些设施提供了对数据和数据产品的可靠和普遍的访问,但用户通常必须下载感兴趣的数据并使用本地资源处理它们。因此,将这些数据和数据产品转化为洞察力需要本地访问强大的计算、存储和网络资源。另一方面,NSF高级网络基础设施(ACI)作为计算和数据支持的科学和工程的开放平台,正在发挥越来越重要的作用,并可以提供必要的能力,使广泛的用户社区能够有效地处理大型设施中的数据。然而,尽管大型科学设施和NSF ACI之间存在明显的互补性,但它们在很大程度上仍然是脱节的。因此,用户被迫积极参与将数据从大型设施移动到本地计算资源或NSF ACI的过程。因此,这种数据传递模式变得低效,并且限制了如果以自动方式处理数据将具有的潜在效用。这项研究的成果可以通过改善数据的可访问性以及科学家与数据源和计算基础设施的交互方式,对科学和工程界产生重大影响。将国家ACI和大型科学设施结合在一起将使科学的获取民主化,并改善NSF资助的基础设施的影响。这对于资源有限且没有高带宽互联网连接到学术/研究网络的小型公共机构尤为重要。开发人力资源,包括培训学生、研究人员和软件专业人员,以及与少数群体和代表性不足的群体开展外联活动,将是这一努力的一个组成部分。该项目使用一个开放的知识库向社区传播研究论文、原型实现和相关数据产品,其目标是探索如何将NSF资助的ACI(如极限科学与工程发现环境(XSEDE))与大型设施(特别是海洋观测站倡议(OOI))以自动化的方式集成,以支持端到端用户工作流程。具体来说,我们建议启用工作流,当触发时可以无缝地编排整个数据到发现管道。这涉及在OOI网络基础设施上执行查询(可能基于感兴趣的事件的发生),使用高带宽互连(诸如因特网2)将数据流传输到适当的ACI设施,以便将该数据放置在计算/分析资源附近(例如,XSEDE JetStream),然后启动建模和分析流程,将这些数据转化为见解。通过这种方式,该项目将利用通常连接这些设施的高性能网络来支持数据移动,并使用最先进的高性能系统处理这些数据。
项目成果
期刊论文数量(11)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A Distributed Multi-Sensor Machine Learning Approach to Earthquake Early Warning
- DOI:10.1609/aaai.v34i01.5376
- 发表时间:2020-02
- 期刊:
- 影响因子:0
- 作者:Kevin Fauvel;Daniel Balouek-Thomert;D. Melgar;Pedro Silva;Anthony Simonet;Gabriel Antoniu;Alexandru Costan;Véronique Masson;M. Parashar;I. Rodero;A. Termier
- 通讯作者:Kevin Fauvel;Daniel Balouek-Thomert;D. Melgar;Pedro Silva;Anthony Simonet;Gabriel Antoniu;Alexandru Costan;Véronique Masson;M. Parashar;I. Rodero;A. Termier
Runtime Management of Data Quality for Scientific Observatories Using Edge and In-Transit Resources
使用边缘和传输中资源对科学观测站的数据质量进行运行时管理
- DOI:10.1109/sbac-pad.2018.00053
- 发表时间:2018
- 期刊:
- 影响因子:0
- 作者:Zamani, Ali Reza;Balouek-Thomert, Daniel;Villalobos, J. J.;Rodero, Ivan;Parashar, Manish
- 通讯作者:Parashar, Manish
Harnessing the Computing Continuum for Urgent Science
- DOI:10.1145/3439602.3439618
- 发表时间:2020-11
- 期刊:
- 影响因子:0
- 作者:Daniel Balouek-Thomert;I. Rodero;M. Parashar
- 通讯作者:Daniel Balouek-Thomert;I. Rodero;M. Parashar
Exploring the Potential of Elastic Computing Clusters in Geo-Distributed Data Centers with Fast Fabric Interconnection
通过快速结构互连探索地理分布式数据中心中弹性计算集群的潜力
- DOI:10.1109/hpcc/smartcity/dss.2019.00135
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Chen, Shouwei;Wang, Wensheng;Rodero, Ivan
- 通讯作者:Rodero, Ivan
Optimizing Performance and Computing Resource Management of In-memory Big Data Analytics with Disaggregated Persistent Memory
使用分解的持久内存优化内存大数据分析的性能和计算资源管理
- DOI:10.1109/ccgrid.2019.00012
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Chen, Shouwei;Wang, Wensheng;Wu, Xueyang;Fan, Zhen;Huang, Kunwu;Zhuang, Peiyu;Li, Yue;Rodero, Ivan;Parashar, Manish;Weng, Dennis
- 通讯作者:Weng, Dennis
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Ivan Rodero其他文献
Grid broker selection strategies using aggregated resource information
- DOI:
10.1016/j.future.2009.07.009 - 发表时间:
2010-01-01 - 期刊:
- 影响因子:
- 作者:
Ivan Rodero;Francesc Guim;Julita Corbalan;Liana Fong;S. Masoud Sadjadi - 通讯作者:
S. Masoud Sadjadi
In-situ feature-based objects tracking for data-intensive scientific and enterprise analytics workflows
- DOI:
10.1007/s10586-014-0396-6 - 发表时间:
2014-08-22 - 期刊:
- 影响因子:4.100
- 作者:
Solomon Lasluisa;Fan Zhang;Tong Jin;Ivan Rodero;Hoang Bui;Manish Parashar - 通讯作者:
Manish Parashar
Ivan Rodero的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Ivan Rodero', 18)}}的其他基金
CIF21 DIBBs: EI: Virtual Data Collaboratory: A Regional Cyberinfrastructure for Collaborative Data Intensive Science
CIF21 DIBB:EI:虚拟数据协作:协作数据密集型科学的区域网络基础设施
- 批准号:
2220826 - 财政年份:2021
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
Collaborative Research: Framework: Data: NSCI: HDR: GeoSCIFramework: Scalable Real-Time Streaming Analytics and Machine Learning for Geoscience and Hazards Research
协作研究:框架:数据:NSCI:HDR:GeoSCIFramework:用于地球科学和灾害研究的可扩展实时流分析和机器学习
- 批准号:
2219975 - 财政年份:2021
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
Collaborative Research: Framework: Data: NSCI: HDR: GeoSCIFramework: Scalable Real-Time Streaming Analytics and Machine Learning for Geoscience and Hazards Research
协作研究:框架:数据:NSCI:HDR:GeoSCIFramework:用于地球科学和灾害研究的可扩展实时流分析和机器学习
- 批准号:
1835692 - 财政年份:2019
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
NSF Large Facilities Cyberinfrastructure Workshop
NSF 大型设施网络基础设施研讨会
- 批准号:
1742969 - 财政年份:2017
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
SPX: Collaborative Research: Cross-layer Application-Aware Resilience at Extreme Scale (CAARES)
SPX:协作研究:超大规模跨层应用程序感知弹性 (CAARES)
- 批准号:
1725649 - 财政年份:2017
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
CIF21 DIBBs: EI: Virtual Data Collaboratory: A Regional Cyberinfrastructure for Collaborative Data Intensive Science
CIF21 DIBB:EI:虚拟数据协作:协作数据密集型科学的区域网络基础设施
- 批准号:
1640834 - 财政年份:2016
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
BIGDATA: Collaborative Research: IA: F: Fractured Subsurface Characterization using High Performance Computing and Guided by Big Data
BIGDATA:协作研究:IA:F:使用高性能计算和大数据指导的断裂地下表征
- 批准号:
1546145 - 财政年份:2016
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
CRII: CI: Exploring Advanced Cyber-Infrastructure Co-Design for Big Data Analytics
CRII:CI:探索大数据分析的高级网络基础设施协同设计
- 批准号:
1464317 - 财政年份:2015
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
online SPE/HPLC-ICP-MS多元素形态分析新方法研究荷塘中铬砷镉汞铅的迁移转化规律
- 批准号:21976048
- 批准年份:2019
- 资助金额:65.0 万元
- 项目类别:面上项目
双积分政策下基于Online Review的新能源汽车企业跨链决策优化研究
- 批准号:71964023
- 批准年份:2019
- 资助金额:27.5 万元
- 项目类别:地区科学基金项目
面向Online-to-Offline智能商务的大数据融合与应用
- 批准号:91646204
- 批准年份:2016
- 资助金额:201.0 万元
- 项目类别:重大研究计划
Online-to-Offline商务环境下"切客"一族生活模式挖掘研究
- 批准号:71172046
- 批准年份:2011
- 资助金额:41.0 万元
- 项目类别:面上项目
相似海外基金
Natural language processing for detecting toxic, abusive, and hateful language online
用于在线检测有毒、辱骂和仇恨语言的自然语言处理
- 批准号:
RGPIN-2022-04481 - 财政年份:2022
- 资助金额:
$ 29.24万 - 项目类别:
Discovery Grants Program - Individual
Development of the disc unit for yarn processing using non-contact type yarn form inspection system in online
开发用于纱线加工的圆盘装置,使用非接触式在线纱线形状检查系统
- 批准号:
21K02123 - 财政年份:2021
- 资助金额:
$ 29.24万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Doctoral Dissertation Research: Heritage speakers processing of the Spanish subjunctive during online comprehension.
博士论文研究:传统发言者在在线理解过程中对西班牙语虚拟语气的处理。
- 批准号:
1939903 - 财政年份:2020
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
CybercrimeNLP (CC-NLP): A natural language processing toolkit for the interdisciplinary analysis of underground online forums
CybercrimeNLP (CC-NLP):用于地下在线论坛跨学科分析的自然语言处理工具包
- 批准号:
ES/T008466/1 - 财政年份:2020
- 资助金额:
$ 29.24万 - 项目类别:
Research Grant
Natural Language Processing for Online Health Information Evaluation
在线健康信息评估的自然语言处理
- 批准号:
539582-2019 - 财政年份:2019
- 资助金额:
$ 29.24万 - 项目类别:
University Undergraduate Student Research Awards
An enhanced online graphics processing unit facility for Early Career Researchers
为早期职业研究人员提供的增强型在线图形处理单元设施
- 批准号:
EP/S017755/1 - 财政年份:2018
- 资助金额:
$ 29.24万 - 项目类别:
Research Grant
Online-Processing of Grammatical Gender in Spoken Language Comprehension: Differences between Children with and without Specific Language Impairment
口语理解中语法性别的在线处理:有特定语言障碍和没有特定语言障碍的儿童之间的差异
- 批准号:
394447853 - 财政年份:2018
- 资助金额:
$ 29.24万 - 项目类别:
Research Grants
Doctoral Dissertation Research: The Online Processing of Noun Phrase Ellipsis
博士论文研究:名词短语省略的在线处理
- 批准号:
1749580 - 财政年份:2018
- 资助金额:
$ 29.24万 - 项目类别:
Standard Grant
Refinement of signal processing procedures for an online flow accelerated corrosion detection system
在线流动加速腐蚀检测系统信号处理程序的细化
- 批准号:
493329-2016 - 财政年份:2016
- 资助金额:
$ 29.24万 - 项目类别:
University Undergraduate Student Research Awards
Setup of an internet platform: "Theatre and Music in Weimar. Digitization, registration, scientific processing and online presentation of the Weimar play bills from the season 1969/70 to the political turn 1989/90"
建立互联网平台:“魏玛的戏剧和音乐。从 1969/70 演出季到 1989/90 政治转折的魏玛戏剧账单的数字化、注册、科学处理和在线演示”
- 批准号:
269685553 - 财政年份:2016
- 资助金额:
$ 29.24万 - 项目类别:
Cataloguing and Digitisation (Scientific Library Services and Information Systems)














{{item.name}}会员




