CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
基本信息
- 批准号:1802358
- 负责人:
- 金额:$ 49.99万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-09-13 至 2019-03-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Knowledge bases today are central to the successful utilization of information available in the large and growing amounts of digital data on the Web. Such technologies have started to unleash a transformation of Web search from a keyword match to discovery, learning, and creativity, which are crucial to promoting the goal of knowledge discovery. Unfortunately, the search for information remains inherently difficult for significant portions of the Web such as the Scholarly Web, which contains many millions of scientific documents. For example, PubMed has over 20 million documents, whereas Google Scholar is estimated to have more than 100 million. Open-access digital libraries such as CiteSeerX, which acquire freely-available research articles from the Web, witness an increase in their document collections as well. Despite attractive advancements by scholarly search portals, semantic search technologies that "understand" complex concepts and their relations and can systematically satisfy users' intricate information needs have yet to be investigated on the Scholarly Web. The goal of this project is to design solutions to make information more accessible and comprehensible to Scholarly Web users in particular, and Web users in general, and to help them discover knowledge more effectively and efficiently. The approach taken will be to develop an integrated framework, focusing on the extraction and utilization of scholarly knowledge graphs in online scholarly environments. Educationally, this work will involve: training of graduate, undergraduate, and high-school students, particularly encouraging the participation of women and underrepresented groups in the research efforts; curriculum development and integration of research into courses taught by the PI; exposure of students to industry and international experiences; and education for the general public. The project will target the following research objectives: (1) explore the construction of scholarly knowledge graphs that combine evidence from multiple resources in an open information extraction framework; (2) design and develop novel algorithms for the detection and analysis of interesting and previously unknown connections between concepts, in order to enforce knowledge discovery on the Scholarly Web; and (3) investigate the utility of scholarly knowledge graphs in a question answering system. The results of this research will be integrated into the CiteSeerX digital library (http://citeseerx.ist.psu.edu). The software, tools, and benchmark datasets, which will be developed during the course of this project will be made publicly available. All findings will be shared with the research community through publications in academic journals and presented in Information Retrieval, Text Mining and Natural Language Processing conferences. For further information, see the project web page: http://www.cse.unt.edu/~ccaragea/skg.html.
今天,知识库是成功利用网络上大量和不断增长的数字数据中的信息的核心。这些技术已经开始释放Web搜索从关键字匹配到发现,学习和创造力的转变,这对促进知识发现的目标至关重要。不幸的是,对于包含数百万科学文献的学术网等网络的重要部分来说,搜索信息仍然是固有的困难。例如,PubMed拥有超过2000万份文档,而Google Scholar估计拥有超过1亿份文档。像CiteSeerX这样的开放式数字图书馆从网络上免费获取研究文章,其文献收藏也有所增加。尽管学术搜索门户网站取得了引人注目的进步,语义搜索技术,“理解”复杂的概念和它们的关系,并能系统地满足用户的复杂的信息需求,尚未被调查的学术网站。该项目的目标是设计解决方案,使信息更容易获得和理解的学术网站用户,特别是一般的网络用户,并帮助他们更有效地发现知识。采取的方法将是制定一个综合框架,侧重于在线学术环境中学术知识图的提取和利用。在教育方面,这项工作将涉及:培训研究生、本科生和高中生,特别是鼓励妇女和代表性不足的群体参与研究工作;课程编制和将研究纳入方案研究所教授的课程;使学生接触工业和国际经验;以及对公众进行教育。该项目将针对以下研究目标:(1)探索构建学术知识图谱,该图谱将来自多个资源的联合收割机证据结合在一个开放的信息提取框架中;(2)设计和开发新的算法,用于检测和分析概念之间有趣和以前未知的联系,以加强学术网络上的知识发现;以及(3)研究学术知识图在问答系统中的应用。这项研究的结果将纳入CiteSeerX数字图书馆(http:citeseerx.ist.psu.edu)。将在本项目期间开发的软件、工具和基准数据集将公开提供。所有研究结果将通过学术期刊上的出版物与研究界分享,并在信息检索,文本挖掘和自然语言处理会议上发表。欲了解更多信息,请参阅项目网页:http://www.cse.unt.edu/~ccaragea/skg.html。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Cornelia Caragea其他文献
Scientific Keyphrase Identification and Classification by Pre-Trained Language Models Intermediate Task Transfer Learning
通过预训练语言模型进行科学的关键词识别和分类中间任务迁移学习
- DOI:
- 发表时间:
2020 - 期刊:
- 影响因子:0
- 作者:
Seoyeon Park;Cornelia Caragea - 通讯作者:
Cornelia Caragea
Metadata Repository
元数据存储库
- DOI:
10.1007/978-0-387-39940-9_3058 - 发表时间:
2009 - 期刊:
- 影响因子:0
- 作者:
Cornelia Caragea;Vasant G Honavar;P. Boncz;P. Larson;S. Dietrich;Gonzalo Navarro;B. Thuraisingham;Yan Luo;Ouri E. Wolfson;S. Beitzel;Eric C. Jensen;O. Frieder;C. Jensen;N. Tradisauskas;E. Munson;A. Wun;K. Goda;Stephen E. Fienberg;Jiashun Jin;Guimei Liu;Nick Craswell;T. Pedersen;Cesare Pautasso;M. Moro;S. Manegold;B. Carminati;Marina Blanton;S. Bouchenak;Noël de Palma;Wei Tang;C. Quix;M. Jeusfeld;R. K. Pon;David J. Buttler;W. Meng;P. Zezula;Michal Batko;Vlastislav Dohnal;J. Domingo;Denilson Barbosa;I. Manolescu;Jeffrey Xu Yu;E. Cecchet;Vivien Quéma;Xifeng Yan;G. Santucci;D. Zeinalipour;Panos K. Chrysanthis;A. Deshpande;Carlos Guestrin;S. Madden;C. Leung;R. H. Güting;Amarnath Gupta;Heng Tao Shen;G. Weikum;Ramesh Jain;J. Yu;P. Ciaccia;K. Candan;M. Sapino;C. Meghini;F. Sebastiani;U. Straccia;F. Nack;V. S. Subrahmanian;Maria Vanina Martinez;D. Reforgiato;T. Westerveld;M. Sebillo;G. Vitiello;M. De Marsico;K. Voruganti;C. Parent;S. Spaccapietra;C. Vangenot;E. Zimányi;Prasan Roy;S. Sudarshan;E. Puppo;Peer Kröger;M. Renz;H. Schuldt;Solmaz Kolahi;A. Unwin;W. Cellary - 通讯作者:
W. Cellary
Semantic Tokenizer for Enhanced Natural Language Processing
用于增强自然语言处理的语义分词器
- DOI:
10.48550/arxiv.2304.12404 - 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Sandeep Mehta;Darpan Shah;Ravindra Kulkarni;Cornelia Caragea - 通讯作者:
Cornelia Caragea
A Group-Based Personalized Model for Image Privacy Classification and Labeling
基于群体的个性化图像隐私分类和标签模型
- DOI:
10.24963/ijcai.2017/552 - 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Haoti Zhong;A. Squicciarini;David J. Miller;Cornelia Caragea - 通讯作者:
Cornelia Caragea
MEDLINE/ PubMed
MEDLINE/PubMed
- DOI:
10.1007/978-0-387-39940-9_3039 - 发表时间:
2004 - 期刊:
- 影响因子:3.8
- 作者:
Cornelia Caragea;V. Honavar;P. Boncz;P. Larson;S. Dietrich;Gonzalo Navarro;Bhavani Thuraisingham;Yan Luo;Ouri E. Wolfson;S. Beitzel;Eric C. Jensen;Ophir Frieder;Christian S. Jensen;N. Tradisauskas;Ethan V. Munson;A. Wun;K. Goda;Stephen E. Fienberg;Jiashun Jin;Guimei Liu;Nick Craswell;T. Pedersen;Cesare Pautasso;M. Moro;S. Manegold;B. Carminati;Marina Blanton;Sara Bouchenak;Noël de Palma;Wei Tang;Christoph Quix;M. Jeusfeld;R. K. Pon;David J. Buttler;W. Meng;P. Zezula;Michal Batko;Vlastislav Dohnal;J. Domingo;Denilson Barbosa;Ioana Manolescu;Jeffrey Xu Yu;Emmanuel Cecchet;Vivien Quéma;Xifeng Yan;G. Santucci;D. Zeinalipour;Panos K. Chrysanthis;Amol Deshpande;Carlos Guestrin;Samuel Madden;Carson Kai;R. H. Güting;Amarnath Gupta;Heng Tao Shen;G. Weikum;Ramesh Jain;Jeffrey Xu Yu;Paolo Ciaccia;K. Candan;M. Sapino;C. Meghini;F. Sebastiani;U. Straccia;F. Nack;V. S. Subrahmanian;Maria Vanina Martinez;D. Reforgiato;T. Westerveld;M. Sebillo;G. Vitiello;Maria De Marsico;K. Voruganti;C. Parent;S. Spaccapietra;Christelle Vangenot;Esteban Zimányi;Prasan Roy;S. Sudarshan;E. Puppo;Peer Kröger;Matthias Renz;H. Schuldt;Solmaz Kolahi;A. Unwin;W. Cellary - 通讯作者:
W. Cellary
Cornelia Caragea的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Cornelia Caragea', 18)}}的其他基金
CHS: Small: Collaborative Research: Automating Relevance and Trust Detection in Social Media Data for Emergency Response
CHS:小型:协作研究:自动化社交媒体数据中的相关性和信任检测以进行紧急响应
- 批准号:
1903963 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
TWC: Small: Collaborative: Towards Privacy Preserving Online Image Sharing
TWC:小型:协作:实现隐私保护在线图像共享
- 批准号:
1903714 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1853919 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1823292 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
大数据:IA:协作研究:社交媒体上危机相关数据分类的领域适应方法
- 批准号:
1741353 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
- 批准号:
1652674 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
III: Small: Collaborative Research: Keyphrase Extraction in Document Networks
III:小:协作研究:文档网络中的关键词提取
- 批准号:
1813571 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
TWC: Small: Collaborative: Towards Privacy Preserving Online Image Sharing
TWC:小型:协作:实现隐私保护在线图像共享
- 批准号:
1814255 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
大数据:IA:协作研究:社交媒体上危机相关数据分类的领域适应方法
- 批准号:
1802284 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CHS: Small: Collaborative Research: Automating Relevance and Trust Detection in Social Media Data for Emergency Response
CHS:小型:协作研究:自动化社交媒体数据中的相关性和信任检测以进行紧急响应
- 批准号:
1814271 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
相似国自然基金
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
基于Linked Open Data的Web服务语义互操作关键技术
- 批准号:61373035
- 批准年份:2013
- 资助金额:77.0 万元
- 项目类别:面上项目
Molecular Interaction Reconstruction of Rheumatoid Arthritis Therapies Using Clinical Data
- 批准号:31070748
- 批准年份:2010
- 资助金额:34.0 万元
- 项目类别:面上项目
高维数据的函数型数据(functional data)分析方法
- 批准号:11001084
- 批准年份:2010
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
染色体复制负调控因子datA在细胞周期中的作用
- 批准号:31060015
- 批准年份:2010
- 资助金额:25.0 万元
- 项目类别:地区科学基金项目
Computational Methods for Analyzing Toponome Data
- 批准号:60601030
- 批准年份:2006
- 资助金额:17.0 万元
- 项目类别:青年科学基金项目
相似海外基金
CAREER: Statistically-Sound Knowledge Discovery from Data
职业:从数据中发现统计上合理的知识
- 批准号:
2238693 - 财政年份:2023
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Integrating a Data-Driven Approach with Technologies for Sharing Rural Knowledge and Values
职业:将数据驱动的方法与技术相结合,共享农村知识和价值观
- 批准号:
2208631 - 财政年份:2021
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Integrating a Data-Driven Approach with Technologies for Sharing Rural Knowledge and Values
职业:将数据驱动的方法与技术相结合,共享农村知识和价值观
- 批准号:
1845964 - 财政年份:2019
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: From Data to Knowledge and Decisions for Global-Scale Ecological Sustainability
职业:从数据到知识和全球规模生态可持续性决策
- 批准号:
1749854 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Scaling Up Knowledge Discovery in High-Dimensional Data Via Nonconvex Statistical Optimization
职业:通过非凸统计优化扩大高维数据中的知识发现
- 批准号:
1906169 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
- 批准号:
1914575 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Scaling Up Knowledge Discovery in High-Dimensional Data Via Nonconvex Statistical Optimization
职业:通过非凸统计优化扩大高维数据中的知识发现
- 批准号:
1652539 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
- 批准号:
1652674 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Cyber-Knowledge Infrastructure for Geospatial Data
职业:地理空间数据的网络知识基础设施
- 批准号:
1455349 - 财政年份:2015
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
CAREER: Learning from Observational Data with Knowledge
职业:从观察数据中学习知识
- 批准号:
1347119 - 财政年份:2014
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant