Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
基本信息
- 批准号:RGPIN-2017-04993
- 负责人:
- 金额:$ 1.46万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Much of the world's high-quality enterprise and social data are stored as semi-structured and structured data. This includes enterprises' RDBMSs, knowledge graphs, and social networks. All these data collections either are defined already as graphs or can be re-modeled as graphs. Over the past decade, we have witnessed advances in storing graph-like databases, but we have not seen much progress in search over them. As Surajit Chaudhuri (a distinguished scientist at Microsoft Research) addressed in his keynote talk at ICDE in 2015, search over graph-like databases has fallen behind search over unstructured data. While scientists and business users look for exciting, actionable discoveries from their heterogeneous datasets, the need to provide effective search is profound.
In this proposed research, we focus on designing effective and efficient methods to explore graph databases. We address important problems, challenges and opportunities for improving knowledge exploration over graph-like databases. These issues arise due to the complexity, scale and massive heterogeneity of such data.
First, we tackle the problem of finding relevant answers to search over heterogeneous graphs using the keyword search paradigm. Real graphs (e.g., social networks) are heterogeneous and model various types of entities and relationships. In these graphs, each node is associated with an importance value corresponding to its semantics. Previous work ranks answers using a combination of structural and content-based metrics, and ignore the type and importance of nodes. By incorporating the importance of nodes into account, we propose efficient algorithms to find relevant answers for the given query. Second, we design new algorithms to answer distance queries (i.e., finding shortest distance between any pair of nodes) over weighted graphs based on a graph indexing method called 2-hop cover. We investigate how graph partitioning can be applied to build the index and how to efficiently update the index over a stream of graph data. Third, we investigate the problem of identifying a user's intention when searching over knowledge graphs. Most of the current work in this area focuses only on finding answers quickly rather than finding more meaningful answers. We investigate the problem of finding a keyword's role to improve search quality.
The results of this proposed research will be useful for Canadian and international businesses and government institutions. The proposed frameworks can be used by financial (e.g., TD Bank and stock market), healthcare, governmental institutions (e.g., Statistics Canada), and technological companies (e.g., IBM and Microsoft). Our program will train students in the databases and data mining area to place them in a strong position when applying for academic and industrial jobs. I expect up to twelve students (including undergraduate students) to be trained in this program.
世界上许多高质量的企业和社会数据都以半结构化和结构化数据的形式存储。这包括企业的RDBMS、知识图和社交网络。所有这些数据集合要么已经定义为图形,要么可以重新建模为图形。在过去的十年里,我们见证了存储类图数据库的进步,但我们还没有看到在搜索它们方面取得多大进展。正如Surajit Chaudhuri(微软研究院的杰出科学家)在2015年ICDE的主题演讲中所说的那样,对图形数据库的搜索已经落后于对非结构化数据的搜索。当科学家和商业用户从他们的异构数据集中寻找令人兴奋的、可操作的发现时,提供有效搜索的需求是深刻的。
在这项研究中,我们专注于设计有效和高效的方法来探索图形数据库。我们解决了重要的问题,挑战和机遇,以提高知识的探索图形数据库。这些问题的出现是由于这些数据的复杂性、规模和巨大的异质性。
首先,我们解决了使用关键字搜索范式在异构图上搜索相关答案的问题。真实的图(例如,社交网络)是异构的并且对各种类型的实体和关系进行建模。在这些图中,每个节点与对应于其语义的重要性值相关联。以前的工作使用结构和基于内容的指标相结合的答案排名,并忽略节点的类型和重要性。通过将节点的重要性考虑在内,我们提出了有效的算法来找到相关的答案为给定的查询。其次,我们设计了新的算法来回答距离查询(即,找到任意节点对之间的最短距离)。我们研究如何图分区可以应用于建立索引,以及如何有效地更新索引的图数据流。第三,我们研究了在搜索知识图时识别用户意图的问题。目前这方面的工作大多只注重快速找到答案,而不是找到更有意义的答案。我们调查的问题,找到一个关键字的作用,以提高搜索质量。
这项研究的结果将有助于加拿大和国际企业和政府机构。拟议的框架可供财务部门(例如,TD银行和股票市场)、医疗保健、政府机构(例如,加拿大统计局)和技术公司(例如,IBM和Microsoft)。我们的计划将培养学生在数据库和数据挖掘领域,使他们在申请学术和工业工作时处于有利地位。我希望多达12名学生(包括本科生)将在这个计划中接受培训。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kargar, Mehdi其他文献
User community detection via embedding of social network structure and temporal content
- DOI:
10.1016/j.ipm.2019.102056 - 发表时间:
2020-03-01 - 期刊:
- 影响因子:8.6
- 作者:
Fani, Hossein;Jiang, Eric;Kargar, Mehdi - 通讯作者:
Kargar, Mehdi
Molecular detection of ESBLs production and antibiotic resistance patterns in Gram negative bacilli isolated from urinary tract infections
- DOI:
10.4103/0377-4929.134688 - 发表时间:
2014-04-01 - 期刊:
- 影响因子:1
- 作者:
Kargar, Mehdi;Kargar, Mohammad;Ghorbani-Dalini, Sadegh - 通讯作者:
Ghorbani-Dalini, Sadegh
Antimicrobial Surfaces Using Covalently Bound Polyallylamine
- DOI:
10.1021/bm401440h - 发表时间:
2014-01-01 - 期刊:
- 影响因子:6.2
- 作者:
Iarikov, Dmitri D.;Kargar, Mehdi;Ducker, William A. - 通讯作者:
Ducker, William A.
Socio-Economic Status and Clinical Breast Examination Screening Uptake: Findings from the First Cohort Study among Iranian Kurdish Women.
- DOI:
10.31557/apjcp.2022.23.5.1555 - 发表时间:
2022-05-01 - 期刊:
- 影响因子:0
- 作者:
Jalilian, Farzad;Jerome-D'Emilia, Bonnie;Najafi, Farid;Pasdar, Yahya;Karami Matin, Behzad;Amini, Mahin;Kargar, Mehdi;Moradinazar, Mehdi;Pirouzeh, Razieh;Karimi, Negar;Hosseini, Seyyed Nasrollah;Mirzaei-Alavijeh, Mehdi - 通讯作者:
Mirzaei-Alavijeh, Mehdi
The performances of the chi-square test and complexity measures for signal recognition in biological sequences
- DOI:
10.1016/j.jtbi.2007.11.021 - 发表时间:
2008-03-21 - 期刊:
- 影响因子:2
- 作者:
Pirhaji, Leila;Kargar, Mehdi;Eslahchi, Changiz - 通讯作者:
Eslahchi, Changiz
Kargar, Mehdi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kargar, Mehdi', 18)}}的其他基金
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2022
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2021
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2019
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
A scalable search system over e-commerce databases
基于电子商务数据库的可扩展搜索系统
- 批准号:
533249-2018 - 财政年份:2018
- 资助金额:
$ 1.46万 - 项目类别:
Engage Grants Program
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2018
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2017
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Distributed Keyword Search over Graph Databases using IBM Analytics Platform
使用 IBM Analytics Platform 通过图数据库进行分布式关键字搜索
- 批准号:
514859-2017 - 财政年份:2017
- 资助金额:
$ 1.46万 - 项目类别:
Engage Grants Program
相似海外基金
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2022
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2021
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2019
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and effective location-aware search on social networks
社交网络上高效且有效的位置感知搜索
- 批准号:
DE190100663 - 财政年份:2019
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Early Career Researcher Award
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2018
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Effective Search over Graph-like Databases
对类图数据库进行高效且有效的搜索
- 批准号:
RGPIN-2017-04993 - 财政年份:2017
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Grants Program - Individual
Efficient and effective ad-hoc search using structured and unstructured geospatial information
使用结构化和非结构化地理空间信息进行高效且有效的临时搜索
- 批准号:
DP140101587 - 财政年份:2014
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Projects
Effective and efficient keyword search for relevant entities over Extensible Markup Language (XML) data
通过可扩展标记语言 (XML) 数据有效且高效地搜索相关实体的关键字
- 批准号:
DP110102407 - 财政年份:2011
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Projects
Effective and Efficient Video Search
有效且高效的视频搜索
- 批准号:
DP0987273 - 财政年份:2009
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Projects
Effective and Efficient Keyword Search in Relational Databases
关系数据库中有效且高效的关键字搜索
- 批准号:
DP0881779 - 财政年份:2008
- 资助金额:
$ 1.46万 - 项目类别:
Discovery Projects