Using Concept Lattices to Reconcile Semantic Heterogeneity in Data
使用概念格来协调数据中的语义异质性
基本信息
- 批准号:RGPIN-2015-03929
- 负责人:
- 金额:$ 1.31万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2016
- 资助国家:加拿大
- 起止时间:2016-01-01 至 2017-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In modern information systems, data capture often cannot be closely controlled by traditional organizational mechanisms intended to enforce common meaning. Additionally, data are no longer used only for established and predetermined purposes. Indeed, one of the key promises of Big Data and Open Data is the opportunity to repurpose data for uses that were not anticipated when systems to collect and store the data were designed. This increasingly diverse information landscape creates significant challenges in integrating or combining information from various sources in ways that retain the semantics (meaning) of the underlying data. Integration difficulties arise from the independence of sources, resulting in a high level of semantic heterogeneity between sources. Achieving high-quality semantic integration is essential to take full advantage of the opportunities afforded by Big Data to support decision-making.
The proposed research introduces the notion of “concept lattice” as a mechanism to provide semantics for data that can be used to integrate heterogeneous independent sources. In a concept lattice, nodes are concepts, interpreted as predicates applied to phenomena (instances) (e.g., Male(John)). A directed arc indicates a “precedence” relationship between two nodes; that is, possessing one property implies possessing another (e.g., Voter(John) -> Adult(John)). In this framework, the semantics of a node is entirely determined by the pattern of (and relationships among) arcs entering and leaving it.
A concept lattice is a “lightweight” conceptual model of a domain, in contrast with traditional approaches that rely on “heavyweight” models (global or mediated schemas). Nodes may be considered either classes or properties, depending only on the pattern of incoming and outgoing arcs. This semantic relativism facilitates integration of concept lattices, where a concept may be a class in one lattice but a property in another. Moreover, two semantically equivalent nodes from heterogeneous sources can have different manifestations, indicating different semantic interpretations of the values of a property.
The proposed research program will build on my earlier work by focusing on four objectives, working with graduate students and academic colleagues. First, we will formally define the core elements of a concept lattice structure, and develop reasoning mechanisms to navigate lattices. Second, we will design and implement a prototype to store and process concept lattices. Third, we will use concept lattices as a foundation to integrate information across heterogeneous sources by defining “merge nodes” – nodes from independent lattices that carry the same meaning and can be used to combine lattices. Fourth, we will evaluate the effectiveness of concept lattices in improving information retrieval from heterogeneous data sources.
在现代信息系统中,数据捕获通常不能由旨在执行共同含义的传统组织机制来密切控制。此外,数据不再仅用于既定和预定的目的。事实上,大数据和开放数据的关键承诺之一是有机会将数据重新用于收集和存储数据的系统设计时没有预料到的用途。这种日益多样化的信息格局在以保留底层数据的语义(含义)的方式集成或组合来自各种来源的信息方面带来了重大挑战。集成的困难来自于源的独立性,导致源之间的高度语义异构性。实现高质量的语义集成对于充分利用大数据提供的机会来支持决策至关重要。
拟议的研究引入了“概念格”的概念,作为一种机制,以提供语义的数据,可用于集成异构的独立源。在概念格中,节点是概念,被解释为应用于现象(实例)的谓词(例如,男(约翰))。有向弧表示两个节点之间的“优先”关系;也就是说,拥有一个属性意味着拥有另一个属性(例如,选民(约翰)->成人(约翰))。在这个框架中,节点的语义完全由进入和离开它的弧的模式(以及弧之间的关系)决定。
概念格是领域的“轻量级”概念模型,与依赖于“重量级”模型(全局或中介模式)的传统方法相反。节点可以被认为是类或属性,这仅取决于传入和传出弧的模式。这种语义相对主义促进了概念格的整合,其中概念可以是一个格中的类,但在另一个格中是属性。此外,来自异构源的两个语义上等同的节点可以具有不同的表现形式,指示对属性的值的不同语义解释。
拟议的研究计划将建立在我早期的工作,重点是四个目标,与研究生和学术界的同事。首先,我们将正式定义概念格结构的核心元素,并开发推理机制来导航格。其次,我们将设计和实现一个原型来存储和处理概念格。第三,我们将使用概念格作为基础,通过定义“合并节点”--来自独立格的节点,这些节点具有相同的含义,可以用于联合收割机格,来整合跨异构源的信息。第四,我们将评估概念格在改善异构数据源信息检索方面的有效性。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Parsons, Jeffrey其他文献
Randomized trial to reduce club drug use and HIV risk behaviors among men who have sex with men.
- DOI:
10.1037/a0015588 - 发表时间:
2009-08 - 期刊:
- 影响因子:5.9
- 作者:
Morgenstern, Jon;Bux, Donald A., Jr.;Parsons, Jeffrey;Hagman, Brett T.;Wainberg, Milton;Irwin, Thomas - 通讯作者:
Irwin, Thomas
USING EYE TRACKING TO EXPOSE COGNITIVE PROCESSES IN UNDERSTANDING CONCEPTUAL MODELS
- DOI:
10.25300/misq/2019/14163 - 发表时间:
2019-12-01 - 期刊:
- 影响因子:7.3
- 作者:
Bera, Palash;Soffer, Pnina;Parsons, Jeffrey - 通讯作者:
Parsons, Jeffrey
"HIV Is Still Real": Perceptions of HIV Testing and HIV Prevention Among Black Men Who Have Sex With Men in New York City
- DOI:
10.1177/1557988308315154 - 发表时间:
2009-06-01 - 期刊:
- 影响因子:2.3
- 作者:
Nanin, Jose;Osubu, Tokes;Parsons, Jeffrey - 通讯作者:
Parsons, Jeffrey
Rules About Casual Sex Partners, Relationship Satisfaction, and HIV Risk in Partnered Gay and Bisexual Men
- DOI:
10.1080/0092623x.2012.691948 - 发表时间:
2014-03-01 - 期刊:
- 影响因子:2.5
- 作者:
Grov, Christian;Starks, Tyrel J.;Parsons, Jeffrey - 通讯作者:
Parsons, Jeffrey
The effect of tracking technique on the quality of user experience for augmented reality mobile navigation
- DOI:
10.1007/s11042-017-4810-y - 发表时间:
2018-05-01 - 期刊:
- 影响因子:3.6
- 作者:
Sekhavat, Yoones A.;Parsons, Jeffrey - 通讯作者:
Parsons, Jeffrey
Parsons, Jeffrey的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Parsons, Jeffrey', 18)}}的其他基金
A design theory for observational crowdsourcing
观察众包的设计理论
- 批准号:
RGPIN-2020-04916 - 财政年份:2022
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
A design theory for observational crowdsourcing
观察众包的设计理论
- 批准号:
RGPIN-2020-04916 - 财政年份:2021
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
A design theory for observational crowdsourcing
观察众包的设计理论
- 批准号:
RGPIN-2020-04916 - 财政年份:2020
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using Concept Lattices to Reconcile Semantic Heterogeneity in Data
使用概念格来协调数据中的语义异质性
- 批准号:
RGPIN-2015-03929 - 财政年份:2019
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using Concept Lattices to Reconcile Semantic Heterogeneity in Data
使用概念格来协调数据中的语义异质性
- 批准号:
RGPIN-2015-03929 - 财政年份:2018
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using Concept Lattices to Reconcile Semantic Heterogeneity in Data
使用概念格来协调数据中的语义异质性
- 批准号:
RGPIN-2015-03929 - 财政年份:2017
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using Concept Lattices to Reconcile Semantic Heterogeneity in Data
使用概念格来协调数据中的语义异质性
- 批准号:
RGPIN-2015-03929 - 财政年份:2015
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using classification principles to improve semantic information integration
利用分类原则提高语义信息集成
- 批准号:
155362-2010 - 财政年份:2014
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using classification principles to improve semantic information integration
利用分类原则提高语义信息集成
- 批准号:
155362-2010 - 财政年份:2013
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
Using classification principles to improve semantic information integration
利用分类原则提高语义信息集成
- 批准号:
155362-2010 - 财政年份:2012
- 资助金额:
$ 1.31万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Developing Teaching Tools to Promote Transfer of Core Concept Knowledge Across Biological Scales and Sub-disciplines.
开发教学工具以促进跨生物尺度和子学科的核心概念知识的转移。
- 批准号:
2336776 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Standard Grant
Individualism and Intentionality: A Research on the Genealogy and Political Ramifications of the Market Concept in Neoliberal Thought
个人主义与意向性:新自由主义思想中市场概念的谱系及其政治影响研究
- 批准号:
24K03432 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Concept Driftの網羅探索
概念漂移综合搜索
- 批准号:
24K15082 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Repurposing Sodium Cromoglycate For Lymphangioleiomyomatosis (LAM): An Open Label, Proof Of Concept And Feasibility Study
重新利用色甘酸钠治疗淋巴管平滑肌瘤病 (LAM):开放标签、概念验证和可行性研究
- 批准号:
MR/Y008618/1 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Research Grant
A Space Warehouse Concept and Ecosystem to Energize European OSAM (STARFAB)
为欧洲 OSAM 注入活力的太空仓库概念和生态系统 (STARFAB)
- 批准号:
10092765 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
EU-Funded
Use and Concept in Neural Machine Translation and Cross-Linguistic Divergence
神经机器翻译和跨语言分歧中的使用和概念
- 批准号:
23K21872 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Proactive Ex Ante Digital Platform Regulations and the Concept of “Fairness”
积极主动的事前数字平台监管和“公平”理念
- 批准号:
24K16261 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Oncological Engineering - A new concept in the treatment of bone metastases
肿瘤工程——治疗骨转移的新概念
- 批准号:
EP/W007096/2 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Research Grant
Developing Teaching Tools to Promote Transfer of Core Concept Knowledge Across Biological Scales and Sub-disciplines.
开发教学工具以促进跨生物尺度和子学科的核心概念知识的转移。
- 批准号:
2336777 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Standard Grant
Developing Teaching Tools to Promote Transfer of Core Concept Knowledge Across Biological Scales and Sub-disciplines.
开发教学工具以促进跨生物尺度和子学科的核心概念知识的转移。
- 批准号:
2336778 - 财政年份:2024
- 资助金额:
$ 1.31万 - 项目类别:
Standard Grant