Aggregating and Semanticizing Unstructured Online Event Information

非结构化在线事件信息的聚合和语义化

基本信息

  • 批准号:
    445911-2012
  • 负责人:
  • 金额:
    $ 1.82万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Engage Grants Program
  • 财政年份:
    2012
  • 资助国家:
    加拿大
  • 起止时间:
    2012-01-01 至 2013-12-31
  • 项目状态:
    已结题

项目摘要

Nightbat is a Web 2.0 company that specializes in the area of online sale of tickets for local events and gather-ings. Local event organizers can manage announcement, registration and scheduling through Nightbat. Currently the services provided through Nightbat are free for event organizers and only a fix per registration charge is made for the participants. The success of Nightbat relies heavily on the number of events that are hosted on its website and the number of people who visit the website on a daily basis, which is now dependent on the mar-keting that happens through different channels. More recently, Nightbat has become interested in developing intelligent Web crawler technology that would automatically browse Websites, identify whether they contain local event information to extract such information to be added to their event information base. The advantage of this approach is that Nightbat will be able to offer its users a one-stop shop for local events that have been automatically identified and collocated from various sources over the Web. The main challenges that Nightbat faces are: i) event information are not centrally gathered in specific websites at the moment and they are dis-persed over various locations across the Web. Some organizers might only advertise on their own Website, while others might announce their event on social networks or online classified advertisement websites; there-fore, finding the right places to automatically extract event information is a challenge; ii) Once an event is identi-fied on a Website, the main challenge is to extract structured information from the unstructured event infor-mation that has been posted by the organizers. Given there are no standards for publishing event information online, event organizers publish their events as unstructured textual data often in HTML format that is under-standable for humans but not interpretable by machines. Therefore, for Nightbat to be able to effectively aggre-gate event information from the Web, it will need to first develop intelligent crawling technology that enables it to find information related to local events online and second be able to semantically extract useful event infor-mation from the otherwise unstructured event information.
Nightbat是一家Web 2.0公司,专门从事当地活动和庆典的在线销售门票。当地活动组织者可以通过Nightbat管理公告、注册和日程安排。目前,通过Nightbat提供的服务对活动组织者是免费的,对参与者只收取固定的注册费。Nightbat的成功在很大程度上依赖于其网站上举办的活动数量和每天访问网站的人数,而这现在取决于通过不同渠道进行的营销。最近,Nightbat开始对开发智能网络爬虫技术感兴趣,该技术将自动浏览网站,识别它们是否包含本地事件信息,以提取此类信息添加到其事件信息库中。这种方法的优点是,Nightbat将能够为用户提供一站式的本地活动,这些活动已经通过网络从各种来源自动识别和配置。Nightbat面临的主要挑战是:i)事件信息目前没有集中收集在特定的网站上,它们分散在网络上的各个位置。一些组织者可能只在他们自己的网站上做广告,而另一些组织者可能在社交网络或在线分类广告网站上宣布他们的活动;因此,找到正确的地方来自动提取活动信息是一个挑战; ii)一旦在网站上识别出活动,主要的挑战是从组织者发布的非结构化活动信息中提取结构化信息。由于没有在线发布事件信息的标准,事件组织者通常以HTML格式发布非结构化文本数据,这些数据对于人类来说是可以理解的,但机器无法解释。因此,Nightbat要想有效地从Web上聚合事件信息,首先需要开发智能爬行技术,使其能够在线查找与本地事件相关的信息,其次才能从非结构化事件信息中提取有用的事件信息。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Bagheri, Ebrahim其他文献

Semantic tagging and linking of software engineering social content
  • DOI:
    10.1007/s10515-014-0146-2
  • 发表时间:
    2016-06-01
  • 期刊:
  • 影响因子:
    3.4
  • 作者:
    Bagheri, Ebrahim;Ensan, Faezeh
  • 通讯作者:
    Ensan, Faezeh
RysannMD: A biomedical semantic annotator balancing speed and accuracy
  • DOI:
    10.1016/j.jbi.2017.05.016
  • 发表时间:
    2017-07-01
  • 期刊:
  • 影响因子:
    4.5
  • 作者:
    Cuzzola, John;Jovanovic, Jelena;Bagheri, Ebrahim
  • 通讯作者:
    Bagheri, Ebrahim
Query expansion using pseudo relevance feedback on wikipedia
  • DOI:
    10.1007/s10844-017-0466-3
  • 发表时间:
    2018-06-01
  • 期刊:
  • 影响因子:
    3.4
  • 作者:
    Keikha, Andisheh;Ensan, Faezeh;Bagheri, Ebrahim
  • 通讯作者:
    Bagheri, Ebrahim
Message from KAMIoT-2012 workshop chairs
KAMIoT-2012 研讨会主席致辞
The state of the art in critical infrastructure protection: a framework for convergence

Bagheri, Ebrahim的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Bagheri, Ebrahim', 18)}}的其他基金

Dynamic Runtime Software Architecture Adaptation
动态运行时软件架构适配
  • 批准号:
    RGPIN-2015-06118
  • 财政年份:
    2022
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Social Information Retrieval
社会信息检索
  • 批准号:
    CRC-2020-00040
  • 财政年份:
    2022
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Canada Research Chairs
Data analytics for device identification
用于设备识别的数据分析
  • 批准号:
    560268-2020
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Alliance Grants
NSERC CREATE in Responsible Development of AI (RAI)
NSERC CREATE 人工智能负责任开发 (RAI)
  • 批准号:
    554764-2021
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative Research and Training Experience
Dynamic Runtime Software Architecture Adaptation
动态运行时软件架构适配
  • 批准号:
    RGPIN-2015-06118
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Social Information Retrieval
社会信息检索
  • 批准号:
    CRC-2020-00040
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Canada Research Chairs
NSERC/Warranty Life Industrial Research Chair in Social Media Analytics
NSERC/保修生命工业研究社交媒体分析主席
  • 批准号:
    513204-2016
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Industrial Research Chairs
Social Information Retrieval
社会信息检索
  • 批准号:
    1000233085-2019
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Canada Research Chairs
Data analytics for device identification
用于设备识别的数据分析
  • 批准号:
    560268-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Alliance Grants
NSERC/Warranty Life Industrial Research Chair in Social Media Analytics
NSERC/保修生命工业研究社交媒体分析主席
  • 批准号:
    513204-2016
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Industrial Research Chairs
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了