III: Medium: Collaborative Research: Connecting the Ephemeral and Archival Information Networks
III:媒介:协作研究:连接临时和档案信息网络
基本信息
- 批准号:1160894
- 负责人:
- 金额:$ 66.35万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-08-01 至 2018-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This collaborative research project (IIS-1160894, W. Bruce Croft, University of Massachusetts Amherst and IIS-1160862, Jamie Callan, Carnegie-Mellon University) addresses the complex issues of ephemeral information that is generated as part of social interactions is different in terms of time scale, quantity, and quality to archival information found on the web. This project investigates the hypothesis that, because of the context provided, searching either ephemeral or archival information is enhanced using the connections between them. It develops new retrieval models and features for ranking functions in a range of search tasks that can exploit an integrated ephemeral/archival network. Some search tasks are based on previous TREC blog, microblog, and web activities. It also investigates two new tasks, conversation retrieval and aggregated social search. Conversation retrieval targets information units in the form of "conversations" or "events" instead of simply retrieving social postings or web pages. Aggregated social search ranks information in different granularities, such as sentence, posting, conversation, or thread, based on the underlying query intent. Research that explores the connections between ephemeral and archival information requires a dataset that contains both types of information. A crucial part of this project extends the archival ClueWeb12 dataset with ephemeral microblog, blog, and discussion forum data that links to the web data. This extension is distributed to the research community as the ClueWeb12++ dataset. This project (http://ciir.cs.umass.edu/research/ephemeral/) is the first to address the full possibilities of search that exploits all the connections and contexts created by bringing together the two "worlds" of information. It also develops and distributes a unique new dataset that supports the development of a new generation of tools to access a broad range of information. Students at collaborating institutions, University of Massachusetts Amherst and Carnegie-Mellon University will be involved in educational activities and benefit from research experience.
这个合作研究项目(IIS-1160894,W.Bruce Croft,马萨诸塞大学阿默斯特分校和IIS-1160862,Jamie Callan,卡内基-梅隆大学)解决了作为社会互动一部分产生的短暂信息的复杂问题,这些信息在时间规模、数量和质量方面与网络上的档案信息不同。这个项目调查的假设是,由于所提供的上下文,使用它们之间的联系来增强对短暂或档案信息的搜索。它开发了新的检索模型和功能,用于对一系列搜索任务中的功能进行排序,从而可以利用集成的临时/档案网络。一些搜索任务是基于以前的TREC博客、微博和网络活动。它还研究了两个新的任务,会话检索和聚合社交搜索。会话检索以“对话”或“事件”的形式针对信息单元,而不是简单地检索社交帖子或网页。聚合的社交搜索基于潜在的查询意图以不同的粒度对信息进行排名,例如句子、帖子、对话或帖子。探索短暂信息和档案信息之间联系的研究需要包含这两种类型信息的数据集。该项目的一个重要部分是使用链接到Web数据的临时微博、博客和论坛数据来扩展存档ClueWeb12数据集。该扩展作为ClueWeb12++数据集分发给研究社区。这个(http://ciir.cs.umass.edu/research/ephemeral/)项目是第一个解决搜索的全部可能性的项目,它利用了通过将两个信息“世界”结合在一起而创建的所有联系和上下文。它还开发和分发一个独特的新数据集,该数据集支持开发新一代工具以获取广泛的信息。合作机构、马萨诸塞大学阿默斯特分校和卡内基-梅隆大学的学生将参与教育活动,并从研究经验中受益。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
W. Bruce Croft其他文献
The Darwinization of Linguistics
语言学的达尔文化
- DOI:
10.1556/select.3.2002.1.7 - 发表时间:
2002 - 期刊:
- 影响因子:0
- 作者:
W. Bruce Croft - 通讯作者:
W. Bruce Croft
Clustering large files of documents using the single-link method
- DOI:
10.1002/asi.4630280606 - 发表时间:
1977-11 - 期刊:
- 影响因子:0
- 作者:
W. Bruce Croft - 通讯作者:
W. Bruce Croft
Methods for Finding Language Universals in Syntax
在语法中查找语言共性的方法
- DOI:
10.1007/978-1-4020-8825-4_8 - 发表时间:
2009 - 期刊:
- 影响因子:0
- 作者:
W. Bruce Croft - 通讯作者:
W. Bruce Croft
The Speech Community in Evolutionary Language Dynamics
进化语言动力学中的语音群落
- DOI:
10.1111/j.1467-9922.2009.00535.x - 发表时间:
2009 - 期刊:
- 影响因子:4.4
- 作者:
R. Blythe;W. Bruce Croft - 通讯作者:
W. Bruce Croft
Evolution: Language Use and the Evolution of Languages
进化:语言的使用和语言的进化
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
W. Bruce Croft - 通讯作者:
W. Bruce Croft
W. Bruce Croft的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('W. Bruce Croft', 18)}}的其他基金
III: Small: Searching for Answers through Iterative Feedback
III:小:通过迭代反馈寻找答案
- 批准号:
1715095 - 财政年份:2017
- 资助金额:
$ 66.35万 - 项目类别:
Continuing Grant
CI-EN-Collaborative Research: Supporting Research and Teaching for Next-Generation Search Engines in Lemur
CI-EN-协作研究:支持狐猴下一代搜索引擎的研究和教学
- 批准号:
1405829 - 财政年份:2014
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
III: Small: Understanding the Relevance of Text Passages
III:小:理解文本段落的相关性
- 批准号:
1419693 - 财政年份:2014
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
CI-ADDO-EN: Collaborative Proposal: Supporting Web-Scale Experimentation using the Lemur Toolkit
CI-ADDO-EN:协作提案:使用 Lemur 工具包支持网络规模实验
- 批准号:
0934322 - 财政年份:2010
- 资助金额:
$ 66.35万 - 项目类别:
Continuing Grant
III: Small: Transforming Long Queries
III:小:转换长查询
- 批准号:
0914442 - 财政年份:2009
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
III-COR: Searching Archives of Community Knowledge
III-COR:搜索社区知识档案
- 批准号:
0711348 - 财政年份:2007
- 资助金额:
$ 66.35万 - 项目类别:
Continuing Grant
CRI: CRD - Supporting User Data, Privacy, and Evaluation in the Lemur Toolkit
CRI:CRD - 在 Lemur 工具包中支持用户数据、隐私和评估
- 批准号:
0707801 - 财政年份:2007
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
SGER: Breaking the Keyword Bottleneck: Towards More Effective Access of Government Information
SGER:打破关键词瓶颈:更有效地获取政府信息
- 批准号:
0527159 - 财政年份:2005
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Question Triage for Experts and Documents: Expanding the Information Retrieval Function of the NSDL
专家和文献的问题分类:扩展 NSDL 的信息检索功能
- 批准号:
0226144 - 财政年份:2002
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
相似海外基金
III : Medium: Collaborative Research: From Open Data to Open Data Curation
III:媒介:协作研究:从开放数据到开放数据管理
- 批准号:
2420691 - 财政年份:2024
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Designing AI Systems with Steerable Long-Term Dynamics
合作研究:III:中:设计具有可操纵长期动态的人工智能系统
- 批准号:
2312865 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312932 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
III: Medium: Collaborative Research: Integrating Large-Scale Machine Learning and Edge Computing for Collaborative Autonomous Vehicles
III:媒介:协作研究:集成大规模机器学习和边缘计算以实现协作自动驾驶汽车
- 批准号:
2348169 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Continuing Grant
Collaborative Research: III: Medium: Algorithms for scalable inference and phylodynamic analysis of tumor haplotypes using low-coverage single cell sequencing data
合作研究:III:中:使用低覆盖率单细胞测序数据对肿瘤单倍型进行可扩展推理和系统动力学分析的算法
- 批准号:
2415562 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: VirtualLab: Integrating Deep Graph Learning and Causal Inference for Multi-Agent Dynamical Systems
协作研究:III:媒介:VirtualLab:集成多智能体动态系统的深度图学习和因果推理
- 批准号:
2312501 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Knowledge discovery from highly heterogeneous, sparse and private data in biomedical informatics
合作研究:III:中:生物医学信息学中高度异构、稀疏和私有数据的知识发现
- 批准号:
2312862 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312930 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: New Machine Learning Empowered Nanoinformatics System for Advancing Nanomaterial Design
合作研究:III:媒介:新的机器学习赋能纳米信息学系统,促进纳米材料设计
- 批准号:
2347592 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Graph Neural Networks for Heterophilous Data: Advancing the Theory, Models, and Applications
合作研究:III:媒介:异质数据的图神经网络:推进理论、模型和应用
- 批准号:
2406648 - 财政年份:2023
- 资助金额:
$ 66.35万 - 项目类别:
Standard Grant