RIDIR: Collaborative Research: Analytical tools for text based social data integration
RIDIR:协作研究:基于文本的社交数据集成的分析工具
基本信息
- 批准号:1738411
- 负责人:
- 金额:$ 119.48万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-09-01 至 2021-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
When something happens in the world -- such as a natural disaster, an election, a protest, or a policy change -- many types of media record different accounts of the same event. Newspapers, social media posts and government documents all provide unique versions of events stored in different formats. Because each source provides its own perspective, synthesizing these stories vastly increase our ability to learn about both events and the dynamics of the media environment. Yet, social scientists are limited in their capacity to access these myriad perspectives because there are few tools for automatically combining these accounts into one integrated analysis. This project will provide a rich infrastructure for integrating texts from diverse sources documenting the same social phenomenon. Such integration often reveals much about underlying social dynamics.This project will develop a tool to integrate documents with different formats with accounts of the same or closely related events through four main methods. First, the tool will allow users to align documents by topic, while accounting for structural and stylistic differences between documents. Second, the tool will compile different types of documents by a shared event or entity. Third, the tool will allow for user-provided schema to combine semi-structured documents. Last, the tool will facilitate data fusion, by identifying and resolving contradictions from multiple sources. The tool will be sufficiently flexible to fit multiple research purposes, allow for human feedback to assist with integration, and facilitate reproducibility by creating a common resource that can be the basis of future research by a whole community of scholars. The system itself will be applicable to almost any set of unstructured text data and will have broad applicability for questions across the social sciences.
当世界上发生一些事情时,比如自然灾害、选举、抗议或政策变化,许多类型的媒体都会对同一事件进行不同的报道。报纸、社交媒体帖子和政府文件都提供了以不同格式存储的事件的独特版本。因为每个来源都提供了自己的视角,综合这些故事大大提高了我们了解事件和媒体环境动态的能力。然而,社会科学家获得这些无数观点的能力有限,因为很少有工具可以自动将这些观点组合成一个综合分析。该项目将提供丰富的基础设施,用于整合记录同一社会现象的不同来源的文本。这种整合往往揭示了许多潜在的社会动态。该项目将开发一种工具,通过四种主要方法将不同格式的文档与相同或密切相关事件的描述集成在一起。首先,该工具将允许用户按主题对齐文档,同时考虑文档之间的结构和风格差异。其次,该工具将根据共享事件或实体编译不同类型的文档。第三,该工具将允许用户提供的模式来组合半结构化文档。最后,该工具将通过识别和解决来自多个来源的矛盾,促进数据融合。该工具将足够灵活,以适应多种研究目的,允许人类的反馈来协助整合,并通过创建一个共同的资源来促进可重复性,该资源可以成为整个学者社区未来研究的基础。该系统本身将适用于几乎任何一组非结构化文本数据,并将广泛适用于整个社会科学的问题。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Adjusting for Confounding with Text Matching
调整文本匹配的混淆
- DOI:10.1111/ajps.12526
- 发表时间:2020
- 期刊:
- 影响因子:4.2
- 作者:Roberts, Margaret E.;Stewart, Brandon M.;Nielsen, Richard A.
- 通讯作者:Nielsen, Richard A.
Censorship of Online Encyclopedias: Implications for NLP Models
- DOI:10.1145/3442188.3445916
- 发表时间:2021-01
- 期刊:
- 影响因子:0
- 作者:Eddie Yang;Margaret E. Roberts
- 通讯作者:Eddie Yang;Margaret E. Roberts
Mass Digitization of Chinese Court Decisions: How to Use Text as Data in the Field of Chinese Law
中国法院判决的大规模数字化:如何在中国法律领域使用文本作为数据
- DOI:10.1086/709916
- 发表时间:2020
- 期刊:
- 影响因子:1.4
- 作者:Liebman, Benjamin L.;Roberts, Margaret E.;Stern, Rachel E.;Wang, Alice Z.
- 通讯作者:Wang, Alice Z.
Social network of extreme tweeters: a case study
极端推特用户的社交网络:案例研究
- DOI:10.1145/3341161.3342909
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Zheng, Xiuwen;Gupta, Amarnath
- 通讯作者:Gupta, Amarnath
Multi-model Investigative Exploration of Social Media Data with BOUTIQUE: A Case Study in Public Health
- DOI:10.1109/escience.2019.00030
- 发表时间:2019-05
- 期刊:
- 影响因子:0
- 作者:Junan Guo;S. Dasgupta;Amarnath Gupta
- 通讯作者:Junan Guo;S. Dasgupta;Amarnath Gupta
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Margaret Roberts其他文献
Scoping the Institute for Research on the Information Environment
确定信息环境研究所的范围
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Nils B. Weidmann;Margaret Roberts;Zachary C. Steinert;S. Hellmeier - 通讯作者:
S. Hellmeier
Efficacy of home-based visuomotor feedback training in stroke patients with chronic hemispatial neglect
家庭视觉运动反馈训练对慢性半侧空间忽视的卒中患者的疗效
- DOI:
10.1080/09602011.2016.1273119 - 发表时间:
2019 - 期刊:
- 影响因子:2.7
- 作者:
Stéphanie Rossit;C. Benwell;Larissa Szymanek;G. Learmonth;Laura McKernan;E. Corrigan;K. Muir;I. Reeves;George Duncan;P. Birschel;Margaret Roberts;K. Livingstone;Hazel Jackson;P. Castle;M. Harvey - 通讯作者:
M. Harvey
Powerful knowledge and geographical education
强大的知识和地理教育
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
Margaret Roberts - 通讯作者:
Margaret Roberts
News from OMEP
- DOI:
10.1007/bf03174543 - 发表时间:
1987-09-01 - 期刊:
- 影响因子:1.800
- 作者:
Margaret Roberts;Margaret Weiser - 通讯作者:
Margaret Weiser
OMEP — What it is and how it works
- DOI:
10.1007/bf03174942 - 发表时间:
1983-03-01 - 期刊:
- 影响因子:1.800
- 作者:
Margaret Roberts - 通讯作者:
Margaret Roberts
Margaret Roberts的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似海外基金
RIDIR: Collaborative Research: Bayesian analytical tools to improve survey estimates for subpopulations and small areas
RIDIR:协作研究:贝叶斯分析工具,用于改进亚人群和小区域的调查估计
- 批准号:
1926424 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR Collaborative Research: Building a Database to Determine Environmental and Familial Effects on Social and Biological Factors
RIDIR 合作研究:建立数据库以确定环境和家庭对社会和生物因素的影响
- 批准号:
1926601 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR Collaborative Research: Building a Database to Determine Environmental and Familial Effects on Social and Biological Factors
RIDIR 合作研究:建立数据库以确定环境和家庭对社会和生物因素的影响
- 批准号:
1926481 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR Collaborative Research: Building a Database to Determine Environmental and Familial Effects on Social and Biological Factors
RIDIR 合作研究:建立数据库以确定环境和家庭对社会和生物因素的影响
- 批准号:
1926528 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR: Collaborative Research: Bayesian analytical tools to improve survey estimates for subpopulations and small areas
RIDIR:协作研究:贝叶斯分析工具,用于改进亚人群和小区域的调查估计
- 批准号:
1926578 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR Collaborative Research: Building a Database to Determine Environmental and Familial Effects on Social and Biological Factors
RIDIR 合作研究:建立数据库以确定环境和家庭对社会和生物因素的影响
- 批准号:
1926402 - 财政年份:2019
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR: Collaborative Research: Data Science Tools for Policy Analyses
RIDIR:协作研究:用于政策分析的数据科学工具
- 批准号:
1831921 - 财政年份:2018
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR: Collaborative Research: Enabling Access to and Analysis of Shared Daylong Child and Family Audio Data
RIDIR:协作研究:能够访问和分析共享的全天儿童和家庭音频数据
- 批准号:
1827744 - 财政年份:2018
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR: Collaborative Research: A Data Science Platform and Mechanisms for Its Sustainability
RIDIR:协作研究:数据科学平台及其可持续性机制
- 批准号:
1831551 - 财政年份:2018
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant
RIDIR: Collaborative Research: Integrated Communication Database and Computational Tools
RIDIR:协作研究:集成通信数据库和计算工具
- 批准号:
1831481 - 财政年份:2018
- 资助金额:
$ 119.48万 - 项目类别:
Standard Grant