Collaborative Research: Updating the Militarized Dispute Data Through Crowdsourcing: MID5

协作研究:通过众包更新军事化争端数据:MID5

基本信息

  • 批准号:
    1528409
  • 负责人:
  • 金额:
    $ 69.04万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2015
  • 资助国家:
    美国
  • 起止时间:
    2015-09-15 至 2019-08-31
  • 项目状态:
    已结题

项目摘要

General Summary The Correlates of War Project's Militarized Interstate Dispute (MID) Data is the most prominent and heavily used data collection in the study of international conflict. The most recent version (MID4) was released in 2014 and brings the period covered to 1816-2010. The MID4 project utilized automated text classification procedures to make the process of identifying relevant news stories more efficient. Over the course of that project, the PIs determined the primary bottleneck in the workflow was the coding of those news documents. To address this inefficiency, The PIs completed a pilot project to determine whether crowdsourcing techniques could be used to code these documents. In the pilot, non-expert workers were paid small sums to read documents and to answer sets of questions, the answers to which were used to identify features of possible militarized incidents (the events that comprise MIDs). A systematic comparison of the crowdsourced responses with those of MID4 Project's trained coders revealed that the crowdsourced codings were completely accurate for 68 percent of the news reports coded; more importantly, high agreement among crowd responses on specific reports was strongly associated with correct coding. This enables the PIs to detect which documents require further expert involvement. As a result, the PIs can produce a majority of the MID data in near-realtime and at limited financial cost. These procedures are applied on the MID5 Project, which will update the MID data for the period 2011-2017.Technical Summary The MID5 project workflow begins with document retrieval from LexisNexis and document classification using the software and methods implemented in MID4. We discard the negatively classified documents, and proceed to extract metadata from the positively classified documents including the document title, the news agency that published the report, the date, and any actors mentioned in the text. Crowd workers are recruited through Amazon's Mechanical Turk and paid a wage to read one of these documents and answer a line of simple, objective questions about it. The questionnaire is predefined, but some extracted metadata is automatically inserted into the questionnaire to improve the quality of responses. Several workers complete a questionnaire for each document, leaving the PIs with problems of aggregation: how to combine multiple worker responses, possibly regarding multiple related questions, into usable data necessary to code the militarized incident. In the pilot study, the PIs show that Bayesian networks are the most effective way to achieve this aggregation. Recently, the PIs have made advances in semi-supervised text classification with hybrid, Deep Restricted Boltzmann Machines, which outperform previous methods in this task.
一般摘要 战争相关项目的军事化州际争端 (MID) 数据是国际冲突研究中最突出且使用最多的数据集合。最新版本 (MID4) 于 2014 年发布,涵盖的时期为 1816 年至 2010 年。 MID4 项目利用自动文本分类程序来提高识别相关新闻报道的过程的效率。在该项目的过程中,PI 确定工作流程中的主要瓶颈是这些新闻文档的编码。为了解决效率低下的问题,PI 完成了一个试点项目,以确定是否可以使用众包技术对这些文档进行编码。在试点中,非专业工作人员获得少量报酬来阅读文件并回答一系列问题,这些问题的答案被用来识别可能的军事化事件(构成 MID 的事件)的特征。对众包响应与 MID4 项目训练有素的编码员的响应进行系统比较后发现,众包编码对于 68% 的新闻报道来说是完全准确的;更重要的是,人群对特定报告的反应高度一致与正确编码密切相关。这使得 PI 能够检测哪些文档需要进一步的专家参与。因此,PI 可以以有限的财务成本近乎实时地生成大部分 MID 数据。这些程序适用于 MID5 项目,该项目将更新 2011-2017 年期间的 MID 数据。技术摘要 MID5 项目工作流程首先从 LexisNexis 进行文档检索,然后使用 MID4 中实现的软件和方法进行文档分类。我们丢弃负面分类的文件,并继续从正面分类的文件中提取元数据,包括文件标题、发布报告的新闻机构、日期以及文本中提到的任何参与者。人群工作人员是通过亚马逊的 Mechanical Turk 招募的,并支付工资来阅读其中一份文件并回答一系列简单、客观的问题。调查问卷是预定义的,但一些提取的元数据会自动插入到调查问卷中,以提高答复质量。多名工作人员针对每个文档完成一份调查问卷,这给 PI 留下了聚合问题:如何将多个工作人员的回答(可能涉及多个相关问题)组合成对军事化事件进行编码所需的可用数据。在试点研究中,PI 表明贝叶斯网络是实现这种聚合的最有效方法。最近,PI 在使用混合深度受限玻尔兹曼机的半监督文本分类方面取得了进展,在该任务中优于以前的方法。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Learning a Deep Hybrid Model for Semi-Supervised Text Classification
  • DOI:
    10.18653/v1/d15-1053
  • 发表时间:
    2015-09
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Alexander Ororbia;C. Lee Giles;D. Reitter
  • 通讯作者:
    Alexander Ororbia;C. Lee Giles;D. Reitter
Event Ordering with a Generalized Model for Sieve Prediction Ranking
使用筛预测排序的广义模型进行事件排序
A Framework for Computational Models of Human Memory
人类记忆计算模型的框架
Degrees of Separation in Semantic and Syntactic Relationships
语义和句法关系的分离程度
Holographic Declarative Memory: Using distributional semantics within ACT-R
全息陈述性记忆:在 ACT-R 中使用分布式语义
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Glenn Palmer其他文献

Updating the Militarized Interstate Dispute Data: A Response to Gibler, Miller, and Little
更新军事化州际争端数据:对吉布勒、米勒和利特尔的回应
  • DOI:
    10.1093/isq/sqz045
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    2.6
  • 作者:
    Glenn Palmer;Vito D'Orazio;Michael R. Kenwick;Roseanne W. McManus
  • 通讯作者:
    Roseanne W. McManus
The MID4 dataset, 2002–2010: Procedures, coding rules and description
MID4 数据集,2002-2010:程序、编码规则和描述
  • DOI:
    10.1177/0738894214559680
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    2.1
  • 作者:
    Glenn Palmer;Vito D'Orazio;Michael R. Kenwick;M. Lane
  • 通讯作者:
    M. Lane
To Protect and to Serve
保护和服务
  • DOI:
    10.1177/0022002702251028
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Trefor Owen Morgan;Glenn Palmer
  • 通讯作者:
    Glenn Palmer
Downwelling spectral irradiance during evening twilight as a function of the lunar phase.
黄昏期间下降的光谱辐照度作为月相的函数。
  • DOI:
    10.1364/ao.54.000b85
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    1.9
  • 作者:
    Glenn Palmer;S. Johnsen
  • 通讯作者:
    S. Johnsen
A Theory of Foreign Policy
外交政策理论
  • DOI:
    10.1515/9781400832644
  • 发表时间:
    2006
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Glenn Palmer;T. Morgan
  • 通讯作者:
    T. Morgan

Glenn Palmer的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Glenn Palmer', 18)}}的其他基金

MID4: UPDATING THE MILITARIZED DISPUTE DATA SET, 2002-2010
MID4:更新军事化争端数据集,2002-2010
  • 批准号:
    0924240
  • 财政年份:
    2009
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Improving the Efficiency of Militarized Interstate Dispute Data Collection using Automated Textual Analysis
使用自动文本分析提高军事化州际争端数据收集的效率
  • 批准号:
    0719634
  • 财政年份:
    2007
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Doctoral Dissertation Research in Political Science: In It to Win It? Domestic Politics and Signaling Long Term Resolve in International Crises
政治学博士论文研究:赢得胜利?
  • 批准号:
    0719769
  • 财政年份:
    2007
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research on Updating the Militarized Interstate Dispute Data
更新军事化州际争端数据的合作研究
  • 批准号:
    0002568
  • 财政年份:
    2000
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Beyond the Water's Edge: Individual Preferences, Domestic Institutions, System Structure and Foreign Policy
合作研究:超越水边:个人偏好、国内制度、制度结构和外交政策
  • 批准号:
    9507909
  • 财政年份:
    1995
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: Updating iVirus - the CyVerse-powered analytical toolkit for viruses of microbes
协作研究:更新 iVirus - CyVerse 支持的微生物病毒分析工具包
  • 批准号:
    2149505
  • 财政年份:
    2022
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: Updating iVirus - the CyVerse-powered analytical toolkit for viruses of microbes
协作研究:更新 iVirus - CyVerse 支持的微生物病毒分析工具包
  • 批准号:
    2149506
  • 财政年份:
    2022
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: A New Nonlinear Modal Updating Framework for Soft, Hydrated Materials
协作研究:用于软水合材料的新型非线性模态更新框架
  • 批准号:
    1728186
  • 财政年份:
    2017
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: A New Nonlinear Modal Updating Framework for Soft, Hydrated Materials
协作研究:用于软水合材料的新型非线性模态更新框架
  • 批准号:
    1727761
  • 财政年份:
    2017
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Updating the Militarized Dispute Data Through Crowdsourcing: MID5
协作研究:通过众包更新军事化争端数据:MID5
  • 批准号:
    1528624
  • 财政年份:
    2015
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: Updating the WeBWorK National Problem Library
合作研究:更新WeBWorK国家问题库
  • 批准号:
    1226081
  • 财政年份:
    2012
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Updating the WeBWorK National Problem Library
合作研究:更新WeBWorK国家问题库
  • 批准号:
    1226176
  • 财政年份:
    2012
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contentious Issues in World Politics: Updating the ICOW Dataset
合作研究:世界政治中有争议的问题:更新 ICOW 数据集
  • 批准号:
    0960567
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contingent Reasoning and Bayesian Updating in Games of Incomplete Information: An Experimental Analysis
协作研究:不完全信息博弈中的条件推理和贝叶斯更新:实验分析
  • 批准号:
    1031101
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contingent Reasoning and Bayesian Updating in Games of Incomplete Information: An Experimental Analysis
协作研究:不完全信息博弈中的条件推理和贝叶斯更新:实验分析
  • 批准号:
    1030467
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了