Collaborative Research: Updating the Militarized Dispute Data Through Crowdsourcing: MID5

协作研究:通过众包更新军事化争端数据:MID5

基本信息

  • 批准号:
    1528409
  • 负责人:
  • 金额:
    $ 69.04万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2015
  • 资助国家:
    美国
  • 起止时间:
    2015-09-15 至 2019-08-31
  • 项目状态:
    已结题

项目摘要

General Summary The Correlates of War Project's Militarized Interstate Dispute (MID) Data is the most prominent and heavily used data collection in the study of international conflict. The most recent version (MID4) was released in 2014 and brings the period covered to 1816-2010. The MID4 project utilized automated text classification procedures to make the process of identifying relevant news stories more efficient. Over the course of that project, the PIs determined the primary bottleneck in the workflow was the coding of those news documents. To address this inefficiency, The PIs completed a pilot project to determine whether crowdsourcing techniques could be used to code these documents. In the pilot, non-expert workers were paid small sums to read documents and to answer sets of questions, the answers to which were used to identify features of possible militarized incidents (the events that comprise MIDs). A systematic comparison of the crowdsourced responses with those of MID4 Project's trained coders revealed that the crowdsourced codings were completely accurate for 68 percent of the news reports coded; more importantly, high agreement among crowd responses on specific reports was strongly associated with correct coding. This enables the PIs to detect which documents require further expert involvement. As a result, the PIs can produce a majority of the MID data in near-realtime and at limited financial cost. These procedures are applied on the MID5 Project, which will update the MID data for the period 2011-2017.Technical Summary The MID5 project workflow begins with document retrieval from LexisNexis and document classification using the software and methods implemented in MID4. We discard the negatively classified documents, and proceed to extract metadata from the positively classified documents including the document title, the news agency that published the report, the date, and any actors mentioned in the text. Crowd workers are recruited through Amazon's Mechanical Turk and paid a wage to read one of these documents and answer a line of simple, objective questions about it. The questionnaire is predefined, but some extracted metadata is automatically inserted into the questionnaire to improve the quality of responses. Several workers complete a questionnaire for each document, leaving the PIs with problems of aggregation: how to combine multiple worker responses, possibly regarding multiple related questions, into usable data necessary to code the militarized incident. In the pilot study, the PIs show that Bayesian networks are the most effective way to achieve this aggregation. Recently, the PIs have made advances in semi-supervised text classification with hybrid, Deep Restricted Boltzmann Machines, which outperform previous methods in this task.
战争相关者项目的军事化州际争端(MID)数据是国际冲突研究中最突出和使用最频繁的数据收集。最新的版本(MID4)于2014年发布,使所涵盖的时期达到1816-2010年。MID4项目利用自动文本分类程序,使确定相关新闻故事的过程更有效率。在该项目的整个过程中,PIS确定工作流程中的主要瓶颈是这些新闻文档的编码。为了解决这种效率低下的问题,私人投资司完成了一个试点项目,以确定是否可以使用众包技术对这些文件进行编码。在试点中,非专家工作人员获得少量报酬,让他们阅读文件和回答一系列问题,这些问题的答案被用来确定可能发生的军事化事件(构成MID的事件)的特征。对众包反应与MID4项目训练有素的编码员的系统比较表明,众包编码对68%的新闻报道完全准确;更重要的是,群众对特定报道的高度一致与正确的编码密切相关。这使私人投资机构能够检测哪些文件需要进一步的专家参与。因此,PI可以以有限的财务成本近乎实时地产生大部分中期数据。这些程序适用于MID5项目,该项目将更新2011-2017年期间的中期数据。技术摘要MID5项目的工作流程始于从LexisNexis检索文件,并使用MID4中实施的软件和方法对文件进行分类。我们丢弃负面分类的文档,然后从肯定分类的文档中提取元数据,包括文档标题、发布报道的新闻机构、日期以及文本中提到的任何参与者。众包工作人员通过亚马逊的机械土耳其人招募,并支付一定的工资来阅读其中一份文件,并回答一系列简单、客观的问题。调查问卷是预定义的,但一些提取的元数据会自动插入到调查问卷中,以提高回答的质量。几名工作人员为每个文件完成一份调查问卷,给PI留下了汇总的问题:如何将多个工作人员的答复(可能涉及多个相关问题)合并为对军事化事件进行编码所需的可用数据。在试点研究中,PI表明贝叶斯网络是实现这种聚合的最有效方式。最近,PI在混合深度受限Boltzmann机器的半监督文本分类方面取得了进展,在这一任务中表现优于以往的方法。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Learning a Deep Hybrid Model for Semi-Supervised Text Classification
  • DOI:
    10.18653/v1/d15-1053
  • 发表时间:
    2015-09
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Alexander Ororbia;C. Lee Giles;D. Reitter
  • 通讯作者:
    Alexander Ororbia;C. Lee Giles;D. Reitter
Event Ordering with a Generalized Model for Sieve Prediction Ranking
使用筛预测排序的广义模型进行事件排序
A Framework for Computational Models of Human Memory
人类记忆计算模型的框架
Degrees of Separation in Semantic and Syntactic Relationships
语义和句法关系的分离程度
Holographic Declarative Memory: Using distributional semantics within ACT-R
全息陈述性记忆:在 ACT-R 中使用分布式语义
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Glenn Palmer其他文献

Updating the Militarized Interstate Dispute Data: A Response to Gibler, Miller, and Little
更新军事化州际争端数据:对吉布勒、米勒和利特尔的回应
  • DOI:
    10.1093/isq/sqz045
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    2.6
  • 作者:
    Glenn Palmer;Vito D'Orazio;Michael R. Kenwick;Roseanne W. McManus
  • 通讯作者:
    Roseanne W. McManus
The MID4 dataset, 2002–2010: Procedures, coding rules and description
MID4 数据集,2002-2010:程序、编码规则和描述
  • DOI:
    10.1177/0738894214559680
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    2.1
  • 作者:
    Glenn Palmer;Vito D'Orazio;Michael R. Kenwick;M. Lane
  • 通讯作者:
    M. Lane
To Protect and to Serve
保护和服务
  • DOI:
    10.1177/0022002702251028
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Trefor Owen Morgan;Glenn Palmer
  • 通讯作者:
    Glenn Palmer
Downwelling spectral irradiance during evening twilight as a function of the lunar phase.
黄昏期间下降的光谱辐照度作为月相的函数。
  • DOI:
    10.1364/ao.54.000b85
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    1.9
  • 作者:
    Glenn Palmer;S. Johnsen
  • 通讯作者:
    S. Johnsen
Defense allocations in Eastern Europe: Alliance politics and leadership change∗
东欧的国防分配:联盟政治和领导层变化*
  • DOI:
    10.1080/03050629008434742
  • 发表时间:
    1990
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Glenn Palmer;W. Reisinger
  • 通讯作者:
    W. Reisinger

Glenn Palmer的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Glenn Palmer', 18)}}的其他基金

MID4: UPDATING THE MILITARIZED DISPUTE DATA SET, 2002-2010
MID4:更新军事化争端数据集,2002-2010
  • 批准号:
    0924240
  • 财政年份:
    2009
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Doctoral Dissertation Research in Political Science: In It to Win It? Domestic Politics and Signaling Long Term Resolve in International Crises
政治学博士论文研究:赢得胜利?
  • 批准号:
    0719769
  • 财政年份:
    2007
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Improving the Efficiency of Militarized Interstate Dispute Data Collection using Automated Textual Analysis
使用自动文本分析提高军事化州际争端数据收集的效率
  • 批准号:
    0719634
  • 财政年份:
    2007
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research on Updating the Militarized Interstate Dispute Data
更新军事化州际争端数据的合作研究
  • 批准号:
    0002568
  • 财政年份:
    2000
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Beyond the Water's Edge: Individual Preferences, Domestic Institutions, System Structure and Foreign Policy
合作研究:超越水边:个人偏好、国内制度、制度结构和外交政策
  • 批准号:
    9507909
  • 财政年份:
    1995
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: Updating iVirus - the CyVerse-powered analytical toolkit for viruses of microbes
协作研究:更新 iVirus - CyVerse 支持的微生物病毒分析工具包
  • 批准号:
    2149505
  • 财政年份:
    2022
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: Updating iVirus - the CyVerse-powered analytical toolkit for viruses of microbes
协作研究:更新 iVirus - CyVerse 支持的微生物病毒分析工具包
  • 批准号:
    2149506
  • 财政年份:
    2022
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: A New Nonlinear Modal Updating Framework for Soft, Hydrated Materials
协作研究:用于软水合材料的新型非线性模态更新框架
  • 批准号:
    1728186
  • 财政年份:
    2017
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: A New Nonlinear Modal Updating Framework for Soft, Hydrated Materials
协作研究:用于软水合材料的新型非线性模态更新框架
  • 批准号:
    1727761
  • 财政年份:
    2017
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Updating the Militarized Dispute Data Through Crowdsourcing: MID5
协作研究:通过众包更新军事化争端数据:MID5
  • 批准号:
    1528624
  • 财政年份:
    2015
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: Updating the WeBWorK National Problem Library
合作研究:更新WeBWorK国家问题库
  • 批准号:
    1226081
  • 财政年份:
    2012
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Updating the WeBWorK National Problem Library
合作研究:更新WeBWorK国家问题库
  • 批准号:
    1226176
  • 财政年份:
    2012
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contentious Issues in World Politics: Updating the ICOW Dataset
合作研究:世界政治中有争议的问题:更新 ICOW 数据集
  • 批准号:
    0960567
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contingent Reasoning and Bayesian Updating in Games of Incomplete Information: An Experimental Analysis
协作研究:不完全信息博弈中的条件推理和贝叶斯更新:实验分析
  • 批准号:
    1031101
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
Collaborative Research: Contingent Reasoning and Bayesian Updating in Games of Incomplete Information: An Experimental Analysis
协作研究:不完全信息博弈中的条件推理和贝叶斯更新:实验分析
  • 批准号:
    1030467
  • 财政年份:
    2010
  • 资助金额:
    $ 69.04万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了