Constructing and Validating an Automated Coding System for Electronic News Sources
构建和验证电子新闻来源自动编码系统
基本信息
- 批准号:1423784
- 负责人:
- 金额:$ 34.18万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-08-15 至 2017-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project builds, tests, and validates an open-source automated system for coding social movement data from electronically available news sources. Although no source is perfect, scholars agree that the best general source of such data, specifically that on social protest, is a compilation of a large number of news sources. This project draws on the advances in machine learning developed in computer science and statistics and combining them with our deep substantive knowledge as sociologists of the problems of identifying and coding collective action events in news sources. An existing highly-regarded hand-coded data set based on the New York Times is used as the reference for "training" machine learning algorithms -- called "classifiers" -- to recognize elements of an action event in a news article and extract relevant information. We also collect and hand-code new data drawn from other regional, national, and international news sources to provide additional training sets to increase the range and variety of protests we are able to detect. A supplement to the project provides research experience for undergraduates who will be involved in collecting and coding these new data.This project builds, tests, and validates an open-source automated system for coding data from electronically available news sources. It advances the state of data collection in social science and employs the latest developments in natural language processing and supervised machine learning within computer science and statistics. The result will be an open-source, publicly available system that may be used by other researchers and further improved and expanded.This project promises to provide an important new methodological tool of broad interdisciplinary value to social scientists and to open the door to more efficiently compiling collective action-data from news sources that can improve both academic scholarship and public policy. All of the code for this project will be released under an open-source license and publicly accessible through a public source code repository. This work will be accessible and useful to scholars in social movements, international relations, and foreign policy. The work will also be of use to a large number of non-academics, such as foreign policy analysts and decision-makers, journalists, and those interested in computational methods of textual analysis and classification. The ability to code collective action data more efficiently and accurately from news sources has broad policy applicability.
该项目构建、测试和验证一个开源自动化系统,用于对来自电子新闻源的社会运动数据进行编码。虽然没有一个来源是完美的,但学者们一致认为,这类数据的最佳一般来源,特别是关于社会抗议的数据,是大量新闻来源的汇编。该项目借鉴了计算机科学和统计学中机器学习的进步,并将其与我们作为社会学家对识别和编码新闻源中集体行动事件的问题的深刻实质性知识相结合。基于纽约时报的现有高度重视的手工编码数据集被用作“训练”机器学习算法的参考-称为“分类器”-以识别新闻文章中的动作事件的元素并提取相关信息。我们还收集和手工编码来自其他地区,国家和国际新闻来源的新数据,以提供额外的训练集,以增加我们能够检测到的抗议活动的范围和种类。该项目的补充为将参与收集和编码这些新数据的本科生提供了研究经验。该项目构建、测试和验证了一个开源自动化系统,用于对电子新闻源的数据进行编码。它推进了社会科学中数据收集的状态,并采用了计算机科学和统计学中自然语言处理和监督机器学习的最新发展。其结果将是一个开源的,公开可用的系统,可用于其他研究人员和进一步改进和扩展,该项目有望提供一个重要的新的方法工具,广泛的跨学科价值的社会科学家和打开大门,更有效地汇编集体行动的数据,从新闻来源,可以改善学术奖学金和公共政策。该项目的所有代码都将在开源许可证下发布,并通过公共源代码库公开访问。这项工作将是访问和有用的学者在社会运动,国际关系和外交政策。这项工作也将用于大量的非学者,如外交政策分析师和决策者,记者,以及那些有兴趣在文本分析和分类的计算方法。从新闻来源更有效、更准确地编码集体行动数据的能力具有广泛的政策适用性。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Black Protests in the United States, 1994 to 2010
1994 年至 2010 年美国黑人抗议活动
- DOI:10.15195/v9.a12
- 发表时间:2022
- 期刊:
- 影响因子:3.4
- 作者:Oliver, Pamela;Lim, Chaeyoon;Matthews, Morgan;Hanna, Alex
- 通讯作者:Hanna, Alex
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Pamela Oliver其他文献
The Revolt of the Reviewers: Towards Fixing a Broken Publishing Process
- DOI:
10.1007/s12108-016-9319-8 - 发表时间:
2016-05-16 - 期刊:
- 影响因子:1.100
- 作者:
Pamela Oliver - 通讯作者:
Pamela Oliver
Poster 8: Low Vision Rehabilitation of an HIV+ Patient With Retinal Necrosis and Optic Atrophy
- DOI:
10.1016/j.optm.2008.04.015 - 发表时间:
2008-06-01 - 期刊:
- 影响因子:
- 作者:
Jamie Althoff;Pamela Oliver - 通讯作者:
Pamela Oliver
Poster 49: Papillophlebitis Associated With Autoimmune Factors
- DOI:
10.1016/j.optm.2007.04.051 - 发表时间:
2007-06-01 - 期刊:
- 影响因子:
- 作者:
Sarah E. Hill;Joseph Sowka;Pamela Oliver - 通讯作者:
Pamela Oliver
Pamela Oliver的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Pamela Oliver', 18)}}的其他基金
Comparing Coverage of Public Events Across Different Types of News Sources
比较不同类型新闻来源对公共事件的报道
- 批准号:
2214160 - 财政年份:2022
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Generating High-Quality Verifiable Relational Data About News Coverage of Social Movement Events
生成有关社会运动事件新闻报道的高质量可验证关系数据
- 批准号:
1918342 - 财政年份:2019
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Residential Segregation and Policing Styles
博士论文研究:居住隔离和警务风格
- 批准号:
1602697 - 财政年份:2016
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Filipino Military Service & The Promise of Benefits
博士论文研究:菲律宾兵役
- 批准号:
1519125 - 财政年份:2015
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Prison Privatization and Public Discourse
监狱私有化和公共话语
- 批准号:
0925328 - 财政年份:2009
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Media, Social Context and Public Discourse
博士论文研究:媒体、社会背景和公共话语
- 批准号:
0828479 - 财政年份:2008
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Tracking the Causes and Consequences of Racial Disparities in Imprisonment
追踪监狱中种族差异的原因和后果
- 批准号:
0136833 - 财政年份:2002
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Cooperation and Conflict between "Old" and "New" Social Movements: The Case of Organized Labor and the Environmental Movement
博士论文研究:“旧”与“新”社会运动的合作与冲突:有组织劳工与环保运动的案例
- 批准号:
9900608 - 财政年份:1999
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
The Content and Timing of Media Coverage of Message Events: Cycles and Comparisions
消息事件媒体报道的内容和时机:周期和比较
- 批准号:
9819884 - 财政年份:1999
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Models of the Diffusion of Collective Action
集体行动扩散模型
- 批准号:
9601409 - 财政年份:1996
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
相似海外基金
Validating FilaChar Use in Wastewater Treatment
验证 FilaChar 在废水处理中的使用
- 批准号:
10106623 - 财政年份:2024
- 资助金额:
$ 34.18万 - 项目类别:
Launchpad
Validating the efficacy of SITREX in preventing heterotopic ossification
验证 SITREX 在预防异位骨化方面的功效
- 批准号:
MR/Z503782/1 - 财政年份:2024
- 资助金额:
$ 34.18万 - 项目类别:
Research Grant
NSF Postdoctoral Fellowship in Biology: Identifying and Validating Missing Links in the Global Bat-Virus Network
美国国家科学基金会生物学博士后奖学金:识别和验证全球蝙蝠病毒网络中缺失的环节
- 批准号:
2305782 - 财政年份:2024
- 资助金额:
$ 34.18万 - 项目类别:
Fellowship Award
Validating the Use of Point-Of-Care Diagnostics for Early Detection of Human Papillomavirus
验证床旁诊断在人乳头瘤病毒早期检测中的应用
- 批准号:
EP/Y003225/1 - 财政年份:2024
- 资助金额:
$ 34.18万 - 项目类别:
Research Grant
Innovating and Validating Scalable Monte Carlo Methods
创新和验证可扩展的蒙特卡罗方法
- 批准号:
DE240101190 - 财政年份:2024
- 资助金额:
$ 34.18万 - 项目类别:
Discovery Early Career Researcher Award
Validating the performance of graphene sensors using advanced metrology
使用先进计量学验证石墨烯传感器的性能
- 批准号:
10039216 - 财政年份:2023
- 资助金额:
$ 34.18万 - 项目类别:
Collaborative R&D
Collaborative Research: SaTC: CORE: Small: Measuring, Validating and Improving upon App-Based Privacy Nutrition Labels
合作研究:SaTC:核心:小型:测量、验证和改进基于应用程序的隐私营养标签
- 批准号:
2247952 - 财政年份:2023
- 资助金额:
$ 34.18万 - 项目类别:
Standard Grant
Developing and validating a training program to improve domain-specific working memory efficiency in second language.
开发和验证培训计划,以提高第二语言特定领域的工作记忆效率。
- 批准号:
23K17499 - 财政年份:2023
- 资助金额:
$ 34.18万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
A synthetic data and generative A.I approach to verifying and validating A.I
用于验证和验证人工智能的合成数据和生成人工智能方法
- 批准号:
10065801 - 财政年份:2023
- 资助金额:
$ 34.18万 - 项目类别:
Collaborative R&D
Battery Materials R&D Centre of Excellence, UK: Validating the lithium refining process flowsheet and designing a facility for rapid scale up
电池材料研究
- 批准号:
10077163 - 财政年份:2023
- 资助金额:
$ 34.18万 - 项目类别:
BEIS-Funded Programmes