HNDS-I: From Stacks to Stats: Unlocking International Census Data from Print Volumes
HNDS-I:从堆栈到统计:从印刷卷中解锁国际人口普查数据
基本信息
- 批准号:2121891
- 负责人:
- 金额:$ 100万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-09-01 至 2025-08-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Nearly every country in the world conducts a census. These censuses generate information about who people are, their education, and where they work and live. Census data are important for understanding how human populations change over time. However, most of these data are found only in printed volumes on library shelves. It is challenging to get data from printed pages ready for analysis, so these data are difficult to use for research. This project will collect census volumes; scan the data inside; convert the data into digital formats that are easy to analyze; and make them freely available to researchers worldwide. Making these data easy to access will allow researchers, policy makers, and others to answer questions about important issues like aging, migration, fertility, and mortality. This project builds on work that created the IPUMS International Historical Geographic Information System (IHGIS). IHGIS software currently works to process and document data from PDF documents. This project extends the IHGIS tools to work with print volumes by using optical character recognition to convert scanned images into digital data tables. To do so, special problems related to scanning data tables are addressed. For example, there is no way to tell from the number itself that a scanned 3,222 should be 8,222 instead. This project develops software to determine the right value; for example, using a digitized table, the software might determine consistency, within a given age group, among the number of people attending school, the number of people not attending school, and the total number of people in the age group to determine the correct 8,322 value. More complicated problems arise with multidimensional tables, such as a table that contains the levels of education attained for people in different age groups across many different geographic regions. The software uses structured information about the content of tables to determine consistency across specific row and column elements and performs the checks to find scanning errors. Automating otherwise labor-intensive problems like these will provide a large collection of data for countries and time periods where digitally published data are not available. Thanks to the geographic detail, historical depth, and global coverage, researchers will be able to use the data to study change over time and differences between places within and between countries.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
世界上几乎每个国家都进行人口普查。这些人口普查产生了关于人们是谁、他们的教育以及他们在哪里工作和生活的信息。人口普查数据对于了解人口如何随时间变化很重要。然而,这些数据中的大多数只能在图书馆书架上的印刷书籍中找到。从打印的页面中获取数据以供分析是一项挑战,因此这些数据很难用于研究。该项目将收集人口普查卷;扫描其中的数据;将数据转换为易于分析的数字格式;并向全世界的研究人员免费提供。使这些数据易于访问将使研究人员,政策制定者和其他人能够回答有关老龄化,移民,生育率和死亡率等重要问题的问题。该项目建立在创建IPUMS国际历史地理信息系统(IHGIS)的基础上。IHGIS软件目前用于处理和记录PDF文档中的数据。该项目通过使用光学字符识别将扫描图像转换为数字数据表,扩展了IHGIS工具,使其能够处理印刷卷。为此,解决了与扫描数据表相关的特殊问题。例如,无法从数字本身判断扫描的3,222应该是8,222。该项目开发了确定正确数值的软件;例如,使用数字化表格,该软件可以确定某一年龄组内上学人数、不上学人数以及该年龄组总人数之间的一致性,以确定正确的8 322数值。多维表会出现更复杂的问题,例如包含许多不同地理区域不同年龄组的人所达到的教育水平的表。该软件使用有关表内容的结构化信息来确定特定行和列元素之间的一致性,并执行检查以查找扫描错误。自动化这些劳动密集型问题将为无法获得数字发布数据的国家和时间段提供大量数据。由于地理细节、历史深度和全球覆盖范围,研究人员将能够使用这些数据来研究随着时间的推移而发生的变化以及国家内部和国家之间的差异。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Steven Manson其他文献
Steven Manson的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Steven Manson', 18)}}的其他基金
National Historical Geographic Information System
国家历史地理信息系统
- 批准号:
2316650 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
National Historical Geographic Information System
国家历史地理信息系统
- 批准号:
1825768 - 财政年份:2018
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
RIDIR: IPUMS-Terra: Global Population and Agricultural Data
RIDIR:IPUMS-Terra:全球人口和农业数据
- 批准号:
1738369 - 财政年份:2017
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
National Historical Geographic Information System
国家历史地理信息系统
- 批准号:
1324875 - 财政年份:2013
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Modeling the Impact of Urbanization on Ecosystem Services in the Twin Cities of Minnesota
博士论文研究:模拟城市化对明尼苏达州双城生态系统服务的影响
- 批准号:
1003138 - 财政年份:2010
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
The Interaction of Radiation With Free and Confined Atoms and Ions
辐射与自由和受限原子和离子的相互作用
- 批准号:
0852786 - 财政年份:2009
- 资助金额:
$ 100万 - 项目类别:
Continuing Grant
Interaction of Radiation with Free and Confined Atoms
辐射与自由原子和受限原子的相互作用
- 批准号:
0555430 - 财政年份:2006
- 资助金额:
$ 100万 - 项目类别:
Continuing Grant
The Interaction of Radiation with Free and Confined Atoms and Ions
辐射与自由和受限原子和离子的相互作用
- 批准号:
0244394 - 财政年份:2003
- 资助金额:
$ 100万 - 项目类别:
Continuing Grant
US-India Cooperative Research: Relativistic Effects in the Photoionization of Free and Confined Atoms
美印合作研究:自由原子和受限原子光电离的相对论效应
- 批准号:
0138115 - 财政年份:2002
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
The Interaction of Radiation With Free and Confined Atoms and Ions
辐射与自由和受限原子和离子的相互作用
- 批准号:
0070646 - 财政年份:2000
- 资助金额:
$ 100万 - 项目类别:
Continuing Grant
相似海外基金
CAREER: Datacenter-Aware Local Storage Stacks
职业:数据中心感知的本地存储堆栈
- 批准号:
2340218 - 财政年份:2024
- 资助金额:
$ 100万 - 项目类别:
Continuing Grant
Collaborative Research: Slopes of Modular Forms and Moduli Stacks of Galois Representations
合作研究:伽罗瓦表示的模形式和模栈的斜率
- 批准号:
2302284 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Geometry of moduli stacks of Galois representations
伽罗瓦表示的模栈的几何
- 批准号:
2302623 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Electrically Conductive 2D Metal-Organic Frameworks and Covalent Organic Frameworks Featuring Built-in Alternating pi-Donor/Acceptor Stacks with Efficient Charge Transport Capacity
导电二维金属有机框架和共价有机框架,具有内置交替 pi 供体/受体堆栈,具有高效的电荷传输能力
- 批准号:
2321365 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Collaborative Research: Slopes of Modular Forms and Moduli Stacks of Galois Representations
合作研究:伽罗瓦表示的模形式和模栈的斜率
- 批准号:
2302285 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Moduli stacks: curves, stable reduction and arithmetic
模数堆栈:曲线、稳定归约和算术
- 批准号:
22KF0205 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Grant-in-Aid for JSPS Fellows
High Durability Solid Oxide Electrolyser Stacks with Enhanced Coated Interconnects and Metal Ion Infiltrated Electrodes - HiDroConnect
具有增强涂层互连和金属离子渗透电极的高耐用性固体氧化物电解槽堆栈 - HiDroConnect
- 批准号:
10080289 - 财政年份:2023
- 资助金额:
$ 100万 - 项目类别:
Collaborative R&D
Synthesis and Assembly 2D Heterostructured Hybrid Stacks
合成和组装 2D 异质结构混合堆栈
- 批准号:
2200366 - 财政年份:2022
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: High-performance Network Stacks for the Edge
合作研究:CNS 核心:中:边缘的高性能网络堆栈
- 批准号:
2212098 - 财政年份:2022
- 资助金额:
$ 100万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: High-performance Network Stacks for the Edge
合作研究:CNS 核心:中:边缘的高性能网络堆栈
- 批准号:
2212099 - 财政年份:2022
- 资助金额:
$ 100万 - 项目类别:
Standard Grant