Big Data for Population Research
人口研究大数据
基本信息
- 批准号:8865394
- 负责人:
- 金额:$ 62.54万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2013
- 资助国家:美国
- 起止时间:2013-09-21 至 2016-05-31
- 项目状态:已结题
- 来源:
- 关键词:AmericanAutomatic Data ProcessingBehaviorBehavioral ResearchBig DataBiological PreservationCategoriesCensusesCharacteristicsChurchClassificationCodeCollaborationsCommunitiesComputer softwareDataData ProtectionDatabasesDocumentationEconomicsEducationElementsFamilyFertilityFundingGenetic TranscriptionGeographic LocationsHealthHuman ActivitiesIndividualKnowledgeLife Cycle StagesManualsMedical ResearchMetadataMethodsMinnesotaMissionNeighborhoodsPersonsPoliciesPolicy MakingPopulationPopulation DynamicsPopulation ResearchProceduresProcessPublishingRecordsResearchResearch InfrastructureResearch PersonnelResourcesRunningSecureSeriesSocial MobilitySocietiesSystemTechnologyTimeUnited States National Institutes of HealthWorkbehavioral/social sciencecomputerized data processingcostcost effectiveexperienceinnovationinsightintergenerationalmigrationmortalitypopulation healthpublic health relevanceracial and ethnicsegregationsocialspatiotemporaltooltrend
项目摘要
DESCRIPTION (provided by applicant): This proposal seeks funding to expand the Integrated Public Use Micro-data Series (IPUMS) by adding demographic and geographic data describing the entire enumerated population of the U.S. from 1790 to 1930. The project will provide data on the characteristics of over 600 million persons, quadrupling the quantity of U.S. census micro-data available for scientific research. The data will cover entire populations with full geographic
detail, providing contextual information on neighborhood characteristics, including ethnic composition, demographic behavior, and population mobility. These data offer the earliest information available on key social and economic characteristics, and they will provide invaluable insight into processes of long-run demographic and economic change. The data will make a permanent and substantial addition to the nation's statistical infrastructure and will have far-reaching implications for research across the social and behavioral sciences. The project is made possible by the donation of a massive high-quality verified transcription of information in the U.S. censuses, prepared by two major genealogical organizations. Converting this immense body of raw data into a format suitable for scientific analysis will require the following tasks: () classify and code geographic locations to be compatible with categories used in the published census returns; (2) assess completeness and accuracy of the data transcription; (3) convert alphabetic string data into numeric categories that are comparable over time; (4) employ new data cleaning software to identify and correct common enumeration and transcription errors; (5) develop documentation, including full descriptions of data processing methods, detailed analysis of comparability issues, and comprehensive machine-processable metadata; (6) incorporate the data into the IPUMS data access system for free dissemination to the scientific community; and (7) implement secure data protection and preservation policies. The project will be executed by a team of highly-experienced researchers with exceptional proficiency in large- scale data creation, integration, and dissemination and will leverage cutting-edge technology to process an unprecedented volume of data at reasonable cost. The project is a collaboration of the Minnesota Population Center with the world's largest producers of genealogical data, allowing cost-effective use of scarce resources to develop shared infrastructure for population and health research.
描述(申请人提供):这项提案寻求资金,通过增加描述1790年至1930年美国全部被统计人口的人口和地理数据,来扩大综合公共用途微观数据系列(IPUMS)。该项目将提供有关6亿多人特征的数据,使可用于科学研究的美国人口普查微观数据的数量翻两番。数据将覆盖所有人口,包括完整的地理位置
详细信息,提供有关社区特征的上下文信息,包括种族构成、人口行为和人口流动。这些数据提供了有关关键社会和经济特征的最早信息,它们将为长期人口和经济变化的进程提供宝贵的洞察力。这些数据将成为国家统计基础设施的永久性和实质性补充,并将对社会科学和行为科学的研究产生深远影响。该项目是通过捐赠由两个主要家谱组织准备的美国人口普查信息的大规模高质量经核实的抄本而成为可能的。将这一庞大的原始数据转换为适合科学分析的格式将需要以下任务:()对地理位置进行分类和编码,使之与已公布的普查报告中使用的类别相一致;(2)评估数据转录的完整性和准确性;(3)将字母串数据转换为随时间推移可以比较的数字类别;(4)使用新的数据清理软件以查明和纠正常见的计数和抄写错误;(5)编写文件,包括对数据处理方法的全面说明、对可比性问题的详细分析以及全面的机器可处理的元数据;(6)将数据纳入IPUMS数据访问系统,免费向科学界传播;和(7)实施安全的数据保护和保存政策。该项目将由一支经验丰富的研究人员团队执行,他们在大规模数据创建、集成和传播方面具有非凡的熟练程度,并将利用尖端技术以合理的成本处理前所未有的数据量。该项目是明尼苏达州人口中心与世界上最大的家谱数据生产商合作的项目,允许经济高效地使用稀缺资源来开发人口和健康研究的共享基础设施。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
CATHERINE A FITCH其他文献
CATHERINE A FITCH的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('CATHERINE A FITCH', 18)}}的其他基金
Microdata for Analysis of Early Life Conditions, Health, and Population
用于分析早期生活状况、健康和人口的微观数据
- 批准号:
10371054 - 财政年份:2012
- 资助金额:
$ 62.54万 - 项目类别:
Microdata for Analysis of Early Life Conditions, Health, and Population
用于分析早期生活状况、健康和人口的微观数据
- 批准号:
9903193 - 财政年份:2012
- 资助金额:
$ 62.54万 - 项目类别:
Microdata for Analysis of Early Life Conditions, Health, and Population
用于分析早期生活状况、健康和人口的微观数据
- 批准号:
9750600 - 财政年份:2012
- 资助金额:
$ 62.54万 - 项目类别:
Marriage and Economic Opportunity in the U.S., 1960-2000
美国的婚姻和经济机会,1960-2000 年
- 批准号:
7184455 - 财政年份:2007
- 资助金额:
$ 62.54万 - 项目类别:
Marriage and Economic Opportunity in the U.S., 1960-2000
美国的婚姻和经济机会,1960-2000 年
- 批准号:
7600587 - 财政年份:2007
- 资助金额:
$ 62.54万 - 项目类别:
相似海外基金
Automatic Data Processing of Four University Survey Systems
四个大学调查系统的自动数据处理
- 批准号:
8420649 - 财政年份:1984
- 资助金额:
$ 62.54万 - 项目类别:
Contract