Collaborative Research III-COR: From a Pile of Documents to a Collection of Information: A Framework for Multi-Dimensional Text Analysis
协作研究III-COR:从一堆文档到信息集合:多维文本分析框架
基本信息
- 批准号:0917773
- 负责人:
- 金额:$ 14.1万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2008
- 资助国家:美国
- 起止时间:2008-11-15 至 2011-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Many information workers are swamped with unfamiliar collections of text. One challenge is to obtain an accurate overview of a large text collection, such as the public comments collected in ''''''''notice and comment'''''''' rulemaking. No single tool currently provides a sufficiently diversified picture of such a corpus, and no adequate theory exists to help people explore and form a deep and nuanced understanding of such a text collection. This research seeks to develop a computational framework that allows further exploration of this problem from multiple, integrated perspectives. All the assembled perspectives will be brought together into a single overall supra-document structure that is dynamically constructed under user guidance. In this structure, hierarchical topic clusters will be cross-linked by opinion and argumentation links, using two classes of text analysis engines: one for topics and subtopics, and the other for argument structures. The research team will design, develop, build, and systematically test an overall text exploration framework, an application to support federal regulation writersone called the Rule-Writers Workbench. There is a strong collaboration with Federal government officials who will provide data and participate in user testing. The three PIs have successfully collaborated on a related project under previous NSF funding. Intellectual Merit: This is a sustainable collaboration between computer science and political/social science research, rooted in a challenging and important real world application and informed by years of end user research. Dynamic, user-driven subtopic definition and clustering algorithms coupled withlanguage modeling are an innovative yet reachable set of goals. The framework to be developed will be grounded in the humanities disciplines'' expertise in rhetoric, discourse structure, and subjectivity.Broader Impacts: The Rule-Writers Workbench will allow federal government regulation writers to employ a suite of technical tools that perform independent analyses of public responses to proposed regulations, including near-duplicate detection and clustering, user-based topic selection from dynamically extracted keywords, opinion identification, and subtopic clustering. These capabilities will open new avenues for federal comment analysis.
许多信息工作者被不熟悉的文本淹没了。一个挑战是获得大量文本集的准确概述,例如在规则制定中收集的公众意见。目前还没有一个单一的工具可以提供这样一个语料库的足够多样化的图片,也没有足够的理论可以帮助人们探索和形成对这样一个文本集合的深刻而细致的理解。本研究旨在开发一个计算框架,允许进一步探索这个问题,从多个,综合的角度。所有组装的透视图将被合并到一个单一的整体超文档结构中,该结构是在用户指导下动态构建的。在这种结构中,层次主题集群将通过意见和论证链接进行交叉链接,使用两类文本分析引擎:一类用于主题和子主题,另一类用于论证结构。该研究团队将设计,开发,构建和系统地测试一个整体的文本探索框架,一个应用程序,以支持联邦法规作家被称为规则作家。与联邦政府官员有着密切的合作,他们将提供数据并参与用户测试。这三个PI在之前的NSF资助下成功地合作了一个相关项目。智力优势:这是计算机科学和政治/社会科学研究之间的可持续合作,植根于具有挑战性和重要的真实的世界应用程序,并通过多年的最终用户研究提供信息。动态的,用户驱动的子主题定义和聚类算法与语言建模相结合,是一个创新的,但可以达到的目标。该框架将以人文学科在修辞、话语结构和主观性方面的专业知识为基础。规则编写器将允许联邦政府法规编写者使用一套技术工具,对公众对拟议法规的反应进行独立分析,包括近似重复检测和聚类,从动态提取的关键字中基于用户的主题选择,意见识别和子主题聚类。这些能力将为联邦评论分析开辟新的途径。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Stuart Shulman其他文献
Stuart Shulman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Stuart Shulman', 18)}}的其他基金
Workshop: YouTube and the 2008 Election Cycle in the United States, April 3-4, 2009
研讨会:YouTube 与美国 2008 年选举周期,2009 年 4 月 3 日至 4 日
- 批准号:
0903886 - 财政年份:2009
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research III-COR: From a Pile of Documents to a Collection of Information: A Framework for Multi-Dimensional Text Analysis
协作研究III-COR:从一堆文档到信息集合:多维文本分析框架
- 批准号:
0705566 - 财政年份:2007
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Coding Across the Disciplines: A Project-Based Workshop on Manual Text Annotation Techniques
跨学科编码:基于项目的手动文本注释技术研讨会
- 批准号:
0620673 - 财政年份:2006
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
ITR/PE:Digital Citizenship: Expanding Information Technology Literacy with a Service-Learning Approach
ITR/PE:数字公民:通过服务学习方法扩大信息技术素养
- 批准号:
0503997 - 财政年份:2004
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: Language Processing Technology for Electronic Rulemaking
合作研究:电子规则制定的语言处理技术
- 批准号:
0429293 - 财政年份:2004
- 资助金额:
$ 14.1万 - 项目类别:
Continuing Grant
SGER COLLABORATIVE: A Testbed for eRulemaking Data
SGER Collaborative:电子规则制定数据的测试平台
- 批准号:
0502121 - 财政年份:2004
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
SGER COLLABORATIVE: A Testbed for eRulemaking Data
SGER Collaborative:电子规则制定数据的测试平台
- 批准号:
0328914 - 财政年份:2003
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
ITR/PE:Digital Citizenship: Expanding Information Technology Literacy with a Service-Learning Approach
ITR/PE:数字公民:通过服务学习方法扩大信息技术素养
- 批准号:
0113718 - 财政年份:2001
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Digital Government: SGER: Citizen Agenda-Setting in the Regulatory Process: Electronic Collection and Synthesis of Public Commentary
数字政府:SGER:监管过程中的公民议程设置:公众评论的电子收集和综合
- 批准号:
0089892 - 财政年份:2000
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: Conference: DESC: Type III: Eco Edge - Advancing Sustainable Machine Learning at the Edge
协作研究:会议:DESC:类型 III:生态边缘 - 推进边缘的可持续机器学习
- 批准号:
2342498 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: Conference: DESC: Type III: Eco Edge - Advancing Sustainable Machine Learning at the Edge
协作研究:会议:DESC:类型 III:生态边缘 - 推进边缘的可持续机器学习
- 批准号:
2342497 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
III : Medium: Collaborative Research: From Open Data to Open Data Curation
III:媒介:协作研究:从开放数据到开放数据管理
- 批准号:
2420691 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322973 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322974 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336769 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336768 - 财政年份:2024
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Designing AI Systems with Steerable Long-Term Dynamics
合作研究:III:中:设计具有可操纵长期动态的人工智能系统
- 批准号:
2312865 - 财政年份:2023
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312932 - 财政年份:2023
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Algorithms for scalable inference and phylodynamic analysis of tumor haplotypes using low-coverage single cell sequencing data
合作研究:III:中:使用低覆盖率单细胞测序数据对肿瘤单倍型进行可扩展推理和系统动力学分析的算法
- 批准号:
2415562 - 财政年份:2023
- 资助金额:
$ 14.1万 - 项目类别:
Standard Grant