ITR: Mining the Bibliome -- Information Extraction from the Biomedical Literature
ITR:挖掘文献库——从生物医学文献中提取信息
基本信息
- 批准号:0205448
- 负责人:
- 金额:$ 349.98万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2002
- 资助国家:美国
- 起止时间:2002-09-01 至 2008-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
EIA-0205448Joshi, AravindUniversity of PennsylvaniaITR: Mining the Bibliome -- Information Extraction from the BiomedicalLiterature The major goal is the development of qualitatively better methods for automatically extracting information from the biomedical literature, relying on recent research in high-accuracy parsing and shallow semantic analysis. The special focus will be on information relevant to drug development, in collaboration with researchers in the Knowledge Integration and DiscoverySystems group at GlaxoSmithKline. This project will also address several database research problems, including methods for modeling complex, incomplete and changing information using semistructured data, and also ways to connect the text analysis process to an information integration environment that can deal with the wide variety of extant bioinformatic data models, formats, languages and interfaces.The engine of recent progress in language processing research has been linguistic data: text corpora, treebanks, lexicons, test corpora for information retrieval and information extraction, and so on. Much of this data has been created by Penn researchers and published by Penn's Linguistic Data Consortium. Hence, one of our major goals is to develop and publish new linguistic resources in three categories: a large corpus of biomedical text annotated with syntactic structures `Treebank' and shallow semantic structures (proposition bank or `Propbank'; several large sets of biomedical abstracts and full-text articles annotated with entities and relations of interest to drug developers, such as enzyme inhibition by various compounds or genotype/phenotype connections `Factbanks'; and broad-coverage lexicons and tools for the analysis of biomedical texts.
EIA-0205448Joshi,阿拉伯宾夕法尼亚大学ITR:挖掘书目--从生物医学文献中提取信息主要目标是开发从生物医学文献中自动提取信息的定性更好的方法,依赖于最近在高精度语法分析和浅层语义分析方面的研究。与葛兰素史克知识集成和发现系统组的研究人员合作,特别关注与药物开发相关的信息。该项目还将解决几个数据库研究问题,包括使用半结构化数据对复杂、不完整和不断变化的信息进行建模的方法,以及如何将文本分析过程连接到可以处理各种现有生物信息数据模型、格式、语言和接口的信息集成环境中。语言处理研究的最新进展的引擎是语言数据:文本语料库、树库、词典、用于信息检索和信息提取的测试语料库等。其中大部分数据是由宾夕法尼亚大学的研究人员创建的,并由宾夕法尼亚大学的语言数据联盟发布。因此,我们的主要目标之一是开发和出版三类新的语言资源:用句法结构(Treebank)和浅层语义结构(命题库或Propbank)注释的大型生物医学文本语料库;几大套生物医学摘要和全文文章,注释实体和药物开发人员感兴趣的关系,例如各种化合物或基因/表型连接‘Factbank’的酶抑制;以及用于分析生物医学文本的广泛词典和工具。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Aravind Joshi其他文献
Cogniac: a discourse processing engine
Cogniac:话语处理引擎
- DOI:
- 发表时间:
1995 - 期刊:
- 影响因子:0
- 作者:
F. B. Baldwin;Aravind Joshi - 通讯作者:
Aravind Joshi
Quantum Circuit Optimization of Arithmetic circuits using ZX Calculus
使用 ZX 微积分对算术电路进行量子电路优化
- DOI:
10.48550/arxiv.2306.02264 - 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Aravind Joshi;Akshara Kairali;Renju Raju;A. Athreya;R. Monica;Sanjay Vishwakarma;Srinjoy Ganguly - 通讯作者:
Srinjoy Ganguly
Aravind Joshi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Aravind Joshi', 18)}}的其他基金
CI: ADDO-EN: Significant Enhancement of the Exisitng Penn Discourse Treebank
CI:ADDO-EN:现有宾夕法尼亚大学话语树库的显着增强
- 批准号:
1059353 - 财政年份:2011
- 资助金额:
$ 349.98万 - 项目类别:
Standard Grant
RI: Exploiting and Exploring Discourse Connectivity: Deriving New Technology and Knowledge from the Penn Discourse Treebank
RI:利用和探索话语连通性:从宾夕法尼亚大学话语树库中获取新技术和知识
- 批准号:
0705671 - 财政年份:2007
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
Metagrammatical Knowledge for Grammars and Corpora
语法和语料库的元语法知识
- 批准号:
0414409 - 财政年份:2004
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
CISE Research Resources: Discourse Penn Treebank and Multimodal FORM: Development of Two Richly Annotated Corpora
CISE 研究资源:Discourse Penn Treebank 和 Multimodal FORM:两个注释丰富的语料库的开发
- 批准号:
0224417 - 财政年份:2002
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
ITR: Language, Learning, and Modeling Biological Sequences
ITR:语言、学习和生物序列建模
- 批准号:
0205456 - 财政年份:2002
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
Constructing Science: Materials and Activities for Kindergarten and First-Grade
构建科学:幼儿园和一年级的材料和活动
- 批准号:
9252885 - 财政年份:1992
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
Research in Natural Language Processing: Mathematical and Computational Investigations in Constrained Grammatical Formalisms
自然语言处理研究:受限语法形式主义的数学和计算研究
- 批准号:
9016592 - 财政年份:1991
- 资助金额:
$ 349.98万 - 项目类别:
Continuing grant
Center for Research in Cognitive Science
认知科学研究中心
- 批准号:
8920230 - 财政年份:1991
- 资助金额:
$ 349.98万 - 项目类别:
Cooperative Agreement
Natural Language Processing (Computer Research)
自然语言处理(计算机研究)
- 批准号:
8410413 - 财政年份:1984
- 资助金额:
$ 349.98万 - 项目类别:
Continuing grant
Modelling Interactive Processes: Flexible Communication With Knowledge Bases
交互过程建模:与知识库的灵活通信
- 批准号:
8219196 - 财政年份:1983
- 资助金额:
$ 349.98万 - 项目类别:
Continuing Grant
相似国自然基金
基于Genome mining技术研究抑制表皮葡萄球菌生物膜形成的次级代谢产物
- 批准号:21242003
- 批准年份:2012
- 资助金额:10.0 万元
- 项目类别:专项基金项目
相似海外基金
NeTS: Small: NSF-DST: Modernizing Underground Mining Operations with Millimeter-Wave Imaging and Networking
NeTS:小型:NSF-DST:利用毫米波成像和网络实现地下采矿作业现代化
- 批准号:
2342833 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Standard Grant
Development of social attention indicators of emerging technologies and science policies with network analysis and text mining
利用网络分析和文本挖掘开发新兴技术和科学政策的社会关注指标
- 批准号:
24K16438 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
ART: Mining the Rich Vein of Research in Montana
艺术:挖掘蒙大拿州研究的丰富脉络
- 批准号:
2331325 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Cooperative Agreement
FightAMR: Novel global One Health surveillance approach to fight AMR using Artificial Intelligence and big data mining
FightAMR:利用人工智能和大数据挖掘对抗 AMR 的新型全球统一健康监测方法
- 批准号:
MR/Y034422/1 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Research Grant
DISES Investigating mercury biogeochemical cycling via mixed-methods in complex artisanal gold mining landscapes and implications for community health
DISES 通过混合方法研究复杂手工金矿景观中的汞生物地球化学循环及其对社区健康的影响
- 批准号:
2307870 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Standard Grant
Toward carbon-neutral society: Development of a full-sustainable eco-friendly green mining process for gold recovery
迈向碳中和社会:开发完全可持续的环保绿色采矿工艺以回收黄金
- 批准号:
24K17540 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Generating green hydrogen from mining wastes
从采矿废物中产生绿色氢气
- 批准号:
IM240100202 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Mid-Career Industry Fellowships
Novel Hydrophobic Concrete for Durable and Resilient Mining Infrastructure
用于耐用且有弹性的采矿基础设施的新型疏水混凝土
- 批准号:
LP230100288 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Linkage Projects
SBIR Phase I: Electromagnetic-ablative PGM Refining for In-situ Asteroid Mining
SBIR 第一阶段:用于小行星原位采矿的电磁烧蚀铂族金属精炼
- 批准号:
2327078 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Standard Grant
Temporal Graph Mining for Anomaly Detection
用于异常检测的时间图挖掘
- 批准号:
DP240101547 - 财政年份:2024
- 资助金额:
$ 349.98万 - 项目类别:
Discovery Projects