Semi-structured Information Retrieval in Clinical Text for Cohort Identification
用于队列识别的临床文本中的半结构化信息检索
基本信息
- 批准号:10450805
- 负责人:
- 金额:$ 59.71万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-09-20 至 2023-04-30
- 项目状态:已结题
- 来源:
- 关键词:AddressAdoptedAdoptionAlgorithmsArchitectureCOVID-19ClinicClinicalClinical DataClinical ResearchClinical TrialsCohort StudiesCollectionCommunitiesComplexDataDevelopmentElectronic Health RecordEvaluationFeedbackFormulationGoalsHealthcareHealthcare SystemsHumanInformaticsInformation RetrievalInstitutionLanguageLearningMedicalMethodsModelingMorphologic artifactsNatural Language ProcessingOutcomePatient RecruitmentsPatientsPerformancePharmaceutical PreparationsProcessResearchResourcesRetrievalSemanticsSiteStructureSystemTechniquesTextTimeTrainingTranslational ResearchVisitWorkbaseclinical data warehouseclinical research sitecohortdata modelingdata standardsdensitydesignexperimental studyhealth care deliveryheterogenous dataindexinginnovationlearning strategynovelopen sourceportabilitypredictive modelingquery toolsrecruitrelating to nervous systemsearch enginestructured datatool
项目摘要
Project Summary
The widespread adoption of Electronic Health Records (EHRs) has enabled the use of clinical data for clinical
research and healthcare delivery. Many institutions have established clinical data warehouses (CDWs) in
conjunction with cohort discovery tools (e.g., i2b2) to support the use of clinical data for clinical research
including retrospective clinical studies as well as feasibility assessment or patient recruitment for clinical trials.
However, a significant portion of relevant patient information is embedded in clinical narratives and natural
language processing (NLP) techniques such as information extraction are critical when using EHR data for
clinical research. Many clinical NLP systems have been developed to extract information from text for various
downstream applications but have had unsatisfactory performance and portability issues. Information retrieval
(IR), a technique used in search engines for storing, retrieving, and ranking documents from a large collection of
text documents based on users’ queries, can provide an alternative approach to leverage clinical narratives for
cohort discovery as it is less dependent on semantics. In order to accomplish this, additional work is needed
since current IR approaches are generally document-based and the formulation of cohort discovery as an IR
task requires the development of innovative IR approaches to handle complex EHR data and cohort criteria with
contextual (e.g., spatial or temporal) constraints.
Our long-term goal is to develop informatics solutions to accelerate the use of EHR data for clinical research.
The main goal of this proposal is to develop innovative IR methods, which formulate cohort discovery from EHR
data as an IR task, aiming to accelerate the identification of patient cohorts for cohort studies or the recruitment
of eligible patients for clinical trials. In our current R01-supported study (R01LM011934), we introduced novel
language models to enable the reuse of NLP-produced artifacts for IR-based cohort retrieval and developed
parallel resources for IR evaluation at two institutions (Mayo Clinic and OHSU). We hypothesize that, given
complex cohort criteria with contextual constraints, an IR framework with tailored architecture components (e.g.,
indexing, ranking, evaluation, and query processing) for storing and querying EHR data has an advantage over
traditional cohort discovery tools for querying unstructured EHR data as well as an advantage over text-based
search engines for querying both structured and unstructured EHR data. For the proposed renewal, we plan to
i) adopt common data models (CDMs) and deploy the framework at one additional site to assess the
generalizability of methods, ii) extend the IR framework to incorporate contextual information, and iii)
incorporate deep semantic representations into the IR framework. If successful, the proposed project will
advance informatics research on cohort discovery and identification, which impacts many applications based on
EHR data such as learning healthcare systems, predictive modeling, or AI in healthcare.
项目总结
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
WILLIAM R HERSH其他文献
WILLIAM R HERSH的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('WILLIAM R HERSH', 18)}}的其他基金
Attracting Talented and Diverse Students to Biomedical Informatics and Data Science Careers Through Short-Term Study at OHSU
通过在 OHSU 的短期学习吸引才华横溢、多元化的学生从事生物医学信息学和数据科学职业
- 批准号:
10630618 - 财政年份:2022
- 资助金额:
$ 59.71万 - 项目类别:
Attracting Talented and Diverse Students to Biomedical Informatics and Data Science Careers Through Short-Term Study at OHSU
通过在 OHSU 的短期学习吸引才华横溢、多元化的学生从事生物医学信息学和数据科学职业
- 批准号:
10701083 - 财政年份:2022
- 资助金额:
$ 59.71万 - 项目类别:
Computational Omics and Biomedical Informatics Program (COBIP)
计算组学和生物医学信息学计划(COBIP)
- 批准号:
10319196 - 财政年份:2021
- 资助金额:
$ 59.71万 - 项目类别:
Computational Omics and Biomedical Informatics Program (COBIP)
计算组学和生物医学信息学计划(COBIP)
- 批准号:
10490403 - 财政年份:2021
- 资助金额:
$ 59.71万 - 项目类别:
Computational Omics and Biomedical Informatics Program (COBIP)
计算组学和生物医学信息学计划(COBIP)
- 批准号:
10676322 - 财政年份:2021
- 资助金额:
$ 59.71万 - 项目类别:
Research Training in Biomedical Informatics and Data Science at Oregon Health & Science University
俄勒冈健康中心生物医学信息学和数据科学研究培训
- 批准号:
9524502 - 财政年份:2017
- 资助金额:
$ 59.71万 - 项目类别:
Biomedical Informatics Research Training at Oregon Health & Science University
俄勒冈健康中心的生物医学信息学研究培训
- 批准号:
9369268 - 财政年份:2016
- 资助金额:
$ 59.71万 - 项目类别:
Semi-structured Information Retrieval in Clinical Text for Cohort Identification
用于队列识别的临床文本中的半结构化信息检索
- 批准号:
10207950 - 财政年份:2014
- 资助金额:
$ 59.71万 - 项目类别:
Semi-structured Information Retrieval in Clinical Text for Cohort Identification
用于队列识别的临床文本中的半结构化信息检索
- 批准号:
10879792 - 财政年份:2014
- 资助金额:
$ 59.71万 - 项目类别:
OHSU Summer Internship in Biomedical Informatics for College Undergraduates
OHSU 大学本科生生物医学信息学暑期实习
- 批准号:
8281433 - 财政年份:2011
- 资助金额:
$ 59.71万 - 项目类别:
相似海外基金
How novices write code: discovering best practices and how they can be adopted
新手如何编写代码:发现最佳实践以及如何采用它们
- 批准号:
2315783 - 财政年份:2023
- 资助金额:
$ 59.71万 - 项目类别:
Standard Grant
One or Several Mothers: The Adopted Child as Critical and Clinical Subject
一位或多位母亲:收养的孩子作为关键和临床对象
- 批准号:
2719534 - 财政年份:2022
- 资助金额:
$ 59.71万 - 项目类别:
Studentship
A material investigation of the ceramic shards excavated from the Omuro Ninsei kiln site: Production techniques adopted by Nonomura Ninsei.
对大室仁清窑遗址出土的陶瓷碎片进行材质调查:野野村仁清采用的生产技术。
- 批准号:
20K01113 - 财政年份:2020
- 资助金额:
$ 59.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
- 批准号:
2633211 - 财政年份:2020
- 资助金额:
$ 59.71万 - 项目类别:
Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
- 批准号:
2436895 - 财政年份:2020
- 资助金额:
$ 59.71万 - 项目类别:
Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
- 批准号:
2633207 - 财政年份:2020
- 资助金额:
$ 59.71万 - 项目类别:
Studentship
A Study on Mutual Funds Adopted for Individual Defined Contribution Pension Plans
个人设定缴存养老金计划采用共同基金的研究
- 批准号:
19K01745 - 财政年份:2019
- 资助金额:
$ 59.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
The limits of development: State structural policy, comparing systems adopted in two European mountain regions (1945-1989)
发展的限制:国家结构政策,比较欧洲两个山区采用的制度(1945-1989)
- 批准号:
426559561 - 财政年份:2019
- 资助金额:
$ 59.71万 - 项目类别:
Research Grants
Securing a Sense of Safety for Adopted Children in Middle Childhood
确保被收养儿童的中期安全感
- 批准号:
2236701 - 财政年份:2019
- 资助金额:
$ 59.71万 - 项目类别:
Studentship
Structural and functional analyses of a bacterial protein translocation domain that has adopted diverse pathogenic effector functions within host cells
对宿主细胞内采用多种致病效应功能的细菌蛋白易位结构域进行结构和功能分析
- 批准号:
415543446 - 财政年份:2019
- 资助金额:
$ 59.71万 - 项目类别:
Research Fellowships














{{item.name}}会员




