CAREER: Towards Interactive and Transparent Question Answering with Applications in the Clinical Domain
职业:在临床领域应用交互式和透明的问答
基本信息
- 批准号:1942980
- 负责人:
- 金额:$ 50万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-06-01 至 2025-05-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Finding relevant information quickly is integral to effective and efficient decision making. This becomes increasingly difficult as the scale and heterogeneity of data continue to grow rapidly. Question answering (QA) systems, which aim to find precise answers to natural language questions from users, have shown great potential to address this problem. However, state-of-the-art QA systems still largely fall short in the following scenarios: (1) when questions are ambiguous and/or complex (e.g., involving multiple relations and operators), (2) when answering questions requires background knowledge that is not readily available in the data, and (3) when users need to understand the system’s answering process in order to better judge its trustworthiness. Such scenarios are prevalent in real application domains of QA (such as healthcare, finance, and sciences), and must be addressed in building practical systems. This project aims to develop a new QA model that can interact with users to resolve ambiguity and uncertainty during the answering process, and can tackle challenging problems such as identifying when requesting feedback from the user is necessary while achieving the optimal trade-off between answer quality and interaction cost. The project further aims to improve the QA model’s transparency by decomposing a complex question into several intermediate sub-questions and allowing users to validate them. The expected results can thus contribute to future human-technology partnership by enabling QA models to be more interactive, more transparent, and hence more trustworthy. The proposed QA model will be tested in a clinical domain, where doctors often ask questions about a patient and look for answers from his/her clinical notes in Electronic Medical Records (EMRs). Such a QA model can enable doctors to effectively and efficiently query EMRs and gather relevant evidence for critical decision making. The project plans to engage high school students and undergraduates, especially from underrepresented groups, and prepare them for future education and employment opportunities. This project will contribute a new, learnable interactive QA model, which will detect the ambiguities and uncertainties during the answering process and interact with users in a natural fashion to seek clarifications. Moreover, the QA model will learn from such interactions to simultaneously improve answer quality and reduce human intervention over time, using imitation and reinforcement learning based frameworks. This project will further advance the QA model with a novel question decomposition component, which decomposes a compositional question into simpler sub-questions and can enhance the transparency of the answering procedure by allowing users to validate the sub-questions (i.e., confirming or correcting the sub-questions). To effectively train the QA model with limited human cost (for providing feedback or training data), the team will explore new learning strategies such as designing user simulators and weak supervision mechanisms. When applying the QA model to the clinical domain, this project will develop novel solutions to domain-specific challenges, such as how to incorporate background biomedical knowledge into a general QA model and how to create high-quality clinical QA datasets at a low cost. The team will closely collaborate with doctors and physicians for model evaluation and actively seek technology transfer opportunities. All datasets, software and demos will be publicly accessible via the investigator’s website. Potential research findings will be disseminated in computer science and medical informatics related venues and will be integrated into existing and new courses.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
快速找到相关信息是有效和高效决策的组成部分。随着数据的规模和异构性持续快速增长,这变得越来越困难。问答(QA)系统,其目的是从用户那里找到自然语言问题的精确答案,已经显示出解决这个问题的巨大潜力。然而,最先进的QA系统在以下场景中仍然很大程度上不足:(1)当问题是模糊的和/或复杂的(例如,涉及多个关系和运算符),(2)当回答问题需要在数据中不容易获得的背景知识时,以及(3)当用户需要理解系统的回答过程以便更好地判断其可信度时。这样的场景在QA的真实的应用领域(例如医疗保健、金融和科学)中很普遍,并且必须在构建实用系统时加以解决。该项目旨在开发一种新的QA模型,该模型可以与用户进行交互,以解决回答过程中的模糊性和不确定性,并可以解决具有挑战性的问题,例如确定何时需要向用户请求反馈,同时实现回答质量和交互成本之间的最佳权衡。该项目进一步旨在通过将复杂问题分解为几个中间子问题并允许用户验证它们来提高QA模型的透明度。因此,预期的结果可以通过使QA模型更具交互性,更透明,从而更值得信赖,从而为未来的人类-技术伙伴关系做出贡献。拟议的QA模型将在临床领域进行测试,医生经常询问有关患者的问题,并从电子病历(EMR)中的临床记录中寻找答案。这样的QA模型可以使医生能够有效和高效地查询EMR,并为关键决策收集相关证据。 该项目计划吸引高中生和大学生,特别是代表性不足的群体,并为他们未来的教育和就业机会做好准备。该项目将提供一个新的、可学习的交互式QA模型,该模型将检测回答过程中的模糊性和不确定性,并以自然的方式与用户交互以寻求澄清。此外,QA模型将从这种交互中学习,同时提高答案质量并减少人工干预,使用基于模仿和强化学习的框架。 这个项目将进一步推进QA模型,其中包括一个新的问题分解组件,该组件将组合问题分解为更简单的子问题,并通过允许用户验证子问题(即,确认或纠正子问题)。为了以有限的人力成本(用于提供反馈或训练数据)有效地训练QA模型,该团队将探索新的学习策略,例如设计用户模拟器和弱监督机制。在将QA模型应用于临床领域时,该项目将针对特定领域的挑战开发新的解决方案,例如如何将背景生物医学知识纳入通用QA模型,以及如何以低成本创建高质量的临床QA数据集。该团队将与医生和内科医生密切合作进行模型评估,并积极寻求技术转让机会。所有数据集,软件和演示将通过研究者的网站公开访问。潜在的研究成果将在计算机科学和医学信息学相关的场所传播,并将被整合到现有的和新的courses.This奖项反映了NSF的法定使命,并已被认为是值得通过使用基金会的智力价值和更广泛的影响审查标准进行评估的支持。
项目成果
期刊论文数量(9)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization
- DOI:10.18653/v1/2020.findings-emnlp.361
- 发表时间:2020-10
- 期刊:
- 影响因子:0
- 作者:Jie Zhao;Huan Sun
- 通讯作者:Jie Zhao;Huan Sun
ReasonBERT: Pre-trained to Reason with Distant Supervision
- DOI:10.18653/v1/2021.emnlp-main.494
- 发表时间:2021-09
- 期刊:
- 影响因子:0
- 作者:Xiang Deng;Yu Su;Alyssa Lees;You Wu;Cong Yu;Huan Sun
- 通讯作者:Xiang Deng;Yu Su;Alyssa Lees;You Wu;Cong Yu;Huan Sun
An Imitation Game for Learning Semantic Parsers from User Interaction
- DOI:10.18653/v1/2020.emnlp-main.559
- 发表时间:2020-05
- 期刊:
- 影响因子:0
- 作者:Ziyu Yao;Yiqi Tang;Wen-tau Yih;Huan Sun;Yu Su
- 通讯作者:Ziyu Yao;Yiqi Tang;Wen-tau Yih;Huan Sun;Yu Su
Learning a Cost-Effective Annotation Policy for Question Answering
- DOI:10.18653/v1/2020.emnlp-main.246
- 发表时间:2020-10
- 期刊:
- 影响因子:0
- 作者:Bernhard Kratzwald;S. Feuerriegel;Huan Sun
- 通讯作者:Bernhard Kratzwald;S. Feuerriegel;Huan Sun
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering
- DOI:10.1109/bibm52615.2021.9669300
- 发表时间:2020-10
- 期刊:
- 影响因子:0
- 作者:Xiang Yue;Xinliang Frederick Zhang;Ziyu Yao;Simon M. Lin;Huan Sun
- 通讯作者:Xiang Yue;Xinliang Frederick Zhang;Ziyu Yao;Simon M. Lin;Huan Sun
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Huan Sun其他文献
Evaluation of azithromycin or hydroxychloroquine plus azithromycin combination therapy on cardiac conduction and function in guinea pigs
阿奇霉素或羟氯喹联合阿奇霉素联合治疗对豚鼠心脏传导和功能的评价
- DOI:
10.1101/2020.10.31.362566 - 发表时间:
2020-11 - 期刊:
- 影响因子:0
- 作者:
Xiang Li;Weijiang Tan;Shuang Zheng;Huan Sun;Xiaosheng Zhang;Xiaohui Li;Honghua Chen;Xuecong Ren;Tianzhen He;Caiyi Zhu;Yu Zhang;Feng Hua Yang - 通讯作者:
Feng Hua Yang
Simulation Study on the Heat Transfer Characteristics of a Spray-Cooled Single-Pipe Cooling Tower
喷雾冷却单管冷却塔传热特性模拟研究
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Kaiyong Hu;Zhaoyi Chen;Yunqing Hu;Huan Sun;Zhili Sun;Tonghua Zou;Jinghong Ning - 通讯作者:
Jinghong Ning
Dabigatran as an alternative for atrial thrombosis resistant to rivaroxaban
达比加群作为治疗对利伐沙班耐药的心房血栓的替代药物
- DOI:
10.1097/md.0000000000013623 - 发表时间:
2018 - 期刊:
- 影响因子:1.6
- 作者:
Huan Sun;Qini Zhao;Yanjing Wang;R. Lakin;Xueyan Liu;Ming Yu;Hongliang Yang;Dongmei Gao;Weiwei Chen;Guangyuan Gao;M. Yan;Yuquan He;Ping Yang - 通讯作者:
Ping Yang
Performance Evaluation of Distributed Scheduling for Downlink Coherent Joint Transmission
下行相干联合传输分布式调度性能评估
- DOI:
10.1109/vtcfall.2015.7391076 - 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Huan Sun;Tao Yang - 通讯作者:
Tao Yang
Shape Evolution of Unstable, Flexural Cracks in Brittle Materials
脆性材料中不稳定弯曲裂纹的形状演变
- DOI:
10.1007/s11665-020-04657-5 - 发表时间:
2020 - 期刊:
- 影响因子:2.3
- 作者:
Lingyue Ma;Huan Sun;R. Dugnani - 通讯作者:
R. Dugnani
Huan Sun的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Huan Sun', 18)}}的其他基金
III: Small: Towards Resolving Ad-hoc Concept Queries with Table Answers via Multi-source Data Mining
III:小:通过多源数据挖掘解决带有表答案的临时概念查询
- 批准号:
1815674 - 财政年份:2018
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
相似海外基金
CAREER: Towards Harnessing the Motility of Microorganisms: Fast Algorithms, Data-Driven Models, and 3D Interactive Visual Computing
职业:利用微生物的运动性:快速算法、数据驱动模型和 3D 交互式视觉计算
- 批准号:
2408964 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
Strategy and policy design towards zero-emission maritime transportation system by interactive simulation
通过交互式模拟进行零排放海上运输系统的战略和政策设计
- 批准号:
22H01693 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Towards Highly Interactive Networked Multimedia Services with Crowd Intelligence
通过群体智能实现高度交互的网络多媒体服务
- 批准号:
RGPIN-2019-04040 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Discovery Grants Program - Individual
CAREER: Towards Harnessing the Motility of Microorganisms: Fast Algorithms, Data-Driven Models, and 3D Interactive Visual Computing
职业:利用微生物的运动性:快速算法、数据驱动模型和 3D 交互式视觉计算
- 批准号:
2146191 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
FAI: Towards Adaptive and Interactive Post Hoc Explanations
FAI:迈向自适应和交互式事后解释
- 批准号:
2040989 - 财政年份:2021
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Beyond Individual Persuasion: Towards a Paradigm Shift in Interactive Visualisation and Sensing for Environmental Change
超越个人说服:交互式可视化和环境变化感知的范式转变
- 批准号:
EP/V042327/1 - 财政年份:2021
- 资助金额:
$ 50万 - 项目类别:
Research Grant
Towards Highly Interactive Networked Multimedia Services with Crowd Intelligence
通过群体智能实现高度交互的网络多媒体服务
- 批准号:
RGPIN-2019-04040 - 财政年份:2021
- 资助金额:
$ 50万 - 项目类别:
Discovery Grants Program - Individual
Towards a Framework for the Methodology, Design, and Evaluation of novel interactive technologies to support short , n-of-1 study approaches for indep
建立新型交互技术的方法、设计和评估框架,以支持独立的短期、n-of-1研究方法
- 批准号:
2481048 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Studentship
Towards Highly Interactive Networked Multimedia Services with Crowd Intelligence
通过群体智能实现高度交互的网络多媒体服务
- 批准号:
RGPIN-2019-04040 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Discovery Grants Program - Individual
Towards Highly Interactive Networked Multimedia Services with Crowd Intelligence
通过群体智能实现高度交互的网络多媒体服务
- 批准号:
RGPIN-2019-04040 - 财政年份:2019
- 资助金额:
$ 50万 - 项目类别:
Discovery Grants Program - Individual