融合领域知识和多级注意力机制的生物医学事件联合抽取研究

批准号:
62006108
项目类别:
青年科学基金项目
资助金额:
24.0 万元
负责人:
何馨宇
依托单位:
学科分类:
自然语言处理
结题年份:
2023
批准年份:
2020
项目状态:
已结题
项目参与者:
何馨宇
国基评审专家1V1指导 中标率高出同行96.8%
结合最新热点,提供专业选题建议
深度指导申报书撰写,确保创新可行
指导项目中标800+,快速提高中标率
微信扫码咨询
中文摘要
基于文献的生物医学事件抽取是生物医学自然语言处理领域的重要研究课题,为新药的研发和疾病的辅助诊断、预防、治疗提供启发和依据。目前的方法存在数据表示语义信息不足、复杂事件抽取精度较低、级联错误和冗余信息等亟待解决的关键问题。由此本项目提出:(1)融入领域知识获取领域扩展词特征,为生物事件抽取研究构建全新的多信息数据向量表示,并将领域知识语义与文本语义有机结合,改善数据语义信息不足的问题;(2)根据简单事件和复杂事件的要素结构特点分别构建针对性的要素检测模型,同时设计多级注意力机制加强要素之间的相互作用,进一步提升复杂事件抽取性能;(3)提出新的基于混合神经网络和动态路径规划策略的联合事件抽取方法,减少冗余信息,避免分阶段方法的级联错误,最终获得高性能的生物事件抽取模型。本项目以癌症相关的生物医学文献为主要研究对象,通过构建生物医学事件数据库和交互网络,为相关疾病的研究提供有力支持。
英文摘要
Biomedical event extraction from literature is an important research topic in the field of biomedical natural language processing, which provides inspirations and basis for the research and development of new drugs as well as the auxiliary diagnosis, prevention and treatment of diseases. By far, there are some key issues needed to be solved urgently: insufficient semantic information of data representation, lower performance of complex event extraction, cascading errors and redundant information. Therefore, this project proposes: (1) a new multi-information data vector representation for biomedical event extraction research by integrating features of domain-extended words obtained from domain knowledge and combined domain knowledge semantics with text semantics, which improves the insufficiency of data semantic information; (2) a new targeted argument detection model designed according to the structural features of simple events and complex events, which integrates a multi-level attention mechanism, aiming to further improve the performance of complex biomedical event extraction; (3) a novel joint event extraction method based on hybrid neural network and dynamic path strategy to reduce redundant information and avoid the cascading errors in pipeline approach, so a high performance biomedical event extraction model will be obtained. This project takes cancer-related biomedical literature as the main research object, and provides strong support for related research by constructing biomedical event databases and an interactive network.
期刊论文列表
专著列表
科研奖励列表
会议论文列表
专利列表
DOI:10.1007/s13042-023-01900-y
发表时间:2023-06
期刊:International Journal of Machine Learning and Cybernetics
影响因子:5.6
作者:Xinyu He;Ge Yan;Changfu Si;Yonggong Ren
通讯作者:Xinyu He;Ge Yan;Changfu Si;Yonggong Ren
DOI:10.1186/s12859-022-04854-0
发表时间:2022-07-29
期刊:BMC bioinformatics
影响因子:3
作者:
通讯作者:
DOI:https://doi.org/10.1007/s11042-023-16679-x
发表时间:2023
期刊:Multimedia Tools and Applications
影响因子:--
作者:Hui Shi;Kexun Yan;Jianning Geng;Yonggong Ren
通讯作者:Yonggong Ren
DOI:10.1109/tcbb.2022.3176319
发表时间:2023-03-01
期刊:IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS
影响因子:4.5
作者:Huang, Xin;Su, Benzhe;Lin, Xiaohui
通讯作者:Lin, Xiaohui
DOI:10.1080/13682199.2023.2195090
发表时间:2023-04
期刊:The Imaging Science Journal
影响因子:--
作者:Hui Shi;Baoyue Hu;Yanli Li;Jianing Geng;Yonggong Ren
通讯作者:Hui Shi;Baoyue Hu;Yanli Li;Jianing Geng;Yonggong Ren
国内基金
海外基金
