BERT with Character - Knowledge Graph infused neural language models to analyse the depiction of literary characters (LitBERT)

BERT with Character - 知识图谱注入神经语言模型来分析文学人物的描述 (LitBERT)

基本信息

项目摘要

This project is a collaboration between computer science and computational literary studies (CLS), an emergent field which analyses larger collections of literary texts using a wide set of tools from computational linguistics, computer science and its own tradition. In our project, we focus on the computational literary analysis of character as one of the most important descriptors of narrative and dramatic texts. Our project will investigate the textual description of character's internal and external features, actions and further character specific information using knowledge induced language models. We aim to create a character knowledge graph through extracting character information from text, find different character types through data-driven clustering, and leverage this information to develop a character-attentive, "literary" language model ("LitBERT") for automatic literary analysis. The project will significantly advance the state of the art in the combination of language models and knowledge graphs, showing how to improve the performance of language models for the analysis of entities and their attributes by (a) integrating knowledge graphs and (b) enriching domain specific knowledge graphs based on text analysis using language models. Additionally, we want to improve the handling of longer texts like novels by advancing the capabilities of language models to represent knowledge, like representation and types of characters in the text world (i.e., the world described in the text).
该项目是计算机科学和计算文学研究(CLS)之间的合作,计算文学研究是一个新兴的领域,它使用计算语言学,计算机科学及其自身传统的广泛工具来分析大量的文学文本。在我们的项目中,我们专注于计算文学分析的字符作为一个最重要的描述符的叙事和戏剧文本。我们的项目将探讨文字描述的字符的内部和外部的特点,行动和进一步字符的具体信息,使用知识诱导的语言模型。我们的目标是通过从文本中提取字符信息来创建字符知识图,通过数据驱动的聚类来发现不同的字符类型,并利用这些信息来开发一个字符关注的“文学”语言模型(“LitBERT”),用于自动文学分析。该项目将显著推进语言模型和知识图相结合的最新技术水平,展示如何通过(a)整合知识图和(B)基于使用语言模型的文本分析丰富特定领域的知识图,来提高语言模型分析实体及其属性的性能。此外,我们希望通过提高语言模型表示知识的能力来改进对长篇文本(如小说)的处理,例如文本世界中字符的表示和类型(即,《圣经》中描述的世界)。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professor Dr. Andreas Hotho其他文献

Professor Dr. Andreas Hotho的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professor Dr. Andreas Hotho', 18)}}的其他基金

Learning Environmental Maps - Integrating Participatory Sensing and Human Perception
学习环境地图 - 整合参与感知和人类感知
  • 批准号:
    314699772
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Priority Programmes
Pragmatics and Semantics in Social Tagging Systems II
社会标签系统中的语用学和语义学 II
  • 批准号:
    196648487
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
    Research Grants
Methods for Hypothesis-driven Analysis of Sequential Data (HydrAS)
假设驱动的序列数据分析方法 (HydrAS)
  • 批准号:
    438232455
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Research Grants

相似海外基金

Parahoric Character Sheaves and Representations of p-Adic Groups
隐喻特征束和 p-Adic 群的表示
  • 批准号:
    2401114
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Evolutionary and neurogenomic mechanisms of species discrimination during character displacement
性格位移过程中物种歧视的进化和神经基因组机制
  • 批准号:
    2338043
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
RII Track-4:@NASA: Automating Character Extraction for Taxonomic Species Descriptions Using Neural Networks, Transformer, and Computer Vision Signal Processing Architectures
RII Track-4:@NASA:使用神经网络、变压器和计算机视觉信号处理架构自动提取分类物种描述的字符
  • 批准号:
    2327168
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Development and improvement of a descriptive automatic scoring system incorporating handwritten character recognition
结合手写字符识别的描述性自动评分系统的开发和改进
  • 批准号:
    23H03511
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Development of a character strengths intervention program in school that combines effectiveness and usefulness for students' well-being
在学校制定性格优势干预计划,将有效性和实用性结合起来,以促进学生的福祉
  • 批准号:
    23K12873
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
A Study of the Japanese National Character: Succession and Development
日本民族性格研究:继承与发展
  • 批准号:
    23H00062
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
CAREER: From the forest to the stream: Exploring forest land cover controls on dissolved organic matter character and aquatic ecosystem respiration in headwater streams
职业:从森林到溪流:探索森林土地覆盖对源头溪流中溶解有机物特征和水生生态系统呼吸的控制
  • 批准号:
    2333030
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
A Study on Character Learning Support Methods to Improve Character Identification Skills of Foreign Residents in Japan
提高居住在日本的外国人的文字识别能力的文字学习支持方法的研究
  • 批准号:
    23K00613
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Synthverse - AI-driven character interaction
Synthverse - AI 驱动的角色交互
  • 批准号:
    10081870
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Collaborative R&D
Structure and Electronic Character of Multi-radicals Embedded in Cyclic Paraphenylene Units
环状对亚苯基单元中嵌入的多自由基的结构和电子特性
  • 批准号:
    22KJ2323
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了