Generating Linguistic Insights in Question Classification throughCombining Explainable Machine Learning and Visualization

通过结合可解释的机器学习和可视化来生成问题分类中的语言见解

基本信息

项目摘要

This project tackles the question classification task: the task of automatically distinguishing between different types of canonical and non-canonical questions. The project encompasses three parts. One part focuses on the extraction and collection of linguistic information to be used as features for Machine Learning models (ML) and Visual Analytics (VA) techniques for the classification task. The second part includes VA techniques for the interactive adjustment of the ML models to improve question classification. Through an interactive manipulation of the model's visual representation we enable the user to interactively adapt and improve the learned model. The third part deals with making the ML techniques transparent in order to generate additional linguistic insights for question classification. We aim to develop novel methods to communicate decisions made by a ML model and provide linguistic insights for the task at hand. By pursuing these goals we are contributing to the research of questions by developing novel tools and methodology within computational linguistics and VA for the analysis and classification of question types. At the same time, we interact with other projects of the Research Unit in terms of carrying the possibilities opened up by LingVis (Visualization for Linguistics) into individual projects and applying them towards the work pursued by these projects. This project benefits from a feedback cycle in that it can integrate insights on the structure of questions coming from other projects into its own classificatory work and can in turn produce results that can then be carried into the other projects.
这个项目处理问题分类任务:自动区分不同类型的规范和非规范问题的任务。该项目包括三个部分。一部分侧重于语言信息的提取和收集,这些信息将用作机器学习模型(ML)和视觉分析(VA)技术用于分类任务的特征。第二部分包括用于交互式调整ML模型的VA技术,以改进问题分类。通过对模型可视化表示的交互式操作,我们使用户能够交互式地适应和改进学习到的模型。第三部分涉及使机器学习技术透明,以便为问题分类生成额外的语言见解。我们的目标是开发新的方法来传达由ML模型做出的决策,并为手头的任务提供语言见解。通过追求这些目标,我们正在通过在计算语言学和VA中开发用于分析和分类问题类型的新工具和方法,为问题的研究做出贡献。与此同时,我们与研究部门的其他项目进行互动,将LingVis(语言学可视化)所带来的可能性应用到各个项目中,并将其应用到这些项目所追求的工作中。该项目受益于反馈循环,因为它可以将来自其他项目的问题结构的见解集成到自己的分类工作中,并且可以反过来产生可用于其他项目的结果。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professorin Dr. Miriam Butt其他文献

Professorin Dr. Miriam Butt的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professorin Dr. Miriam Butt', 18)}}的其他基金

Visual Analytics and Linguistics for Interpreting Deliberative Argumentation (VALIDA)
用于解释协商论证的视觉分析和语言学(VALIDA)
  • 批准号:
    376714276
  • 财政年份:
    2017
  • 资助金额:
    --
  • 项目类别:
    Priority Programmes
Information Structure and Questions in Urdu/Hindi
乌尔都语/印地语的信息结构和问题
  • 批准号:
    276392517
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Research Units
Coordination Funds
协调基金
  • 批准号:
    276396713
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Research Units
Visual Analysis of Language Change and Use Patterns
语言变化和使用模式的可视化分析
  • 批准号:
    218458885
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Research Grants
Computerlinguistische Implementierung einer großen, robusten Grammatik für Urdu/Hindi im Kontext paralleler Grammatikentwicklung
在并行语法开发的背景下,大型、稳健的乌尔都语/印地语语法的计算语言实现
  • 批准号:
    77719491
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Research Grants
CUEPAQ: Visual Analytics and Linguistics for Capturing, Understanding, and Explaining Personalized Argument Quality
CUEPAQ:用于捕获、理解和解释个性化论证质量的视觉分析和语言学
  • 批准号:
    455910360
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Priority Programmes

相似海外基金

A Cross-Linguistic Study on Speech Fluency in L1 Japanese and L2 English
日语一级和英语二级的言语流利度跨语言研究
  • 批准号:
    24K04169
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Use and Concept in Neural Machine Translation and Cross-Linguistic Divergence
神经机器翻译和跨语言分歧中的使用和概念
  • 批准号:
    23K21872
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Doctoral Dissertation Research: Place, Persona, and Linguistic Style among College Student Border Commuters
博士论文研究:大学生边境通勤者的地点、角色和语言风格
  • 批准号:
    2336567
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAREER: Investigating linguistic and cognitive abstractions for solving word problems in minds and machines
职业:研究语言和认知抽象以解决大脑和机器中的文字问题
  • 批准号:
    2339729
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Characteristics of media use and linguistic trajectories during early childhood
幼儿期媒体使用特征和语言轨迹
  • 批准号:
    2235083
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Linguistic discrimination and migrant youth in regional Australia
澳大利亚乡村地区的语言歧视和移民青年
  • 批准号:
    DE230101209
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Discovery Early Career Researcher Award
Applying an equity and diversity lens to understand the care experiences and healthcare outcomes of low income and linguistic minority groups in Ontario retirement homes: A mixed methods study
应用公平和多样性的视角来了解安大略省养老院中低收入和语言少数群体的护理体验和医疗保健结果:一项混合方法研究
  • 批准号:
    484613
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Fellowship Programs
Building an Error-Annotated Corpus of Learner Indonesian and Developing an Automated Writing Support for Japanese Students Using Deep Linguistic Indonesian Parsers
建立一个错误注释的印尼语学习者语料库,并使用深度语言印尼语解析器为日本学生开发自动写作支持
  • 批准号:
    23K12235
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Modelling bottom-up and top-down linguistic knowledge across different contexts of bilingual development
在双语发展的不同背景下对自下而上和自上而下的语言知识进行建模
  • 批准号:
    ES/X008266/1
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Research Grant
Linguistic knowledge and language change: Testing and forming a theory of social meaning formation based on Irish English usage data
语言知识和语言变化:基于爱尔兰英语使用数据测试和形成社会意义形成理论
  • 批准号:
    22KK0193
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Fund for the Promotion of Joint International Research (Fostering Joint International Research (A))
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了