EAGER: Identifying and Producing Code-Switching in Languages from Spoken, Lexical and Socio-linguistic Features

EAGER:根据口语、词汇和社会语言特征识别和产生语言中的语码转换

基本信息

  • 批准号:
    2327564
  • 负责人:
  • 金额:
    $ 10.89万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2023
  • 资助国家:
    美国
  • 起止时间:
    2023-06-15 至 2024-05-31
  • 项目状态:
    已结题

项目摘要

This Early Grant for Exploratory Research investigates conversations between the vast number of persons in our world who speak multiple languages and who frequently switch back and forth between those languages in what is called “code-switching”. It is important for speech dialogue systems and voice assistants to not only be able to identify when, why, and to what effect code-switching occurs, but also to correctly interpret what is said and to be able to generate similarly code-switched responses when interacting with such users. Advances in speech technology in recent years have resulted in widespread use of voice assistants such as Siri, Google Assistant and Alexa. They enable vast improvement in information access by voice for languages such as English, French, German, Cantonese, Mandarin, and Spanish. However, such access is limited to monolingual speech, which for many multilingual speakers is not the most natural form of speech production. Thus, code-switched speech is rarely understood correctly and is never able to be produced in assistant responses. A major barrier to enabling naturalistic and comfortable communication for these speakers is the lack of speech technology that can not only understand code-switched input but also produce similar human-like output. This project addresses these issues by examining how spoken and written code-switching interacts with other aspects of language communication. It will explore research questions not yet studied in code-switching research including (1) whether speakers entrain, speak more similarly, on pronunciation and other strategies of code-switching in speech; (2) whether there is a quantifiable relationship between code-switching and empathy in speech, where empathy is a speaker's intention to convey that they understand another's problems and want to help address them; (3) whether the presence of named entities, such as names or geographical locations, primes code-switching; (4) which dialogue acts, such as questions or statements or backchannels, tend to be produced most often in code-switched speech; and (5) how speakers produce intonational contours when they code-switch (via choosing their intonation production to match either of the languages they are producing or by being different from both?) Statistical and machine-learning techniques will both be used to address these questions in the context of spoken and lexical-feature-tagged code-switched speech in Standard American English, Spanish, Mandarin Chinese, and Hindi. By identifying new aspects of code-switching, the project will seed further exploration of this phenomenon by the research community.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
这项探索性研究的早期拨款调查了我们这个世界上讲多种语言的大量人之间的对话,这些人经常在这些语言之间来回切换,这被称为“代码转换”。语音对话系统和语音助手不仅要能够识别代码转换发生的时间、原因和效果,而且要正确解释所说的内容,并能够在与此类用户交互时生成类似的代码转换响应,这一点很重要。近年来,语音技术的进步导致了Siri、b谷歌Assistant和Alexa等语音助手的广泛使用。它们极大地改善了英语、法语、德语、广东话、普通话和西班牙语等语言的语音信息获取。然而,这种访问仅限于单语语音,对于许多多语使用者来说,单语语音并不是最自然的语音产生形式。因此,代码转换语音很少被正确理解,并且永远无法在辅助响应中产生。对于这些说话者来说,实现自然和舒适的交流的一个主要障碍是缺乏语音技术,这种技术不仅可以理解代码切换的输入,还可以产生类似人类的输出。本项目通过考察口语和书面语代码转换如何与语言交流的其他方面相互作用来解决这些问题。它将探讨语码转换研究中尚未研究的研究问题,包括(1)说话者是否在语音中的发音和其他语码转换策略上更相似;(2)语码转换与言语共情之间是否存在可量化的关系,其中共情是说话者意图传达他们理解他人的问题并希望帮助解决问题;(3)名称实体(如名称或地理位置)的存在是否会促进代码转换;(4)在语码转换言语中,哪些对话行为(如提问、陈述或反向通道)最常发生;(5)说话者在进行语码转换时是如何产生语调轮廓的(通过选择语调产生来匹配他们所产生的其中一种语言,还是通过与两种语言都不同?)统计和机器学习技术都将用于在标准美式英语、西班牙语、普通话和印地语的口语和词汇特征标记的代码转换语音环境中解决这些问题。通过识别代码转换的新方面,该项目将为研究界对这一现象的进一步探索奠定基础。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Julia Hirschberg其他文献

A Novel Methodology for Developing Automatic Harassment Classifiers for Twitter
一种为 Twitter 开发自动骚扰分类器的新方法
  • DOI:
    10.18653/v1/2020.alw-1.2
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Ishaan Arora;Julia Guo;Sarah Ita Levitan;Susan E Mcgregor;Julia Hirschberg
  • 通讯作者:
    Julia Hirschberg
Detecting Inappropriate Clarification Requests in Spoken Dialogue Systems
检测口语对话系统中不适当的澄清请求
  • DOI:
    10.3115/v1/w14-4331
  • 发表时间:
    2014
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Alex Liu;Rose Sloan;M. Then;Svetlana Stoyanchev;Julia Hirschberg;Elizabeth Shriberg
  • 通讯作者:
    Elizabeth Shriberg
The Prosody of Backchannels in American English
美式英语中 Backchannels 的韵律
  • DOI:
    10.7916/d8ww7s1p
  • 发表时间:
    2007
  • 期刊:
  • 影响因子:
    3.6
  • 作者:
    S. Benus;Agustin Gravano;Julia Hirschberg
  • 通讯作者:
    Julia Hirschberg
A Speech-First Model for Repair Detection and Correction
用于修复检测和纠正的语音优先模型
Intonation and discourse processing
语调和话语处理
  • DOI:
    10.7916/d8pv6tp1
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    1.1
  • 作者:
    Julia Hirschberg;J. Venditti
  • 通讯作者:
    J. Venditti

Julia Hirschberg的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Julia Hirschberg', 18)}}的其他基金

RI: Small: Creating Text-to-Speech Synthesis for Low Resource Languages
RI:小型:为低资源语言创建文本到语音合成
  • 批准号:
    1717680
  • 财政年份:
    2017
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
EAGER: Creating Speech Synthesizers for Low Resource Languages
EAGER:为低资源语言创建语音合成器
  • 批准号:
    1548092
  • 财政年份:
    2015
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
Collaborative Research: CI-P: Reciprosody - A Repository for Prosodically Annotated Material
合作研究:CI-P:Reciprosody - 韵律注释材料存储库
  • 批准号:
    1205450
  • 财政年份:
    2012
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
Using Computational Tools to Facilitate Corpus Collection and Language Use in Arrernte (aer)
使用计算工具促进 Arrernte (aer) 中的语料库收集和语言使用
  • 批准号:
    1160700
  • 财政年份:
    2012
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
IGERT: From Data to Solutions: A New PhD Program in Transformational Data & Information Sciences Research and Innovation
IGERT:从数据到解决方案:一个新的转型数据博士项目
  • 批准号:
    1144854
  • 财政年份:
    2012
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Continuing Grant
CI-P: Collaborative Research: Summarizing Opinion and Speaker Attitude in Speech
CI-P:协作研究:总结观点和演讲者在演讲中的态度
  • 批准号:
    1059260
  • 财政年份:
    2011
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
EAGER: Using Social Media and Crowdsourcing to Create a New Affect Dictionary
EAGER:利用社交媒体和众包创建新的情感词典
  • 批准号:
    1145505
  • 财政年份:
    2011
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
RI: Medium: Collaborative Research: From Text to Pictures
RI:媒介:协作研究:从文本到图片
  • 批准号:
    0904361
  • 财政年份:
    2009
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
RI-Medium: Collaborative: Corpus-Based Studies of Lexical, Acoustic-Prosodic, and Discourse Entrainment in Spoken Dialogue
RI-Medium:协作:基于语料库的口语对话中的词汇、声学韵律和话语夹带研究
  • 批准号:
    0803148
  • 财政年份:
    2008
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
Doctoral Consortium at The Human Language Technology Conference - North American chapter of the Association for Computational Linguistics annual meeting (NAACL HLT) 2007.
人类语言技术会议博士联盟 - 计算语言学协会年会 (NAACL HLT) 2007 年北美分会。
  • 批准号:
    0707305
  • 财政年份:
    2007
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant

相似海外基金

Identifying potential trade-offs of adapting to climate change
确定适应气候变化的潜在权衡
  • 批准号:
    DP240100230
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Discovery Projects
Identifying key fire drivers in Australia; biomass, climate or people
确定澳大利亚的主要火灾驱动因素;
  • 批准号:
    DE240100340
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Discovery Early Career Researcher Award
Collaborative Research: Dynamic connectivity of river networks as a framework for identifying controls on flux propagation and assessing landscape vulnerability to change
合作研究:河流网络的动态连通性作为识别通量传播控制和评估景观变化脆弱性的框架
  • 批准号:
    2342936
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Continuing Grant
Collaborative Research: Dynamic connectivity of river networks as a framework for identifying controls on flux propagation and assessing landscape vulnerability to change
合作研究:河流网络的动态连通性作为识别通量传播控制和评估景观变化脆弱性的框架
  • 批准号:
    2342937
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Continuing Grant
Identifying the goals and strategies people use to make others feel worse
确定人们用来让别人感觉更糟的目标和策略
  • 批准号:
    FT230100401
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    ARC Future Fellowships
NPBactID - Differential binding of peptoid functionalized nanoparticles to bacteria for identifying specific strains
NPBactID - 类肽功能化纳米粒子与细菌的差异结合,用于识别特定菌株
  • 批准号:
    EP/Y029542/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Fellowship
Developing Algorithms for Identifying Gene Modules in Single-Cell RNA-Seq Using Signed Graphs
开发使用符号图识别单细胞 RNA-Seq 中基因模块的算法
  • 批准号:
    24K18100
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Postdoctoral Fellowship: OPP-PRF: Identifying Central and Peripheral Thermosensors in Eurythermal and Stenothermal Arctic Fishes
博士后奖学金:OPP-PRF:识别广温和窄温北极鱼类的中央和外周热传感器
  • 批准号:
    2317970
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Standard Grant
Antecedents and consequences of cruise travel experience: Identifying contributors to well-being of cruise tourists.
邮轮旅行体验的前因和后果:确定邮轮游客福祉的贡献者。
  • 批准号:
    24K15536
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
CAREER: Identifying reaction mechanisms for the formation of stable interphases in lithium metal batteries
职业:确定锂金属电池中形成稳定界面的反应机制
  • 批准号:
    2338202
  • 财政年份:
    2024
  • 资助金额:
    $ 10.89万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了