Collaborative Research: Improving speech technology for better learning outcomes: the case of AAE child speakers
协作研究:改进语音技术以获得更好的学习成果:AAE 儿童扬声器的案例
基本信息
- 批准号:2202585
- 负责人:
- 金额:$ 31.89万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-05-01 至 2025-04-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
The lack of reading proficiency seen in children of underserved school districts has lasting impacts on students’ performances in various subjects. Low literacy is an especially pressing issue for African American students. Interactive spoken language systems offer the possibility of a powerful tool for assisting in early childhood education, freeing up teachers’ time, and engaging students in repeated opportunities for learning. These systems involve both Automatic Speech Recognition and Text-to-Speech Systems. The goal of this research is to improve the performance of such systems for young speakers of African American English (AAE) such that automated oral literacy assessment can be developed. The research has important societal and technological impacts. It will enhance the usability of speech technology in early education for AAE speaking children, providing a model for better supporting students with diverse dialects. Many under-resourced children do not have access to adequate reading and language assessments, and the proposed work will address these issues by creating methods for adapting spoken language technology to AAE children, increasing fairness in speech technology on a broader scale. The work has strong outreach and dissemination programs and will train undergraduate and graduate students in interdisciplinary research in Electrical and Computer Engineering, Linguistics, Education, and Psychology. Challenges facing children’s Automatic Speech Recognition (ASR) are due to (1) lack of child speech data and, hence, current models used for recognition are trained using data collected from adult speakers, and (2) children display a wider range of intra- and inter- speaker variability than adults. ASR performance is especially poor for children who are non-native English speakers or those who at times transition into dialects such as AAE that are different from what ASR systems are typically trained on. In addition, most dialog systems built on text-to-speech (TTS) technology are designed using General American English (GAE) voices, which minority children may not identify with. In the high-stakes area of education, these considerations impact the effectiveness of technology for different groups. The work will utilize a new and continuously developing database of AAE children's speech to research the impact of spoken language systems on children’s learning outcomes. On the learning side, the research will highlight the impact of dialect on literacy assessment. On the technology side, the work will yield novel machine learning algorithms for low-resource tasks. Specifically, this project will develop data augmentation techniques that can increase the amount of training data available for low-resource tasks, and data normalization techniques so that ASR performance is improved for AAE child speakers. The work on TTS will explore new methods of disentangling speaker and dialect impacts on spectral realization of phrases that model dialect density (rather than treating dialect as a categorical variable) and separately accounting for pronunciation and prosodic factors. Methods found to be effective for TTS will be leveraged in the data augmentation work for ASR and explored as a diagnostic in literacy assessment.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在服务不足的学区,儿童缺乏阅读能力,这对学生在各个科目的表现产生了持久的影响。对于非裔美国学生来说,识字率低是一个特别紧迫的问题。交互式口语系统提供了一种强大的工具,用于帮助幼儿教育,解放教师的时间,并让学生参与反复学习的机会。这些系统既包括自动语音识别系统,也包括文本到语音系统。这项研究的目标是改善这些系统对年轻的非裔美国人英语(AAE)的表现,以便开发自动化的口语识字评估。这项研究具有重要的社会和技术影响。它将增强语音技术在AAE儿童早期教育中的可用性,为更好地支持不同方言的学生提供一个模式。许多资源不足的儿童无法获得适当的阅读和语言评估,拟议的工作将通过创造使口语技术适应AAE儿童的方法来解决这些问题,在更广泛的范围内增加语音技术的公平性。这项工作有强大的外展和传播计划,并将在电气和计算机工程、语言学、教育学和心理学的跨学科研究方面培训本科生和研究生。儿童自动语音识别(ASR)面临的挑战是:(1)缺乏儿童语音数据,因此,当前用于识别的模型是使用从成人说话人那里收集的数据进行训练的,以及(2)儿童比成人表现出更大范围的说话人内和说话人之间的可变性。对于非英语母语的儿童或有时过渡到AAE等方言的儿童来说,ASR的表现尤其糟糕,这些方言与ASR系统通常接受的培训不同。此外,大多数建立在文本到语音(TTS)技术上的对话系统都是使用通用美国英语(GAE)语音设计的,少数族裔儿童可能不会认同这种语音。在事关重大的教育领域,这些考虑因素会影响技术对不同群体的有效性。这项工作将利用一个新的和不断发展的AAE儿童语音数据库来研究口语系统对儿童学习结果的影响。在学习方面,研究将突出方言对识字评估的影响。在技术方面,这项工作将为低资源任务产生新的机器学习算法。具体地说,该项目将开发数据增强技术,以增加可用于低资源任务的训练数据量,并开发数据标准化技术,以便提高AAE儿童说话者的ASR性能。TTS方面的工作将探索新的方法,以分离说话人和方言对建模方言密度(而不是将方言作为范畴变量)并分别考虑发音和韵律因素的短语的频谱实现的影响。被发现对TTS有效的方法将被用于ASR的数据增强工作,并被用作识字评估的诊断。该奖项反映了NSF的法定使命,并通过使用基金会的智力优势和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Towards Effective Speech-based AI in the Classroom: The Case of AAE-Speaking Children
在课堂上实现有效的基于语音的人工智能:以 AAE 语言儿童为例
- DOI:
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Alexander, Johnson;Julie, Washington;Robin, Morris;Mari, Ostendorf;Alison, Bailey;Abeer, Alwan
- 通讯作者:Abeer, Alwan
Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and Children
利用多种来源自动检测成人和儿童的非裔美国英语方言
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Johnson, Alexander;Shetty, Vishwas;Ostendorf, Mari;and Alwan, Abeer
- 通讯作者:and Alwan, Abeer
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Abeer Alwan其他文献
Modeling auditory perception to improve robust speech recognition
建立听觉感知模型以提高稳健的语音识别能力
- DOI:
- 发表时间:
1997 - 期刊:
- 影响因子:0
- 作者:
B. Strope;Abeer Alwan - 通讯作者:
Abeer Alwan
Unraveling the associations between voice pitch and major depressive disorder: a multisite genetic study
揭示声音音调与重度抑郁症之间的关联:一项多站点遗传研究
- DOI:
10.1038/s41380-024-02877-y - 发表时间:
2024-12-31 - 期刊:
- 影响因子:10.100
- 作者:
Yazheng Di;Elior Rahmani;Joel Mefford;Jinhan Wang;Vijay Ravi;Aditya Gorla;Abeer Alwan;Kenneth S. Kendler;Tingshao Zhu;Jonathan Flint - 通讯作者:
Jonathan Flint
Optical Phonetics and Visual Percep Stress in Eng
英语中的光学语音和视觉感知压力
- DOI:
- 发表时间:
2003 - 期刊:
- 影响因子:0
- 作者:
P. Keating;Marco Baroni;Sven Matty;E. T. Auer;Rebecca Scarborough;Abeer Alwan;E. Bernstein - 通讯作者:
E. Bernstein
Towards Automatically Assessing Children’s Picture Description Tasks
自动评估儿童图片描述任务
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Hariram Veeramani;Natarajan Balaji Shankar;Alexander Johnson;Abeer Alwan - 通讯作者:
Abeer Alwan
Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals.
基于 MRI 和 EPG 数据的液体近似的发音声学模型。
- DOI:
- 发表时间:
1997 - 期刊:
- 影响因子:2.4
- 作者:
Shrikanth S. Narayanan;Abeer Alwan;K. Haker - 通讯作者:
K. Haker
Abeer Alwan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Abeer Alwan', 18)}}的其他基金
Collaborative Research: RI: Small: From Ultrasound and MRI to articulatory and acoustic models of child speech development
合作研究:RI:小型:从超声和 MRI 到儿童言语发展的发音和声学模型
- 批准号:
2006979 - 财政年份:2020
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Workshop for Undergraduate and MS Female Students in Speech Science and Technology
语音科学与技术本科生和女硕士讲习班
- 批准号:
1745166 - 财政年份:2017
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
NRI: INT: COLLAB: Development, Deployment and Evaluation of Personalized Learning Companion Robots for Early Literacy and Language Learning
NRI:INT:COLLAB:用于早期识字和语言学习的个性化学习伴侣机器人的开发、部署和评估
- 批准号:
1734380 - 财政年份:2017
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
RI: Medium: Collaborative Research: Variance and Invariance in Voice Quality: Implications for Machine and Human Speaker Identification
RI:媒介:协作研究:语音质量的方差和不变性:对机器和人类说话人识别的影响
- 批准号:
1704167 - 财政年份:2017
- 资助金额:
$ 31.89万 - 项目类别:
Continuing Grant
A Workshop for Junior Female Researchers in Speech Science and Technology
语音科学与技术青年女性研究员研讨会
- 批准号:
1637240 - 财政年份:2016
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
The Role of Speech Science in Developing Robust Speech Technology Applications
语音科学在开发强大的语音技术应用中的作用
- 批准号:
1543522 - 财政年份:2015
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Models of Child Speech
EAGER:合作研究:儿童言语模型
- 批准号:
1551113 - 财政年份:2015
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
EAGER: Variance and Invariance in Voice Quality
EAGER:语音质量的方差和不变性
- 批准号:
1450992 - 财政年份:2014
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Towards Modeling Human Speech Confusions in Noise
EAGER:协作研究:对噪声中的人类语音混乱进行建模
- 批准号:
1247809 - 财政年份:2012
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
RI: Small: A New Voice Source Model: From Glottal Areas to Better Speech Synthesis
RI:Small:一种新的语音源模型:从声门区域到更好的语音合成
- 批准号:
1018863 - 财政年份:2010
- 资助金额:
$ 31.89万 - 项目类别:
Continuing Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: Improving Upper Division Physics Education and Strengthening Student Research Opportunities at 14 HSIs in California
合作研究:改善加州 14 所 HSI 的高年级物理教育并加强学生研究机会
- 批准号:
2345092 - 财政年份:2024
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: Improving Upper Division Physics Education and Strengthening Student Research Opportunities at 14 HSIs in California
合作研究:改善加州 14 所 HSI 的高年级物理教育并加强学生研究机会
- 批准号:
2345093 - 财政年份:2024
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
SBP: Collaborative Research: Improving Engagement with Professional Development Programs by Attending to Teachers' Psychosocial Experiences
SBP:协作研究:通过关注教师的社会心理体验来提高对专业发展计划的参与度
- 批准号:
2314254 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: Improving Worker Safety by Understanding Risk Compensation as a Latent Precursor of At-risk Decisions
合作研究:通过了解风险补偿作为风险决策的潜在前兆来提高工人安全
- 批准号:
2326937 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Continuing Grant
Collaborative Research: Improving Model Representations of Antarctic Ice-shelf Instability and Break-up due to Surface Meltwater Processes
合作研究:改进地表融水过程导致的南极冰架不稳定和破裂的模型表示
- 批准号:
2213704 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: SaTC: CORE: Small: Measuring, Validating and Improving upon App-Based Privacy Nutrition Labels
合作研究:SaTC:核心:小型:测量、验证和改进基于应用程序的隐私营养标签
- 批准号:
2247952 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: Reducing Model Uncertainty by Improving Understanding of Pacific Meridional Climate Structure during Past Warm Intervals
合作研究:通过提高对过去温暖时期太平洋经向气候结构的理解来降低模型不确定性
- 批准号:
2303568 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Continuing Grant
Collaborative Research: SitS: Improving Rice Cultivation by Observing Dynamic Soil Chemical Processes from Grain to Landscape Scales
合作研究:SitS:通过观察从谷物到景观尺度的动态土壤化学过程来改善水稻种植
- 批准号:
2226647 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: SitS: Improving Rice Cultivation by Observing Dynamic Soil Chemical Processes from Grain to Landscape Scales
合作研究:SitS:通过观察从谷物到景观尺度的动态土壤化学过程来改善水稻种植
- 批准号:
2226648 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant
Collaborative Research: CISE-MSI: RCBP-RF: CPS: Socially Informed Traffic Signal Control for Improving Near Roadway Air Quality
合作研究:CISE-MSI:RCBP-RF:CPS:用于改善附近道路空气质量的社会知情交通信号控制
- 批准号:
2318696 - 财政年份:2023
- 资助金额:
$ 31.89万 - 项目类别:
Standard Grant