Social Perceptions of Synthetic Speakers
合成扬声器的社会认知
基本信息
- 批准号:423651352
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:德国
- 项目类别:Research Grants
- 财政年份:2019
- 资助国家:德国
- 起止时间:2018-12-31 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Speech signals automatically induce social perceptions in listeners regarding the speakers. With acoustic analysis and signal manipulation, a great body of knowledge has been accumulated regarding relevant acoustic correlates of social perceptions, such as spectral and prosodic parameters, as well as perceptual dimensions for natural speech. However, despite the advent of modern speech synthesis paradigms providing very high quality, it is yet to be understood, if results from natural speech also hold for synthesized speech. Hence, the major research question is: “Which acoustic features of synthesized speech affect subjective perceptions of social speaker characteristics?”In order to answer this question, this project studies social perception of the two basic social attributions, competence and benevolence, for text-to-speech (TTS) synthesizers in two potential application domains: Stimuli from the topics of healthcare and of customer service. Results are compared to those obtained from natural speech in earlier projects. It is tested whether competence and benevolence also emerge as basic social attributions, or if other dimensions are more relevant. Regarding the speech signal, similarities and differences in acoustic parameters and their systematics are identified. A mid-term result is an acoustic prediction model of the identified social dimensions for synthesized speech.On a methodological level, utterances are created with state-of-the-art TTS systems and systematically modified on the signal level, in order to produce stimuli for empirical testing with human listeners. Crowd-sourcing techniques are applied for the required listening and rating tests. The final goal is to examine, how acoustic features and patterns can be directly incorporated in modern TTS methodologies (Hidden-Markov-Models, Deep Neural Networks) instead of post-processing signal manipulation. This leads to the secondary research question: “Which alterations of the synthesis procedure lead to positive perceptions of speakers?” For this aim, current approaches from speaker conversion are applied.Apart from the fundamental knowledge gained from this research, results will be relevant for TTS system developers, in order to efficiently improve voices for particular service domains.
语音信号自动地诱导听者对说话者的社会感知。随着声学分析和信号处理,已经积累了大量的知识,关于相关的声学相关的社会感知,如频谱和韵律参数,以及感知维度的自然语音。然而,尽管现代语音合成范例的出现提供了非常高的质量,它还有待理解,如果从自然语音的结果也适用于合成语音。因此,主要的研究问题是:“哪些合成语音的声学特征影响主观感知的社会扬声器的特点?”为了回答这个问题,本项目研究了两个基本的社会属性,能力和仁慈,文本到语音(TTS)合成器在两个潜在的应用领域:刺激的主题,医疗保健和客户服务的社会感知。结果进行了比较,从自然语音在早期的项目。测试是否能力和仁慈也出现作为基本的社会属性,或者如果其他方面更相关。关于语音信号,声学参数和它们的系统的相似性和差异被识别。中期的结果是一个声学预测模型的识别社会层面的合成speech.On方法层面上,话语创建与国家的最先进的TTS系统和系统的信号水平上的修改,以产生刺激与人类听众的实证测试。所需的收听和评级测试采用了众包技术。最终目标是研究声学特征和模式如何直接纳入现代TTS方法(隐马尔可夫模型,深度神经网络),而不是后处理信号操作。这就引出了第二个研究问题:“合成过程的哪些改变会导致对说话者的积极看法?”为了实现这一目标,目前的方法从扬声器转换applied.Apart从这项研究中获得的基础知识,结果将是相关的TTS系统开发人员,以有效地提高特定服务领域的语音。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Professor Dr.-Ing. Sebastian Möller其他文献
Professor Dr.-Ing. Sebastian Möller的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Professor Dr.-Ing. Sebastian Möller', 18)}}的其他基金
Quantification of perceived location privacy, and its relationship to privacy behaviour
感知位置隐私的量化及其与隐私行为的关系
- 批准号:
409241470 - 财政年份:2019
- 资助金额:
-- - 项目类别:
Research Grants
Simulation of Conversation Behavior in Case of Impaired Telephone Transmission
电话传输受损情况下的对话行为模拟
- 批准号:
320253669 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Research Grants
Quality Attributes and Overall Quality of Transmitted Speech
传输语音的质量属性和总体质量
- 批准号:
289919134 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Research Grants (Transfer Project)
Subjective measurement and instrumental estimation of mobile online gaming quality based on perceptual dimensions
基于感知维度的移动网络游戏质量主观测量与工具评价
- 批准号:
279244726 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Research Grants
Subjective measurement and instrumental estimation of conversational speech quality based on perceptual dimensions
基于感知维度的会话语音质量主观测量与仪器评价
- 批准号:
251103195 - 财政年份:2014
- 资助金额:
-- - 项目类别:
Research Grants
Modellierung von Benutzerverhalten zur Usability-Evaluierung von Sprachdialogdiensten mit Hilfe von techniksoziologisch ermittelten Regeln
借助技术社会学确定的规则对用户行为进行建模,以进行语音对话服务的可用性评估
- 批准号:
152700694 - 财政年份:2009
- 资助金额:
-- - 项目类别:
Research Grants
Qualitätsmessung multimodaler Mensch-Maschine-Interaktion
多模态人机交互的质量测量
- 批准号:
55252204 - 财政年份:2008
- 资助金额:
-- - 项目类别:
Research Grants
Qualität multimodaler Mensch-Maschine-Interaktion
多模态人机交互的质量
- 批准号:
5454604 - 财政年份:2005
- 资助金额:
-- - 项目类别:
Heisenberg Fellowships
Sprachsignal-Qualitätsmessung auf der Grundlage auditiv und messtechnisch definierter Qualitätsattribute
基于听觉和计量定义的质量属性的语音信号质量测量
- 批准号:
5427824 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Research Grants
Knowledge-enhanced information extraction across languages for pharmacovigilance
跨语言的知识增强信息提取用于药物警戒
- 批准号:
442445488 - 财政年份:
- 资助金额:
-- - 项目类别:
Research Grants
相似海外基金
Postdoctoral Fellowship: STEMEdIPRF: Towards a Diverse Professoriate: Experiences that Inform Underrepresented Scholars' Perceptions of Value Alignment and Career Decisions
博士后奖学金:STEMEdIPRF:走向多元化的教授职称:为代表性不足的学者对价值调整和职业决策的看法提供信息的经验
- 批准号:
2327411 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Heian Jingu and Jidai Matsuri: Changing Meanings and Perceptions from Their Creation to the Present
平安神宫和时代祭:从其诞生到现在的意义和认知的变化
- 批准号:
24K03398 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)
Perceptions, practices, autonomy levels and other variations among teachers who use AI teaching tools
使用人工智能教学工具的教师的认知、实践、自主水平和其他差异
- 批准号:
24K16628 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
Postdoctoral Fellowship: STEMEdIPRF: Increasing geoscience enrollment and engagement by transforming perceptions of geoscience among students and the general public
博士后奖学金:STEMEdIPRF:通过改变学生和公众对地球科学的看法来增加地球科学的入学率和参与度
- 批准号:
2327348 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Research: U.S. institutions after COVID-19: Trust, accountability, and public perceptions
合作研究:COVID-19 后的美国机构:信任、责任和公众看法
- 批准号:
2422394 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Mindset Dynamics: Using the Perception Clarity Methodology (PCM) to shift perceptions
心态动态:使用感知清晰度方法 (PCM) 来转变认知
- 批准号:
ES/Y011015/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
Reframing arrival: Transnational perspectives on perceptions, governance and practices - REFRAME.
重构到来:关于认知、治理和实践的跨国视角 - REFRAME。
- 批准号:
AH/Y00759X/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
RAPID: DRL AI: Understanding Perceptions and Use of AI in K-12 Education Using a Nationally Representative Sample
RAPID:DRL AI:使用全国代表性样本了解 K-12 教育中 AI 的认知和使用
- 批准号:
2334172 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Research Initiation: Addressing the Readiness Gap through Examining Engineering Students' Perceptions of their Future Professional Selves
研究启动:通过检查工科学生对未来职业自我的看法来解决准备差距
- 批准号:
2306178 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Standard Grant
Broadening Participation Research: Understanding faculty attitudes, competency, and perceptions of providing career advising to African American STEM students at HBCUs
扩大参与研究:了解教师对 HBCU 的非裔美国 STEM 学生提供职业建议的态度、能力和看法
- 批准号:
2306671 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Continuing Grant