Automatic Estimation of Fundamental Frequency Contour Parameters and Automatic Acquisition of Generative rules

基频轮廓参数自动估计及生成规则自动获取

基本信息

  • 批准号:
    11480090
  • 负责人:
  • 金额:
    $ 7.81万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Scientific Research (B).
  • 财政年份:
    1999
  • 资助国家:
    日本
  • 起止时间:
    1999 至 2000
  • 项目状态:
    已结题

项目摘要

(1) Smoothing and interpolation of measured F_0 contours of read speech of JapaneseMedian-smoothing for removing gross errors of pitch detection, linear interpolation of voiceless intervals, and recursive piecewise approximation of the resulting contour by third-order polynomials, were combined to obtain a mathematical approximation to the measured F_0 contour that is continuous and differentiable everywhere.(2) Automatic estimation of parameters using the derivative of the smoothed F_0 contourFirst-order approximations to the timings and amplitudes of the accent commands were obtained from the derivative of the smoothed F_0 contour, while those of the phrase commands were obtained from the residual. These first-order estimations were then refined by the method of Analysis-by-Synthesis to obtain optimum estimations.(3) Automatic acquisition of rules for prosody generationAutomatic acquisition of rules for prosody generation were investigated. From analysis results of a large amount of speech material obtained by the above-mentioned methods, it was found that a three-level quantization was perceptually acceptable both for the amplitudes of the accent commands and the magnitudes of the phrase commands for synthesis of read speech of Japanese.
(1)日语朗读语音测量F_0轮廓的平滑和内插消除基音检测粗差的中值平滑、清音区间的线性内插和三阶多项式对测量轮廓的递归分段逼近,得到连续且处处可微的测量F_0轮廓的数学逼近。(2)利用平滑后的F_0轮廓的导数自动估计参数由平滑的F_0轮廓的导数得到重音命令的定时和幅度的一阶近似,而短语命令的一阶近似由平滑的F_0轮廓的导数得到。通过综合分析的方法对这些一阶估计进行精化,得到最优估计。(3)韵律生成规则的自动获取研究韵律生成规则的自动获取。从用上述方法获得的大量语音材料的分析结果中发现,对于合成日语朗读语音的重音命令的幅度和短语命令的幅度,三级量化在感知上都是可以接受的。

项目成果

期刊论文数量(42)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Hiroya Fujisaki: "Pre-processing of fundamental frequency contours of speech for automatic parameter extraction"Proceedings of the 5th International Conference of Signal Processing. 2. 722-725 (2000)
Hiroya Fujisaki:“用于自动参数提取的语音基频轮廓的预处理”第五届国际信号处理会议论文集。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Shuichi Narusawa: "A method for automatic extraction of parameters of the fundamental frequency contour"Proceedings of the 6th International Conference on Spoken Language Processing. 1. 649-652 (2000)
Shuichi Narusawa:“一种自动提取基频轮廓参数的方法”第六届国际口语处理会议论文集。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
成澤修一: "日本語朗読音声の基本周波数パターンのパラメータ自動推定およびその生成規則の自動獲得"日本音響学会2001年春季研究発表会講演論文集. 1. 259-260 (2001)
Shuichi Narisawa:“日语背诵语音基频模式的自动参数估计及其生成规则的自动获取”日本声学学会 2001 年春季研究会议论文集 1. 259-260 (2001)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Hiroya FUJISAKI, Sumio OHNO, and Shuichi NARUSAWA: "A automatic estimation of input commands for generation of fundamental frequency contours in Japanese text reading"Record of 1999 Autumn Meeting, Acoustical Society of Japan. vol.1. 269-270 (1999)
Hiroya FUJISAKI、Sumio OHNO 和 Shuichi NARUSAWA:“在日语文本阅读中自动估计用于生成基频轮廓的输入命令”日本声学学会 1999 年秋季会议记录。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Hiroya FUJISAKI, Shuichi NARUSAWA, Masako MARUNO, and Sumio OHNO: "Estimation of model parameters from smoothed fundamental frequency contours of Japanese text reading"Record of 2000 Spring Meeting, Acoustical Society of Japan. vol.1. 225-226 (2000)
Hiroya FUJISAKI、Shuichi NARUSAWA、Masako MARUNO 和 Sumio OHNO:“根据日语文本阅读的平滑基频轮廓估计模型参数”日本声学学会 2000 年春季会议记录。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

FUJISAKI Hiroya其他文献

FUJISAKI Hiroya的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('FUJISAKI Hiroya', 18)}}的其他基金

Construction of an Intelligent System for information Retrieval in an Environment of Information Network
信息网络环境下智能信息检索系统的构建
  • 批准号:
    09558041
  • 财政年份:
    1998
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
A System for Rule Synthesis of Prosodic Features of Speech of Multiple Language Based on a Generative Model of Fundamental Frequency Contours
基于基频轮廓生成模型的多语言语音韵律特征规则综合系统
  • 批准号:
    08458090
  • 财政年份:
    1996
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
International Coordination of Speech Databases, Prosodic Labeling, and Speech Input/Output Systems Assessment
语音数据库、韵律标记和语音输入/输出系统评估的国际协调
  • 批准号:
    08044173
  • 财政年份:
    1996
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for international Scientific Research
Trial Construction of an Advanced Computer-readable Lexical Database Capable of Automatic Acquisition of Lexical Information
自动获取词汇信息的先进计算机可读词汇数据库的试建
  • 批准号:
    07558274
  • 财政年份:
    1995
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
International Standardization of Spoken Language Detabases
口语数据库国际标准化
  • 批准号:
    05044112
  • 财政年份:
    1993
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for international Scientific Research
Production of a Prototype Lexical Database Featuring High-speed, High-accuracy Access and Lexical Knowledge Acquisition
高速、高精度访问和词汇知识获取的原型词汇数据库的制作
  • 批准号:
    05558038
  • 财政年份:
    1993
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for Developmental Scientific Research (B)
A scheme for continuous speech recognition in a large context based on the human process of spoken language recognition
基于人类口语识别过程的大上下文连续语音识别方案
  • 批准号:
    03452164
  • 财政年份:
    1991
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for General Scientific Research (B)
Research on International Standardization of Spoken Language Database and Assessment Techniques for Speech Input/Output
口语数据库国际标准化及语音输入输出评估技术研究
  • 批准号:
    02044041
  • 财政年份:
    1990
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for international Scientific Research
Co-operative Research on Modeling of Language Acquisition and Concept Formation Process in Engineering
工程中语言习得和概念形成过程建模的合作研究
  • 批准号:
    01300004
  • 财政年份:
    1989
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for Co-operative Research (A)
Research on Synthesis Method for Spoken Sentences from Knowledge Representation
知识表示的口语句子合成方法研究
  • 批准号:
    63420051
  • 财政年份:
    1988
  • 资助金额:
    $ 7.81万
  • 项目类别:
    Grant-in-Aid for General Scientific Research (A)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了