Large-Scale Discriminative Modelling for Data-Intensive Speech and Language Processing

数据密集型语音和语言处理的大规模判别建模

基本信息

  • 批准号:
    261540-2013
  • 负责人:
  • 金额:
    $ 1.82万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2015
  • 资助国家:
    加拿大
  • 起止时间:
    2015-01-01 至 2016-12-31
  • 项目状态:
    已结题

项目摘要

As the wireless Internet and smartphones have become more and more prevalent in our daily life, many service providers have used the so-called cloud-computing framework to deliver various types of speech and language related services to customers. Under this background, it is very easy and fast for these providers to accumulate massive amount of real-world data in their central servers. Therefore, it has become a very interesting research topic regarding how to take advantage of the mass data available in this unprecedented big data era to further boost up performance to next level for many real world applications. In this research program, we will study this problem in the context of data-intensive speech and language applications, such as automatic speech recognition, text categorization and spoken language processing. We will first focus our research on investigating new discriminative models that are effective to model the most important and pertinent information available in the massive training data, which may be noisy and unlabelled. Meanwhile, we will also design efficient learning algorithms that are flexible enough to take full advantage of currently popular parallel computing facilities, such as multi-core CPUs, GPUs and map-reduce based distributed computing platform. More specifically, based on our previous research on margin-based discriminative learning, we will study three important sub-topics in this research program: i) how to extract discriminative and compact features for speech and language data; ii) how to build flexible discriminative models capable of dealing with a vast quantity of data in data-intensive speech and language applications; iii) how to parallelize learning algorithms to efficiently solve large-scale optimization in model estimation. We believe these issues are critical for us to fully benefit from massive real-world data in many data-intensive speech and language applications. This research program will help to advance large-scale learning and automatic knowledge discovery technologies for many Canadian and International industrial IT sections that are based on the more and more popular cloud-computing model.
随着无线互联网和智能手机在我们的日常生活中越来越普遍,许多服务提供商使用所谓的云计算框架为客户提供各种类型的语音和语言相关服务。在这种背景下,这些提供商可以非常容易和快速地在其中央服务器中积累大量真实世界的数据。因此,在这个前所未有的大数据时代,如何利用海量的可用数据,进一步将许多现实应用的性能提升到一个新的水平,已经成为一个非常有趣的研究课题。在本研究计划中,我们将在数据密集型语音和语言应用的背景下研究这个问题,如自动语音识别、文本分类和口语处理。我们首先将研究重点放在研究新的判别模型上,这些模型可以有效地对大量训练数据中最重要和最相关的信息进行建模,这些数据可能是嘈杂的和未标记的。同时,我们还将设计高效的学习算法,这些算法足够灵活,可以充分利用当前流行的并行计算设施,如多核cpu、gpu和基于map-reduce的分布式计算平台。更具体地说,在我们之前基于边缘的判别学习研究的基础上,我们将研究三个重要的子主题:1)如何提取语音和语言数据的判别和紧凑特征;Ii)如何在数据密集型语音和语言应用中构建灵活的判别模型,能够处理大量数据;Iii)如何并行化学习算法以有效解决模型估计中的大规模优化问题。我们认为,这些问题对于我们充分受益于许多数据密集型语音和语言应用程序中的大量真实数据至关重要。这一研究项目将有助于推动基于越来越流行的云计算模型的许多加拿大和国际工业IT部门的大规模学习和自动知识发现技术。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Jiang, Hui其他文献

Transmission of multidrug-resistant tuberculosis in Beijing, China: An epidemiological and genomic analysis.
  • DOI:
    10.3389/fpubh.2022.1019198
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    5.2
  • 作者:
    Yin, Jinfeng;Zhang, Hongwei;Gao, Zhidong;Jiang, Hui;Qin, Liyi;Zhu, Chendi;Gao, Qian;He, Xiaoxin;Li, Weimin
  • 通讯作者:
    Li, Weimin
Optimization of a multilayer Laue lens system for a hard x-ray nanoprobe
用于硬 X 射线纳米探针的多层劳厄透镜系统的优化
  • DOI:
    10.1088/2040-8978/16/1/015002
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Jiang, Hui;Wang, Hua;Mao, Chengwen;Li, Aiguo;He, Yan;Dong, Zhaohui;Zheng, Yi
  • 通讯作者:
    Zheng, Yi
Liver serine palmitoyltransferase activity deficiency in early life impairs adherens junctions and promotes tumorigenesis.
  • DOI:
    10.1002/hep.28845
  • 发表时间:
    2016-12
  • 期刊:
  • 影响因子:
    13.5
  • 作者:
    Li, Zhiqiang;Kabir, Inamul;Jiang, Hui;Zhou, Hongwen;Libien, Jenny;Zeng, Jianying;Stanek, Albert;Ou, Peiqi;Li, Kailyn R.;Zhang, Shane;Bui, Hai H.;Kuo, Ming-Shang;Park, Tae-Sik;Kim, Benjamin;Worgall, Tilla S.;Huan, Chongmin;Jiang, Xian-Cheng
  • 通讯作者:
    Jiang, Xian-Cheng
An acyltransferase domain of FK506 polyketide synthase recognizing both an acyl carrier protein and coenzymeA as acyl donors to transfer allylmalonyl and ethylmalonyl units
FK506 聚酮合酶的酰基转移酶结构域识别酰基载体蛋白和辅酶 A 作为酰基供体以转移烯丙基丙二酰基和乙基丙二酰基单位
  • DOI:
    10.1111/febs.13296
  • 发表时间:
    2015-07-01
  • 期刊:
  • 影响因子:
    5.4
  • 作者:
    Jiang, Hui;Wang, Yue-Yue;Li, Yong-Quan
  • 通讯作者:
    Li, Yong-Quan
High level of intraoperative lactate might predict acute kidney injury in aortic arch surgery via minimally invasive approach in patients with type A dissection.
  • DOI:
    10.3389/fcvm.2023.1188393
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    3.6
  • 作者:
    Lyu, Ying;Liu, Yu;Xiao, Xiong;Yang, Zhonglu;Ge, Yuguang;Jiang, Hui
  • 通讯作者:
    Jiang, Hui

Jiang, Hui的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Jiang, Hui', 18)}}的其他基金

Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    RGPIN-2018-05870
  • 财政年份:
    2022
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    RGPIN-2018-05870
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    RGPIN-2018-05870
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    522577-2018
  • 财政年份:
    2019
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    RGPIN-2018-05870
  • 财政年份:
    2019
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    522577-2018
  • 财政年份:
    2018
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Exploring New Neural Computing Models for Natural Language Understanding
探索自然语言理解的新神经计算模型
  • 批准号:
    RGPIN-2018-05870
  • 财政年份:
    2018
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Large-Scale Discriminative Modelling for Data-Intensive Speech and Language Processing
数据密集型语音和语言处理的大规模判别建模
  • 批准号:
    261540-2013
  • 财政年份:
    2017
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Large-Scale Discriminative Modelling for Data-Intensive Speech and Language Processing
数据密集型语音和语言处理的大规模判别建模
  • 批准号:
    261540-2013
  • 财政年份:
    2016
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Large-Scale Discriminative Modelling for Data-Intensive Speech and Language Processing
数据密集型语音和语言处理的大规模判别建模
  • 批准号:
    261540-2013
  • 财政年份:
    2014
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual

相似国自然基金

基于热量传递的传统固态发酵过程缩小(Scale-down)机理及调控
  • 批准号:
    22108101
  • 批准年份:
    2021
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于Multi-Scale模型的轴流血泵瞬变流及空化机理研究
  • 批准号:
    31600794
  • 批准年份:
    2016
  • 资助金额:
    22.0 万元
  • 项目类别:
    青年科学基金项目
针对Scale-Free网络的紧凑路由研究
  • 批准号:
    60673168
  • 批准年份:
    2006
  • 资助金额:
    25.0 万元
  • 项目类别:
    面上项目

相似海外基金

Renewal application: How do ecological trade-offs drive ectomycorrhizal fungal community assembly? Fine- scale processes with large-scale implications
更新应用:生态权衡如何驱动外生菌根真菌群落组装?
  • 批准号:
    MR/Y011503/1
  • 财政年份:
    2025
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Fellowship
Flok Health: Developing a pocket physiotherapist to deliver personalised healthcare at population scale
Flok Health:开发袖珍物理治疗师以在人口规模上提供个性化医疗保健
  • 批准号:
    10100082
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative R&D
LSS_BeyondAverage: Probing cosmic large-scale structure beyond the average
LSS_BeyondAverage:探测超出平均水平的宇宙大尺度结构
  • 批准号:
    EP/Y027906/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
SAFER - Secure Foundations: Verified Systems Software Above Full-Scale Integrated Semantics
SAFER - 安全基础:高于全面集成语义的经过验证的系统软件
  • 批准号:
    EP/Y035976/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
Imaging for Multi-scale Multi-modal and Multi-disciplinary Analysis for EnGineering and Environmental Sustainability (IM3AGES)
工程和环境可持续性多尺度、多模式和多学科分析成像 (IM3AGES)
  • 批准号:
    EP/Z531133/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
Optimising Nature's pharmacies: plant chemicals and pollinator health at the landscape scale
优化大自然的药房:景观尺度上的植物化学物质和传粉媒介健康
  • 批准号:
    NE/Y000285/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
EDIBLES: Environmentally Driven Body-Scale Electromagnetic Co-Sensing
食用:环境驱动的人体规模电磁协同感应
  • 批准号:
    EP/Y002008/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
SBIR Phase I: Industrial-Scale Technology for Drug Development in Mature Human Fat Cells
SBIR 第一阶段:成熟人类脂肪细胞药物开发的工业规模技术
  • 批准号:
    2322443
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Standard Grant
CSR: Small: Multi-FPGA System for Real-time Fraud Detection with Large-scale Dynamic Graphs
CSR:小型:利用大规模动态图进行实时欺诈检测的多 FPGA 系统
  • 批准号:
    2317251
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Standard Grant
Traversing the Gray Zone with Scale-aware Turbulence Closures
通过尺度感知的湍流闭合穿越灰色区域
  • 批准号:
    2337399
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了