Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning

用于机器学习和决策理论人工智能规划的连续决策图

基本信息

  • 批准号:
    RGPIN-2016-05705
  • 负责人:
  • 金额:
    $ 3.35万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2019
  • 资助国家:
    加拿大
  • 起止时间:
    2019-01-01 至 2020-12-31
  • 项目状态:
    已结题

项目摘要

A key challenge in both Machine Learning and Decision-theoretic AI Planning is the inability of existing methods to efficiently and accurately reason about piecewise continuous functions. Such functions arise in diverse tasks such as preference learning and real-time optimization of traffic signals. For example, in the latter application area, optimal planners must reason about piecewise continuous bursts of traffic flow that occur when signals change. The proposed research program directly attacks this challenge through the further development of continuous decision diagrams and their application to problems ranging from preference learning and elicitation critical for online commerce to optimized traffic signal control critical for highly congested urban environments.****Continuous decision diagrams such as the extended algebraic decision diagram (XADD) were invented by the author to address deficiencies in compactly representing and performing efficient closed-form computation with piecewise continuous functions. XADDs have achieved some of the first exact solutions to learning, inference and decision-making problems in piecewise graphical models and (partially observed) Markov decision processes (PO)(MDPs). However, XADD use is currently limited to (a) relatively small problems and (b) highly restricted classes of piecewise continuous functions.****This proposal significantly advances the expressiveness and scalability of XADDs for both exact and bounded approximate Machine Learning and Decision-theoretic AI Planning with piecewise continuous functions along the following technical research thrusts:****Thrust 1 -- Compact, Expressive Representations for XADDs. We will develop novel expressive classes of XADDs and bounded approximation schemes to support improved tractability and scalability over the existing XADD.****Thrust 2 -- Scalable, Expressive Learning and Inference with XADDs. We will leverage XADDs to develop novel message-passing and Markov Chain Monte Carlo (MCMC) learning and inference algorithms to overcome existing tractability and expressiveness drawbacks.****Thrust 3 -- Enhanced Decision-theoretic AI Planning with XADDs. We will leverage extensions of the XADD to develop novel dynamic programming solutions and compact mixed-integer linear programming (MILP) compilations of piecewise continuous (PO)MDPs yielding substantial improvements in both model expressivity and solution tractability.****Industrial collaborations will serve as a motivator and testbed for the research. Specifically, the research will be grounded in (i) personalized online e-book search via an ongoing collaboration with Kobo, Inc. and (ii) in traffic modeling, prediction, and signal control studies in collaboration with the University of Toronto Intelligent Transportation Systems Centre and Testbed.**
机器学习和决策理论人工智能规划的一个关键挑战是现有方法无法有效和准确地推断分段连续函数。这些功能出现在各种任务中,如偏好学习和交通信号的实时优化。例如,在后一个应用领域,最优规划者必须对信号变化时出现的分段连续交通流进行推理。拟议的研究计划通过进一步发展连续决策图及其应用于从在线商务关键的偏好学习和启发到高度拥挤的城市环境关键的优化交通信号控制等问题,直接应对这一挑战。****连续决策图,如扩展代数决策图(XADD)是由作者发明的,以解决在用分段连续函数紧凑地表示和执行有效的封闭形式计算方面的缺陷。xadd已经在分段图形模型和(部分观察到的)马尔可夫决策过程(PO)(mdp)中实现了一些学习、推理和决策问题的首批精确解决方案。然而,XADD的使用目前仅限于(a)相对较小的问题和(b)分段连续函数的高度受限类。****该提案显著提高了xadd的表达性和可扩展性,用于精确和有界近似机器学习和决策理论AI规划,具有分段连续函数,沿着以下技术研究方向:****Thrust 1—xadd的紧凑,表达性表示。我们将开发新的XADD表达类和有界近似方案,以支持在现有XADD上改进的可跟踪性和可伸缩性。****Thrust 2—使用xadd进行可扩展、表达性学习和推理。我们将利用xadd开发新的消息传递和马尔可夫链蒙特卡罗(Markov Chain Monte Carlo, MCMC)学习和推理算法,以克服现有的可跟踪性和表达性缺点。****推力3—增强决策理论AI规划与xadd。我们将利用XADD的扩展来开发新的动态规划解决方案和分段连续(PO) mdp的紧凑混合整数线性规划(MILP)编译,从而在模型表达性和解决方案可追溯性方面得到实质性改进。****工业合作将成为这项研究的动力和试验台。具体而言,该研究将基于(1)通过与Kobo公司的持续合作进行个性化在线电子书搜索,以及(2)与多伦多大学智能交通系统中心和试验台合作进行交通建模、预测和信号控制研究

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Sanner, Scott其他文献

Evaluation of Machine Learning Algorithms for Predicting Readmission After Acute Myocardial Infarction Using Routinely Collected Clinical Data
  • DOI:
    10.1016/j.cjca.2019.10.023
  • 发表时间:
    2020-06-01
  • 期刊:
  • 影响因子:
    6.2
  • 作者:
    Gupta, Shagun;Ko, Dennis T.;Sanner, Scott
  • 通讯作者:
    Sanner, Scott
Online continual learning in image classification: An empirical survey
  • DOI:
    10.1016/j.neucom.2021.10.021
  • 发表时间:
    2021-11-05
  • 期刊:
  • 影响因子:
    6
  • 作者:
    Mai, Zheda;Li, Ruiwen;Sanner, Scott
  • 通讯作者:
    Sanner, Scott
Relevance- and interface-driven clustering for visual information retrieval
  • DOI:
    10.1016/j.is.2020.101592
  • 发表时间:
    2020-12-01
  • 期刊:
  • 影响因子:
    3.7
  • 作者:
    Bouadjenek, Mohamed Reda;Sanner, Scott;Du, Yihao
  • 通讯作者:
    Du, Yihao
A longitudinal study of topic classification on Twitter.
Twitter上的主题分类的纵向研究。
  • DOI:
    10.7717/peerj-cs.991
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    3.8
  • 作者:
    Bouadjenek, Mohamed Reda;Sanner, Scott;Iman, Zahra;Xie, Lexing;Shi, Daniel Xiaoliang
  • 通讯作者:
    Shi, Daniel Xiaoliang
Comparison of machine learning models for occupancy prediction in residential buildings using connected thermostat data
  • DOI:
    10.1016/j.buildenv.2019.106177
  • 发表时间:
    2019-08-01
  • 期刊:
  • 影响因子:
    7.4
  • 作者:
    Huchuk, Brent;Sanner, Scott;O'Brien, William
  • 通讯作者:
    O'Brien, William

Sanner, Scott的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Sanner, Scott', 18)}}的其他基金

Unifying Recent Advances in Deep Learning with Decision-theoretic Planning for Learned MDPs and POMDPs
将深度学习的最新进展与学习 MDP 和 POMDP 的决策理论规划相结合
  • 批准号:
    RGPIN-2022-04377
  • 财政年份:
    2022
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2021
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2020
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Machine learning for residential building HVAC analytics platform
用于住宅建筑 HVAC 分析平台的机器学习
  • 批准号:
    508857-2017
  • 财政年份:
    2020
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Collaborative Research and Development Grants
Machine learning for residential building HVAC analytics platform
用于住宅建筑 HVAC 分析平台的机器学习
  • 批准号:
    508857-2017
  • 财政年份:
    2019
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Collaborative Research and Development Grants
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2018
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Machine Learning, Sentiment, and Social Media Analysis for Financial Analytics
用于财务分析的机器学习、情绪和社交媒体分析
  • 批准号:
    531275-2018
  • 财政年份:
    2018
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Engage Grants Program
Machine learning for residential building HVAC analytics platform
用于住宅建筑 HVAC 分析平台的机器学习
  • 批准号:
    508857-2017
  • 财政年份:
    2018
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Collaborative Research and Development Grants
Machine learning for residential building HVAC analytics platform
用于住宅建筑 HVAC 分析平台的机器学习
  • 批准号:
    508857-2017
  • 财政年份:
    2017
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Collaborative Research and Development Grants
Deep Unsupervised Learning for Network Anomaly Detection
用于网络异常检测的深度无监督学习
  • 批准号:
    514078-2017
  • 财政年份:
    2017
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Engage Grants Program

相似国自然基金

Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    合作创新研究团队

相似海外基金

SHF: Medium: Improving the Efficiency and Applicability of Decision Diagrams
SHF:中:提高决策图的效率和适用性
  • 批准号:
    2212142
  • 财政年份:
    2022
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Standard Grant
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2021
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2020
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Optimization with Decision Diagrams: Theory and Applications
使用决策图进行优化:理论与应用
  • 批准号:
    RGPIN-2015-04152
  • 财政年份:
    2019
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Decision Diagrams for 0-1 Polynomial Programming
0-1 多项式规划的决策图
  • 批准号:
    540367-2019
  • 财政年份:
    2019
  • 资助金额:
    $ 3.35万
  • 项目类别:
    University Undergraduate Student Research Awards
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2018
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Optimization with Decision Diagrams: Theory and Applications
使用决策图进行优化:理论与应用
  • 批准号:
    RGPIN-2015-04152
  • 财政年份:
    2018
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2017
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Optimization with Decision Diagrams: Theory and Applications
使用决策图进行优化:理论与应用
  • 批准号:
    RGPIN-2015-04152
  • 财政年份:
    2017
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
Continuous Decision Diagrams for Machine Learning and Decision-theoretic AI Planning
用于机器学习和决策理论人工智能规划的连续决策图
  • 批准号:
    RGPIN-2016-05705
  • 财政年份:
    2016
  • 资助金额:
    $ 3.35万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了