Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
基本信息
- 批准号:RGPIN-2020-06907
- 负责人:
- 金额:$ 1.68万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Suppose that we wish to have a computer program automatically organize a large collection of websites by grouping them according to topic. The program observes that the words “clinical” and “injury” often occur together on the same pages, while the words “baste” and “chop” often occur together on others, thus forming the basis for two categories: medical and cooking websites.
We could define our categories in advance, or we could allow the program to identify categories organically. In the latter case, we would use a non-parametric Bayesian (NPB) clustering algorithm. Such algorithms are based on probabilistic models that are exchangeable, meaning, loosely, that the order in which we receive data is meaningless i.e. the first and second websites are as likely to share the same topic as are the eighth and one hundredth.
The Chinese restaurant process (CRP) is one such model. Imagine customers entering a vast dim sum restaurant with giant tables. The first customer must sit alone. Subsequent customers choose seats at random according to the following rule: the nth customer will join a table with m other customers with probability m/n, or will sit alone with probability 1/n. In our example above, customers and tables represent websites and categories.
It is not obvious, but the CRP is exchangeable: customers 1 and 2 are as likely to sit together as customers 8 and 100.
My research concerns models for random growth or change in exchangeable structures. The CRP is a growth process on exchangeable partitions. Another example is the reseating Chinese restaurant process (RCRP), in which, instead of beginning with an empty restaurant and having new customers enter, we begin with customers already seated, and at each step a randomly chosen customer stands up and randomly chooses a new seat. This can be used as a model for updating our guesses on which websites belong to which categories.
In the next five years, I will address the following problems:
(1) describe, in various ways, the limit of the RCRP on vast numbers of customers, and
(2) introduce and study growth processes, like the CRP, for exchangeable hierarchies: partitions in which we sub-partition the segments repeatedly, in order give models for documents partitioned according to topic, sub-topic, sub-sub-topic, etc..
Problem (1) has been widely studied since 2010. In recent years, I've made major progress towards describing how the numbers of customers at each table change over time, in the limit. I believe that I am close to completing that description. Afterwards, I will work on describing a version in which we can see where each customer is seated.
For problem (2), only one model for growing exchangeable hierarchies has been well-studied: the nested Chinese restaurant process. But I have written two papers that find that exchangeable hierarchies can exhibit other behaviors. I plan to introduce new, more flexible models, which can then be applied to nested clustering problems.
假设我们希望有一个计算机程序通过根据主题对网站进行分组来自动组织大量网站。该程序观察到,“临床”和“伤害”这两个词经常在同一页面上同时出现,而“酱”和“剁”这两个词经常在其他页面上同时出现,从而形成了两类网站的基础:医疗和烹饪网站。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Forman, Noah其他文献
A two-parameter family of measure-valued diffusions with Poisson–Dirichlet stationary distributions
具有泊松狄利克雷平稳分布的二参数测值扩散族
- DOI:
10.1214/21-aap1732 - 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Forman, Noah;Rizzolo, Douglas;Shi, Quan;Winkel, Matthias - 通讯作者:
Winkel, Matthias
Metrics on sets of interval partitions with diversity
具有多样性的区间划分集的度量
- DOI:
10.1214/20-ecp317 - 发表时间:
2020 - 期刊:
- 影响因子:0.5
- 作者:
Forman, Noah;Pal, Soumik;Rizzolo, Douglas;Winkel, Matthias - 通讯作者:
Winkel, Matthias
Projections of the Aldous chain on binary trees: Intertwining and consistency
奥尔德斯链在二叉树上的投影:交织和一致性
- DOI:
10.1002/rsa.20930 - 发表时间:
2020 - 期刊:
- 影响因子:1
- 作者:
Forman, Noah;Pal, Soumik;Rizzolo, Douglas;Winkel, Matthias - 通讯作者:
Winkel, Matthias
Diffusions on a space of interval partitions: the two-parameter model
区间分区空间上的扩散:二参数模型
- DOI:
10.1214/23-ejp946 - 发表时间:
2023 - 期刊:
- 影响因子:1.4
- 作者:
Forman, Noah;Rizzolo, Douglas;Shi, Quan;Winkel, Matthias - 通讯作者:
Winkel, Matthias
Ranked masses in two-parameter Fleming–Viot diffusions
双参数 Fleming-Viot 扩散中的质量排名
- DOI:
10.1090/tran/8764 - 发表时间:
2023 - 期刊:
- 影响因子:1.3
- 作者:
Forman, Noah;Pal, Soumik;Rizzolo, Douglas;Winkel, Matthias - 通讯作者:
Winkel, Matthias
Forman, Noah的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Forman, Noah', 18)}}的其他基金
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
RGPIN-2020-06907 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
RGPIN-2020-06907 - 财政年份:2021
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
DGECR-2020-00371 - 财政年份:2020
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Launch Supplement
相似海外基金
A novel through-the-scope exchangeable double balloon catheter to guide endoscopic bypass: a practice-changing technology in the management of malignant gastric outlet obstruction
一种新型的通过镜可交换双球囊导管来引导内窥镜旁路:治疗恶性胃出口梗阻的一种改变实践的技术
- 批准号:
498860 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Operating Grants
Managed Exchangeable Solid State Fuel Storage Technology for Rail Vehicles
轨道车辆托管可交换固态燃料存储技术
- 批准号:
10060552 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Collaborative R&D
A novel through-the-scope exchangeable double balloon catheter to guide endoscopic bypass: a practice-changing technology in the management of malignant gastric outlet obstruction
一种新型的通过镜可交换双球囊导管来引导内窥镜旁路:治疗恶性胃出口梗阻的一种改变实践的技术
- 批准号:
489994 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Operating Grants
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
RGPIN-2020-06907 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Charting a New Paradigm for Large Non-Exchangeable Multi-Agent and Many-Particle Systems
为大型不可交换多代理和多粒子系统绘制新范式
- 批准号:
2205694 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Standard Grant
Re-evaluation of exchangeable cations in forest soils using Sr and Cs isotopic compositions
使用 Sr 和 Cs 同位素组合物重新评估森林土壤中的可交换阳离子
- 批准号:
22K03741 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
RGPIN-2020-06907 - 财政年份:2021
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Dynamics in exchangeable partitions and hierarchies
可交换分区和层次结构中的动态
- 批准号:
DGECR-2020-00371 - 财政年份:2020
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Launch Supplement
Bleaching-independent, multi-color, whole-cell STED microscopy using exchangeable fluorophores
使用可交换荧光团的不依赖漂白的多色全细胞 STED 显微镜
- 批准号:
422735238 - 财政年份:2019
- 资助金额:
$ 1.68万 - 项目类别:
Research Grants
Design of ligand-tag exchangeable new photoaffinity probe utilizing nosyl chemistry
利用 nosyl 化学设计配体标签可交换的新型光亲和探针
- 批准号:
18K14350 - 财政年份:2018
- 资助金额:
$ 1.68万 - 项目类别:
Grant-in-Aid for Early-Career Scientists