Repetitions in Words: Branching Out from Dejean's theorem

文字中的重复:德让定理的分支

基本信息

  • 批准号:
    RGPIN-2021-04084
  • 负责人:
  • 金额:
    $ 0.78万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2021
  • 资助国家:
    加拿大
  • 起止时间:
    2021-01-01 至 2022-12-31
  • 项目状态:
    已结题

项目摘要

The proposed research is in a relatively new field of mathematics called combinatorics on words. A word is a sequence of letters taken from some finite alphabet. Relevant examples are the long binary strings read by computers, or the long sequences of DNA/RNA that are analysed by biologists. Broadly speaking, combinatorics on words is the study of patterns and structures in words. In recent years, computers have played an increasingly important role in the field, both for generating experimental evidence, and as tools for proving results. At the same time, the need to understand the structure of long words has become a key challenge with the rise of big data. This has led to several practical applications of results from combinatorics on words in diverse areas including bioinformatics, text and natural language processing, communication technology, and crystallography. The prototypical example of a pattern studied in combinatorics on words is the square. A square is a word of the form xx, where x is a nonempty word. For example, the English word murmur is a square. A word is square-free if it contains no squares as factors. For example, the English words apple and banana are not square-free, since they contain the squares pp and anan as factors, respectively, while the word cantaloupe is square-free. In the early twentieth century, Axel Thue established the surprising result that there are arbitrarily long square-free words over alphabets with just three letters. Squares fit into the more general theme of repetitions in words. Squares have exponent 2, since they are made up of a shorter word that is repeated exactly twice. The English word alfalfa has exponent 7/3, as it can be formed by repeating the shorter word alf exactly 7/3 times. While Thue showed that there are arbitrarily long square-free words over three letters, a stronger result is actually true - there are arbitrarily long words over three letters that contain no factor of exponent greater than 7/4. In fact, this is best possible, since every long enough word on three letters contains a factor of exponent at least 7/4. This says that the repetition threshold for three letters is 7/4. Dejean's theorem, which was proven through the work of many authors over approximately 40 years, gives the value of the repetition threshold for every possible alphabet size. In other words, over any fixed alphabet, Dejean's theorem describes exactly the repeitions that can be avoided, and the repetitions that must inevitably occur in long enough words. The proposed research will make progress on several strengthenings and variations of Dejean's theorem through both theoretical and computational methods. This work will expand our knowledge of repetitions in words, and will lead to the development of new methods and techniques in combinatorics on words. This work has anticipated theoretical applications in game theory and graph theory, and the potential for practical applications.
这项被提议的研究是在一个相对较新的数学领域,称为单词组合学。单词是从某个有限的字母表中取出的字母序列。相关的例子是由计算机读取的长二进制字符串,或由生物学家分析的长DNA/RNA序列。从广义上讲,词的组合学是研究词的模式和结构。近年来,计算机在该领域扮演着越来越重要的角色,无论是在生成实验证据方面,还是在作为证明结果的工具方面。与此同时,随着大数据的兴起,理解长词结构的需求已成为一项关键挑战。这导致了组合学的结果在不同领域的几个实际应用,包括生物信息学,文本和自然语言处理,通信技术和晶体学。在单词组合学中研究的模式的典型例子是正方形。正方形是形式为xx的单词,其中x是一个非空单词。例如,英语单词“杂音”是一个正方形。如果一个单词不包含作为因数的平方,那么它就是无平方的。例如,英语单词apple和banana不是无方形的,因为它们分别包含正方形pp和anan作为因子,而单词cantaloupe是无方形的。在20世纪早期,阿克塞尔·图伊(Axel Thue)建立了一个令人惊讶的结果,即在只有三个字母的字母表上存在任意长的无方形单词。方形符合单词重复的更普遍的主题。正方形的指数是2,因为它们是由一个恰好重复两次的较短单词组成的。英语单词alfalfa的指数是7/3,因为它可以通过重复较短的单词half正好7/3次而形成。虽然Thue证明了存在超过三个字母的任意长无平方词,但一个更强大的结果实际上是正确的——存在超过三个字母的任意长词,其指数因子不大于7/4。事实上,这是最好的选择,因为每个足够长的单词都包含至少7/4的指数因子。这说明三个字母的重复阈值是7/4。德让定理(Dejean’s theorem)是经过许多作者近40年的努力证明的,它给出了每种可能字母表大小的重复阈值。换句话说,在任何固定的字母表中,德让定理精确地描述了可以避免的重复,以及在足够长的单词中不可避免地出现的重复。本研究将从理论和计算两方面对德让定理进行若干强化和改进。这项工作将扩大我们对单词重复的认识,并将导致单词组合学新方法和新技术的发展。这项工作预期了博弈论和图论的理论应用,以及实际应用的潜力。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Mol, Lucas其他文献

Mol, Lucas的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Mol, Lucas', 18)}}的其他基金

Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
  • 批准号:
    RGPIN-2021-04084
  • 财政年份:
    2022
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Discovery Grants Program - Individual
Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
  • 批准号:
    RGPIN-2021-04084
  • 财政年份:
    2021
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Discovery Grants Program - Individual
Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
  • 批准号:
    DGECR-2021-00304
  • 财政年份:
    2021
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Discovery Launch Supplement
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
  • 批准号:
    425438-2012
  • 财政年份:
    2014
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
  • 批准号:
    425438-2012
  • 财政年份:
    2013
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
  • 批准号:
    425438-2012
  • 财政年份:
    2012
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in directed acyclic graphs
有向无环图中的渗流
  • 批准号:
    408910-2011
  • 财政年份:
    2011
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Master's

相似海外基金

Lost For Words: Cognitive Ageing And Language Control In Bilingual Older Adults With And Without Cognitive Impairment
失语:有或没有认知障碍的双语老年人的认知衰老和语言控制
  • 批准号:
    EP/Y036522/1
  • 财政年份:
    2024
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Research Grant
Exploratory research on "words" and "concepts" for cross-curricular citizenship education
跨课程公民教育的“言语”与“观念”探索性研究
  • 批准号:
    23K17613
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
An empirical study of the borrowing process of foreign words in Japanese from the perspective of English and French phonetics
英法语音学视角下日语外来词借用过程的实证研究
  • 批准号:
    23K00549
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Plaintive Words: Legal and Literary Complaints, 1550-1625
哀怨的话语:法律和文学的抱怨,1550-1625
  • 批准号:
    2877040
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Studentship
MiDiario: Mobile Intervention for Diabetes via Reflection and Introspection in My Own Words
MiDiario:用我自己的话反思和自省的糖尿病移动干预
  • 批准号:
    10698632
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
Co-designing cultural adaptations of the 'F-words for Child Development': Advancing knowledge and improving the cultural relevance, safety, and inclusivity of F-words-based services in two child/family health organizations in Ontario and Manitoba
共同设计“儿童发展的脏话”的文化适应:在安大略省和马尼托巴省的两个儿童/家庭健康组织中促进知识并提高基于脏话的服务的文化相关性、安全性和包容性
  • 批准号:
    484638
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Fellowship Programs
A Diachronic study of the word base of the Japanese Kango words and a study about the derivation of the modern new Sino-Japanese words through it.
日语暗语词基的历时研究及现代汉日新词的衍生研究
  • 批准号:
    23K00559
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
What percentage of words (tokens) should learners know to comprehend listening materials? An updated examination for teachers and researchers concerning input mode, meaning senses of words, and genre
学习者应该知道多少百分比的单词(标记)才能理解听力材料?
  • 批准号:
    23K00712
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The first English speakers in their own words
第一个说英语的人用自己的话来说
  • 批准号:
    DP230101057
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
    Discovery Projects
Overcoming early childhood caries disease burden through the words of mothers (WOM project)
通过母亲的话语克服儿童早期龋齿疾病负担(WOM项目)
  • 批准号:
    10810957
  • 财政年份:
    2023
  • 资助金额:
    $ 0.78万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了