Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
基本信息
- 批准号:RGPIN-2021-04084
- 负责人:
- 金额:$ 0.53万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2021
- 资助国家:加拿大
- 起止时间:2021-01-01 至 2022-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The proposed research is in a relatively new field of mathematics called combinatorics on words. A word is a sequence of letters taken from some finite alphabet. Relevant examples are the long binary strings read by computers, or the long sequences of DNA/RNA that are analysed by biologists. Broadly speaking, combinatorics on words is the study of patterns and structures in words. In recent years, computers have played an increasingly important role in the field, both for generating experimental evidence, and as tools for proving results. At the same time, the need to understand the structure of long words has become a key challenge with the rise of big data. This has led to several practical applications of results from combinatorics on words in diverse areas including bioinformatics, text and natural language processing, communication technology, and crystallography. The prototypical example of a pattern studied in combinatorics on words is the square. A square is a word of the form xx, where x is a nonempty word. For example, the English word murmur is a square. A word is square-free if it contains no squares as factors. For example, the English words apple and banana are not square-free, since they contain the squares pp and anan as factors, respectively, while the word cantaloupe is square-free. In the early twentieth century, Axel Thue established the surprising result that there are arbitrarily long square-free words over alphabets with just three letters. Squares fit into the more general theme of repetitions in words. Squares have exponent 2, since they are made up of a shorter word that is repeated exactly twice. The English word alfalfa has exponent 7/3, as it can be formed by repeating the shorter word alf exactly 7/3 times. While Thue showed that there are arbitrarily long square-free words over three letters, a stronger result is actually true - there are arbitrarily long words over three letters that contain no factor of exponent greater than 7/4. In fact, this is best possible, since every long enough word on three letters contains a factor of exponent at least 7/4. This says that the repetition threshold for three letters is 7/4. Dejean's theorem, which was proven through the work of many authors over approximately 40 years, gives the value of the repetition threshold for every possible alphabet size. In other words, over any fixed alphabet, Dejean's theorem describes exactly the repeitions that can be avoided, and the repetitions that must inevitably occur in long enough words. The proposed research will make progress on several strengthenings and variations of Dejean's theorem through both theoretical and computational methods. This work will expand our knowledge of repetitions in words, and will lead to the development of new methods and techniques in combinatorics on words. This work has anticipated theoretical applications in game theory and graph theory, and the potential for practical applications.
拟议的研究属于一个相对较新的数学领域,称为单词组合学。单词是取自某个有限字母表的字母序列。相关示例包括计算机读取的长二进制字符串,或生物学家分析的长 DNA/RNA 序列。从广义上讲,单词组合学是对单词模式和结构的研究。近年来,计算机在该领域发挥着越来越重要的作用,既可以生成实验证据,也可以作为证明结果的工具。与此同时,随着大数据的兴起,理解长词结构的需求已成为一个关键挑战。这导致了单词组合学结果在生物信息学、文本和自然语言处理、通信技术和晶体学等不同领域的一些实际应用。单词组合学中研究的模式的典型例子是正方形。方块是 xx 形式的单词,其中 x 是非空单词。例如,英语单词 murmur 是一个正方形。如果一个单词不包含平方作为因子,则该单词是无平方的。例如,英语单词 apple 和 Banana 就不是无平方的,因为它们分别包含平方 pp 和 anan 作为因子,而单词 cantaloupe 则不是无平方的。二十世纪初,Axel Thue 得出了一个令人惊讶的结果:在只有三个字母的字母表上存在任意长的无方形单词。方块符合更普遍的单词重复主题。正方形的指数为 2,因为它们是由恰好重复两次的较短单词组成的。英语单词 alfalfa 的指数为 7/3,因为它可以通过重复较短的单词 alf 恰好 7/3 次来形成。虽然 Thue 表明存在超过三个字母的任意长的无平方单词,但实际上有一个更强的结果是正确的 - 存在超过三个字母的任意长的单词,且不包含大于 7/4 的指数因子。事实上,这是最好的可能,因为三个字母上的每个足够长的单词都包含至少 7/4 的指数因子。这表示三个字母的重复阈值是 7/4。德让定理经过许多作者大约 40 年的工作证明,给出了每种可能的字母大小的重复阈值。换句话说,在任何固定的字母表上,德让定理准确地描述了可以避免的重复,以及在足够长的单词中必然出现的重复。 拟议的研究将通过理论和计算方法在德让定理的几个强化和变化方面取得进展。这项工作将扩展我们对单词重复的了解,并将导致单词组合学新方法和技术的发展。这项工作预见了博弈论和图论中的理论应用以及实际应用的潜力。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mol, Lucas其他文献
Mol, Lucas的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mol, Lucas', 18)}}的其他基金
Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
- 批准号:
RGPIN-2021-04084 - 财政年份:2022
- 资助金额:
$ 0.53万 - 项目类别:
Discovery Grants Program - Individual
Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
- 批准号:
DGECR-2021-00304 - 财政年份:2021
- 资助金额:
$ 0.53万 - 项目类别:
Discovery Launch Supplement
Repetitions in Words: Branching Out from Dejean's theorem
文字中的重复:德让定理的分支
- 批准号:
RGPIN-2021-04084 - 财政年份:2021
- 资助金额:
$ 0.53万 - 项目类别:
Discovery Grants Program - Individual
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
- 批准号:
425438-2012 - 财政年份:2014
- 资助金额:
$ 0.53万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
- 批准号:
425438-2012 - 财政年份:2013
- 资助金额:
$ 0.53万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in Directed Acyclic Graphs
有向无环图中的渗流
- 批准号:
425438-2012 - 财政年份:2012
- 资助金额:
$ 0.53万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Seepage in directed acyclic graphs
有向无环图中的渗流
- 批准号:
408910-2011 - 财政年份:2011
- 资助金额:
$ 0.53万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's
相似海外基金
Lost For Words: Cognitive Ageing And Language Control In Bilingual Older Adults With And Without Cognitive Impairment
失语:有或没有认知障碍的双语老年人的认知衰老和语言控制
- 批准号:
EP/Y036522/1 - 财政年份:2024
- 资助金额:
$ 0.53万 - 项目类别:
Research Grant
Exploratory research on "words" and "concepts" for cross-curricular citizenship education
跨课程公民教育的“言语”与“观念”探索性研究
- 批准号:
23K17613 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
An empirical study of the borrowing process of foreign words in Japanese from the perspective of English and French phonetics
英法语音学视角下日语外来词借用过程的实证研究
- 批准号:
23K00549 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Plaintive Words: Legal and Literary Complaints, 1550-1625
哀怨的话语:法律和文学的抱怨,1550-1625
- 批准号:
2877040 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Studentship
MiDiario: Mobile Intervention for Diabetes via Reflection and Introspection in My Own Words
MiDiario:用我自己的话反思和自省的糖尿病移动干预
- 批准号:
10698632 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Co-designing cultural adaptations of the 'F-words for Child Development': Advancing knowledge and improving the cultural relevance, safety, and inclusivity of F-words-based services in two child/family health organizations in Ontario and Manitoba
共同设计“儿童发展的脏话”的文化适应:在安大略省和马尼托巴省的两个儿童/家庭健康组织中促进知识并提高基于脏话的服务的文化相关性、安全性和包容性
- 批准号:
484638 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Fellowship Programs
A Diachronic study of the word base of the Japanese Kango words and a study about the derivation of the modern new Sino-Japanese words through it.
日语暗语词基的历时研究及现代汉日新词的衍生研究
- 批准号:
23K00559 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
What percentage of words (tokens) should learners know to comprehend listening materials? An updated examination for teachers and researchers concerning input mode, meaning senses of words, and genre
学习者应该知道多少百分比的单词(标记)才能理解听力材料?
- 批准号:
23K00712 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
The first English speakers in their own words
第一个说英语的人用自己的话来说
- 批准号:
DP230101057 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:
Discovery Projects
Overcoming early childhood caries disease burden through the words of mothers (WOM project)
通过母亲的话语克服儿童早期龋齿疾病负担(WOM项目)
- 批准号:
10810957 - 财政年份:2023
- 资助金额:
$ 0.53万 - 项目类别:














{{item.name}}会员




