Computational Analysis of Short Repetitive Motifs in DNA Sequences

DNA 序列中短重复基序的计算分析

基本信息

  • 批准号:
    7196374
  • 负责人:
  • 金额:
    $ 7.4万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-05-01 至 2009-04-30
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): Computational analysis of various aspects of gene regulation, transcription factor binding in particular, is an important and well known problem. Adequately addressed, it would greatly improve our understanding of diseases and help with the development of treatments. However, despite the intensive efforts and the application of sophisticated models, the identification of the binding motifs in DMA sequences remains elusive. The raw sequence likely carries only a part of the regulatory signal, and it is often too short and subtle to be detected even by the most sensitive algorithms. Many software tools developed for this purpose exploit the clustering and over-representation of motifs in promoter regions, sometimes combining this method with other experimental or phylogenetic information. However, the fact that many short sequences appear to be over-represented in any segment of DNA, at least in comparison with completely random model, impedes the reliable discovery. This proposal seeks support to develop new software and apply it to the identification, visualization and analysis of repeated short (approximately 5-25 bases) degenerate motifs, in short (a few hundred bases) and long (entire chromosomes) DNA sequences. We intend to use this software on the human and other genomes, as well as on sequences of interest to our collaborators in biology and chemistry, in an attempt to systematically characterize short over-represented sequences. We shall determine which of these motifs correspond to the experimentally confirmed transcription factor binding consensuses, study their phylogenetic conservation and investigate their possible association with repeat families. Special attention will be paid to the upstream sequences of genes, and tools will be developed for a genome-wide search for related motif layouts. Our software will be based on an adaptation of classic string processing algorithms to address the inexact matches in a novel way, by combining the seed elements into statistically significant degenerate motifs. In addition to performing analysis with our collaborators, we will place the programs in the public domain, along with the other tools which we have already developed and published, inviting other investigators to use them on their own data.
描述(由申请人提供): 计算分析基因调控的各个方面,特别是转录因子结合,是一个重要的和众所周知的问题。如果处理得当,它将极大地提高我们对疾病的了解,并有助于开发治疗方法。然而,尽管进行了大量的努力并应用了复杂的模型,但对DMA序列中的结合基序的识别仍然难以捉摸。原始序列可能只携带部分调控信号,而且它往往太短太微妙,即使是最敏感的算法也无法检测到。为此目的开发的许多软件工具利用启动子区域中基序的聚集和过度表达,有时将这种方法与其他实验或系统发育信息结合起来。然而,至少与完全随机的模型相比,许多短序列似乎在DNA的任何片段中过度表达的事实阻碍了可靠的发现。 这项提议寻求支持开发新的软件,并将其应用于识别、可视化和分析重复的短(约5-25个碱基)简并基序,即短(数百个碱基)和长(整个染色体)DNA序列。我们打算在人类和其他基因组以及我们的生物和化学合作者感兴趣的序列上使用这个软件,试图系统地描述短的过度表达的序列。我们将确定这些基序中的哪些与实验确认的转录因子结合共识相对应,研究它们的系统发育保守性,并调查它们与重复家族的可能联系。将特别关注基因的上游序列,并将开发工具,在全基因组范围内搜索相关的基序布局。 我们的软件将基于经典字符串处理算法的改编,通过将种子元素组合成统计上显著的退化主题,以一种新颖的方式解决不精确匹配问题。除了与我们的合作者一起进行分析外,我们还将把这些程序与我们已经开发和发布的其他工具一起放在公共领域,邀请其他调查人员在他们自己的数据上使用它们。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Nikola Stojanovic其他文献

Nikola Stojanovic的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

相似海外基金

How novices write code: discovering best practices and how they can be adopted
新手如何编写代码:发现最佳实践以及如何采用它们
  • 批准号:
    2315783
  • 财政年份:
    2023
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Standard Grant
One or Several Mothers: The Adopted Child as Critical and Clinical Subject
一位或多位母亲:收养的孩子作为关键和临床对象
  • 批准号:
    2719534
  • 财政年份:
    2022
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2633211
  • 财政年份:
    2020
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Studentship
A material investigation of the ceramic shards excavated from the Omuro Ninsei kiln site: Production techniques adopted by Nonomura Ninsei.
对大室仁清窑遗址出土的陶瓷碎片进行材质调查:野野村仁清采用的生产技术。
  • 批准号:
    20K01113
  • 财政年份:
    2020
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2436895
  • 财政年份:
    2020
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2633207
  • 财政年份:
    2020
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Studentship
The limits of development: State structural policy, comparing systems adopted in two European mountain regions (1945-1989)
发展的限制:国家结构政策,比较欧洲两个山区采用的制度(1945-1989)
  • 批准号:
    426559561
  • 财政年份:
    2019
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Research Grants
Securing a Sense of Safety for Adopted Children in Middle Childhood
确保被收养儿童的中期安全感
  • 批准号:
    2236701
  • 财政年份:
    2019
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Studentship
A Study on Mutual Funds Adopted for Individual Defined Contribution Pension Plans
个人设定缴存养老金计划采用共同基金的研究
  • 批准号:
    19K01745
  • 财政年份:
    2019
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Structural and functional analyses of a bacterial protein translocation domain that has adopted diverse pathogenic effector functions within host cells
对宿主细胞内采用多种致病效应功能的细菌蛋白易位结构域进行结构和功能分析
  • 批准号:
    415543446
  • 财政年份:
    2019
  • 资助金额:
    $ 7.4万
  • 项目类别:
    Research Fellowships
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了