SGER: Algorithmic Challenges in Computational Biology

SGER:计算生物学中的算法挑战

基本信息

  • 批准号:
    0305444
  • 负责人:
  • 金额:
    $ 7.5万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2003
  • 资助国家:
    美国
  • 起止时间:
    2003-05-15 至 2006-04-30
  • 项目状态:
    已结题

项目摘要

EIA-0305444Bruce R DonaldDartmouth CollegeProject SummaryThe expression patterns of many genes associated with circannual (yearly), circadian (daily), cell-cycle and other periodic biological processes are known to be rhythmic. Conversely, the expression profiles of genes associated with aperiodic biological processes (e.g., tissue repair) are not rhythmic. The functional significance of previously uncharacterized genes, therefore, may be inferred if they exhibit rhythmic patterns of expression synchronized to some ongoing biological process.DNA microarray experiments are an effective tool for identifying rhythmic genes when a time-series of expression levels are collected. Unlike Northern blots and real-time PCR, which study one gene at a time, DNA microarray hybridization experiments can reveal the expression patterns of entire genomes. Chronobiologists are therefore able to assign putative functional properties to large numbers of genes based on the results of a single experiment. However, the large volume of data generated by hybridization experiments makes manual inspection of individual expression profiles impractical. Separating the subset of genes whose expression profiles are rhythmic from the thousands or tens of thousands that are not requires computer assistance. Ideally, the algorithms for providing such assistance should be efficient and have well-understood performance guarantees.We propose to design and implement algorithms to identify and characterize the properties of rhythmic genes from DNA microarray hybridization time-series data. Our approach will build on our recent papers in {\em The International Conference on Research in Computational Molecular Biology (RECOMB)}~\cite{recomb02} and the {\em IEEE Computer Society Bioinformatics Conference}~\cite{csb02}. We will specifically addresses issues of computational complexity, statistical significance and morphological similarity. We hope our techniques will aid efforts in functional genomics, by developing new algorithmic techniques for the analysis of massively-parallel DNA microarray expression data.We will develop a model-based analysis technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA microarray hybridization data. Our approach, called {\sc rage}(Rhythmic Analysis of Gene Expression), decouples the problems of estimating a pattern's wavelength and phase. Specifically (I) we propose the {\em autocorrelation} to render our search algorithm phase-independent, and (II) we propose the {\em Hausdorff distance} to measure the similarity of the autocorrelated signals. By attacking the problem of microarray gene-expression time-series analysis using these new methods, we hope to strengthen the computational armamentarium of the chronobiologist. Our {\sc rage} algorithm is linear-time in frequency and phase resolution, an improvement over previous quadratic-time approaches. Unlike previous approaches, {\sc rage} uses a true distance metric for measuring expression profile similarity, based on the Hausdorff distance. This results in better clustering of expression profiles for rhythmic analysis. The confidence of each frequency estimate is computed using $Z$-scores. In preliminary results,{\sc rage}performed better than competing techniques on synthetic and actual DNA microarray hybridization data. Employing results on combinatorial bounds for Voronoi diagrams~\cite{Huttenlocher}, we can replace the discretized phase search in our method with an exact (combinatorially precise) phase search~\cite{recomb02}, resulting in a faster algorithm with no complexity dependence on phase resolution. Thus, one emphasis of this proposal is the development of combinatorially-precise, provable algorithms for analyzing expression patterns.Surprisingly, maximum entropy spectral analysis (MESA) has not been applied to massively-parallel gene expression time-series analysis before. Therefore, we will also develop a maximum entropy-based analysis technique for extracting and characterizing rhythmic expression profiles from DNA microarray hybridization data. This approach, called {\sc enrage} (Entropy-based Rhythmic Analysis of Gene Expression), treats the task of estimating an expression profile's periodicity and phase as a simultaneous bicriterion optimization problem. Specifically, a frequency domain spectrum is reconstructed from a time-series of gene expression data, subject to two constraints: (a) the likelihood of the spectrum and (b) the Shannon entropy of the reconstructed spectrum. Unlike Fourier-based spectral analysis, maximum entropy spectral reconstruction is well-suited to signals of the type generated in DNA microarray experiments. The {\sc enrage} algorithm is optimal, running in linear time in the number of expression profiles. Moreover, a preliminary implementation of our algorithm runs an order of magnitude faster than previous methods. In preliminary results, we found that {\sc enrage} performed better than previous methods in identifying and characterizing periodic expression profiles on both synthetic and actual DNA microarray hybridization data. Thus, a second thrust of this proposal is the development of novel signal-processing approaches to analyze gene expression patterns, and their integration with combinatorial algorithms from computational geometry.
EIA-0305444Bruce R DonaldDartmouth CollegeProject SummaryThe expression patterns of many genes associated with circannual (yearly), circadian (daily), cell-cycle and other periodic biological processes are known to be rhythmic. Conversely, the expression profiles of genes associated with aperiodic biological processes (e.g., tissue repair) are not rhythmic. The functional significance of previously uncharacterized genes, therefore, may be inferred if they exhibit rhythmic patterns of expression synchronized to some ongoing biological process.DNA microarray experiments are an effective tool for identifying rhythmic genes when a time-series of expression levels are collected. Unlike Northern blots and real-time PCR, which study one gene at a time, DNA microarray hybridization experiments can reveal the expression patterns of entire genomes. Chronobiologists are therefore able to assign putative functional properties to large numbers of genes based on the results of a single experiment. However, the large volume of data generated by hybridization experiments makes manual inspection of individual expression profiles impractical. Separating the subset of genes whose expression profiles are rhythmic from the thousands or tens of thousands that are not requires computer assistance. Ideally, the algorithms for providing such assistance should be efficient and have well-understood performance guarantees.We propose to design and implement algorithms to identify and characterize the properties of rhythmic genes from DNA microarray hybridization time-series data. Our approach will build on our recent papers in {\em The International Conference on Research in Computational Molecular Biology (RECOMB)}~\cite{recomb02} and the {\em IEEE Computer Society Bioinformatics Conference}~\cite{csb02}. We will specifically addresses issues of computational complexity, statistical significance and morphological similarity. We hope our techniques will aid efforts in functional genomics, by developing new algorithmic techniques for the analysis of massively-parallel DNA microarray expression data.We will develop a model-based analysis technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA microarray hybridization data. Our approach, called {\sc rage}(Rhythmic Analysis of Gene Expression), decouples the problems of estimating a pattern's wavelength and phase. Specifically (I) we propose the {\em autocorrelation} to render our search algorithm phase-independent, and (II) we propose the {\em Hausdorff distance} to measure the similarity of the autocorrelated signals. By attacking the problem of microarray gene-expression time-series analysis using these new methods, we hope to strengthen the computational armamentarium of the chronobiologist. Our {\sc rage} algorithm is linear-time in frequency and phase resolution, an improvement over previous quadratic-time approaches. Unlike previous approaches, {\sc rage} uses a true distance metric for measuring expression profile similarity, based on the Hausdorff distance. This results in better clustering of expression profiles for rhythmic analysis. The confidence of each frequency estimate is computed using $Z$-scores. In preliminary results,{\sc rage}performed better than competing techniques on synthetic and actual DNA microarray hybridization data. Employing results on combinatorial bounds for Voronoi diagrams~\cite{Huttenlocher}, we can replace the discretized phase search in our method with an exact (combinatorially precise) phase search~\cite{recomb02}, resulting in a faster algorithm with no complexity dependence on phase resolution. Thus, one emphasis of this proposal is the development of combinatorially-precise, provable algorithms for analyzing expression patterns.Surprisingly, maximum entropy spectral analysis (MESA) has not been applied to massively-parallel gene expression time-series analysis before. Therefore, we will also develop a maximum entropy-based analysis technique for extracting and characterizing rhythmic expression profiles from DNA microarray hybridization data. This approach, called {\sc enrage} (Entropy-based Rhythmic Analysis of Gene Expression), treats the task of estimating an expression profile's periodicity and phase as a simultaneous bicriterion optimization problem. Specifically, a frequency domain spectrum is reconstructed from a time-series of gene expression data, subject to two constraints: (a) the likelihood of the spectrum and (b) the Shannon entropy of the reconstructed spectrum. Unlike Fourier-based spectral analysis, maximum entropy spectral reconstruction is well-suited to signals of the type generated in DNA microarray experiments. The {\sc enrage} algorithm is optimal, running in linear time in the number of expression profiles. Moreover, a preliminary implementation of our algorithm runs an order of magnitude faster than previous methods. In preliminary results, we found that {\sc enrage} performed better than previous methods in identifying and characterizing periodic expression profiles on both synthetic and actual DNA microarray hybridization data. Thus, a second thrust of this proposal is the development of novel signal-processing approaches to analyze gene expression patterns, and their integration with combinatorial algorithms from computational geometry.

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Bruce Donald其他文献

'Myths and facts' education is comparable to 'facts only' for recall of back pain information but may improve fear-avoidance beliefs: an embedded randomized trial.
在回忆背痛信息方面,“神话与事实”教育与“仅事实”教育相当,但可能会提高避免恐惧的信念:一项嵌入式随机试验。
  • DOI:
    10.2519/jospt.2022.10989
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Priscilla Viana da Silva;S. Kamper;E. Robson;S. Davidson;C. Gleadhill;Bruce Donald;Tiê Parma Yamato;Erin Nolan;Hopin Lee;C. Williams
  • 通讯作者:
    C. Williams
Predicting the Effect of Mutations in the KRas/c-Raf-RBD Protein-Protein Interface
  • DOI:
    10.1016/j.bpj.2017.11.3151
  • 发表时间:
    2018-02-02
  • 期刊:
  • 影响因子:
  • 作者:
    Anna Lowegard;Marcel Frenkel;Bruce Donald
  • 通讯作者:
    Bruce Donald

Bruce Donald的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Bruce Donald', 18)}}的其他基金

Postdoctoral: Physical Geometric Algorithms and Systems for High-Throughput NMR Structural Biology
博士后:高通量核磁共振结构生物学的物理几何算法和系统
  • 批准号:
    0102710
  • 财政年份:
    2001
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
Postdoctoral: Physical Geometric Algorithms and Systems for Structural Biology Using Mass Spectrometry
博士后:利用质谱进行结构生物学的物理几何算法和系统
  • 批准号:
    0102712
  • 财政年份:
    2001
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
CISE Postdoctoral Research Associates in Experimental Computer Science: Challenges in Micromanipulation: Massively Parallel MEMS Algorithms and Systems
CISE 实验计算机科学博士后研究员:微操作的挑战:大规模并行 MEMS 算法和系统
  • 批准号:
    9901407
  • 财政年份:
    1999
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
REU: MEMS Algorithms and Systems for Distributed Manipulation
REU:分布式操纵的 MEMS 算法和系统
  • 批准号:
    9906790
  • 财政年份:
    1999
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing Grant
CISE PostDoc: Microrobotics and MEMS
CISE 博士后:微型机器人和 MEMS
  • 批准号:
    9805548
  • 财政年份:
    1998
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
Distributed Manipulation Algorithms for Massively Parallel Microfabricated Actuator Arrays
大规模并行微加工执行器阵列的分布式操纵算法
  • 批准号:
    9896020
  • 财政年份:
    1997
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing Grant
Distributed Manipulation Algorithms for Massively Parallel Microfabricated Actuator Arrays
大规模并行微加工执行器阵列的分布式操纵算法
  • 批准号:
    9530785
  • 财政年份:
    1996
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing grant
A Computational Approach to the Design of Dynamic Microelectronmechanial Structures
动态微机电结构设计的计算方法
  • 批准号:
    9403903
  • 财政年份:
    1994
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
Task-Level Planning and Task-Directed Sensing for Robots in Uncertain Environments
不确定环境中机器人的任务级规划和任务导向感知
  • 批准号:
    9201699
  • 财政年份:
    1993
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing grant
CISE Research Instrumentation: Research in Mobile Autonomous Robotic Motion, Sensing, and Planning in Unstructured Environment
CISE 研究仪器:非结构化环境中的移动自主机器人运动、传感和规划研究
  • 批准号:
    9121783
  • 财政年份:
    1992
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant

相似海外基金

NSF-BSF: AF: Small: Algorithmic and Information-Theoretic Challenges in Causal Inference
NSF-BSF:AF:小:因果推理中的算法和信息论挑战
  • 批准号:
    2321079
  • 财政年份:
    2023
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
CAREER: Addressing Algorithmic Challenges in Computational Genomic Epidemiology
职业:解决计算基因组流行病学中的算法挑战
  • 批准号:
    2415564
  • 财政年份:
    2023
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing Grant
Addressing algorithmic and data challenges to deep learning based segmentation of spine anatomy
解决基于深度学习的脊柱解剖分割的算法和数据挑战
  • 批准号:
    10367207
  • 财政年份:
    2022
  • 资助金额:
    $ 7.5万
  • 项目类别:
CAREER: Addressing Algorithmic Challenges in Computational Genomic Epidemiology
职业:解决计算基因组流行病学中的算法挑战
  • 批准号:
    2047828
  • 财政年份:
    2021
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Continuing Grant
AF: Small: Algorithmic and Market Design Challenges in Cloud Computing
AF:小:云计算中的算法和市场设计挑战
  • 批准号:
    2110707
  • 财政年份:
    2021
  • 资助金额:
    $ 7.5万
  • 项目类别:
    Standard Grant
Stakeholder Guidance to Anticipate and Address Ethical Challenges in Applications of Machine Learning and Artificial Intelligence in Algorithmic Medicine: a Novel Empirical Approach
利益相关者指导预测和解决机器学习和人工智能在算法医学中的应用中的伦理挑战:一种新颖的经验方法
  • 批准号:
    10367404
  • 财政年份:
    2021
  • 资助金额:
    $ 7.5万
  • 项目类别:
Stakeholder Guidance to Anticipate and Address Ethical Challenges in Applications of Machine Learning and Artificial Intelligence in Algorithmic Medicine: a Novel Empirical Approach
利益相关者指导预测和解决机器学习和人工智能在算法医学中的应用中的伦理挑战:一种新颖的经验方法
  • 批准号:
    10674548
  • 财政年份:
    2020
  • 资助金额:
    $ 7.5万
  • 项目类别:
Stakeholder Guidance to Anticipate and Address Ethical Challenges in Applications of Machine Learning and Artificial Intelligence in Algorithmic Medicine: a Novel Empirical Approach
利益相关者指导预测和解决机器学习和人工智能在算法医学中的应用中的伦理挑战:一种新颖的经验方法
  • 批准号:
    10267034
  • 财政年份:
    2020
  • 资助金额:
    $ 7.5万
  • 项目类别:
Stakeholder Guidance to Anticipate and Address Ethical Challenges in Applications of Machine Learning and Artificial Intelligence in Algorithmic Medicine: a Novel Empirical Approach
利益相关者指导预测和解决机器学习和人工智能在算法医学中的应用中的伦理挑战:一种新颖的经验方法
  • 批准号:
    10099785
  • 财政年份:
    2020
  • 资助金额:
    $ 7.5万
  • 项目类别:
Stakeholder Guidance to Anticipate and Address Ethical Challenges in Applications of Machine Learning and Artificial Intelligence in Algorithmic Medicine: a Novel Empirical Approach
利益相关者指导预测和解决机器学习和人工智能在算法医学中的应用中的伦理挑战:一种新颖的经验方法
  • 批准号:
    10455006
  • 财政年份:
    2020
  • 资助金额:
    $ 7.5万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了