Adapting Phred/Phrap/Consed for NextGen Sequencing
调整 Phred/Phrap/Consed 进行下一代测序
基本信息
- 批准号:7847401
- 负责人:
- 金额:$ 59.13万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2010
- 资助国家:美国
- 起止时间:2010-09-18 至 2013-06-30
- 项目状态:已结题
- 来源:
- 关键词:AddressBioinformaticsBiomedical ResearchCommunitiesComputer softwareConsensusDNA SequenceDataData Storage and RetrievalDetectionEnvironmentGenerationsGeneticGenomeGenomicsImageImage AnalysisImageryMapsMethodsOutputPerformanceProbabilityProductionProtocols documentationReadingRunningSequence AlignmentSpeedStatistical ModelsTechnologyVariantbasecomputerized data processingcostfile formatflexibilityhuman diseaseimprovedmeetingsnew technologynext generationprogramspublic health relevancetool
项目摘要
DESCRIPTION (provided by applicant): Adapting Phred/Phrap/Consed to Next-Generation Sequencing New methods for DNA sequencing are allowing the production of much more data at a fraction of the cost of traditional technologies, such that DNA sequencing is now being used more than ever before in biomedical research. However software to analyze the output from these new technologies could be significantly improved. This proposal is to upgrade the widely used phred/phrap/consed package for these "next-generation" sequencers. We have developed a new base-calling and image analysis program, next_phred, for the Illumina sequencer which gives 80%-90% more reads than the Illumina software and 50% fewer base- calling errors, thus significantly reducing sequencing costs and allowing more confident detection of sequence variants. We will make further performance improvements and investigate whether changes to the Illumina experimental protocol can increase yield still further. We will also calibrate the error probabilities for the base-callers of other next-generation sequencers. We will enable consed (the visualization, finishing, and analysis tool) to nimbly handle assemblies of up to several billion reads, a large reference sequence, and high depth of coverage; to detect structural variants and determine SNPs using a probabilistic model; to directly read the output of assemblers commonly used with next-generation data; and to perform batch correction of erroneous assemblies and consensus bases. We will further improve cross_match (the flexible sequence alignment program which is part of phred/phrap/consed) and our new ultrafast aligner phaster for mapping large numbers of genomic or RNA-Seq reads to a reference genome. Both programs will be given speed and functionality enhancements, including the capability to handle paired reads and to output alignments in a more compact file format. We will create a bioinformatics environment allowing even small labs to manage the massive amounts of data from next-generation sequencers. This will include the implementation of compact file formats, prescriptions for data storage, generation of files usable in a variety of applications, and pipelines for Illumina and 454 data processing.
PUBLIC HEALTH RELEVANCE: New DNA sequencing technologies are vastly increasing the amount of data available to decipher the genetic basis of human disease. Software able to fully exploit this data is currently lacking. Our software, commonly used for older types of sequencing machines, will be improved to meet this challenge and to significantly lower sequencing costs.
描述(由申请人提供):使Phred/Phrap/Consed适应下一代测序DNA测序的新方法允许以传统技术的一小部分成本产生更多的数据,使得DNA测序现在比以往任何时候都更多地用于生物医学研究。然而,分析这些新技术输出的软件还可以大大改进。这项建议是为了升级这些“下一代”测序仪广泛使用的phred/phrap/consed软件包。 我们已经为Illumina测序仪开发了一种新的碱基识别和图像分析程序next_phred,其给出比Illumina软件多80%-90%的读数和少50%的碱基识别错误,从而显著降低测序成本并允许更有信心地检测序列变体。我们将进一步改进性能,并研究对Illumina实验协议的更改是否可以进一步提高产量。我们还将校准其他下一代测序仪的碱基调用者的错误概率。 我们将使consed(可视化,整理和分析工具)能够灵活地处理高达数十亿读取,大参考序列和高覆盖深度的组件;使用概率模型检测结构变异并确定SNP;直接读取下一代数据常用的汇编程序的输出;并对错误的组件和共识碱基进行批量校正。 我们将进一步改进cross_match(phred/phrap/consed的一部分,灵活的序列比对程序)和我们新的超快比对器phaster,用于将大量基因组或RNA-Seq读数映射到参考基因组。这两个程序将被赋予速度和功能增强,包括处理配对读取和以更紧凑的文件格式输出比对的能力。 我们将创建一个生物信息学环境,使即使是小实验室也能管理来自下一代测序仪的大量数据。这将包括实现紧凑的文件格式,数据存储的处方,可用于各种应用程序的文件的生成,以及Illumina和454数据处理的管道。
公共卫生关系:新的DNA测序技术大大增加了可用于破译人类疾病遗传基础的数据量。目前缺乏能够充分利用这些数据的软件。我们的软件通常用于较旧类型的测序仪,将得到改进,以应对这一挑战,并显着降低测序成本。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
PHILIP P GREEN其他文献
PHILIP P GREEN的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('PHILIP P GREEN', 18)}}的其他基金
Adapting Phred/Phrap/Consed for NextGen Sequencing
调整 Phred/Phrap/Consed 进行下一代测序
- 批准号:
8144487 - 财政年份:2010
- 资助金额:
$ 59.13万 - 项目类别:
Adapting Phred/Phrap/Consed for NextGen Sequencing
调整 Phred/Phrap/Consed 进行下一代测序
- 批准号:
8298629 - 财政年份:2010
- 资助金额:
$ 59.13万 - 项目类别:
AUTOMATED DATA PROCESSING FOR GENOME SEQUENCING
基因组测序的自动化数据处理
- 批准号:
2889651 - 财政年份:1992
- 资助金额:
$ 59.13万 - 项目类别:
AUTOMATED DATA PROCESSING FOR GENOME SEQUENCING
基因组测序的自动化数据处理
- 批准号:
2026820 - 财政年份:1992
- 资助金额:
$ 59.13万 - 项目类别:
相似海外基金
Conference: Global Bioinformatics Education Summit 2024 — Energizing Communities to Power the Bioeconomy Workforce
会议:2024 年全球生物信息学教育峰会 — 激励社区为生物经济劳动力提供动力
- 批准号:
2421267 - 财政年份:2024
- 资助金额:
$ 59.13万 - 项目类别:
Standard Grant
Open Access Block Award 2024 - EMBL - European Bioinformatics Institute
2024 年开放获取区块奖 - EMBL - 欧洲生物信息学研究所
- 批准号:
EP/Z532678/1 - 财政年份:2024
- 资助金额:
$ 59.13万 - 项目类别:
Research Grant
Conference: The 9th Workshop on Biostatistics and Bioinformatics
会议:第九届生物统计与生物信息学研讨会
- 批准号:
2409876 - 财政年份:2024
- 资助金额:
$ 59.13万 - 项目类别:
Standard Grant
PAML 5: A friendly and powerful bioinformatics resource for phylogenomics
PAML 5:用于系统基因组学的友好且强大的生物信息学资源
- 批准号:
BB/X018571/1 - 财政年份:2024
- 资助金额:
$ 59.13万 - 项目类别:
Research Grant
PDB Management by The Research Collaboratory for Structural Bioinformatics
结构生物信息学研究合作实验室的 PDB 管理
- 批准号:
2321666 - 财政年份:2024
- 资助金额:
$ 59.13万 - 项目类别:
Cooperative Agreement
Building a Bioinformatics Ecosystem for Agri-Ecologists
为农业生态学家构建生物信息学生态系统
- 批准号:
BB/X018768/1 - 财政年份:2023
- 资助金额:
$ 59.13万 - 项目类别:
Research Grant
Integrative viral genomics and bioinformatics platform
综合病毒基因组学和生物信息学平台
- 批准号:
MC_UU_00034/5 - 财政年份:2023
- 资助金额:
$ 59.13万 - 项目类别:
Intramural
Collaborative Research: IIBR: Innovation: Bioinformatics: Linking Chemical and Biological Space: Deep Learning and Experimentation for Property-Controlled Molecule Generation
合作研究:IIBR:创新:生物信息学:连接化学和生物空间:属性控制分子生成的深度学习和实验
- 批准号:
2318829 - 财政年份:2023
- 资助金额:
$ 59.13万 - 项目类别:
Continuing Grant
Planning Proposal: CREST Center in Bioinformatics
规划方案:CREST生物信息学中心
- 批准号:
2334642 - 财政年份:2023
- 资助金额:
$ 59.13万 - 项目类别:
Standard Grant