Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement
提高序列注释准确性和速度的机器学习方法:软件增强的补充
基本信息
- 批准号:10406630
- 负责人:
- 金额:$ 22.19万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-09-20 至 2023-07-31
- 项目状态:已结题
- 来源:
- 关键词:AddressBiologicalCentromereClassificationCloud ComputingComplexComputer softwareConsensusDNA Transposable ElementsDataDevelopmentDocumentationElementsError SourcesGalaxyGenetic RecombinationGenomeGoalsHumanLabelLibrariesMachine LearningMaintenanceMethodsModernizationMolecular BiologyOutputParentsPathway interactionsPatternProcessRepetitive SequenceReportingSequence AlignmentSequence AnalysisSoftware EngineeringSoftware ToolsSourceSpeedStructureSystemTestingUncertaintyVisualizationadjudicatebasecloud basedgenome annotationimprovedmolecular sequence databaseparent grantsearchable databasesoftware developmentsupport toolstelomeretoolweb server
项目摘要
Summary
The goal of this parent grant for this supplement request is to develop Machine Learning approaches to
improve both accuracy and speed of highly-sensitive sequence database search and alignment. We have
developed three software tools associated with this effort of correctly annotating genomes: (i) ULTRA, which
labels repetitive sequence, (ii) PolyA which integrates such labels with other sequence annotations in a
probabilistic framework, computing uncertainty and improving accuracy, and (iii) SODA, which aids in
visualization of annotations and supporting evidence. Here, we describe a plan to refactor these software
tools and their documentation to improve robustness and reliability, and to improve their availability through
package management systems and incorporation into cloud-based analysis frameworks.
总结
这项补充申请的父母补助金的目标是开发机器学习方法,
提高了高灵敏度序列数据库搜索和比对准确性和速度。我们有
开发了三种与正确注释基因组的努力相关的软件工具:(i)ULTRA,
标记重复序列,(ii)PolyA,其将这些标记与其他序列注释整合在一个或多个标记中,
概率框架,计算不确定性和提高准确性,以及(iii)SODA,这有助于
注释和支持证据的可视化。在这里,我们描述了一个重构这些软件的计划
工具及其文档,以提高健壮性和可靠性,并通过
软件包管理系统,并纳入基于云的分析框架。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Travis John Wheeler其他文献
Travis John Wheeler的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Travis John Wheeler', 18)}}的其他基金
Building Knowledge About Alternatively-spliced Dual-Coding Exons
建立关于选择性剪接双编码外显子的知识
- 批准号:
10363514 - 财政年份:2022
- 资助金额:
$ 22.19万 - 项目类别:
Building Knowledge About Alternatively-spliced Dual-Coding Exons
建立关于选择性剪接双编码外显子的知识
- 批准号:
10701663 - 财政年份:2022
- 资助金额:
$ 22.19万 - 项目类别:
Machine learning approaches for improved accuracy and speed in sequence annotation
用于提高序列注释的准确性和速度的机器学习方法
- 批准号:
10838066 - 财政年份:2019
- 资助金额:
$ 22.19万 - 项目类别:
Machine learning approaches for improved accuracy and speed in sequence annotation
用于提高序列注释的准确性和速度的机器学习方法
- 批准号:
10465048 - 财政年份:2019
- 资助金额:
$ 22.19万 - 项目类别:
Machine learning approaches for improved accuracy and speed in sequence annotation
用于提高序列注释的准确性和速度的机器学习方法
- 批准号:
10020995 - 财政年份:2019
- 资助金额:
$ 22.19万 - 项目类别:
Machine learning approaches for improved accuracy and speed in sequence annotation
用于提高序列注释的准确性和速度的机器学习方法
- 批准号:
10231149 - 财政年份:2019
- 资助金额:
$ 22.19万 - 项目类别:
相似海外基金
NSF/BIO-DFG: Biological Fe-S intermediates in the synthesis of nitrogenase metalloclusters
NSF/BIO-DFG:固氮酶金属簇合成中的生物 Fe-S 中间体
- 批准号:
2335999 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Collaborative Research: Conference: Large Language Models for Biological Discoveries (LLMs4Bio)
合作研究:会议:生物发现的大型语言模型 (LLMs4Bio)
- 批准号:
2411529 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Collaborative Research: Conference: Large Language Models for Biological Discoveries (LLMs4Bio)
合作研究:会议:生物发现的大型语言模型 (LLMs4Bio)
- 批准号:
2411530 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Collaborative Research: NSF-ANR MCB/PHY: Probing Heterogeneity of Biological Systems by Force Spectroscopy
合作研究:NSF-ANR MCB/PHY:通过力谱探测生物系统的异质性
- 批准号:
2412551 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Elucidating mechanisms of biological hydrogen conversion through model metalloenzymes
通过模型金属酶阐明生物氢转化机制
- 批准号:
2419343 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Collaborative Research: The Interplay of Water Condensation and Fungal Growth on Biological Surfaces
合作研究:水凝结与生物表面真菌生长的相互作用
- 批准号:
2401507 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
DESIGN: Driving Culture Change in a Federation of Biological Societies via Cohort-Based Early-Career Leaders
设计:通过基于队列的早期职业领袖推动生物协会联盟的文化变革
- 批准号:
2334679 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
REU Site: Modeling the Dynamics of Biological Systems
REU 网站:生物系统动力学建模
- 批准号:
2243955 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Standard Grant
Defining the biological boundaries to sustain extant life on Mars
定义维持火星现存生命的生物边界
- 批准号:
DP240102658 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Discovery Projects
Advanced Multiscale Biological Imaging using European Infrastructures
利用欧洲基础设施进行先进的多尺度生物成像
- 批准号:
EP/Y036654/1 - 财政年份:2024
- 资助金额:
$ 22.19万 - 项目类别:
Research Grant