GAIUS - Maintenance activities for the sustainability of AUGUSTUS

GAIUS - AUGUSTUS 可持续发展的维护活动

基本信息

  • 批准号:
    391397397
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    德国
  • 项目类别:
    Research data and software (Scientific Library Services and Information Systems)
  • 财政年份:
    2018
  • 资助国家:
    德国
  • 起止时间:
    2017-12-31 至 2020-12-31
  • 项目状态:
    已结题

项目摘要

AUGUSTUS is a tool for the structural annotation of genes in genomic sequences. Structural genome annotation is the classical bioinformatics task of finding genes and identifying their exon-intron structure in a genome. It is commonly aided by RNA-Seq and by homology data from related genomes. Comprehensive state of the art genome annotation also requires that a clade-specific statistical model with thousands of parameters is adjusted to the target genome. The task of gene prediction is carried out frequently and on most newly sequenced genomes. In the most recent independent assessment of genome annotation methods, AUGUSTUS belonged to the most accurate programs, for example, it achieved with 61% the highest gene-level sensitivity on human protein-coding genes. For years, we have been observing a rising number of citations as well as a very high number of downloads and web service submissions.The development of AUGUSTUS was performed in different research projects, e.g. on new methods for homology integration, for automatic training, for RNA-Seq integration, alternative splicing or multi-genome annotation. Currently, AUGUSTUS is available to users as open source code written in C++ and through two web services. However, the usability of AUGUSTUS and code quality were not yet addressed. The lack of usability makes it especially difficult for less IT literate biologists to use AUGUSTUS. Current usability deficiencies cost many users valuable time and push other users towards choosing a more convenient but a less suited choice, that may ultimately result in wrong conclusions or failed experiments at a later time. Furthermore, there is currently no system in place to share species-specific parameters across different research groups that study related species. The lack of focus on the code quality makes it difficult for other researchers to contribute. Moreover, since the source code development currently happens in internal repositories at the University of Greifswald and there is no issue management in place to handle change requests and bug reports, it is difficult for other researchers to get involved in the development of AUGUSTUS.We therefore propose to address the above mentioned issues in the GAIUS project. We will improve usability through better documentation, development of easier interfaces and a unified pipeline script. A repository for parameter and data sharing will allow users to benefit directly of other users' work on related genomes. This will also support replicable research. The local installation of AUGUSTUS will become simpler via a Debian package and pipeline virtualization. Source code and issue management will be addressed e.g. by using GitHub. The WebAUGUSTUS deployment infrastructure will be improved through the adoption of DevOps methods to facilitate future updates; and very importantly, the technical depth of source code will be reduced through additional tests and refactoring of existing code.
AUGUSTUS是一种用于基因组序列中基因结构注释的工具。结构基因组注释是在基因组中发现基因并识别其外显子-内含子结构的经典生物信息学任务。它通常由RNA-Seq和相关基因组的同源性数据辅助。现有技术的基因组注释的综合状态还需要具有数千个参数的进化枝特异性统计模型被调整到靶基因组。基因预测的任务经常在大多数新测序的基因组上进行。在最近对基因组注释方法的独立评估中,AUGUSTUS属于最准确的程序,例如,它在人类蛋白质编码基因上实现了61%的最高基因水平灵敏度。多年来,我们一直观察到越来越多的引用以及非常高的下载和Web服务提交量。AUGUSTUS的开发在不同的研究项目中进行,例如同源整合的新方法,自动训练,RNA-Seq整合,选择性剪接或多基因组注释。目前,AUGUSTUS以C++编写的开源代码和两个Web服务的形式提供给用户。然而,AUGUSTUS的可用性和代码质量尚未得到解决。缺乏可用性使得不太懂IT的生物学家很难使用AUGUSTUS。当前的可用性缺陷花费了许多用户宝贵的时间,并促使其他用户选择更方便但不太合适的选择,这最终可能导致错误的结论或以后失败的实验。此外,目前还没有一个系统,可以让研究相关物种的不同研究小组分享特定物种的参数。缺乏对代码质量的关注使得其他研究人员很难做出贡献。此外,由于源代码开发目前发生在内部仓库在格赖夫斯瓦尔德大学,并没有问题管理到位,以处理更改请求和错误报告,这是很难让其他研究人员参与到开发的AUGUSTUS。因此,我们建议在GAIUS项目中解决上述问题。我们将通过更好的文档、更简单的界面开发和统一的管道脚本来提高可用性。一个参数和数据共享库将使用户能够直接受益于其他用户在相关基因组方面的工作。这也将支持可复制的研究。通过Debian软件包和管道虚拟化,AUGUSTUS的本地安装将变得更加简单。源代码和问题管理将通过例如使用GitHub来解决。WebAUGUSTUS部署基础设施将通过采用DevOps方法进行改进,以促进未来的更新;非常重要的是,源代码的技术深度将通过对现有代码的额外测试和重构来降低。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professor Dr. Steffen Herbold其他文献

Professor Dr. Steffen Herbold的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professor Dr. Steffen Herbold', 18)}}的其他基金

DEFECTS - Comparable and Externally Valid Software Defect Prediction
DEFECTS - 可比较且外部有效的软件缺陷预测
  • 批准号:
    402774445
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Research Grants
SENLP - Software Engineering knowledge of NLP models
SENLP - NLP 模型的软件工程知识
  • 批准号:
    524228075
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Research Grants

相似海外基金

The Role of the Amino Acid Hypusine in the Maintenance and Function of Tissue-Resident Macrophages
氨基酸马尿苷在组织驻留巨噬细胞的维持和功能中的作用
  • 批准号:
    10656730
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
NERBL Core 1: Facility Management, Maintenance and Operations
NERBL 核心 1:设施管理、维护和运营
  • 批准号:
    10793932
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
BRCA-dependent Mechanisms of Genome Maintenance and Repair
BRCA 依赖的基因组维护和修复机制
  • 批准号:
    10516336
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
Maintenance of the Animal Feed Regulatory Program Standards with Coordinated Preventive Control Regulatory Activities and Capacity Building in South Carolina
通过协调南卡罗来纳州的预防控制监管活动和能力建设来维持动物饲料监管计划标准
  • 批准号:
    10662499
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
Maintenance of the Animal Feed Regulatory Program Standards with Coordinated Preventive Control Regulatory Activities and Capacity Building in South Carolina
通过协调南卡罗来纳州的预防控制监管活动和能力建设来维持动物饲料监管计划标准
  • 批准号:
    10573118
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
BRCA-dependent Mechanisms of Genome Maintenance and Repair
BRCA 依赖的基因组维护和修复机制
  • 批准号:
    10704127
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
IDALS/FDA AFRPS Maintenance with PC Regulatory Activities
IDALS/FDA AFRPS 维护与 PC 监管活动
  • 批准号:
    10175212
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
NEBRASKA'S MAINTENANCE OF THE ANIMAL FEED REGULATORY PROGRAM STANDARDS WITH PREVENTIVE CONTROL REGULATORY ACTIVITIES AND CAPACITY BUILDING
内布拉斯加州通过预防性控制监管活动和能力建设维持动物饲料监管计划标准
  • 批准号:
    10457341
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
Animal Feed Regulatory Program Standards (AFRPS) Maintenance and the Enhancement of Current Good Manufacturing Practices (cGMP) and Preventive Controls for Animal Food Regulatory (PCAF) Activities
动物饲料监管计划标准 (AFRPS) 维持和加强现行良好生产规范 (cGMP) 以及动物食品监管 (PCAF) 活动的预防控制
  • 批准号:
    10254364
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
FDACS-Maintenance AFRPS w/Optional Coordinated PC Regulatory Activities
FDACS-维护 AFRPS 以及可选的协调 PC 监管活动
  • 批准号:
    10254360
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了