Búsqueda | Biblioteca Virtual en Salud Fronteriza

Metagenome-assembled genome extraction and analysis from microbiomes using KBase.

Chivian, Dylan; Jungbluth, Sean P; Dehal, Paramvir S; Wood-Charlson, Elisha M; Canon, Richard S; Allen, Benjamin H; Clark, Mikayla M; Gu, Tianhao; Land, Miriam L; Price, Gavin A; Riehl, William J; Sneddon, Michael W; Sutormin, Roman; Zhang, Qizhi; Cottingham, Robert W; Henry, Chris S; Arkin, Adam P.

Nat Protoc ; 18(1): 208-238, 2023 01.

Artículo en Inglés | MEDLINE | ID: mdl-36376589

RESUMEN

Uncultivated Bacteria and Archaea account for the vast majority of species on Earth, but obtaining their genomes directly from the environment, using shotgun sequencing, has only become possible recently. To realize the hope of capturing Earth's microbial genetic complement and to facilitate the investigation of the functional roles of specific lineages in a given ecosystem, technologies that accelerate the recovery of high-quality genomes are necessary. We present a series of analysis steps and data products for the extraction of high-quality metagenome-assembled genomes (MAGs) from microbiomes using the U.S. Department of Energy Systems Biology Knowledgebase (KBase) platform ( http://www.kbase.us/ ). Overall, these steps take about a day to obtain extracted genomes when starting from smaller environmental shotgun read libraries, or up to about a week from larger libraries. In KBase, the process is end-to-end, allowing a user to go from the initial sequencing reads all the way through to MAGs, which can then be analyzed with other KBase capabilities such as phylogenetic placement, functional assignment, metabolic modeling, pangenome functional profiling, RNA-Seq and others. While portions of such capabilities are available individually from other resources, the combination of the intuitive usability, data interoperability and integration of tools in a freely available computational resource makes KBase a powerful platform for obtaining MAGs from microbiomes. While this workflow offers tools for each of the key steps in the genome extraction process, it also provides a scaffold that can be easily extended with additional MAG recovery and analysis tools, via the KBase software development kit (SDK).

Asunto(s)

Metagenoma , Microbiota , Filogenia , Genoma Bacteriano , Microbiota/genética , Bacterias/genética , Metagenómica

Publisher Correction: Metagenome-assembled genome extraction and analysis from microbiomes using KBase.

Nat Protoc ; 18(2): 658, 2023 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-36451056

Statistical evaluation of pairwise protein sequence comparison with the Bayesian bootstrap.

Price, Gavin A; Crooks, Gavin E; Green, Richard E; Brenner, Steven E.

Bioinformatics ; 21(20): 3824-31, 2005 Oct 15.

Artículo en Inglés | MEDLINE | ID: mdl-16105900

RESUMEN

MOTIVATION: Protein sequence comparison methods are routinely used to infer the intricate network of evolutionary relationships found within the rapidly growing library of protein sequences, and thereby to predict the structure and function of uncharacterized proteins. In the present study, we detail an improved statistical benchmark of pairwise protein sequence comparison algorithms. We use bootstrap resampling techniques to determine standard statistical errors and to estimate the confidence of our conclusions. We show that the underlying structure within benchmark databases causes Efron's standard, non-parametric bootstrap to be biased. Consequently, the standard bootstrap underpredicts average performance when used in the context of evaluating sequence comparison methods. We have developed, as an alternative, an unbiased statistical evaluation based on the Bayesian bootstrap, a resampling method operationally similar to the standard bootstrap. RESULTS: We apply our analysis to the comparative study of amino acid substitution matrix families and find that using modern matrices results in a small, but statistically significant improvement in remote homology detection compared with the classic PAM and BLOSUM matrices. AVAILABILITY: The sequence sets and code for performing these analyses are available from http://compbio.berkeley.edu/. CONTACT: brenner@compbio.berkeley.edu.

Asunto(s)

Algoritmos , Modelos Moleculares , Proteínas/química , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Teorema de Bayes , Interpretación Estadística de Datos , Modelos Estadísticos , Datos de Secuencia Molecular , Proteínas/análisis

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA