Pesquisa | Biblioteca Virtual em Saúde

Inferring microbial co-occurrence networks from amplicon data: a systematic evaluation.

Kishore, Dileep; Birzu, Gabriel; Hu, Zhenjun; DeLisi, Charles; Korolev, Kirill S; Segrè, Daniel.

mSystems ; 8(4): e0096122, 2023 08 31.

Artigo em Inglês | MEDLINE | ID: mdl-37338270

RESUMO

Microbes commonly organize into communities consisting of hundreds of species involved in complex interactions with each other. 16S ribosomal RNA (16S rRNA) amplicon profiling provides snapshots that reveal the phylogenies and abundance profiles of these microbial communities. These snapshots, when collected from multiple samples, can reveal the co-occurrence of microbes, providing a glimpse into the network of associations in these communities. However, the inference of networks from 16S data involves numerous steps, each requiring specific tools and parameter choices. Moreover, the extent to which these steps affect the final network is still unclear. In this study, we perform a meticulous analysis of each step of a pipeline that can convert 16S sequencing data into a network of microbial associations. Through this process, we map how different choices of algorithms and parameters affect the co-occurrence network and identify the steps that contribute substantially to the variance. We further determine the tools and parameters that generate robust co-occurrence networks and develop consensus network algorithms based on benchmarks with mock and synthetic data sets. The Microbial Co-occurrence Network Explorer, or MiCoNE (available at https://github.com/segrelab/MiCoNE) follows these default tools and parameters and can help explore the outcome of these combinations of choices on the inferred networks. We envisage that this pipeline could be used for integrating multiple data sets and generating comparative analyses and consensus networks that can guide our understanding of microbial community assembly in different biomes. IMPORTANCE Mapping the interrelationships between different species in a microbial community is important for understanding and controlling their structure and function. The surge in the high-throughput sequencing of microbial communities has led to the creation of thousands of data sets containing information about microbial abundances. These abundances can be transformed into co-occurrence networks, providing a glimpse into the associations within microbiomes. However, processing these data sets to obtain co-occurrence information relies on several complex steps, each of which involves numerous choices of tools and corresponding parameters. These multiple options pose questions about the robustness and uniqueness of the inferred networks. In this study, we address this workflow and provide a systematic analysis of how these choices of tools affect the final network and guidelines on appropriate tool selection for a particular data set. We also develop a consensus network algorithm that helps generate more robust co-occurrence networks based on benchmark synthetic data sets.

Assuntos

Consórcios Microbianos , Microbiota , RNA Ribossômico 16S/genética , Microbiota/genética , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala

Toward FAIR Representations of Microbial Interactions.

Pacheco, Alan R; Pauvert, Charlie; Kishore, Dileep; Segrè, Daniel.

mSystems ; 7(5): e0065922, 2022 10 26.

Artigo em Inglês | MEDLINE | ID: mdl-36005399

RESUMO

Despite an ever-growing number of data sets that catalog and characterize interactions between microbes in different environments and conditions, many of these data are neither easily accessible nor intercompatible. These limitations present a major challenge to microbiome research by hindering the streamlined drawing of inferences across studies. Here, we propose guiding principles to make microbial interaction data more findable, accessible, interoperable, and reusable (FAIR). We outline specific use cases for interaction data that span the diverse space of microbiome research, and discuss the untapped potential for new insights that can be fulfilled through broader integration of microbial interaction data. These include, among others, the design of intercompatible synthetic communities for environmental, industrial, or medical applications, and the inference of novel interactions from disparate studies. Lastly, we envision potential trajectories for the deployment of FAIR microbial interaction data based on existing resources, reporting standards, and current momentum within the community.

Assuntos

Interações Microbianas , Microbiota

Pipeliner: A Nextflow-Based Framework for the Definition of Sequencing Data Processing Pipelines.

Federico, Anthony; Karagiannis, Tanya; Karri, Kritika; Kishore, Dileep; Koga, Yusuke; Campbell, Joshua D; Monti, Stefano.

Front Genet ; 10: 614, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31316552

RESUMO

The advent of high-throughput sequencing technologies has led to the need for flexible and user-friendly data preprocessing platforms. The Pipeliner framework provides an out-of-the-box solution for processing various types of sequencing data. It combines the Nextflow scripting language and Anaconda package manager to generate modular computational workflows. We have used Pipeliner to create several pipelines for sequencing data processing including bulk RNA-sequencing (RNA-seq), single-cell RNA-seq, as well as digital gene expression data. This report highlights the design methodology behind Pipeliner that enables the development of highly flexible and reproducible pipelines that are easy to extend and maintain on multiple computing environments. We also provide a quick start user guide demonstrating how to setup and execute available pipelines with toy datasets.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA