Your browser doesn't support javascript.
loading
Reference-free Structural Variant Detection in Microbiomes via Long-read Coassembly Graphs.
Curry, Kristen D; Yu, Feiqiao Brian; Vance, Summer E; Segarra, Santiago; Bhaya, Devaki; Chikhi, Rayan; Rocha, Eduardo P C; Treangen, Todd J.
Afiliação
  • Curry KD; Rice University, Department of Computer Science, Houston, TX 77005, United States.
  • Yu FB; Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Microbial Evolutionary Genomics, 75015 Paris, France.
  • Vance SE; Arc Institute, Palo Alto, CA 94304, United States.
  • Segarra S; University of California, Berkeley, Department of Environmental Science, Policy, and Management, Berkeley, CA 94720, United States.
  • Bhaya D; Rice University, Department of Electrical and Computer Engineering, Houston, TX 77005, United States.
  • Chikhi R; Carnegie Institution for Science, Department of Plant Biology, Stanford, CA 94305, United States.
  • Rocha EPC; Institut Pasteur, Université Paris Cité, Sequence Bioinformatics unit, 75015 Paris, France.
  • Treangen TJ; Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Microbial Evolutionary Genomics, 75015 Paris, France.
bioRxiv ; 2024 Jan 30.
Article em En | MEDLINE | ID: mdl-38352454
ABSTRACT
Bacterial genome dynamics are vital for understanding the mechanisms underlying microbial adaptation, growth, and their broader impact on host phenotype. Structural variants (SVs), genomic alterations of 10 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to absence of clear reference genomes and presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing a single metagenome coassembly graph constructed from all samples in a series. The log fold change in graph coverage between subsequent samples is then calculated to call SVs that are thriving or declining throughout the series. We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, which is particularly noticeable as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between subsequent time and temperature samples, suggesting host advantage. Our innovative approach leverages raw read patterns rather than references or MAGs to include all sequencing reads in analysis, and thus provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial genome dynamics.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article