Your browser doesn't support javascript.
loading
An overlooked phenomenon: complex interactions of potential error sources on the quality of bacterial de novo genome assemblies.
Rádai, Zoltán; Váradi, Alex; Takács, Péter; Nagy, Nikoletta Andrea; Schmitt, Nicholas; Prépost, Eszter; Kardos, Gábor; Laczkó, Levente.
Affiliation
  • Rádai Z; Institute of Metagenomics, University of Debrecen, Debrecen, Hungary. zozi.web@gmail.com.
  • Váradi A; Department of Dermatology, University Hospital Düsseldorf, Heinrich-Heine-University, Düsseldorf, Germany. zozi.web@gmail.com.
  • Takács P; Institute of Metagenomics, University of Debrecen, Debrecen, Hungary.
  • Nagy NA; Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary.
  • Schmitt N; Institute of Metagenomics, University of Debrecen, Debrecen, Hungary.
  • Prépost E; Department of Health Informatics, Institute of Health Sciences, Faculty of Health, University of Debrecen, Debrecen, Hungary.
  • Kardos G; Institute of Metagenomics, University of Debrecen, Debrecen, Hungary.
  • Laczkó L; Department of Evolutionary Zoology, ELKH-DE Behavioural Ecology Research Group, University of Debrecen, Debrecen, Hungary.
BMC Genomics ; 25(1): 45, 2024 Jan 09.
Article in En | MEDLINE | ID: mdl-38195441
ABSTRACT

BACKGROUND:

Parameters adversely affecting the contiguity and accuracy of the assemblies from Illumina next-generation sequencing (NGS) are well described. However, past studies generally focused on their additive effects, overlooking their potential interactions possibly exacerbating one another's effects in a multiplicative manner. To investigate whether or not they act interactively on de novo genome assembly quality, we simulated sequencing data for 13 bacterial reference genomes, with varying levels of error rate, sequencing depth, PCR and optical duplicate ratios.

RESULTS:

We assessed the quality of assemblies from the simulated sequencing data with a number of contiguity and accuracy metrics, which we used to quantify both additive and multiplicative effects of the four parameters. We found that the tested parameters are engaged in complex interactions, exerting multiplicative, rather than additive, effects on assembly quality. Also, the ratio of non-repeated regions and GC% of the original genomes can shape how the four parameters affect assembly quality.

CONCLUSIONS:

We provide a framework for consideration in future studies using de novo genome assembly of bacterial genomes, e.g. in choosing the optimal sequencing depth, balancing between its positive effect on contiguity and negative effect on accuracy due to its interaction with error rate. Furthermore, the properties of the genomes to be sequenced also should be taken into account, as they might influence the effects of error sources themselves.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design / Genome, Bacterial Language: En Journal: BMC Genomics Journal subject: GENETICA Year: 2024 Document type: Article Affiliation country: Hungria

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design / Genome, Bacterial Language: En Journal: BMC Genomics Journal subject: GENETICA Year: 2024 Document type: Article Affiliation country: Hungria