RESUMO
Bacterial genetic diversity is often described solely using base-pair changes despite a wide variety of other mutation types likely being major contributors. Tandem duplication/amplifications are thought to be widespread among bacteria but due to their often-intractable size and instability, comprehensive studies of these mutations are rare. We define a methodology to investigate amplifications in bacterial genomes based on read depth of genome sequence data as a proxy for copy number. We demonstrate the approach with Bordetella pertussis, whose insertion sequence element-rich genome provides extensive scope for amplifications to occur. Analysis of data for 2430 B. pertussis isolates identified 272 putative amplifications, of which 94â% were located at 11 hotspot loci. We demonstrate limited phylogenetic connection for the occurrence of amplifications, suggesting unstable and sporadic characteristics. Genome instability was further described in vitro using long-read sequencing via the Nanopore platform, which revealed that clonally derived laboratory cultures produced heterogenous populations rapidly. We extended this research to analyse a population of 1000 isolates of another important pathogen, Mycobacterium tuberculosis. We found 590 amplifications in M. tuberculosis, and like B. pertussis, these occurred primarily at hotspots. Genes amplified in B. pertussis include those involved in motility and respiration, whilst in M. tuberuclosis, functions included intracellular growth and regulation of virulence. Using publicly available short-read data we predicted previously unrecognized, large amplifications in B. pertussis and M. tuberculosis. This reveals the unrecognized and dynamic genetic diversity of B. pertussis and M. tuberculosis, highlighting the need for a more holistic understanding of bacterial genetics.
Assuntos
Bordetella pertussis/genética , Variação Genética , Mycobacterium tuberculosis/genética , Bordetella pertussis/classificação , Genes Bacterianos/genética , Genoma Bacteriano , Instabilidade Genômica , Mutação , Mycobacterium tuberculosis/classificação , Filogenia , Virulência/genética , Coqueluche/microbiologiaRESUMO
Despite sharing many of the traits that have allowed the genus Bacillus to gain recognition for its agricultural relevance, the genus Lysinibacillus is not as well-known and studied. The present study employs in vitro, in vivo, in planta, and in silico approaches to characterize Lysinibacillus fusiformis strain S4C11, isolated from the roots of an apple tree in northern Italy. The in vitro and in vivo assays demonstrated that strain S4C11 possesses an antifungal activity against different fungal pathogens, and is capable of interfering with the germination of Botrytis cinerea conidia, as well as of inhibiting its growth through the production of volatile organic molecules. In planta assays showed that the strain possesses the ability to promote plant growth, that is not host-specific, both in controlled conditions and in a commercial nursery. Biocontrol assays carried out against phytopathogenic viruses gave contrasting results, suggesting that the strain does not activate the host's defense pathways. The in silico analyses were carried out by sequencing the genome of the strain through an innovative approach that combines Illumina and High-Definition Mapping methods, allowing the reconstruction of a main chromosome and two plasmids from strain S4C11. The analysis of the genes encoded by the genome contributed to the characterization of the strain, detecting genes related to the biocontrol effect detected in the experimental trials.
Assuntos
Bacillaceae/fisiologia , Antibiose , Bacillaceae/genética , Bacillaceae/isolamento & purificação , Botrytis/crescimento & desenvolvimento , Botrytis/fisiologia , Simulação por Computador , Genoma Bacteriano , Itália , Malus/microbiologia , Doenças das Plantas/microbiologia , Raízes de Plantas/microbiologiaRESUMO
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution and comprehensiveness. To help translate these methods to routine research and clinical practice, we developed a sequence-resolved benchmark set for identification of both false-negative and false-positive germline large insertions and deletions. To create this benchmark for a broadly consented son in a Personal Genome Project trio with broadly available cells and DNA, the Genome in a Bottle Consortium integrated 19 sequence-resolved variant calling methods from diverse technologies. The final benchmark set contains 12,745 isolated, sequence-resolved insertion (7,281) and deletion (5,464) calls ≥50 base pairs (bp). The Tier 1 benchmark regions, for which any extra calls are putative false positives, cover 2.51 Gbp and 5,262 insertions and 4,095 deletions supported by ≥1 diploid assembly. We demonstrate that the benchmark set reliably identifies false negatives and false positives in high-quality SV callsets from short-, linked- and long-read sequencing and optical mapping.