RESUMO
BACKGROUND: Multi-locus sequence typing (MLST) has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven) to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. RESULTS: We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. CONCLUSIONS: SRST is a novel software tool for accurate assignment of sequence types using short read data. Several uses for the tool are demonstrated, including quality control for high-throughput sequencing projects, plasmid MLST and analysis of genomic data during outbreak investigation. SRST is open-source, requires Python, BWA and SamTools, and is available from http://srst.sourceforge.net.
Assuntos
Tipagem de Sequências Multilocus/métodos , Análise de Sequência de DNA/métodos , SoftwareRESUMO
MOTIVATION: Second-generation sequencing technology makes it feasible for many researches to obtain enough sequence reads to attempt the de novo assembly of higher eukaryotes (including mammals). De novo assembly not only provides a tool for understanding wide scale biological variation, but within human biomedicine, it offers a direct way of observing both large-scale structural variation and fine-scale sequence variation. Unfortunately, improvements in the computational feasibility for de novo assembly have not matched the improvements in the gathering of sequence data. This is for two reasons: the inherent computational complexity of the problem and the in-practice memory requirements of tools. RESULTS: In this article, we use entropy compressed or succinct data structures to create a practical representation of the de Bruijn assembly graph, which requires at least a factor of 10 less storage than the kinds of structures used by deployed methods. Moreover, because our representation is entropy compressed, in the presence of sequencing errors it has better scaling behaviour asymptotically than conventional approaches. We present results of a proof-of-concept assembly of a human genome performed on a modest commodity server.
Assuntos
Genômica/métodos , Análise de Sequência de DNA/métodos , Software , Biologia Computacional/métodos , Genoma Humano , HumanosRESUMO
Mycobacterium tuberculosis (M. tuberculosis) is considered innately resistant to ß-lactam antibiotics. However, there is evidence that susceptibility to ß-lactam antibiotics in combination with ß-lactamase inhibitors is variable among clinical isolates, and these may present therapeutic options for drug-resistant cases. Here we report our investigation of susceptibility to ß-lactam/ß-lactamase inhibitor combinations among clinical isolates of M. tuberculosis, and the use of comparative genomics to understand the observed heterogeneity in susceptibility. Eighty-nine South African clinical isolates of varying first and second-line drug susceptibility patterns and two reference strains of M. tuberculosis underwent minimum inhibitory concentration (MIC) determination to two ß-lactams: amoxicillin and meropenem, both alone and in combination with clavulanate, a ß-lactamase inhibitor. 41/91 (45%) of tested isolates were found to be hypersusceptible to amoxicillin/clavulanate relative to reference strains, including 14/24 (58%) of multiple drug-resistant (MDR) and 22/38 (58%) of extensively drug-resistant (XDR) isolates. Genome-wide polymorphisms identified using whole-genome sequencing were used in a phylogenetically-aware linear mixed model to identify polymorphisms associated with amoxicillin/clavulanate susceptibility. Susceptibility to amoxicillin/clavulanate was over-represented among isolates within a specific clade (LAM4), in particular among XDR strains. Twelve sets of polymorphisms were identified as putative markers of amoxicillin/clavulanate susceptibility, five of which were confined solely to LAM4. Within the LAM4 clade, 'paradoxical hypersusceptibility' to amoxicillin/clavulanate has evolved in parallel to first and second-line drug resistance. Given the high prevalence of LAM4 among XDR TB in South Africa, our data support an expanded role for ß-lactam/ß-lactamase inhibitor combinations for treatment of drug-resistant M. tuberculosis.
Assuntos
Antibacterianos/farmacologia , Mycobacterium tuberculosis/efeitos dos fármacos , Amoxicilina/farmacologia , Teorema de Bayes , Ácido Clavulânico/farmacologia , Farmacorresistência Bacteriana Múltipla/efeitos dos fármacos , Farmacorresistência Bacteriana Múltipla/genética , Genes Bacterianos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Meropeném , Testes de Sensibilidade Microbiana , Mutação , Mycobacterium tuberculosis/enzimologia , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/isolamento & purificação , Filogenia , Análise de Sequência de DNA , Tienamicinas/farmacologia , Tuberculose/diagnóstico , Tuberculose/microbiologia , beta-Lactamases/química , beta-Lactamases/metabolismoRESUMO
Recent advances in DNA sequencing technologies have the potential to transform the field of clinical and public health microbiology, and in the last few years numerous case studies have demonstrated successful applications in this context. Among other considerations, a lack of user-friendly data analysis and interpretation tools has been frequently cited as a major barrier to routine use of these techniques. Here we consider the requirements of microbiology laboratories for the analysis, clinical interpretation and management of bacterial whole-genome sequence (WGS) data. Then we discuss relevant, existing WGS analysis tools. We highlight many essential and useful features that are represented among existing tools, but find that no single tool fulfils all of the necessary requirements. We conclude that to fully realise the potential of WGS analyses for clinical and public health microbiology laboratories of all scales, we will need to develop tools specifically with the needs of these laboratories in mind.