Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
1.
Sci Rep ; 11(1): 10590, 2021 05 19.
Article in English | MEDLINE | ID: mdl-34012005

ABSTRACT

Despite the advent of whole genome metagenomics, targeted approaches (such as 16S rRNA gene amplicon sequencing) continue to be valuable for determining the microbial composition of samples. Amplicon microbiome sequencing can be performed on clinical samples from a normally sterile site to determine the aetiology of an infection (usually single pathogen identification) or samples from more complex niches such as human mucosa or environmental samples where multiple microorganisms need to be identified. The methodologies are frequently applied to determine both presence of micro-organisms and their quantity or relative abundance. There are a number of technical steps required to perform microbial community profiling, many of which may have appreciable precision and bias that impacts final results. In order for these methods to be applied with the greatest accuracy, comparative studies across different laboratories are warranted. In this study we explored the impact of the bioinformatic approaches taken in different laboratories on microbiome assessment using 16S rRNA gene amplicon sequencing results. Data were generated from two mock microbial community samples which were amplified using primer sets spanning five different variable regions of 16S rRNA genes. The PCR-sequencing analysis included three technical repeats of the process to determine the repeatability of their methods. Thirteen laboratories participated in the study, and each analysed the same FASTQ files using their choice of pipeline. This study captured the methods used and the resulting sequence annotation and relative abundance output from bioinformatic analyses. Results were compared to digital PCR assessment of the absolute abundance of each target representing each organism in the mock microbial community samples and also to analyses of shotgun metagenome sequence data. This ring trial demonstrates that the choice of bioinformatic analysis pipeline alone can result in different estimations of the composition of the microbiome when using 16S rRNA gene amplicon sequencing data. The study observed differences in terms of both presence and abundance of organisms and provides a resource for ensuring reproducible pipeline development and application. The observed differences were especially prevalent when using custom databases and applying high stringency operational taxonomic unit (OTU) cut-off limits. In order to apply sequencing approaches with greater accuracy, the impact of different analytical steps needs to be clearly delineated and solutions devised to harmonise microbiome analysis results.


Subject(s)
Computational Biology , Metagenomics , Microbiota , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA
2.
Dev Biol ; 473: 1-14, 2021 05.
Article in English | MEDLINE | ID: mdl-33453264

ABSTRACT

Correct vascular differentiation requires distinct patterns of gene expression in different subtypes of endothelial cells. Members of the ETS transcription factor family are essential for the transcriptional activation of arterial and angiogenesis-specific gene regulatory elements, leading to the hypothesis that they play lineage-defining roles in arterial and angiogenic differentiation directly downstream of VEGFA signalling. However, an alternative explanation is that ETS binding at enhancers and promoters is a general requirement for activation of many endothelial genes regardless of expression pattern, with subtype-specificity provided by additional factors. Here we use analysis of Ephb4 and Coup-TFII (Nr2f2) vein-specific enhancers to demonstrate that ETS factors are equally essential for vein, arterial and angiogenic-specific enhancer activity patterns. Further, we show that ETS factor binding at these vein-specific enhancers is enriched by VEGFA signalling, similar to that seen at arterial and angiogenic enhancers. However, while arterial and angiogenic enhancers can be activated by VEGFA in vivo, the Ephb4 and Coup-TFII venous enhancers are not, suggesting that the specificity of VEGFA-induced arterial and angiogenic enhancer activity occurs via non-ETS transcription factors. These results support a model in which ETS factors are not the primary regulators of specific patterns of gene expression in different endothelial subtypes.


Subject(s)
Endothelial Cells/metabolism , Neovascularization, Physiologic/physiology , Proto-Oncogene Proteins c-ets/metabolism , Animals , Arteries/metabolism , Cell Differentiation/physiology , Endothelial Cells/physiology , Endothelium/metabolism , Enhancer Elements, Genetic/genetics , Female , Gene Expression/genetics , Gene Expression Regulation, Developmental/genetics , Male , Mice , Mice, Inbred C57BL , Mice, Transgenic , Proto-Oncogene Proteins c-ets/physiology , Signal Transduction , Transcription Factors/metabolism , Transcriptional Activation , Vascular Endothelial Growth Factor A/metabolism , Veins/metabolism , Zebrafish/embryology , Zebrafish/metabolism , Zebrafish Proteins/metabolism
3.
Sci Rep ; 9(1): 15330, 2019 10 25.
Article in English | MEDLINE | ID: mdl-31653906

ABSTRACT

The major global pathogen Clostridium difficile (recently renamed Clostridioides difficile) has large genetic diversity including multiple mobile genetic elements. In this study, whole genome sequencing of 86 strains from the poorly characterised clade 3, predominantly PCR ribotype (RT)023, of C. difficile revealed distinctive surface architecture characteristics and a large mobile genetic island. These strains have a unique sortase substrate phenotype compared with well-characterised strains of C. difficile, and loss of the phage protection protein CwpV. A large genetic insertion (023_CTnT) comprised of three smaller elements (023_CTn1-3) is present in 80/86 strains analysed in this study, with genes common among other bacterial strains in the gut microbiome. Novel cargo regions of 023_CTnT include genes encoding a sortase, putative sortase substrates, lantibiotic ABC transporters and a putative siderophore biosynthetic cluster. We demonstrate the excision of 023_CTnT and sub-elements 023_CTn2 and 023_CTn3 from the genome of RT023 reference strain CD305 and the transfer of 023_CTn3 to a non-toxigenic C. difficile strain, which may have implications for the use of non-toxigenic C. difficile strains as live attenuated vaccines. Finally, we show that the genes within the island are expressed in a regulated manner in C. difficile RT023 strains conferring a distinct "niche adaptation".


Subject(s)
Cell Membrane/metabolism , Clostridioides difficile/genetics , DNA Transposable Elements/genetics , Amino Acid Sequence , Anaerobiosis , Bacterial Proteins/chemistry , Bacterial Proteins/metabolism , Base Sequence , Cell Wall/metabolism , Genes, Bacterial , Genomic Islands , Humans , Microbiota/genetics , Peptide Hydrolases/chemistry , Peptide Hydrolases/metabolism
4.
PLoS Pathog ; 14(9): e1007191, 2018 09.
Article in English | MEDLINE | ID: mdl-30208103

ABSTRACT

Clostridium difficile is a Gram-positive spore-forming anaerobe and a major cause of antibiotic-associated diarrhoea. Disruption of the commensal microbiota, such as through treatment with broad-spectrum antibiotics, is a critical precursor for colonisation by C. difficile and subsequent disease. Furthermore, failure of the gut microbiota to recover colonisation resistance can result in recurrence of infection. An unusual characteristic of C. difficile among gut bacteria is its ability to produce the bacteriostatic compound para-cresol (p-cresol) through fermentation of tyrosine. Here, we demonstrate that the ability of C. difficile to produce p-cresol in vitro provides a competitive advantage over gut bacteria including Escherichia coli, Klebsiella oxytoca and Bacteroides thetaiotaomicron. Metabolic profiling of competitive co-cultures revealed that acetate, alanine, butyrate, isobutyrate, p-cresol and p-hydroxyphenylacetate were the main metabolites responsible for differentiating the parent strain C. difficile (630Δerm) from a defined mutant deficient in p-cresol production. Moreover, we show that the p-cresol mutant displays a fitness defect in a mouse relapse model of C. difficile infection (CDI). Analysis of the microbiome from this mouse model of CDI demonstrates that colonisation by the p-cresol mutant results in a distinctly altered intestinal microbiota, and metabolic profile, with a greater representation of Gammaproteobacteria, including the Pseudomonales and Enterobacteriales. We demonstrate that Gammaproteobacteria are susceptible to exogenous p-cresol in vitro and that there is a clear divide between bacterial Phyla and their susceptibility to p-cresol. In general, Gram-negative species were relatively sensitive to p-cresol, whereas Gram-positive species were more tolerant. This study demonstrates that production of p-cresol by C. difficile has an effect on the viability of intestinal bacteria as well as the major metabolites produced in vitro. These observations are upheld in a mouse model of CDI, in which p-cresol production affects the biodiversity of gut microbiota and faecal metabolite profiles, suggesting that p-cresol production contributes to C. difficile survival and pathogenesis.


Subject(s)
Clostridioides difficile/metabolism , Clostridium Infections/microbiology , Cresols/metabolism , Gastrointestinal Microbiome/physiology , Gram-Negative Bacteria/physiology , Animals , Anti-Bacterial Agents/adverse effects , Biodiversity , Cell Membrane/drug effects , Clostridioides difficile/genetics , Clostridioides difficile/pathogenicity , Cresols/pharmacology , Disease Models, Animal , Female , Gastrointestinal Microbiome/drug effects , Gram-Negative Bacteria/drug effects , Humans , Metabolome , Mice , Mice, Inbred C57BL , Mutation
5.
Mediators Inflamm ; 2018: 3434101, 2018.
Article in English | MEDLINE | ID: mdl-29967566

ABSTRACT

Chlamydia trachomatis is the most commonly diagnosed bacterial sexually transmitted infection and can lead to tubal factor infertility, a disease characterised by fibrosis of the fallopian tubes. Genetic polymorphisms in molecular pathways involving G protein-coupled receptor signalling, the Akt/PI3K cascade, the mitotic cell cycle, and immune response have been identified in association with the development of trachomatous scarring, an ocular form of chlamydia-related fibrotic pathology. In this case-control study, we performed genome-wide association and pathways-based analysis in a sample of 71 Dutch women who attended an STI clinic who were seropositive for Chlamydia trachomatis antibodies and 169 high-risk Dutch women who sought similar health services but who were seronegative. We identified two regions of within-gene SNP association with Chlamydia trachomatis serological response and found that GPCR signalling and cell cycle pathways were also associated with the trait. These pathway-level associations appear to be common to immunological sequelae of chlamydial infections in both ocular and urogenital tropisms. These pathways may be central mediators of human refractoriness to chlamydial diseases.


Subject(s)
Chlamydia Infections/genetics , Genome-Wide Association Study/methods , Polymorphism, Single Nucleotide/genetics , Adolescent , Adult , Case-Control Studies , Chlamydia trachomatis/pathogenicity , Female , Genotype , Humans , In Vitro Techniques , Receptors, G-Protein-Coupled/genetics , Risk Factors , Young Adult
6.
Clin Infect Dis ; 66(6): 840-848, 2018 03 05.
Article in English | MEDLINE | ID: mdl-29095965

ABSTRACT

Background: Recent evidence suggests that hospital transmission of methicillin-resistant Staphylococcus aureus (MRSA) is uncommon in UK centers that have implemented sustained infection control programs. We investigated whether a healthcare-network analysis could shed light on transmission paths currently sustaining MRSA levels in UK hospitals. Methods: A cross-sectional observational study was performed in 2 National Health Service hospital groups and a general district hospital in Southeast London. All MRSA patients identified at inpatient, outpatient, and community settings between 1 November 2011 and 29 February 2012 were included. We identified genetically defined MRSA transmission clusters in individual hospitals and across the healthcare network, and examined genetic differentiation of sequence type (ST) 22 MRSA isolates within and between hospitals and inpatient or outpatient and community settings, as informed by average and median pairwise single-nucleotide polymorphisms (SNPs) and SNP-based proportions of nearly identical isolates. Results: Two hundred forty-eight of 610 (40.7%) MRSA patients were linked in 90 transmission clusters, of which 27 spanned multiple hospitals. Analysis of a large 32 patient ST22-MRSA cluster showed that 26 of 32 patients (81.3%) had multiple contacts with one another during ward stays at any hospital. No residential, outpatient, or significant community healthcare contacts were identified. Genetic differentiation between ST22 MRSA inpatient isolates from different hospitals was less than between inpatient isolates from the same hospitals (P ≤ .01). Conclusions: There is evidence of frequent ward-based transmission of MRSA brought about by frequent patient admissions to multiple hospitals. Limiting in-ward transmission requires sharing of MRSA status data between hospitals.


Subject(s)
Cross Infection/microbiology , Cross Infection/transmission , Methicillin-Resistant Staphylococcus aureus/genetics , Methicillin-Resistant Staphylococcus aureus/isolation & purification , Staphylococcal Infections/transmission , Aged , Aged, 80 and over , Anti-Bacterial Agents/pharmacology , Cross Infection/epidemiology , Cross-Sectional Studies , Disease Outbreaks/prevention & control , Female , Genome, Bacterial , Hospitals/statistics & numerical data , Humans , Infection Control , Inpatients , London/epidemiology , Male , Methicillin/pharmacology , Middle Aged , Multigene Family , Polymorphism, Single Nucleotide , Staphylococcal Infections/epidemiology , Whole Genome Sequencing
7.
Mol Biosyst ; 13(5): 852-865, 2017 May 02.
Article in English | MEDLINE | ID: mdl-28317949

ABSTRACT

In multicellular organisms, a large number of proteins interact with the polyanionic polysaccharides heparan sulphate (HS) and heparin. These interactions are usually assumed to be dominated by charge-charge interactions between the anionic carboxylate and/or sulfate groups of the polysaccharide and cationic amino acids of the protein. A major question is whether there exist conserved amino acid sequences for HS/heparin binding among these diverse proteins. Potentially conserved HS/heparin binding sequences were sought amongst 437 HS/heparin binding proteins. Amino acid sequences were extracted and compared using a Levenshtein distance metric. The resultant similarity matrices were visualised as graphs, enabling extraction of strongly conserved sequences from highly variable primary sequences while excluding short, core regions. This approach did not reveal extensive, conserved HS/heparin binding sequences, rather a number of shorter, more widely spaced sequences that may work in unison to form heparin-binding sites on protein surfaces, arguing for convergent evolution. Thus, it is the three-dimensional arrangement of these conserved motifs on the protein surface, rather than the primary sequence per se, which are the evolutionary elements.


Subject(s)
Heparin/metabolism , Heparitin Sulfate/metabolism , Proteins/chemistry , Proteins/metabolism , Amino Acid Sequence , Binding Sites , Computational Biology/methods , Conserved Sequence , Humans , Models, Molecular , Protein Binding , Protein Interaction Maps
8.
Methods Mol Biol ; 1476: 23-33, 2016.
Article in English | MEDLINE | ID: mdl-27507331

ABSTRACT

This chapter presents a method to process C. difficile whole-genome next-generation sequencing data straight from the sequencer. Quality control processing and de novo assembly of these data enable downstream analyses such as gene annotation and in silico multi-locus strain-type identification.


Subject(s)
Clostridioides difficile/classification , DNA, Bacterial/analysis , Genome, Bacterial , Multilocus Sequence Typing/methods , Sequence Analysis, DNA/methods , Software , Base Sequence , Clostridioides difficile/genetics , Contig Mapping , DNA, Bacterial/genetics , High-Throughput Nucleotide Sequencing , Humans , Molecular Sequence Annotation , Quality Control
9.
Vaccine ; 34(17): 2035-43, 2016 Apr 12.
Article in English | MEDLINE | ID: mdl-26709640

ABSTRACT

BACKGROUND: Unbiased deep sequencing offers the potential for improved adventitious virus screening in vaccines and biotherapeutics. Successful implementation of such assays will require appropriate control materials to confirm assay performance and sensitivity. METHODS: A common reference material containing 25 target viruses was produced and 16 laboratories were invited to process it using their preferred adventitious virus detection assay. RESULTS: Fifteen laboratories returned results, obtained using a wide range of wet-lab and informatics methods. Six of 25 target viruses were detected by all laboratories, with the remaining viruses detected by 4-14 laboratories. Six non-target viruses were detected by three or more laboratories. CONCLUSION: The study demonstrated that a wide range of methods are currently used for adventitious virus detection screening in biological products by deep sequencing and that they can yield significantly different results. This underscores the need for common reference materials to ensure satisfactory assay performance and enable comparisons between laboratories.


Subject(s)
Biological Products/standards , Drug Contamination/prevention & control , Vaccines/standards , Viruses/isolation & purification , High-Throughput Nucleotide Sequencing , Laboratories , Reference Standards
10.
Proc Natl Acad Sci U S A ; 112(42): 13027-32, 2015 Oct 20.
Article in English | MEDLINE | ID: mdl-26438871

ABSTRACT

Malaria cases caused by the zoonotic parasite Plasmodium knowlesi are being increasingly reported throughout Southeast Asia and in travelers returning from the region. To test for evidence of signatures of selection or unusual population structure in this parasite, we surveyed genome sequence diversity in 48 clinical isolates recently sampled from Malaysian Borneo and in five lines maintained in laboratory rhesus macaques after isolation in the 1960s from Peninsular Malaysia and the Philippines. Overall genomewide nucleotide diversity (π = 6.03 × 10(-3)) was much higher than has been seen in worldwide samples of either of the major endemic malaria parasite species Plasmodium falciparum and Plasmodium vivax. A remarkable substructure is revealed within P. knowlesi, consisting of two major sympatric clusters of the clinical isolates and a third cluster comprising the laboratory isolates. There was deep differentiation between the two clusters of clinical isolates [mean genomewide fixation index (FST) = 0.21, with 9,293 SNPs having fixed differences of FST = 1.0]. This differentiation showed marked heterogeneity across the genome, with mean FST values of different chromosomes ranging from 0.08 to 0.34 and with further significant variation across regions within several chromosomes. Analysis of the largest cluster (cluster 1, 38 isolates) indicated long-term population growth, with negatively skewed allele frequency distributions (genomewide average Tajima's D = -1.35). Against this background there was evidence of balancing selection on particular genes, including the circumsporozoite protein (csp) gene, which had the top Tajima's D value (1.57), and scans of haplotype homozygosity implicate several genomic regions as being under recent positive selection.


Subject(s)
Genome, Protozoan , Plasmodium knowlesi/genetics , Adaptation, Physiological , Animals , Genetics, Population , Plasmodium knowlesi/physiology , Polymorphism, Single Nucleotide
11.
PLoS Genet ; 11(4): e1005131, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25928499

ABSTRACT

Whole-genome sequencing technologies are being increasingly applied to Plasmodium falciparum clinical isolates to identify genetic determinants of malaria pathogenesis. However, genome-wide discovery methods, such as haplotype scans for signatures of natural selection, are hindered by missing genotypes in sequence data. Poor correlation between single nucleotide polymorphisms (SNPs) in the P. falciparum genome complicates efforts to apply established missing-genotype imputation methods that leverage off patterns of linkage disequilibrium (LD). The accuracy of state-of-the-art, LD-based imputation methods (IMPUTE, Beagle) was assessed by measuring allelic r2 for 459 P. falciparum samples from malaria patients in 4 countries: Thailand, Cambodia, Gambia, and Malawi. In restricting our analysis to 86 k high-quality SNPs across the populations, we found that the complete-case analysis was restricted to 21k SNPs (24.5%), despite no single SNP having more than 10% missing genotypes. The accuracy of Beagle in filling in missing genotypes was consistently high across all populations (allelic r2, 0.87-0.96), but the performance of IMPUTE was mixed (allelic r2, 0.34-0.99) depending on reference haplotypes and population. Positive selection analysis using Beagle-imputed haplotypes identified loci involved in resistance to chloroquine (crt) in Thailand, Cambodia, and Gambia, sulfadoxine-pyrimethamine (dhfr, dhps) in Cambodia, and artemisinin (kelch13) in Cambodia. Tajima's D-based analysis identified genes under balancing selection that encode well-characterized vaccine candidates: apical merozoite antigen 1 (ama1) and merozoite surface protein 1 (msp1). In contrast, the complete-case analysis failed to identify any well-validated drug resistance or candidate vaccine loci, except kelch13. In a setting of low LD and modest levels of missing genotypes, using Beagle to impute P. falciparum genotypes is a viable strategy for conducting accurate large-scale population genetics and association analyses, and supporting global surveillance for drug resistance markers and candidate vaccine antigens.


Subject(s)
Genetics, Population , Malaria, Falciparum/genetics , Plasmodium falciparum/genetics , Selection, Genetic/genetics , Alleles , Animals , Cambodia , Female , Gambia , Genotype , Haplotypes , Humans , Malaria, Falciparum/parasitology , Malawi , Male , Plasmodium falciparum/pathogenicity , Polymorphism, Single Nucleotide , Thailand
12.
Genome Med ; 7(1): 51, 2015.
Article in English | MEDLINE | ID: mdl-26019726

ABSTRACT

Mycobacterium tuberculosis drug resistance (DR) challenges effective tuberculosis disease control. Current molecular tests examine limited numbers of mutations, and although whole genome sequencing approaches could fully characterise DR, data complexity has restricted their clinical application. A library (1,325 mutations) predictive of DR for 15 anti-tuberculosis drugs was compiled and validated for 11 of them using genomic-phenotypic data from 792 strains. A rapid online 'TB-Profiler' tool was developed to report DR and strain-type profiles directly from raw sequences. Using our DR mutation library, in silico diagnostic accuracy was superior to some commercial diagnostics and alternative databases. The library will facilitate sequence-based drug-susceptibility testing.

13.
Nat Commun ; 5: 4052, 2014 Jun 13.
Article in English | MEDLINE | ID: mdl-24923250

ABSTRACT

Malaria is a major public health problem that is actively being addressed in a global eradication campaign. Increased population mobility through international air travel has elevated the risk of re-introducing parasites to elimination areas and dispersing drug-resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable public health tool for locating the source of imported outbreaks. Here we analyse the mitochondrion and apicoplast genomes of 711 Plasmodium falciparum isolates from 14 countries, and find evidence that they are non-recombining and co-inherited. The high degree of linkage produces a panel of relatively few single-nucleotide polymorphisms (SNPs) that is geographically informative. We design a 23-SNP barcode that is highly predictive (~92%) and easily adapted to aid case management in the field and survey parasite migration worldwide.


Subject(s)
Genome, Protozoan , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Geography , Humans , Phylogeny , Plasmodium falciparum/classification , Plasmodium falciparum/isolation & purification , Polymorphism, Single Nucleotide
14.
J Infect Dis ; 210(12): 1991-2000, 2014 Dec 15.
Article in English | MEDLINE | ID: mdl-24948693

ABSTRACT

BACKGROUND: Selection by host immunity and antimalarial drugs has driven extensive adaptive evolution in Plasmodium falciparum and continues to produce ever-changing landscapes of genetic variation. METHODS: We performed whole-genome sequencing of 69 P. falciparum isolates from Malawi and used population genetics approaches to investigate genetic diversity and population structure and identify loci under selection. RESULTS: High genetic diversity (π = 2.4 × 10(-4)), moderately high multiplicity of infection (2.7), and low linkage disequilibrium (500-bp) were observed in Chikhwawa District, Malawi, an area of high malaria transmission. Allele frequency-based tests provided evidence of recent population growth in Malawi and detected potential targets of host immunity and candidate vaccine antigens. Comparison of the sequence variation between isolates from Malawi and those from 5 geographically dispersed countries (Kenya, Burkina Faso, Mali, Cambodia, and Thailand) detected population genetic differences between Africa and Asia, within Southeast Asia, and within Africa. Haplotype-based tests of selection to sequence data from all 6 populations identified signals of directional selection at known drug-resistance loci, including pfcrt, pfdhps, pfmdr1, and pfgch1. CONCLUSIONS: The sequence variations observed at drug-resistance loci reflect differences in each country's historical use of antimalarial drugs and may be useful in formulating local malaria treatment guidelines.


Subject(s)
Adaptation, Biological , Evolution, Molecular , Genome, Protozoan , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Antimalarials/therapeutic use , Child , Child, Preschool , Female , Genetic Variation , Humans , Malaria, Falciparum/drug therapy , Malaria, Falciparum/epidemiology , Malawi , Male , Molecular Epidemiology , Plasmodium falciparum/isolation & purification , Selection, Genetic , Sequence Analysis, DNA
15.
Bioinformatics ; 30(15): 2227-9, 2014 Aug 01.
Article in English | MEDLINE | ID: mdl-24700318

ABSTRACT

SUMMARY: SVAMP is a stand-alone desktop application to visualize genomic variants (in variant call format) in the context of geographical metadata. Users of SVAMP are able to generate phylogenetic trees and perform principal coordinate analysis in real time from variant call format (VCF) and associated metadata files. Allele frequency map, geographical map of isolates, Tajima's D metric, single nucleotide polymorphism density, GC and variation density are also available for visualization in real time. We demonstrate the utility of SVAMP in tracking a methicillin-resistant Staphylococcus aureus outbreak from published next-generation sequencing data across 15 countries. We also demonstrate the scalability and accuracy of our software on 245 Plasmodium falciparum malaria isolates from three continents. AVAILABILITY AND IMPLEMENTATION: The Qt/C++ software code, binaries, user manual and example datasets are available at http://cbrc.kaust.edu.sa/svamp CONTACT: arnab.pain@kaust.edu.sa or arnab.pain@cantab.net SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genetic Variation , Genomics/methods , Phylogeny , Software , Disease Outbreaks , High-Throughput Nucleotide Sequencing , Humans , Malaria, Falciparum/epidemiology , Methicillin-Resistant Staphylococcus aureus/genetics , Methicillin-Resistant Staphylococcus aureus/physiology , Plasmodium falciparum/genetics , Plasmodium falciparum/physiology , Polymorphism, Single Nucleotide , Staphylococcal Infections/epidemiology , Time Factors
16.
Ann Hum Genet ; 78(2): 129-40, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24571231

ABSTRACT

Rare genetic variants are thought to be important components in the causality of many diseases but discovering these associations is challenging. We demonstrate how best to use family-based designs to improve the power to detect rare variant disease associations. We show that using genetic data from enriched families (those pedigrees with greater than one affected member) increases the power and sensitivity of existing case-control rare variant tests. However, we show that transmission- (or within-family-) based tests do not benefit from this enrichment. This means that, in studies where a limited amount of genotyping is available, choosing a single case from each of many pedigrees has greater power than selecting multiple cases from fewer pedigrees. Finally, we show how a pseudo-case-control design allows a greater range of statistical tests to be applied to family data.


Subject(s)
Disease/genetics , Genetic Association Studies/methods , Pedigree , Case-Control Studies , Genetic Variation , Genotype , Humans , Models, Genetic
17.
Bioinformatics ; 30(9): 1292-4, 2014 May 01.
Article in English | MEDLINE | ID: mdl-24443379

ABSTRACT

Individuals living in endemic areas generally harbour multiple parasite strains. Multiplicity of infection (MOI) can be an indicator of immune status and transmission intensity. It has a potentially confounding effect on a number of population genetic analyses, which often assume isolates are clonal. Polymerase chain reaction-based approaches to estimate MOI can lack sensitivity. For example, in the human malaria parasite Plasmodium falciparum, genotyping of the merozoite surface protein (MSP1/2) genes is a standard method for assessing MOI, despite the apparent problem of underestimation. The availability of deep coverage data from massively parallizable sequencing technologies means that MOI can be detected genome wide by considering the abundance of heterozygous genotypes. Here, we present a method to estimate MOI, which considers unique combinations of polymorphisms from sequence reads. The method is implemented within the estMOI software. When applied to clinical P.falciparum isolates from three continents, we find that multiple infections are common, especially in regions with high transmission.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Algorithms , Animals , Genome, Protozoan , Genotype , Humans , Malaria, Falciparum/epidemiology , Plasmodium falciparum/isolation & purification , Polymorphism, Genetic , Software
18.
J Infect Dis ; 209(11): 1808-15, 2014 Jun 01.
Article in English | MEDLINE | ID: mdl-24338354

ABSTRACT

Malaria is a global public health challenge, with drug resistance a major barrier to disease control and elimination. To meet the urgent need for better treatments and vaccines, a deeper knowledge of Plasmodium biology and malaria epidemiology is required. An improved understanding of the genomic variation of malaria parasites, especially the most virulent Plasmodium falciparum (Pf) species, has the potential to yield new insights in these areas. High-throughput sequencing and genotyping is generating large amounts of genomic data across multiple parasite populations. The resulting ability to identify informative variants, particularly single-nucleotide polymorphisms (SNPs), will lead to the discovery of intra- and inter-population differences and thus enable the development of genetic barcodes for diagnostic assays and clinical studies. Knowledge of genetic variability underlying drug resistance and other differential phenotypes will also facilitate the identification of novel mutations and contribute to surveillance and stratified medicine applications. The PlasmoView interactive web-browsing tool enables the research community to visualise genomic variation and annotation (eg, biological function) in a geographic setting. The first release contains over 600,000 high-quality SNPs in 631 Pf isolates from laboratory strains and four malaria-endemic regions (West Africa, East Africa, Southeast Asia and Oceania).


Subject(s)
Genome, Protozoan/genetics , Internet , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Polymorphism, Single Nucleotide/genetics , Animals , Base Sequence , DNA, Protozoan/genetics , Humans , Malaria, Falciparum/epidemiology , Molecular Sequence Annotation
19.
Bioinformatics ; 28(22): 2991-3, 2012 Nov 15.
Article in English | MEDLINE | ID: mdl-23014632

ABSTRACT

SUMMARY: Spoligotyping is a well-established genotyping technique based on the presence of unique DNA sequences in Mycobacterium tuberculosis (Mtb), the causal agent of tuberculosis disease (TB). Although advances in sequencing technologies are leading to whole-genome bacterial characterization, tens of thousands of isolates have been spoligotyped, giving a global view of Mtb strain diversity. To bridge the gap, we have developed SpolPred, a software to predict the spoligotype from raw sequence reads. Our approach is compared with experimentally and de novo assembly determined strain types in a set of 44 Mtb isolates. In silico and experimental results are identical for almost all isolates (39/44). However, SpolPred detected five experimentally false spoligotypes and was more accurate and faster than the assembling strategy. Application of SpolPred to an additional seven isolates with no laboratory data led to types that clustered with identical experimental types in a phylogenetic analysis using single-nucleotide polymorphisms. Our results demonstrate the usefulness of the tool and its role in revealing experimental limitations. AVAILABILITY AND IMPLEMENTATION: SpolPred is written in C and is available from www.pathogenseq.org/spolpred. CONTACT: francesc.coll@lshtm.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Online.


Subject(s)
Mycobacterium tuberculosis/classification , Mycobacterium tuberculosis/genetics , Software , Tuberculosis/microbiology , Bacterial Typing Techniques , Genome, Bacterial , Genotype , Humans , Mycobacterium tuberculosis/isolation & purification , Phylogeny
20.
Bioinformatics ; 28(22): 2983-5, 2012 Nov 15.
Article in English | MEDLINE | ID: mdl-22976080

ABSTRACT

SUMMARY: There is an immediate need for tools to both analyse and visualize in real-time single-nucleotide polymorphisms, insertions and deletions, and other structural variants from new sequence file formats. We have developed VarB software that can be used to visualize variant call format files in real time, as well as identify regions under balancing selection and informative markers to differentiate user-defined groups (e.g. populations). We demonstrate its utility using sequence data from 50 Plasmodium falciparum isolates comprising two different continents and confirm known signals from genomic regions that contain important antigenic and anti-malarial drug-resistance genes. AVAILABILITY AND IMPLEMENTATION: The C++-based software VarB and user manual are available from www.pathogenseq.org/varb. CONTACT: taane.clark@lshtm.ac.uk


Subject(s)
Genetic Variation , Plasmodium falciparum/genetics , Software , Genome, Protozoan , High-Throughput Nucleotide Sequencing , INDEL Mutation , Plasmodium falciparum/isolation & purification , Polymorphism, Single Nucleotide , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...