Search | VHL Search Portal

1.

Recruitment of multi-segment genomic RNAs by Bluetongue virus requires a preformed RNA network.

Sung, Po-Yu; Phelan, Jody E; Luo, Dongsheng; Kulasegaran-Shylini, Raghavendran; Bohn, Patrick; Smyth, Redmond P; Roy, Polly.

Nucleic Acids Res ; 2024 May 20.

Article in English | MEDLINE | ID: mdl-38769067

ABSTRACT

How do segmented RNA viruses correctly recruit their genome has yet to be clarified. Bluetongue virus is a double-stranded RNA virus with 10 segments of different sizes, but it assembles its genome in single-stranded form through a series of specific RNA-RNA interactions prior to packaging. In this study, we determined the structure of each BTV transcript, individually and in different combinations, using 2'-hydroxyl acylation analysed by primer extension and mutational profiling (SHAPE-MaP). SHAPE-MaP identified RNA structural changes during complex formation and putative RNA-RNA interaction sites. Our data also revealed a core RNA-complex of smaller segments which serves as the foundation ('anchor') for the assembly of a complete network composed of ten ssRNA segments. The same order of core RNA complex formation was identified in cells transfected with viral RNAs. No viral protein was required for these assembly reactions. Further, substitution mutations in the interacting bases within the core assemblies, altered subsequent segment addition and affected virus replication. These data identify a wholly RNA driven reaction that may offer novel opportunities for designed attenuation or antiviral therapeutics.

2.

Newly Identified Mycobacterium africanum Lineage 10, Central Africa.

Guyeux, Christophe; Senelle, Gaetan; Le Meur, Adrien; Supply, Philip; Gaudin, Cyril; Phelan, Jody E; Clark, Taane G; Rigouts, Leen; de Jong, Bouke; Sola, Christophe; Refrégier, Guislaine.

Emerg Infect Dis ; 30(3): 560-563, 2024 Oct.

Article in English | MEDLINE | ID: mdl-38407162

ABSTRACT

Analysis of genome sequencing data from >100,000 genomes of Mycobacterium tuberculosis complex using TB-Annotator software revealed a previously unknown lineage, proposed name L10, in central Africa. Phylogenetic reconstruction suggests L10 could represent a missing link in the evolutionary and geographic migration histories of M. africanum.

Subject(s)

Biological Evolution , Mycobacterium , Phylogeny , Mycobacterium/genetics , Software , Africa, Central/epidemiology

3.

Portable sequencing of Mycobacterium tuberculosis for clinical and epidemiological applications.

Gómez-González, Paula J; Campino, Susana; Phelan, Jody E; Clark, Taane G.

Brief Bioinform ; 23(5)2022 09 20.

Article in English | MEDLINE | ID: mdl-35894606

ABSTRACT

With >1 million associated deaths in 2020, human tuberculosis (TB) caused by the bacteria Mycobacterium tuberculosis remains one of the deadliest infectious diseases. A plethora of genomic tools and bioinformatics pipelines have become available in recent years to assist the whole genome sequencing of M. tuberculosis. The Oxford Nanopore Technologies (ONT) portable sequencer is a promising platform for cost-effective application in clinics, including personalizing treatment through detection of drug resistance-associated mutations, or in the field, to assist epidemiological and transmission investigations. In this study, we performed a comparison of 10 clinical isolates with DNA sequenced on both long-read ONT and (gold standard) short-read Illumina HiSeq platforms. Our analysis demonstrates the robustness of the ONT variant calling for single nucleotide polymorphisms, despite the high error rate. Moreover, because of improved coverage in repetitive regions where short sequencing reads fail to align accurately, ONT data analysis can incorporate additional regions of the genome usually excluded (e.g. pe/ppe genes). The resulting extra resolution can improve the characterization of transmission clusters and dynamics based on inferring closely related isolates. High concordance in variants in loci associated with drug resistance supports its use for the rapid detection of resistant mutations. Overall, ONT sequencing is a promising tool for TB genomic investigations, particularly to inform clinical and surveillance decision-making to reduce the disease burden.

Subject(s)

Mycobacterium tuberculosis , Tuberculosis , Computational Biology , Genomics , High-Throughput Nucleotide Sequencing , Humans , Mycobacterium tuberculosis/genetics , Sequence Analysis, DNA , Tuberculosis/drug therapy , Tuberculosis/epidemiology , Whole Genome Sequencing/methods

4.

Feature weighted models to address lineage dependency in drug-resistance prediction from Mycobacterium tuberculosis genome sequences.

Billows, Nina; Phelan, Jody E; Xia, Dong; Peng, Yonghong; Clark, Taane G; Chang, Yu-Mei.

Bioinformatics ; 39(7)2023 07 01.

Article in English | MEDLINE | ID: mdl-37428143

ABSTRACT

MOTIVATION: Tuberculosis (TB) is caused by members of the Mycobacterium tuberculosis complex (MTBC), which has a strain- or lineage-based clonal population structure. The evolution of drug-resistance in the MTBC poses a threat to successful treatment and eradication of TB. Machine learning approaches are being increasingly adopted to predict drug-resistance and characterize underlying mutations from whole genome sequences. However, such approaches may not generalize well in clinical practice due to confounding from the population structure of the MTBC. RESULTS: To investigate how population structure affects machine learning prediction, we compared three different approaches to reduce lineage dependency in random forest (RF) models, including stratification, feature selection, and feature weighted models. All RF models achieved moderate-high performance (area under the ROC curve range: 0.60-0.98). First-line drugs had higher performance than second-line drugs, but it varied depending on the lineages in the training dataset. Lineage-specific models generally had higher sensitivity than global models which may be underpinned by strain-specific drug-resistance mutations or sampling effects. The application of feature weights and feature selection approaches reduced lineage dependency in the model and had comparable performance to unweighted RF models. AVAILABILITY AND IMPLEMENTATION: https://github.com/NinaMercedes/RF_lineages.

Subject(s)

Mycobacterium tuberculosis , Tuberculosis, Multidrug-Resistant , Tuberculosis , Humans , Mycobacterium tuberculosis/genetics , Tuberculosis, Multidrug-Resistant/drug therapy , Tuberculosis/drug therapy , Mutation , Whole Genome Sequencing , Antitubercular Agents/pharmacology , Antitubercular Agents/therapeutic use

5.

Emergence of KPC-3- and OXA-181-producing ST13 and ST17 Klebsiella pneumoniae in Portugal: genomic insights on national and international dissemination.

Elias, Rita; Spadar, Anton; Hendrickx, Antoni P A; Bonnin, Remy A; Dortet, Laurent; Pinto, Margarida; Phelan, Jody E; Portugal, Isabel; Campino, Susana; da Silva, Gabriela Jorge; Clark, Taane G; Duarte, Aida; Perdigão, João.

J Antimicrob Chemother ; 78(5): 1300-1308, 2023 05 03.

Article in English | MEDLINE | ID: mdl-36999363

ABSTRACT

BACKGROUND: Carbapenem-resistant Klebsiella pneumoniae (CRKP) strains are of particular concern, especially strains with mobilizable carbapenemase genes such as blaKPC, blaNDM or blaOXA-48, given that carbapenems are usually the last line drugs in the ß-lactam class and, resistance to this sub-class is associated with increased mortality and frequently co-occurs with resistance to other antimicrobial classes. OBJECTIVES: To characterize the genomic diversity and international dissemination of CRKP strains from tertiary care hospitals in Lisbon, Portugal. METHODS: Twenty CRKP isolates obtained from different patients were subjected to WGS for species confirmation, typing, drug resistance gene detection and phylogenetic reconstruction. Two additional genomic datasets were included for comparative purposes: 26 isolates (ST13, ST17 and ST231) from our collection and 64 internationally available genomic assemblies (ST13). RESULTS: By imposing a 21 SNP cut-off on pairwise comparisons we identified two genomic clusters (GCs): ST13/GC1 (nâ=â11), all bearing blaKPC-3, and ST17/GC2 (nâ=â4) harbouring blaOXA-181 and blaCTX-M-15 genes. The inclusion of the additional datasets allowed the expansion of GC1/ST13/KPC-3 to 23 isolates, all exclusively from Portugal, France and the Netherlands. The phylogenetic tree reinforced the importance of the GC1/KPC-3-producing clones along with their rapid emergence and expansion across these countries. The data obtained suggest that the ST13 branch emerged over a decade ago and only more recently did it underpin a stronger pulse of transmission in the studied population. CONCLUSIONS: This study identifies an emerging OXA-181/ST17-producing strain in Portugal and highlights the ongoing international dissemination of a KPC-3/ST13-producing clone from Portugal.

Subject(s)

Carbapenem-Resistant Enterobacteriaceae , Klebsiella Infections , Humans , Klebsiella pneumoniae , Phylogeny , Portugal/epidemiology , beta-Lactamases/genetics , Bacterial Proteins/genetics , Carbapenems , Genomics , Microbial Sensitivity Tests , Klebsiella Infections/epidemiology , Anti-Bacterial Agents/pharmacology , Molecular Chaperones/genetics , Tumor Suppressor Proteins/genetics

6.

Molecular characterisation of second-line drug resistance among drug resistant tuberculosis patients tested in Uganda: a two and a half-year's review.

Mujuni, Dennis; Kasemire, Dianah Linda; Ibanda, Ivan; Kabugo, Joel; Nsawotebba, Andrew; Phelan, Jody E; Majwala, Robert Kaos; Tugumisirize, Didas; Nyombi, Abdunoor; Orena, Beatrice; Turyahabwe, Irene; Byabajungu, Henry; Nadunga, Diana; Musisi, Kenneth; Joloba, Moses Lutakoome; Ssengooba, Willy.

BMC Infect Dis ; 22(1): 363, 2022 Apr 11.

Article in English | MEDLINE | ID: mdl-35410160

ABSTRACT

BACKGROUND: Second-line drug resistance (SLD) among tuberculosis (TB) patients is a serious emerging challenge towards global control of the disease. We characterized SLD-resistance conferring-mutations among TB patients with rifampicin and/or isoniazid (RIF and/or INH) drug-resistance tested at the Uganda National TB Reference Laboratory (NTRL) between June 2017 and December 2019. METHODS: This was a descriptive cross-sectional secondary data analysis of 20,508 M. tuberculosis isolates of new and previously treated patients' resistant to RIF and/or INH. DNA strips with valid results to characterise the SLD resistance using the commercial Line Probe Assay Genotype MTBDRsl Version 2.0 Assay (Hain Life Science, Nehren, Germany) were reviewed. Data were analysed with STATAv15 using cross-tabulation for frequency and proportions of known resistance-conferring mutations to injectable agents (IA) and fluoroquinolones (FQ). RESULTS: Among the eligible participants, 12,993/20,508 (63.4%) were male and median (IQR) age 32 (24-43). A total of 576/20,508 (2.8%) of the M. tuberculosis isolates from participants had resistance to RIF and/or INH. These included; 102/576 (17.7%) single drug-resistant and 474/576 (82.3%) multidrug-resistant (MDR) strains. Only 102 patients had test results for FQ of whom 70/102 (68.6%) and 01/102 (0.98%) had resistance-conferring mutations in the gyrA locus and gyrB locus respectively. Among patients with FQ resistance, gyrAD94G 42.6% (30.0-55.9) and gyrA A90V 41.1% (28.6-54.3) mutations were most observed. Only one mutation, E540D was detected in the gyrB locus. A total of 26 patients had resistance-conferring mutations to IA in whom, 20/26 77.0% (56.4-91.0) had A1401G mutation in the rrs gene locus. CONCLUSIONS: Our study reveals a high proportion of mutations known to confer high-level fluoroquinolone drug-resistance among patients with rifampicin and/or isoniazid drug resistance. Utilizing routinely generated laboratory data from existing molecular diagnostic methods may aid real-time surveillance of emerging tuberculosis drug-resistance in resource-limited settings.

Subject(s)

Mycobacterium tuberculosis , Tuberculosis, Multidrug-Resistant , Adult , Antitubercular Agents/pharmacology , Antitubercular Agents/therapeutic use , Cross-Sectional Studies , Drug Resistance, Multiple, Bacterial/genetics , Female , Fluoroquinolones/therapeutic use , Humans , Isoniazid/pharmacology , Isoniazid/therapeutic use , Male , Microbial Sensitivity Tests , Mutation , Rifampin/pharmacology , Rifampin/therapeutic use , Tuberculosis, Multidrug-Resistant/drug therapy , Tuberculosis, Multidrug-Resistant/epidemiology , Uganda/epidemiology , Young Adult

7.

Clusters of Drug-Resistant Mycobacterium tuberculosis Detected by Whole-Genome Sequence Analysis of Nationwide Sample, Thailand, 2014-2017.

Nonghanphithak, Ditthawat; Chaiprasert, Angkana; Smithtikarn, Saijai; Kamolwat, Phalin; Pungrassami, Petchawan; Chongsuvivatwong, Virasakdi; Mahasirimongkol, Surakameth; Reechaipichitkul, Wipa; Leepiyasakulchai, Chaniya; Phelan, Jody E; Blair, David; Clark, Taane G; Faksri, Kiatichai.

Emerg Infect Dis ; 27(3): 813-822, 2021 03.

Article in English | MEDLINE | ID: mdl-33622486

ABSTRACT

Multidrug-resistant tuberculosis (MDR TB), pre-extensively drug-resistant tuberculosis (pre-XDR TB), and extensively drug-resistant tuberculosis (XDR TB) complicate disease control. We analyzed whole-genome sequence data for 579 phenotypically drug-resistant M. tuberculosis isolates (28% of available MDR/pre-XDR and all culturable XDR TB isolates collected in Thailand during 2014-2017). Most isolates were from lineage 2 (n = 482; 83.2%). Cluster analysis revealed that 281/579 isolates (48.5%) formed 89 clusters, including 205 MDR TB, 46 pre-XDR TB, 19 XDR TB, and 11 poly-drug-resistant TB isolates based on genotypic drug resistance. Members of most clusters had the same subset of drug resistance-associated mutations, supporting potential primary resistance in MDR TB (n = 176/205; 85.9%), pre-XDR TB (n = 29/46; 63.0%), and XDR TB (n = 14/19; 73.7%). Thirteen major clades were significantly associated with geography (p<0.001). Clusters of clonal origin contribute greatly to the high prevalence of drug-resistant TB in Thailand.

Subject(s)

Mycobacterium tuberculosis , Pharmaceutical Preparations , Tuberculosis, Multidrug-Resistant , Antitubercular Agents/therapeutic use , Drug Resistance, Multiple, Bacterial , Humans , Microbial Sensitivity Tests , Sequence Analysis , Thailand , Tuberculosis, Multidrug-Resistant/drug therapy

8.

Whole genome sequencing of Mycobacterium tuberculosis isolates and clinical outcomes of patients treated for multidrug-resistant tuberculosis in Tanzania.

Katale, Bugwesa Z; Mbelele, Peter M; Lema, Nsiande A; Campino, Susana; Mshana, Stephen E; Rweyemamu, Mark M; Phelan, Jody E; Keyyu, Julius D; Majigo, Mtebe; Mbugi, Erasto V; Dockrell, Hazel M; Clark, Taane G; Matee, Mecky I; Mpagama, Stellah.

BMC Genomics ; 21(1): 174, 2020 Feb 21.

Article in English | MEDLINE | ID: mdl-32085703

ABSTRACT

BACKGROUND: Tuberculosis (TB), particularly multi- and or extensive drug resistant TB, is still a global medical emergency. Whole genome sequencing (WGS) is a current alternative to the WHO-approved probe-based methods for TB diagnosis and detection of drug resistance, genetic diversity and transmission dynamics of Mycobacterium tuberculosis complex (MTBC). This study compared WGS and clinical data in participants with TB. RESULTS: This cohort study performed WGS on 87 from MTBC DNA isolates, 57 (66%) and 30 (34%) patients with drug resistant and susceptible TB, respectively. Drug resistance was determined by Xpert® MTB/RIF assay and phenotypic culture-based drug-susceptibility-testing (DST). WGS and bioinformatics data that predict phenotypic resistance to anti-TB drugs were compared with participant's clinical outcomes. They were 47 female participants (54%) and the median age was 35 years (IQR): 29-44). Twenty (23%) and 26 (30%) of participants had TB/HIV co-infection BMI < 18 kg/m2 respectively. MDR-TB participants had MTBC with multiple mutant genes, compared to those with mono or polyresistant TB, and the majority belonged to lineage 3 Central Asian Strain (CAS). Also, MDR-TB was associated with delayed culture-conversion (median: IQR (83: 60-180 vs. 51:30-66) days). WGS had high concordance with both culture-based DST and Xpert® MTB/RIF assay in detecting drug resistance (kappa = 1.00). CONCLUSION: This study offers comparison of mutations detected by Xpert and WGS with phenotypic DST of M. tuberculosis isolates in Tanzania. The high concordance between the different methods and further insights provided by WGS such as PZA-DST, which is not routinely performed in most resource-limited-settings, provides an avenue for inclusion of WGS into diagnostic matrix of TB including drug-resistant TB.

Subject(s)

Antitubercular Agents/therapeutic use , Drug Resistance, Multiple, Bacterial/genetics , Mutation , Mycobacterium tuberculosis/genetics , Tuberculosis, Multidrug-Resistant/drug therapy , Adult , Cohort Studies , Female , Humans , Male , Mycobacterium tuberculosis/physiology , Tanzania , Treatment Outcome , Tuberculosis, Multidrug-Resistant/microbiology , Whole Genome Sequencing

9.

Identifying mixed Mycobacterium tuberculosis infections from whole genome sequence data.

Sobkowiak, Benjamin; Glynn, Judith R; Houben, Rein M G J; Mallard, Kim; Phelan, Jody E; Guerra-Assunção, José Afonso; Banda, Louis; Mzembe, Themba; Viveiros, Miguel; McNerney, Ruth; Parkhill, Julian; Crampin, Amelia C; Clark, Taane G.

BMC Genomics ; 19(1): 613, 2018 Aug 14.

Article in English | MEDLINE | ID: mdl-30107785

ABSTRACT

BACKGROUND: Mixed, polyclonal Mycobacterium tuberculosis infection occurs in natural populations. Developing an effective method for detecting such cases is important in measuring the success of treatment and reconstruction of transmission between patients. Using whole genome sequence (WGS) data, we assess two methods for detecting mixed infection: (i) a combination of the number of heterozygous sites and the proportion of heterozygous sites to total SNPs, and (ii) Bayesian model-based clustering of allele frequencies from sequencing reads at heterozygous sites. RESULTS: In silico and in vitro artificially mixed and known pure M. tuberculosis samples were analysed to determine the specificity and sensitivity of each method. We found that both approaches were effective in distinguishing between pure strains and mixed infection where there was relatively high (> 10%) proportion of a minor strain in the mixture. A large dataset of clinical isolates (n = 1963) from the Karonga Prevention Study in Northern Malawi was tested to examine correlations with patient characteristics and outcomes with mixed infection. The frequency of mixed infection in the population was found to be around 10%, with an association with year of diagnosis, but no association with age, sex, HIV status or previous tuberculosis. CONCLUSIONS: Mixed Mycobacterium tuberculosis infection was identified in silico using whole genome sequence data. The methods presented here can be applied to population-wide analyses of tuberculosis to estimate the frequency of mixed infection, and to identify individual cases of mixed infections. These cases are important when considering the evolution and transmission of the disease, and in patient treatment.

Subject(s)

Mycobacterium tuberculosis/classification , Mycobacterium tuberculosis/genetics , Sequence Analysis, DNA/methods , Tuberculosis/diagnosis , Whole Genome Sequencing/methods , Adolescent , Adult , Bayes Theorem , DNA, Bacterial , Female , Genome, Bacterial , Humans , Male , Middle Aged , Mycobacterium tuberculosis/isolation & purification , Polymorphism, Single Nucleotide , Tuberculosis/genetics , Tuberculosis/microbiology , Young Adult

10.

Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages.

Phelan, Jody E; Coll, Francesc; Bergval, Indra; Anthony, Richard M; Warren, Rob; Sampson, Samantha L; Gey van Pittius, Nicolaas C; Glynn, Judith R; Crampin, Amelia C; Alves, Adriana; Bessa, Theolis Barbosa; Campino, Susana; Dheda, Keertan; Grandjean, Louis; Hasan, Rumina; Hasan, Zahra; Miranda, Anabela; Moore, David; Panaiotov, Stefan; Perdigao, Joao; Portugal, Isabel; Sheen, Patricia; de Oliveira Sousa, Erivelton; Streicher, Elizabeth M; van Helden, Paul D; Viveiros, Miguel; Hibberd, Martin L; Pain, Arnab; McNerney, Ruth; Clark, Taane G.

BMC Genomics ; 17: 151, 2016 Feb 29.

Article in English | MEDLINE | ID: mdl-26923687

ABSTRACT

BACKGROUND: Approximately 10% of the Mycobacterium tuberculosis genome is made up of two families of genes that are poorly characterized due to their high GC content and highly repetitive nature. The PE and PPE families are typified by their highly conserved N-terminal domains that incorporate proline-glutamate (PE) and proline-proline-glutamate (PPE) signature motifs. They are hypothesised to be important virulence factors involved with host-pathogen interactions, but their high genetic variability and complexity of analysis means they are typically disregarded in genome studies. RESULTS: To elucidate the structure of these genes, 518 genomes from a diverse international collection of clinical isolates were de novo assembled. A further 21 reference M. tuberculosis complex genomes and long read sequence data were used to validate the approach. SNP analysis revealed that variation in the majority of the 168 pe/ppe genes studied was consistent with lineage. Several recombination hotspots were identified, notably pe_pgrs3 and pe_pgrs17. Evidence of positive selection was revealed in 65 pe/ppe genes, including epitopes potentially binding to major histocompatibility complex molecules. CONCLUSIONS: This, the first comprehensive study of the pe and ppe genes, provides important insight into M. tuberculosis diversity and has significant implications for vaccine development.

Subject(s)

Genes, Bacterial , Multigene Family , Mycobacterium tuberculosis/genetics , Polymorphism, Single Nucleotide , Recombination, Genetic , DNA, Bacterial/genetics , Evolution, Molecular , Genome, Bacterial , Genomics/methods , Genotype , Mutation , Phylogeny , Selection, Genetic , Sequence Analysis, DNA

11.

New reference genomes to distinguish the sympatric malaria parasites, Plasmodium ovale curtisi and Plasmodium ovale wallikeri.

Higgins, Matthew; Manko, Emilia; Ward, Daniel; Phelan, Jody E; Nolder, Debbie; Sutherland, Colin J; Clark, Taane G; Campino, Susana.

Sci Rep ; 14(1): 3843, 2024 02 15.

Article in English | MEDLINE | ID: mdl-38360879

ABSTRACT

Despite Plasmodium ovale curtisi (Poc) and wallikeri (Pow) being important human-infecting malaria parasites that are widespread across Africa and Asia, little is known about their genome diversity. Morphologically identical, Poc and Pow are indistinguishable and commonly misidentified. Recent rises in the incidence of Poc/Pow infections have renewed efforts to address fundamental knowledge gaps in their biology, and to develop diagnostic tools to understand their epidemiological dynamics and malaria burden. A major roadblock has been the incompleteness of available reference assemblies (PocGH01, PowCR01; ~ 33.5 Mbp). Here, we applied multiple sequencing platforms and advanced bioinformatics tools to generate new reference genomes, Poc221 (South Sudan; 36.0 Mbp) and Pow222 (Nigeria; 34.3 Mbp), with improved nuclear genome contiguity (> 4.2 Mbp), annotation and completeness (> 99% Plasmodium spp., single copy orthologs). Subsequent sequencing of 6 Poc and 15 Pow isolates from Africa revealed a total of 22,517 and 43,855 high-quality core genome SNPs, respectively. Genome-wide levels of nucleotide diversity were determined to be 2.98 × 10-4 (Poc) and 3.43 × 10-4 (Pow), comparable to estimates for other Plasmodium species. Overall, the new reference genomes provide a robust foundation for dissecting the biology of Poc/Pow, their population structure and evolution, and will contribute to uncovering the recombination barrier separating these species.

Subject(s)

Malaria , Parasites , Plasmodium ovale , Animals , Humans , Parasites/genetics , Sequence Analysis, DNA , Malaria/parasitology , Nigeria

12.

Multi-platform whole genome sequencing for tuberculosis clinical and surveillance applications.

Thorpe, Joseph; Sawaengdee, Waritta; Ward, Daniel; Campos, Monica; Wichukchinda, Nuanjun; Chaiyasirinroje, Boonchai; Thanraka, Aungkana; Chumpol, Jaluporn; Phelan, Jody E; Campino, Susana; Mahasirimongkol, Surakameth; Clark, Taane G.

Sci Rep ; 14(1): 5201, 2024 03 03.

Article in English | MEDLINE | ID: mdl-38431684

ABSTRACT

Whole genome sequencing (WGS) of Mycobacterium tuberculosis offers valuable insights for tuberculosis (TB) control. High throughput platforms like Illumina and Oxford Nanopore Technology (ONT) are increasingly used globally, although ONT is known for higher error rates and is less established for genomic studies. Here we present a study comparing the sequencing outputs of both Illumina and ONT platforms, analysing DNA from 59 clinical isolates in highly endemic TB regions of Thailand. The resulting sequence data were used to profile the M. tuberculosis pairs for their lineage, drug resistance and presence in transmission chains, and were compared to publicly available WGS data from Thailand (n = 1456). Our results revealed isolates that are predominantly from lineages 1 and 2, with consistent drug resistance profiles, including six multidrug-resistant strains; however, analysis of ONT data showed longer phylogenetic branches, emphasising the technologies higher error rate. An analysis incorporating the larger dataset identified fifteen of our samples within six potential transmission clusters, including a significant clade of 41 multi-drug resistant isolates. ONT's extended sequences also revealed strain-specific structural variants in pe/ppe genes (e.g. ppe50), which are candidate loci for vaccine development. Despite some limitations, our results show that ONT sequencing is a promising approach for TB genomic research, supporting precision medicine and decision-making in areas with less developed infrastructure, which is crucial for tackling the disease's significant regional burden.

Subject(s)

Mycobacterium tuberculosis , Tuberculosis, Multidrug-Resistant , Tuberculosis , Humans , Tuberculosis, Multidrug-Resistant/microbiology , Antitubercular Agents/pharmacology , Drug Resistance, Multiple, Bacterial/genetics , Phylogeny , Tuberculosis/drug therapy , Whole Genome Sequencing/methods , Microbial Sensitivity Tests

13.

Large-scale genomic analysis of Mycobacterium tuberculosis reveals extent of target and compensatory mutations linked to multi-drug resistant tuberculosis.

Napier, Gary; Campino, Susana; Phelan, Jody E; Clark, Taane G.

Sci Rep ; 13(1): 623, 2023 01 12.

Article in English | MEDLINE | ID: mdl-36635309

ABSTRACT

Resistance to isoniazid (INH) and rifampicin (RIF) first-line drugs in Mycobacterium tuberculosis (Mtb), together called multi-drug resistance, threatens tuberculosis control. Resistance mutations in katG (for INH) and rpoB (RIF) genes often come with fitness costs. To overcome these costs, Mtb compensatory mutations have arisen in rpoC/rpoA (RIF) and ahpC (INH) loci. By leveraging the presence of known compensatory mutations, we aimed to detect novel resistance mutations occurring in INH and RIF target genes. Across ~ 32 k Mtb isolates with whole genome sequencing (WGS) data, there were 6262 (35.7%) with INH and 5435 (30.7%) with RIF phenotypic resistance. Known mutations in katG and rpoB explained ~ 99% of resistance. However, 188 (0.6%) isolates had ahpC compensatory mutations with no known resistance mutations in katG, leading to the identification of 31 putative resistance mutations in katG, each observed in at least 3 isolates. These putative katG mutations can co-occur with other INH variants (e.g., katG-Ser315Thr, fabG1 mutations). For RIF, there were no isolates with rpoC/rpoA compensatory mutations and unknown resistance mutations. Overall, using WGS data we identified putative resistance markers for INH that could be used for genotypic drug-resistance profiling. Establishing the complete repertoire of Mtb resistance mutations will assist the clinical management of tuberculosis.

Subject(s)

Mycobacterium tuberculosis , Tuberculosis, Multidrug-Resistant , Tuberculosis , Humans , Mycobacterium tuberculosis/genetics , Antitubercular Agents/pharmacology , Antitubercular Agents/therapeutic use , Tuberculosis, Multidrug-Resistant/drug therapy , Tuberculosis, Multidrug-Resistant/genetics , Tuberculosis, Multidrug-Resistant/microbiology , Mutation , Isoniazid/pharmacology , Isoniazid/therapeutic use , Tuberculosis/microbiology , Rifampin/pharmacology , Genomics , Bacterial Proteins/genetics , Microbial Sensitivity Tests

14.

Large-scale reference-free analysis of flavivirus sequences in Aedes aegypti whole genome DNA sequencing data.

Spadar, Anton; Phelan, Jody E; Clark, Taane G; Campino, Susana.

Parasit Vectors ; 16(1): 265, 2023 Aug 05.

Article in English | MEDLINE | ID: mdl-37543604

ABSTRACT

Flaviviruses are a diverse group of RNA viruses, which include the etiological agents of Zika, dengue and yellow fever that are transmitted by mosquitoes. Flaviviruses do not encode reverse transcriptase and cannot reverse transcribe into DNA, yet DNA sequences of flaviviruses are found both integrated in the chromosomes of Aedes aegypti mosquitoes and as extrachromosomal sequences. We have previously examined the Ae. aegypti reference genome to identify flavivirus integrations and analyzed conservation of these sequences among whole-genome data of 464 Ae. aegypti collected across 10 countries globally. Here, we extended this analysis by identifying flavivirus sequences in these samples independently of the Ae. aegypti reference assembly. Our aim was to identify the complete set of viral sequences, including those absent in the reference genome, and their geographical distribution. We compared the identified sequences using BLASTn and applied machine learning methods to identify clusters of similar sequences. Apart from clusters of sequences that correspond to the four viral integration events that we had previously described, we identified 19 smaller clusters. The only cluster with a strong geographic association consisted of Cell-fusing agent virus-like sequences specific to Thailand. The remaining clusters did not have a geographic association and mostly consisted of near identical short sequences without strong similarity to any known flaviviral genomes. The short read sequencing data did not permit us to determine whether identified sequences were extrachromosomal or integrated into Ae. aegypti chromosomes. Our results suggest that Liverpool strain and field Ae. aegypti mosquitoes have a similar variety of conserved flaviviral DNA, whose functional role should be investigated in follow-up studies.

Subject(s)

Aedes , Flavivirus , Zika Virus Infection , Zika Virus , Animals , Flavivirus/genetics , Aedes/genetics , Zika Virus/genetics , DNA, Viral , Sequence Analysis, DNA , Mosquito Vectors/genetics

15.

TB-ML-a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis.

Libiseller-Egger, Julian; Wang, Linfeng; Deelder, Wouter; Campino, Susana; Clark, Taane G; Phelan, Jody E.

Bioinform Adv ; 3(1): vbad040, 2023.

Article in English | MEDLINE | ID: mdl-37033466

ABSTRACT

Motivation: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. Results: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. Availability and implementation: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. Contact: jody.phelan@lshtm.ac.uk.

16.

Drug resistance profiling of asymptomatic and low-density Plasmodium falciparum malaria infections on Ngodhe island, Kenya, using custom dual-indexing next-generation sequencing.

Osborne, Ashley; Phelan, Jody E; Kaneko, Akira; Kagaya, Wataru; Chan, Chim; Ngara, Mtakai; Kongere, James; Kita, Kiyoshi; Gitaka, Jesse; Campino, Susana; Clark, Taane G.

Sci Rep ; 13(1): 11416, 2023 07 14.

Article in English | MEDLINE | ID: mdl-37452073

ABSTRACT

Malaria control initiatives require rapid and reliable methods for the detection and monitoring of molecular markers associated with antimalarial drug resistance in Plasmodium falciparum parasites. Ngodhe island, Kenya, presents a unique malaria profile, with lower P. falciparum incidence rates than the surrounding region, and a high proportion of sub-microscopic and low-density infections. Here, using custom dual-indexing and Illumina next generation sequencing, we generate resistance profiles on seventy asymptomatic and low-density P. falciparum infections from a mass drug administration program implemented on Ngodhe island between 2015 and 2016. Our assay encompasses established molecular markers on the Pfcrt, Pfmdr1, Pfdhps, Pfdhfr, and Pfk13 genes. Resistance markers for sulfadoxine-pyrimethamine were identified at high frequencies, including a quintuple mutant haplotype (Pfdhfr/Pfdhps: N51I, C59R, S108N/A437G, K540E) identified in 62.2% of isolates. The Pfdhps K540E biomarker, used to inform decision making for intermittent preventative treatment in pregnancy, was identified in 79.2% of isolates. Several variants on Pfmdr1, associated with reduced susceptibility to quinolones and lumefantrine, were also identified (Y184F 47.1%; D1246Y 16.0%; N86 98%). Overall, we have presented a low-cost and extendable approach that can provide timely genetic profiles to inform clinical and surveillance activities, especially in settings with abundant low-density infections, seeking malaria elimination.

Subject(s)

Antimalarials , Malaria, Falciparum , Malaria , Pregnancy , Female , Humans , Kenya/epidemiology , Malaria, Falciparum/drug therapy , Malaria, Falciparum/epidemiology , Malaria, Falciparum/parasitology , Antimalarials/pharmacology , Antimalarials/therapeutic use , Pyrimethamine/pharmacology , Pyrimethamine/therapeutic use , Sulfadoxine/pharmacology , Sulfadoxine/therapeutic use , Malaria/parasitology , Plasmodium falciparum , Drug Resistance/genetics , Drug Combinations , High-Throughput Nucleotide Sequencing

17.

Genome dynamics of high-risk resistant and hypervirulent Klebsiella pneumoniae clones in Dhaka, Bangladesh.

Hussain, Arif; Mazumder, Razib; Ahmed, Abdullah; Saima, Umme; Phelan, Jody E; Campino, Susana; Ahmed, Dilruba; Asadulghani, Md; Clark, Taane G; Mondal, Dinesh.

Front Microbiol ; 14: 1184196, 2023.

Article in English | MEDLINE | ID: mdl-37303793

ABSTRACT

Klebsiella pneumoniae is recognized as an urgent public health threat because of the emergence of difficult-to-treat (DTR) strains and hypervirulent clones, resulting in infections with high morbidity and mortality rates. Despite its prominence, little is known about the genomic epidemiology of K. pneumoniae in resource-limited settings like Bangladesh. We sequenced genomes of 32 K. pneumoniae strains isolated from patient samples at the International Center for Diarrhoeal Disease Research, Bangladesh (icddr,b). Genome sequences were examined for their diversity, population structure, resistome, virulome, MLST, O and K antigens and plasmids. Our results revealed the presence of two K. pneumoniae phylogroups, namely KpI (K. pneumoniae) (97%) and KpII (K. quasipneumoniae) (3%). The genomic characterization revealed that 25% (8/32) of isolates were associated with high-risk multidrug-resistant clones, including ST11, ST14, ST15, ST307, ST231 and ST147. The virulome analysis confirmed the presence of six (19%) hypervirulent K. pneumoniae (hvKp) and 26 (81%) classical K. pneumoniae (cKp) strains. The most common ESBL gene identified was blaCTX-M-15 (50%). Around 9% (3/32) isolates exhibited a difficult-to-treat phenotype, harboring carbapenem resistance genes (2 strains harbored blaNDM-5 plus blaOXA-232, one isolate blaOXA-181). The most prevalent O antigen was O1 (56%). The capsular polysaccharides K2, K20, K16 and K62 were enriched in the K. pneumoniae population. This study suggests the circulation of the major international high-risk multidrug-resistant and hypervirulent (hvKp) K. pneumoniae clones in Dhaka, Bangladesh. These findings warrant immediate appropriate interventions, which would otherwise lead to a high burden of untreatable life-threatening infections locally.

18.

Functional genetic variation in pe/ppe genes contributes to diversity in Mycobacterium tuberculosis lineages and potential interactions with the human host.

Gómez-González, Paula Josefina; Grabowska, Anna D; Tientcheu, Leopold D; Tsolaki, Anthony G; Hibberd, Martin L; Campino, Susana; Phelan, Jody E; Clark, Taane G.

Front Microbiol ; 14: 1244319, 2023.

Article in English | MEDLINE | ID: mdl-37876785

ABSTRACT

Introduction: Around 10% of the coding potential of Mycobacterium tuberculosisis constituted by two poorly understood gene families, the pe and ppe loci, thought to be involved in host-pathogen interactions. Their repetitive nature and high GC content have hindered sequence analysis, leading to exclusion from whole-genome studies. Understanding the genetic diversity of pe/ppe families is essential to facilitate their potential translation into tools for tuberculosis prevention and treatment. Methods: To investigate the genetic diversity of the 169 pe/ppe genes, we performed a sequence analysis across 73 long-read assemblies representing seven different lineages of M. tuberculosis and M. bovis BCG. Individual pe/ppe gene alignments were extracted and diversity and conservation across the different lineages studied. Results: The pe/ppe genes were classified into three groups based on the level of protein sequence conservation relative to H37Rv, finding that >50% were conserved, with indels in pe_pgrs and ppe_mptr sub-families being major drivers of structural variation. Gene rearrangements, such as duplications and gene fusions, were observed between pe and pe_pgrs genes. Inter-lineage diversity revealed lineage-specific SNPs and indels. Discussion: The high level of pe/ppe genes conservation, together with the lineage-specific findings, suggest their phylogenetic informativeness. However, structural variants and gene rearrangements differing from the reference were also identified, with potential implications for pathogenicity. Overall, improving our knowledge of these complex gene families may have insights into pathogenicity and inform the development of much-needed tools for tuberculosis control.

19.

High throughput human genotyping for variants associated with malarial disease outcomes using custom targeted amplicon sequencing.

Osborne, Ashley; Phelan, Jody E; Vanheer, Leen N; Manjurano, Alphaxard; Gitaka, Jesse; Drakeley, Christopher J; Kaneko, Akira; Kita, Kiyoshi; Campino, Susana; Clark, Taane G.

Sci Rep ; 13(1): 12062, 2023 07 26.

Article in English | MEDLINE | ID: mdl-37495620

ABSTRACT

Malaria has exhibited the strongest known selective pressure on the human genome in recent history and is the evolutionary driving force behind genetic conditions, such as sickle-cell disease, glucose-6-phosphatase deficiency, and some other erythrocyte defects. Genomic studies (e.g., The 1000 Genomes project) have provided an invaluable baseline for human genetics, but with an estimated two thousand ethno-linguistic groups thought to exist across the African continent, our understanding of the genetic differences between indigenous populations and their implications on disease is still limited. Low-cost sequencing-based approaches make it possible to target specific molecular markers and genes of interest, leading to potential insights into genetic diversity. Here we demonstrate the versatility of custom dual-indexing technology and Illumina next generation sequencing to generate a genetic profile of human polymorphisms associated with malaria pathology. For 100 individuals diagnosed with severe malaria in Northeast Tanzania, variants were successfully characterised on the haemoglobin subunit beta (HBB), glucose-6-phosphate dehydrogenase (G6PD), atypical chemokine receptor 1 (ACKR1) genes, and the intergenic Dantu genetic blood variant, then validated using pre-existing genotyping data. High sequencing coverage was observed across all amplicon targets in HBB, G6PD, ACKR1, and the Dantu blood group, with variants identified at frequencies previously observed within this region of Tanzania. Sequencing data exhibited high concordance rates to pre-existing genotyping data (> 99.5%). Our work demonstrates the potential utility of amplicon sequencing for applications in human genetics, including to personalise medicine and understand the genetic diversity of loci linked to important host phenotypes, such as malaria susceptibility.

Subject(s)

Malaria , Genotype , Malaria/epidemiology , Malaria/genetics , Humans , Polymorphism, Single Nucleotide , Tanzania/epidemiology , Male , Female , ABO Blood-Group System

20.

Genomic and functional portrait of multidrug-resistant, hydrogen sulfide (H₂S)-producing variants of Escherichia coli.

Mazumder, Razib; Hussain, Arif; Rahman, Mohammad Mustafizur; Phelan, Jody E; Campino, Susana; Abdullah, Ahmed; Clark, Taane G; Mondal, Dinesh.

Front Microbiol ; 14: 1206757, 2023.

Article in English | MEDLINE | ID: mdl-37577429

ABSTRACT

Atypical Escherichia coli forms exhibit unusual characteristics compared to typical strains. The H2S-producing variants of some atypical E. coli strains cause a wide range of illnesses in humans and animals. However, there are sparse reports on such strains worldwide. We performed whole-genome sequencing (WGS) and detailed characterization of four H2S-producing E. coli variants from poultry and human clinical sources in Dhaka, Bangladesh. All four isolates were confirmed as E. coli using biochemical tests and genomic analysis, and were multidrug-resistant (MDR). WGS analysis including an additional Chinese strain, revealed diverse STs among the five H2S-producing E. coli genomes, with clonal complex ST10 being detected in 2 out of 5 genomes. The predominant phylogroup detected was group A (n = 4/5). The blaTEM1B (n = 5/5) was the most predominant extended-spectrum beta-lactamase (ESBL) gene, followed by different alleles of blaCTX-M (blaCTX-M -55,-65,-123; n = 3/5). Multiple plasmid replicons were detected, with IncX being the most common. One E. coli strain was classified as enteropathogenic E. coli. The genomes of all five isolates harbored five primary and four secondary function genes related to H2S production. These findings suggest the potential of these isolates to cause disease and spread antibiotic resistance. Therefore, such atypical E. coli forms should be included in differential diagnosis to understand the pathogenicity, antimicrobial resistance and evolution of H2S-producing E. coli.

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL