RESUMO
The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us of how little is known about biosafety level 4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. VIDEO ABSTRACT.
Assuntos
Genoma Viral , Febre Lassa/virologia , Vírus Lassa/genética , RNA Viral/genética , África Ocidental/epidemiologia , Animais , Evolução Biológica , Reservatórios de Doenças , Ebolavirus/genética , Variação Genética , Glicoproteínas/genética , Doença pelo Vírus Ebola/virologia , Humanos , Febre Lassa/epidemiologia , Febre Lassa/transmissão , Vírus Lassa/classificação , Vírus Lassa/fisiologia , Murinae/genética , Mutação , Nigéria/epidemiologia , Proteínas Virais/genética , Zoonoses/epidemiologia , Zoonoses/virologiaRESUMO
Medulloblastomas are the most common malignant brain tumours in children. Identifying and understanding the genetic events that drive these tumours is critical for the development of more effective diagnostic, prognostic and therapeutic strategies. Recently, our group and others described distinct molecular subtypes of medulloblastoma on the basis of transcriptional and copy number profiles. Here we use whole-exome hybrid capture and deep sequencing to identify somatic mutations across the coding regions of 92 primary medulloblastoma/normal pairs. Overall, medulloblastomas have low mutation rates consistent with other paediatric tumours, with a median of 0.35 non-silent mutations per megabase. We identified twelve genes mutated at statistically significant frequencies, including previously known mutated genes in medulloblastoma such as CTNNB1, PTCH1, MLL2, SMARCA4 and TP53. Recurrent somatic mutations were newly identified in an RNA helicase gene, DDX3X, often concurrent with CTNNB1 mutations, and in the nuclear co-repressor (N-CoR) complex genes GPS2, BCOR and LDB1. We show that mutant DDX3X potentiates transactivation of a TCF promoter and enhances cell viability in combination with mutant, but not wild-type, ß-catenin. Together, our study reveals the alteration of WNT, hedgehog, histone methyltransferase and now N-CoR pathways across medulloblastomas and within specific subtypes of this disease, and nominates the RNA helicase DDX3X as a component of pathogenic ß-catenin signalling in medulloblastoma.
Assuntos
Neoplasias Cerebelares/genética , Exoma/genética , Genoma Humano/genética , Meduloblastoma/genética , Mutação/genética , Neoplasias Cerebelares/classificação , Criança , RNA Helicases DEAD-box/química , RNA Helicases DEAD-box/genética , RNA Helicases DEAD-box/metabolismo , DNA Helicases/química , DNA Helicases/genética , Proteínas de Ligação a DNA/genética , Proteínas Hedgehog/metabolismo , Histona Metiltransferases , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/genética , Proteínas com Domínio LIM/genética , Meduloblastoma/classificação , Modelos Moleculares , Proteínas de Neoplasias/genética , Proteínas Nucleares/química , Proteínas Nucleares/genética , Receptores Patched , Receptor Patched-1 , Regiões Promotoras Genéticas/genética , Estrutura Terciária de Proteína/genética , Proteínas Proto-Oncogênicas/genética , Receptores de Superfície Celular/genética , Proteínas Repressoras/genética , Transdução de Sinais , Fatores de Transcrição TCF/metabolismo , Fatores de Transcrição/química , Fatores de Transcrição/genética , Proteína Supressora de Tumor p53/genética , Proteínas Wnt/metabolismo , beta Catenina/genética , beta Catenina/metabolismoRESUMO
While high-throughput sequencing methods are revolutionizing fungal ecology, recovering accurate estimates of species richness and abundance has proven elusive. We sought to design internal transcribed spacer (ITS) primers and an Illumina protocol that would maximize coverage of the kingdom Fungi while minimizing nontarget eukaryotes. We inspected alignments of the 5.8S and large subunit (LSU) ribosomal genes and evaluated potential primers using PrimerProspector. We tested the resulting primers using tiered-abundance mock communities and five previously characterized soil samples. We recovered operational taxonomic units (OTUs) belonging to all 8 members in both mock communities, despite DNA abundances spanning 3 orders of magnitude. The expected and observed read counts were strongly correlated (r = 0.94 to 0.97). However, several taxa were consistently over- or underrepresented, likely due to variation in rRNA gene copy numbers. The Illumina data resulted in clustering of soil samples identical to that obtained with Sanger sequence clone library data using different primers. Furthermore, the two methods produced distance matrices with a Mantel correlation of 0.92. Nonfungal sequences comprised less than 0.5% of the soil data set, with most attributable to vascular plants. Our results suggest that high-throughput methods can produce fairly accurate estimates of fungal abundances in complex communities. Further improvements might be achieved through corrections for rRNA copy number and utilization of standardized mock communities. IMPORTANCE: Fungi play numerous important roles in the environment. Improvements in sequencing methods are providing revolutionary insights into fungal biodiversity, yet accurate estimates of the number of fungal species (i.e., richness) and their relative abundances in an environmental sample (e.g., soil, roots, water, etc.) remain difficult to obtain. We present improved methods for high-throughput Illumina sequencing of the species-diagnostic fungal ribosomal marker gene that improve the accuracy of richness and abundance estimates. The improvements include new PCR primers and library preparation, validation using a known mock community, and bioinformatic parameter tuning.
Assuntos
Biodiversidade , Primers do DNA/genética , Fungos/isolamento & purificação , DNA Fúngico/genética , DNA Espaçador Ribossômico/genética , Fungos/classificação , Fungos/genética , Variação Genética , Filogenia , Reação em Cadeia da Polimerase , Análise de Sequência de DNA , Especificidade da EspécieRESUMO
The degree to which molecular epidemiology reveals information about the sources and transmission patterns of an outbreak depends on the resolution of the technology used and the samples studied. Isolates of Escherichia coli O104:H4 from the outbreak centered in Germany in May-July 2011, and the much smaller outbreak in southwest France in June 2011, were indistinguishable by standard tests. We report a molecular epidemiological analysis using multiplatform whole-genome sequencing and analysis of multiple isolates from the German and French outbreaks. Isolates from the German outbreak showed remarkably little diversity, with only two single nucleotide polymorphisms (SNPs) found in isolates from four individuals. Surprisingly, we found much greater diversity (19 SNPs) in isolates from seven individuals infected in the French outbreak. The German isolates form a clade within the more diverse French outbreak strains. Moreover, five isolates derived from a single infected individual from the French outbreak had extremely limited diversity. The striking difference in diversity between the German and French outbreak samples is consistent with several hypotheses, including a bottleneck that purged diversity in the German isolates, variation in mutation rates in the two E. coli outbreak populations, or uneven distribution of diversity in the seed populations that led to each outbreak.
Assuntos
Surtos de Doenças/estatística & dados numéricos , Infecções por Escherichia coli/epidemiologia , Infecções por Escherichia coli/microbiologia , Escherichia coli/genética , Escherichia coli/isolamento & purificação , Infecções por Escherichia coli/genética , Europa (Continente)/epidemiologia , Humanos , Modelos Genéticos , Filogenia , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
A complex microbiota inhabits various microenvironments of the gut, with some symbiotic bacteria having evolved traits to invade the epithelial mucus layer and reside deep within the intestinal tissue of animals. Whether these distinct bacterial communities across gut biogeographies exhibit divergent behaviours is largely unknown. Global transcriptomic analysis to investigate microbial physiology in specific mucosal niches has been hampered technically by an overabundance of host RNA. Here, we employed hybrid selection RNA sequencing (hsRNA-Seq) to enable detailed spatial transcriptomic profiling of a prominent human commensal as it colonizes the colonic lumen, mucus or epithelial tissue of mice. Compared to conventional RNA-Seq, hsRNA-Seq increased reads mapping to the Bacteroides fragilis genome by 48- and 154-fold in mucus and tissue, respectively, allowing for high-fidelity comparisons across biogeographic sites. Near the epithelium, B. fragilis upregulated numerous genes involved in protein synthesis, indicating that bacteria inhabiting the mucosal niche are metabolically active. Further, a specific sulfatase (BF3086) and glycosyl hydrolase (BF3134) were highly induced in mucus and tissue compared to bacteria in the lumen. In-frame deletion of these genes impaired in vitro growth on mucus as a carbon source, as well as mucosal colonization of mice. Mutants in either B. fragilis gene displayed a fitness defect in competing for colonization against bacterial challenge, revealing the importance of site-specific gene expression for robust host-microbial symbiosis. As a versatile tool, hsRNA-Seq can be deployed to explore the in vivo spatial physiology of numerous bacterial pathogens or commensals.
Assuntos
Bacteroides fragilis/genética , Bacteroides fragilis/fisiologia , Colo/microbiologia , Animais , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Bacteroides fragilis/crescimento & desenvolvimento , Colite/microbiologia , Feminino , Regulação Bacteriana da Expressão Gênica , Vida Livre de Germes , Humanos , Mucosa Intestinal/microbiologia , Masculino , Camundongos , Ácidos Sulfônicos , Simbiose , TranscriptomaRESUMO
Cercospora arachidicola, causal agent of early leaf spot, is an economically important peanut pathogen. Lack of genetic information about this fungus prevents understanding the role that potentially diverse genotypes may have in peanut breeding programs. Here, we report for the first time a draft genome sequence of C. arachidicola.
RESUMO
In its largest outbreak, Ebola virus disease is spreading through Guinea, Liberia, Sierra Leone, and Nigeria. We sequenced 99 Ebola virus genomes from 78 patients in Sierra Leone to ~2000× coverage. We observed a rapid accumulation of interhost and intrahost genetic variation, allowing us to characterize patterns of viral transmission over the initial weeks of the epidemic. This West African variant likely diverged from central African lineages around 2004, crossed from Guinea to Sierra Leone in May 2014, and has exhibited sustained human-to-human transmission subsequently, with no evidence of additional zoonotic sources. Because many of the mutations alter protein sequences and other biologically meaningful targets, they should be monitored for impact on diagnostics, vaccines, and therapies critical to outbreak response.
Assuntos
Surtos de Doenças , Ebolavirus/genética , Monitoramento Epidemiológico , Doença pelo Vírus Ebola/transmissão , Doença pelo Vírus Ebola/virologia , Sequência de Bases , Ebolavirus/isolamento & purificação , Variação Genética , Genoma Viral/genética , Genômica/métodos , Doença pelo Vírus Ebola/epidemiologia , Humanos , Mutação , Análise de Sequência de DNA , Serra Leoa/epidemiologiaRESUMO
BACKGROUND: DNA replication initiates at distinct origins in eukaryotic genomes, but the genomic features that define these sites are not well understood. RESULTS: We have taken a combined experimental and bioinformatic approach to identify and characterize origins of replication in three distantly related fission yeasts: Schizosaccharomyces pombe, Schizosaccharomyces octosporus and Schizosaccharomyces japonicus. Using single-molecule deep sequencing to construct amplification-free high-resolution replication profiles, we located origins and identified sequence motifs that predict origin function. We then mapped nucleosome occupancy by deep sequencing of mononucleosomal DNA from the corresponding species, finding that origins tend to occupy nucleosome-depleted regions. CONCLUSIONS: The sequences that specify origins are evolutionarily plastic, with low complexity nucleosome-excluding sequences functioning in S. pombe and S. octosporus, and binding sites for trans-acting nucleosome-excluding proteins functioning in S. japonicus. Furthermore, chromosome-scale variation in replication timing is conserved independently of origin location and via a mechanism distinct from known heterochromatic effects on origin function. These results are consistent with a model in which origins are simply the nucleosome-depleted regions of the genome with the highest affinity for the origin recognition complex. This approach provides a general strategy for understanding the mechanisms that define DNA replication origins in eukaryotes.
Assuntos
Genoma Fúngico , Origem de Replicação , Schizosaccharomyces/genética , Análise de Sequência de DNA/métodos , Sítios de Ligação , Mapeamento Cromossômico/métodos , Cromossomos Fúngicos/genética , Cromossomos Fúngicos/metabolismo , Biologia Computacional/métodos , Período de Replicação do DNA , DNA Fúngico/genética , Evolução Molecular , Heterogeneidade Genética , Heterocromatina/genética , Heterocromatina/metabolismo , Nucleossomos/genética , Nucleossomos/metabolismo , Motivos de Nucleotídeos , Schizosaccharomyces/metabolismo , Especificidade da EspécieRESUMO
We have adapted a solution hybrid selection protocol to enrich pathogen DNA in clinical samples dominated by human genetic material. Using mock mixtures of human and Plasmodium falciparum malaria parasite DNA as well as clinical samples from infected patients, we demonstrate an average of approximately 40-fold enrichment of parasite DNA after hybrid selection. This approach will enable efficient genome sequencing of pathogens from clinical samples, as well as sequencing of endosymbiotic organisms such as Wolbachia that live inside diverse metazoan phyla.