Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2024 Mar 29.
Article in English | MEDLINE | ID: mdl-38585716

ABSTRACT

Immunoglobulin (IGH, IGK, IGL) loci in the human genome are highly polymorphic regions that encode the building blocks of the light and heavy chain IG proteins that dimerize to form antibodies. The processes of V(D)J recombination and somatic hypermutation in B cells are responsible for creating an enormous reservoir of highly specific antibodies capable of binding a vast array of possible antigens. However, the antibody repertoire is fundamentally limited by the set of variable (V), diversity (D), and joining (J) alleles present in the germline IG loci. To better understand how the germline IG haplotypes contribute to the expressed antibody repertoire, we combined genome sequencing of the germline IG loci with single-cell transcriptome sequencing of B cells from the same donor. Sequencing and assembly of the germline IG loci captured the IGH locus in a single fully-phased contig where the maternal and paternal contributions to the germline V, D, and J repertoire can be fully resolved. The B cells were collected following a measles, mumps, and rubella (MMR) vaccination, resulting in a population of cells that were activated in response to this specific immune challenge. Single-cell, full-length transcriptome sequencing of these B cells resulted in whole transcriptome characterization of each cell, as well as highly-accurate consensus sequences for the somatically rearranged and hypermutated light and heavy chain IG transcripts. A subset of antibodies synthesized based on their consensus heavy and light chain transcript sequences demonstrated binding to measles antigens and neutralization of measles live virus.

2.
Cell Stem Cell ; 30(9): 1262-1281.e8, 2023 09 07.
Article in English | MEDLINE | ID: mdl-37582363

ABSTRACT

RNA splicing factors are recurrently mutated in clonal blood disorders, but the impact of dysregulated splicing in hematopoiesis remains unclear. To overcome technical limitations, we integrated genotyping of transcriptomes (GoT) with long-read single-cell transcriptomics and proteogenomics for single-cell profiling of transcriptomes, surface proteins, somatic mutations, and RNA splicing (GoT-Splice). We applied GoT-Splice to hematopoietic progenitors from myelodysplastic syndrome (MDS) patients with mutations in the core splicing factor SF3B1. SF3B1mut cells were enriched in the megakaryocytic-erythroid lineage, with expansion of SF3B1mut erythroid progenitor cells. We uncovered distinct cryptic 3' splice site usage in different progenitor populations and stage-specific aberrant splicing during erythroid differentiation. Profiling SF3B1-mutated clonal hematopoiesis samples revealed that erythroid bias and cell-type-specific cryptic 3' splice site usage in SF3B1mut cells precede overt MDS. Collectively, GoT-Splice defines the cell-type-specific impact of somatic mutations on RNA splicing, from early clonal outgrowths to overt neoplasia, directly in human samples.


Subject(s)
Myelodysplastic Syndromes , RNA Splice Sites , Humans , Multiomics , RNA Splicing/genetics , Myelodysplastic Syndromes/genetics , Myelodysplastic Syndromes/metabolism , RNA Splicing Factors/genetics , RNA Splicing Factors/metabolism , Mutation/genetics , Phosphoproteins/genetics , Phosphoproteins/metabolism
3.
Cell ; 186(1): 47-62.e16, 2023 01 05.
Article in English | MEDLINE | ID: mdl-36608657

ABSTRACT

Horizontal gene transfer accelerates microbial evolution. The marine picocyanobacterium Prochlorococcus exhibits high genomic plasticity, yet the underlying mechanisms are elusive. Here, we report a novel family of DNA transposons-"tycheposons"-some of which are viral satellites while others carry cargo, such as nutrient-acquisition genes, which shape the genetic variability in this globally abundant genus. Tycheposons share distinctive mobile-lifecycle-linked hallmark genes, including a deep-branching site-specific tyrosine recombinase. Their excision and integration at tRNA genes appear to drive the remodeling of genomic islands-key reservoirs for flexible genes in bacteria. In a selection experiment, tycheposons harboring a nitrate assimilation cassette were dynamically gained and lost, thereby promoting chromosomal rearrangements and host adaptation. Vesicles and phage particles harvested from seawater are enriched in tycheposons, providing a means for their dispersal in the wild. Similar elements are found in microbes co-occurring with Prochlorococcus, suggesting a common mechanism for microbial diversification in the vast oligotrophic oceans.


Subject(s)
Ecosystem , Genome, Bacterial , Genome, Bacterial/genetics , Phylogeny , Oceans and Seas , Genomics
4.
Genome Res ; 30(3): 437-446, 2020 03.
Article in English | MEDLINE | ID: mdl-32075851

ABSTRACT

Viruses are the most abundant biological entities on Earth and play key roles in host ecology, evolution, and horizontal gene transfer. Despite recent progress in viral metagenomics, the inherent genetic complexity of virus populations still poses technical difficulties for recovering complete virus genomes from natural assemblages. To address these challenges, we developed an assembly-free, single-molecule nanopore sequencing approach, enabling direct recovery of complete virus genome sequences from environmental samples. Our method yielded thousands of full-length, high-quality draft virus genome sequences that were not recovered using standard short-read assembly approaches. Additionally, our analyses discriminated between populations whose genomes had identical direct terminal repeats versus those with circularly permuted repeats at their termini, thus providing new insight into native virus reproduction and genome packaging. Novel DNA sequences were discovered, whose repeat structures, gene contents, and concatemer lengths suggest they are phage-inducible chromosomal islands, which are packaged as concatemers in phage particles, with lengths that match the size ranges of co-occurring phage genomes. Our new virus sequencing strategy can provide previously unavailable information about the genome structures, population biology, and ecology of naturally occurring viruses and viral parasites.


Subject(s)
Genome, Viral , Nanopore Sequencing/methods , Bacteriophages/genetics , DNA Packaging , Metagenomics , Seawater/virology
5.
Genome Med ; 11(1): 25, 2019 04 23.
Article in English | MEDLINE | ID: mdl-31014393

ABSTRACT

BACKGROUND: Intrachromosomal triplications (TRP) can contribute to disease etiology via gene dosage effects, gene disruption, position effects, or fusion gene formation. Recently, post-zygotic de novo triplications adjacent to copy-number neutral genomic intervals with runs of homozygosity (ROH) have been shown to result in uniparental isodisomy (UPD). The genomic structure of these complex genomic rearrangements (CGRs) shows a consistent pattern of an inverted triplication flanked by duplications (DUP-TRP/INV-DUP) formed by an iterative DNA replisome template-switching mechanism during replicative repair of a single-ended, double-stranded DNA (seDNA), the ROH results from an interhomolog or nonsister chromatid template switch. It has been postulated that these CGRs may lead to genetic abnormalities in carriers due to dosage-sensitive genes mapping within the copy-number variant regions, homozygosity for alleles at a locus causing an autosomal recessive (AR) disease trait within the ROH region, or imprinting-associated diseases. METHODS: Here, we report a family wherein the affected subject carries a de novo 2.2-Mb TRP followed by 42.2 Mb of ROH and manifests clinical features overlapping with those observed in association with chromosome 14 maternal UPD (UPD(14)mat). UPD(14)mat can cause clinical phenotypic features enabling a diagnosis of Temple syndrome. This CGR was then molecularly characterized by high-density custom aCGH, genome-wide single-nucleotide polymorphism (SNP) and methylation arrays, exome sequencing (ES), and the Oxford Nanopore long-read sequencing technology. RESULTS: We confirmed the postulated DUP-TRP/INV-DUP structure by multiple orthogonal genomic technologies in the proband. The methylation status of known differentially methylated regions (DMRs) on chromosome 14 revealed that the subject shows the typical methylation pattern of UPD(14)mat. Consistent with these molecular findings, the clinical features overlap with those observed in Temple syndrome, including speech delay. CONCLUSIONS: These data provide experimental evidence that, in humans, triplication can lead to segmental UPD and imprinting disease. Importantly, genotype/phenotype analyses further reveal how a post-zygotically generated complex structural variant, resulting from a replication-based mutational mechanism, contributes to expanding the clinical phenotype of known genetic syndromes. Mechanistically, such events can distort transmission genetics resulting in homozygosity at a locus for which only one parent is a carrier as well as cause imprinting diseases.


Subject(s)
Chromosome Aberrations , Chromosome Disorders/genetics , Chromosomes, Human, Pair 14/genetics , Genomic Imprinting , Chromosome Disorders/pathology , DNA Methylation , DNA Replication , Humans , Male , Pedigree , Phenotype , Polymorphism, Single Nucleotide , Young Adult
6.
Nat Rev Genet ; 20(3): 157-172, 2019 03.
Article in English | MEDLINE | ID: mdl-30546107

ABSTRACT

Prokaryotic DNA contains three types of methylation: N6-methyladenine, N4-methylcytosine and 5-methylcytosine. The lack of tools to analyse the frequency and distribution of methylated residues in bacterial genomes has prevented a full understanding of their functions. Now, advances in DNA sequencing technology, including single-molecule, real-time sequencing and nanopore-based sequencing, have provided new opportunities for systematic detection of all three forms of methylated DNA at a genome-wide scale and offer unprecedented opportunities for achieving a more complete understanding of bacterial epigenomes. Indeed, as the number of mapped bacterial methylomes approaches 2,000, increasing evidence supports roles for methylation in regulation of gene expression, virulence and pathogen-host interactions.


Subject(s)
Bacteria , DNA Methylation , DNA, Bacterial , Genome, Bacterial , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Bacteria/genetics , Bacteria/metabolism , DNA, Bacterial/genetics , DNA, Bacterial/metabolism
7.
Genome Res ; 28(7): 1067-1078, 2018 07.
Article in English | MEDLINE | ID: mdl-29764913

ABSTRACT

N6-Methyladenine (m6dA) has been discovered as a novel form of DNA methylation prevalent in eukaryotes; however, methods for high-resolution mapping of m6dA events are still lacking. Single-molecule real-time (SMRT) sequencing has enabled the detection of m6dA events at single-nucleotide resolution in prokaryotic genomes, but its application to detecting m6dA in eukaryotic genomes has not been rigorously examined. Herein, we identified unique characteristics of eukaryotic m6dA methylomes that fundamentally differ from those of prokaryotes. Based on these differences, we describe the first approach for mapping m6dA events using SMRT sequencing specifically designed for the study of eukaryotic genomes and provide appropriate strategies for designing experiments and carrying out sequencing in future studies. We apply the novel approach to study two eukaryotic genomes. For green algae, we construct the first complete genome-wide map of m6dA at single-nucleotide and single-molecule resolution. For human lymphoblastoid cells (hLCLs), it was necessary to integrate SMRT sequencing data with independent sequencing data. The joint analyses suggest putative m6dA events are enriched in the promoters of young full-length LINE-1 elements (L1s), but call for validation by additional methods. These analyses demonstrate a general method for rigorous mapping and characterization of m6dA events in eukaryotic genomes.


Subject(s)
Eukaryota/genetics , Genome/genetics , Cell Line , Chromosome Mapping/methods , DNA Methylation/genetics , Humans , Prokaryotic Cells/metabolism , Promoter Regions, Genetic/genetics , Sequence Analysis, DNA/methods
8.
Nat Biotechnol ; 36(1): 61-69, 2018 01.
Article in English | MEDLINE | ID: mdl-29227468

ABSTRACT

Shotgun metagenomics methods enable characterization of microbial communities in human microbiome and environmental samples. Assembly of metagenome sequences does not output whole genomes, so computational binning methods have been developed to cluster sequences into genome 'bins'. These methods exploit sequence composition, species abundance, or chromosome organization but cannot fully distinguish closely related species and strains. We present a binning method that incorporates bacterial DNA methylation signatures, which are detected using single-molecule real-time sequencing. Our method takes advantage of these endogenous epigenetic barcodes to resolve individual reads and assembled contigs into species- and strain-level bins. We validate our method using synthetic and real microbiome sequences. In addition to genome binning, we show that our method links plasmids and other mobile genetic elements to their host species in a real microbiome sample. Incorporation of DNA methylation information into shotgun metagenomics analyses will complement existing methods to enable more accurate sequence binning.


Subject(s)
DNA Methylation/genetics , Metagenome/genetics , Metagenomics , Microbiota/genetics , Algorithms , Bacteria/genetics , Cluster Analysis , Environmental Microbiology , Genome, Bacterial/genetics , Humans , Plasmids/genetics , Sequence Analysis, DNA
9.
Nat Commun ; 6: 7438, 2015 Jun 15.
Article in English | MEDLINE | ID: mdl-26074426

ABSTRACT

Beyond its role in host defense, bacterial DNA methylation also plays important roles in the regulation of gene expression, virulence and antibiotic resistance. Bacterial cells in a clonal population can generate epigenetic heterogeneity to increase population-level phenotypic plasticity. Single molecule, real-time (SMRT) sequencing enables the detection of N6-methyladenine and N4-methylcytosine, two major types of DNA modifications comprising the bacterial methylome. However, existing SMRT sequencing-based methods for studying bacterial methylomes rely on a population-level consensus that lacks the single-cell resolution required to observe epigenetic heterogeneity. Here, we present SMALR (single-molecule modification analysis of long reads), a novel framework for single molecule-level detection and phasing of DNA methylation. Using seven bacterial strains, we show that SMALR yields significantly improved resolution and reveals distinct types of epigenetic heterogeneity. SMALR is a powerful new tool that enables de novo detection of epigenetic heterogeneity and empowers investigation of its functions in bacterial populations.


Subject(s)
DNA Methylation , DNA, Bacterial/metabolism , Epigenesis, Genetic , Base Sequence , Campylobacter jejuni/genetics , Caulobacter crescentus/genetics , Chromohalobacter/genetics , DNA Modification Methylases/genetics , DNA Modification Methylases/metabolism , Escherichia coli/genetics , Genome, Bacterial , Geobacter/genetics , Helicobacter pylori/genetics , Molecular Sequence Data , Sequence Analysis, DNA
10.
mBio ; 4(4)2013 Jul 02.
Article in English | MEDLINE | ID: mdl-23820394

ABSTRACT

UNLABELLED: Prior to the epidemic that emerged in Haiti in October of 2010, cholera had not been documented in this country. After its introduction, a strain of Vibrio cholerae O1 spread rapidly throughout Haiti, where it caused over 600,000 cases of disease and >7,500 deaths in the first two years of the epidemic. We applied whole-genome sequencing to a temporal series of V. cholerae isolates from Haiti to gain insight into the mode and tempo of evolution in this isolated population of V. cholerae O1. Phylogenetic and Bayesian analyses supported the hypothesis that all isolates in the sample set diverged from a common ancestor within a time frame that is consistent with epidemiological observations. A pangenome analysis showed nearly homogeneous genomic content, with no evidence of gene acquisition among Haiti isolates. Nine nearly closed genomes assembled from continuous-long-read data showed evidence of genome rearrangements and supported the observation of no gene acquisition among isolates. Thus, intrinsic mutational processes can account for virtually all of the observed genetic polymorphism, with no demonstrable contribution from horizontal gene transfer (HGT). Consistent with this, the 12 Haiti isolates tested by laboratory HGT assays were severely impaired for transformation, although unlike previously characterized noncompetent V. cholerae isolates, each expressed hapR and possessed a functional quorum-sensing system. Continued monitoring of V. cholerae in Haiti will illuminate the processes influencing the origin and fate of genome variants, which will facilitate interpretation of genetic variation in future epidemics. IMPORTANCE: Vibrio cholerae is the cause of substantial morbidity and mortality worldwide, with over three million cases of disease each year. An understanding of the mode and rate of evolutionary change is critical for proper interpretation of genome sequence data and attribution of outbreak sources. The Haiti epidemic provides an unprecedented opportunity to study an isolated, single-source outbreak of Vibrio cholerae O1 over an established time frame. By using multiple approaches to assay genetic variation, we found no evidence that the Haiti strain has acquired any genes by horizontal gene transfer, an observation that led us to discover that it is also poorly transformable. We have found no evidence that environmental strains have played a role in the evolution of the outbreak strain.


Subject(s)
Cholera/epidemiology , Cholera/microbiology , Epidemics , Evolution, Molecular , Genome, Bacterial , Vibrio cholerae O1/genetics , Vibrio cholerae O1/isolation & purification , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , Gene Order , Haiti/epidemiology , Humans , Mutation , Sequence Analysis, DNA , Vibrio cholerae O1/classification
11.
Genome Res ; 21(7): 1008-16, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21602305

ABSTRACT

To map the genetics of gene expression in metabolically relevant tissues and investigate the diversity of expression SNPs (eSNPs) in multiple tissues from the same individual, we collected four tissues from approximately 1000 patients undergoing Roux-en-Y gastric bypass (RYGB) and clinical traits associated with their weight loss and co-morbidities. We then performed high-throughput genotyping and gene expression profiling and carried out a genome-wide association analyses for more than 100,000 gene expression traits representing four metabolically relevant tissues: liver, omental adipose, subcutaneous adipose, and stomach. We successfully identified 24,531 eSNPs corresponding to about 10,000 distinct genes. This represents the greatest number of eSNPs identified to our knowledge by any study to date and the first study to identify eSNPs from stomach tissue. We then demonstrate how these eSNPs provide a high-quality disease map for each tissue in morbidly obese patients to not only inform genetic associations identified in this cohort, but in previously published genome-wide association studies as well. These data can aid in elucidating the key networks associated with morbid obesity, response to RYGB, and disease as a whole.


Subject(s)
Gastric Mucosa/metabolism , Liver/metabolism , Obesity, Morbid/epidemiology , Obesity, Morbid/genetics , Adiposity/genetics , Adult , Cohort Studies , Comorbidity , Databases, Genetic , Female , Gastric Bypass , Gene Expression Profiling , Genome-Wide Association Study/methods , Genotype , Humans , Male , Middle Aged , Obesity, Morbid/surgery , Polymorphism, Single Nucleotide , Weight Loss
12.
J Clin Invest ; 120(7): 2414-22, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20577049

ABSTRACT

Atherosclerosis represents the most significant risk factor for coronary artery disease (CAD), the leading cause of death in developed countries. To better understand the pathogenesis of atherosclerosis, we applied a likeli-hood-based model selection method to infer gene-disease causality relationships for the aortic lesion trait in a segregating mouse population demonstrating a spectrum of susceptibility to developing atherosclerotic lesions. We identified 292 genes that tested causal for aortic lesions from liver and adipose tissues of these mice, and we experimentally validated one of these candidate causal genes, complement component 3a receptor 1 (C3ar1), using a knockout mouse model. We also found that genes identified by this method overlapped with genes progressively regulated in the aortic arches of 2 mouse models of atherosclerosis during atherosclerotic lesion development. By comparing our gene set with findings from public human genome-wide association studies (GWAS) of CAD and related traits, we found that 5 genes identified by our study overlapped with published studies in humans in which they were identified as risk factors for multiple atherosclerosis-related pathologies, including myocardial infarction, serum uric acid levels, mean platelet volume, aortic root size, and heart failure. Candidate causal genes were also found to be enriched with CAD risk polymorphisms identified by the Wellcome Trust Case Control Consortium (WTCCC). Our findings therefore validate the ability of causality testing procedures to provide insights into the mechanisms underlying atherosclerosis development.


Subject(s)
Atherosclerosis/genetics , Coronary Artery Disease/genetics , Coronary Disease/genetics , Myocardial Infarction/genetics , Adipose Tissue , Animals , Aorta , Genes , Genome-Wide Association Study , Humans , Liver , Mice , Mice, Knockout , Phenotype , Polymorphism, Genetic , Risk Factors
13.
PLoS Genet ; 6(5): e1000932, 2010 May 06.
Article in English | MEDLINE | ID: mdl-20463879

ABSTRACT

Genome-wide association studies (GWAS) have demonstrated the ability to identify the strongest causal common variants in complex human diseases. However, to date, the massive data generated from GWAS have not been maximally explored to identify true associations that fail to meet the stringent level of association required to achieve genome-wide significance. Genetics of gene expression (GGE) studies have shown promise towards identifying DNA variations associated with disease and providing a path to functionally characterize findings from GWAS. Here, we present the first empiric study to systematically characterize the set of single nucleotide polymorphisms associated with expression (eSNPs) in liver, subcutaneous fat, and omental fat tissues, demonstrating these eSNPs are significantly more enriched for SNPs that associate with type 2 diabetes (T2D) in three large-scale GWAS than a matched set of randomly selected SNPs. This enrichment for T2D association increases as we restrict to eSNPs that correspond to genes comprising gene networks constructed from adipose gene expression data isolated from a mouse population segregating a T2D phenotype. Finally, by restricting to eSNPs corresponding to genes comprising an adipose subnetwork strongly predicted as causal for T2D, we dramatically increased the enrichment for SNPs associated with T2D and were able to identify a functionally related set of diabetes susceptibility genes. We identified and validated malic enzyme 1 (Me1) as a key regulator of this T2D subnetwork in mouse and provided support for the association of this gene to T2D in humans. This integration of eSNPs and networks provides a novel approach to identify disease susceptibility networks rather than the single SNPs or genes traditionally identified through GWAS, thereby extracting additional value from the wealth of data currently being generated by GWAS.


Subject(s)
Adipose Tissue/metabolism , Diabetes Mellitus, Type 2/genetics , Gene Expression , Genome-Wide Association Study , Liver/metabolism , Polymorphism, Single Nucleotide , Animals , Cohort Studies , DNA-Binding Proteins/genetics , DNA-Binding Proteins/metabolism , Diabetes Mellitus, Type 2/metabolism , Female , Humans , Male , Mice , Mice, Inbred C57BL , Mice, Knockout , Mice, Obese , Transcription Factors/genetics , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...