Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
1.
Sci Rep ; 14(1): 7028, 2024 03 25.
Article in English | MEDLINE | ID: mdl-38528062

ABSTRACT

Accurate indel calling plays an important role in precision medicine. A benchmarking indel set is essential for thoroughly evaluating the indel calling performance of bioinformatics pipelines. A reference sample with a set of known-positive variants was developed in the FDA-led Sequencing Quality Control Phase 2 (SEQC2) project, but the known indels in the known-positive set were limited. This project sought to provide an enriched set of known indels that would be more translationally relevant by focusing on additional cancer related regions. A thorough manual review process completed by 42 reviewers, two advisors, and a judging panel of three researchers significantly enriched the known indel set by an additional 516 indels. The extended benchmarking indel set has a large range of variant allele frequencies (VAFs), with 87% of them having a VAF below 20% in reference Sample A. The reference Sample A and the indel set can be used for comprehensive benchmarking of indel calling across a wider range of VAF values in the lower range. Indel length was also variable, but the majority were under 10 base pairs (bps). Most of the indels were within coding regions, with the remainder in the gene regulatory regions. Although high confidence can be derived from the robust study design and meticulous human review, this extensive indel set has not undergone orthogonal validation. The extended benchmarking indel set, along with the indels in the previously published known-positive set, was the truth set used to benchmark indel calling pipelines in a community challenge hosted on the precisionFDA platform. This benchmarking indel set and reference samples can be utilized for a comprehensive evaluation of indel calling pipelines. Additionally, the insights and solutions obtained during the manual review process can aid in improving the performance of these pipelines.


Subject(s)
Benchmarking , High-Throughput Nucleotide Sequencing , Humans , Computational Biology , Quality Control , INDEL Mutation , Polymorphism, Single Nucleotide
2.
Genes (Basel) ; 13(3)2022 03 21.
Article in English | MEDLINE | ID: mdl-35328106

ABSTRACT

Impairing reconsolidation may disrupt drug memories to prevent relapse, meanwhile long-term transcription regulations in the brain regions contribute to the occurrence of emotional memories. The basolateral amygdala (BLA) is involved in the drug-cue association, while the nucleus accumbens (NAc) responds to the drug reward. Here, we assessed whether DNA methyltransferases (Dnmts) in these two brain regions function identically in the reconsolidation of morphine reward memory. We show that Dnmts inhibition in the BLA but not in the NAc after memory retrieval impaired reconsolidation of a morphine reward memory. Moreover, the mRNA levels of Dnmt3a and Dnmt3b, rather than Dnmt1, in the BLA were continuously upregulated after retrieval. We further identified the differentially methylated regions (DMRs) in genes in the BLA after retrieval, and focused on the DMRs located in gene promoter regions. Among them were three genes (Gnas, Sox10, and Pik3r1) involved in memory modulation. Furthermore, Gnas promoter hypermethylation was confirmed to be inversely correlated with the downregulation of Gnas mRNA levels. The findings indicate that the specific transcription regulation mechanism in the BLA and NAc on reconsolidation of opiate-associated memories can be dissociable, and DNA hypermethylation of Gnas in the BLA is necessary for the reconsolidation of morphine reward memories.


Subject(s)
Basolateral Nuclear Complex , Morphine , Amygdala , Animals , DNA , Morphine/pharmacology , Promoter Regions, Genetic , RNA, Messenger , Rats , Rats, Sprague-Dawley , Reward
3.
Brief Bioinform ; 22(6)2021 11 05.
Article in English | MEDLINE | ID: mdl-34058751

ABSTRACT

Genetic and epigenetic contributions to various diseases and biological processes have been well-recognized. However, simultaneous identification of single-nucleotide variants (SNVs) and DNA methylation levels from traditional bisulfite sequencing data is still challenging. Here, we develop double strand bisulfite sequencing (DSBS) for genome-wide accurate identification of SNVs and DNA methylation simultaneously at a single-base resolution by using one dataset. Locking Watson and Crick strand together by hairpin adapter followed by bisulfite treatment and massive parallel sequencing, DSBS simultaneously sequences the bisulfite-converted Watson and Crick strand in one paired-end read, eliminating the strand bias of bisulfite sequencing data. Mutual correction of read1 and read2 can estimate the amplification and sequencing errors, and enables our developed computational pipeline, DSBS Analyzer (https://github.com/tianguolangzi/DSBS), to accurately identify SNV and DNA methylation. Additionally, using DSBS, we provide a genome-wide hemimethylation landscape in the human cells, and reveal that the density of DNA hemimethylation sites in promoter region and CpG island is lower than that in other genomic regions. The cost-effective new approach, which decodes DNA methylome and genomic variants simultaneously, will facilitate more comprehensive studies on numerous diseases and biological processes driven by both genetic and epigenetic variations.


Subject(s)
Computational Biology/methods , DNA Methylation , Epigenomics/methods , Sequence Analysis, DNA , Software , Sulfites , CpG Islands , Epigenesis, Genetic , Genetic Background , Genetics, Population , Genomics , Polymorphism, Single Nucleotide , Whole Genome Sequencing
4.
Genome Biol ; 22(1): 109, 2021 04 16.
Article in English | MEDLINE | ID: mdl-33863344

ABSTRACT

BACKGROUND: Targeted sequencing using oncopanels requires comprehensive assessments of accuracy and detection sensitivity to ensure analytical validity. By employing reference materials characterized by the U.S. Food and Drug Administration-led SEquence Quality Control project phase2 (SEQC2) effort, we perform a cross-platform multi-lab evaluation of eight Pan-Cancer panels to assess best practices for oncopanel sequencing. RESULTS: All panels demonstrate high sensitivity across targeted high-confidence coding regions and variant types for the variants previously verified to have variant allele frequency (VAF) in the 5-20% range. Sensitivity is reduced by utilizing VAF thresholds due to inherent variability in VAF measurements. Enforcing a VAF threshold for reporting has a positive impact on reducing false positive calls. Importantly, the false positive rate is found to be significantly higher outside the high-confidence coding regions, resulting in lower reproducibility. Thus, region restriction and VAF thresholds lead to low relative technical variability in estimating promising biomarkers and tumor mutational burden. CONCLUSION: This comprehensive study provides actionable guidelines for oncopanel sequencing and clear evidence that supports a simplified approach to assess the analytical performance of oncopanels. It will facilitate the rapid implementation, validation, and quality control of oncopanels in clinical use.


Subject(s)
Biomarkers, Tumor , Genetic Testing/methods , Genomics/methods , Neoplasms/genetics , Oncogenes , DNA Copy Number Variations , Genetic Testing/standards , Genomics/standards , Humans , Molecular Diagnostic Techniques/methods , Molecular Diagnostic Techniques/standards , Mutation , Neoplasms/diagnosis , Polymorphism, Single Nucleotide , Reproducibility of Results , Sensitivity and Specificity
5.
Theranostics ; 10(7): 3035-3048, 2020.
Article in English | MEDLINE | ID: mdl-32194853

ABSTRACT

Background: Esophageal squamous cell carcinoma (ESCC), one of the leading causes of cancer mortality worldwide, is a heterogeneous cancer with diverse clinical manifestations. However, little is known about the epigenetic heterogeneity and its clinical relevance for this prevalent cancer. Methods: We generated 7.56 Tb single-base resolution whole-genome bisulfite sequencing data for 84 ESCC and paired paraneoplastic tissues. The analysis identified inter- and intratumor DNA methylation (DNAm) heterogeneity, epigenome-wide DNAm alterations together with the functional regulators involved in the hyper- or hypomethylated regions, and their association with clinical features. We then validated the correlation between the methylation level of specific regions and clinical outcomes of 96 ESCC patients in an independent cohort. Results: ESCC manifested substantial inter- and intratumor DNAm heterogeneity. The high intratumor DNAm heterogeneity was associated with lymph node metastasis and worse overall survival. Interestingly, hypermethylated regions in ESCC were enriched in promoters of numerous transcription factors, and demethylated noncoding regions related to RXR transcription factor binding appeared to contribute to the development of ESCC. Furthermore, we identified numerous DNAm alterations associated with carcinogenesis and lymph node metastasis of ESCC. We also validated three novel prognostic markers for ESCC, including one each in the promoter of CLK1, the 3' untranslated region of ZEB2, and the intergenic locus surrounded by several lncRNAs. Conclusions: This study presents the first population-level resource for dissecting base-resolution DNAm variation in ESCC and provides novel insights into the ESCC pathogenesis and progression, which might facilitate diagnosis and prognosis for this prevalent malignancy.


Subject(s)
Carcinoma, Squamous Cell/secondary , DNA Methylation , DNA, Neoplasm/genetics , Esophageal Neoplasms/genetics , Esophageal Squamous Cell Carcinoma/genetics , Gene Expression Regulation, Neoplastic , Lymphatic Metastasis/genetics , Carcinoma, Squamous Cell/genetics , Disease Progression , Esophageal Squamous Cell Carcinoma/pathology , Gene Dosage , Genetic Heterogeneity , Humans , Kaplan-Meier Estimate , Precision Medicine/methods , Prognosis , Treatment Outcome , Whole Genome Sequencing
7.
Nucleic Acids Res ; 47(W1): W610-W613, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31066442

ABSTRACT

Quality control (QC) for lab-designed primers is crucial for the success of a polymerase chain reaction (PCR). Here, we present MFEprimer-3.0, a functional primer quality control program for checking non-specific amplicons, dimers, hairpins and other parameters. The new features of the current version include: (i) more sensitive binding site search using the updated k-mer algorithm that allows mismatches within the k-mer, except for the first base at the 3' end. The binding sites of each primer with a stable 3' end are listed in the output; (ii) new algorithms for rapidly identifying self-dimers, cross-dimers and hairpins; (iii) the command-line version, which has an added option of JSON output to enhance the versatility of MFEprimer by acting as a QC step in the 'primer design → quality control → redesign' pipeline; (iv) a function for checking whether the binding sites contain single nucleotide polymorphisms (SNPs), which will affect the consistency of binding efficiency among different samples. In summary, MFEprimer-3.0 is updated with the well-tested PCR primer QC program and it can be integrated into various PCR primer design applications as a QC module. The MFEprimer-3.0 server is freely accessible without any login requirement at: https://mfeprimer3.igenetech.com/ and https://www.mfeprimer.com/. The source code for the command-line version is available upon request.


Subject(s)
DNA Primers/standards , Polymerase Chain Reaction/standards , Software , Algorithms , Base Pair Mismatch , Binding Sites , DNA Primers/chemistry , Genome, Human , Humans , Multiplex Polymerase Chain Reaction/standards , Quality Control , Sequence Analysis
8.
J Genet Genomics ; 45(10): 527-538, 2018 10 20.
Article in English | MEDLINE | ID: mdl-30392784

ABSTRACT

Autism spectrum disorder (ASD) is a neurodevelopmental disorder with considerable clinical and genetic heterogeneity. In this study, we identified all classes of genomic variants from whole-genome sequencing (WGS) dataset of 32 Chinese trios with ASD, including de novo mutations, inherited variants, copy number variants (CNVs) and genomic structural variants. A higher mutation rate (Poisson test, P < 2.2 × 10-16) in exonic (1.37 × 10-8) and 3'-UTR regions (1.42 × 10-8) was revealed in comparison with that of whole genome (1.05 × 10-8). Using an integrated model, we identified 87 potentially risk genes (P < 0.01) from 4832 genes harboring various rare deleterious variants, including CHD8 and NRXN2, implying that the disorders may be in favor to multiple-hit. In particular, frequent rare inherited mutations of several microcephaly-associated genes (ASPM, WDR62, and ZNF335) were found in ASD. In chromosomal structure analyses, we found four de novo CNVs and one de novo chromosomal rearrangement event, including a de novo duplication of UBE3A-containing region at 15q11.2-q13.1, which causes Angelman syndrome and microcephaly, and a disrupted TNR due to de novo chromosomal translocation t(1; 5)(q25.1; q33.2). Taken together, our results suggest that abnormalities of centrosomal function and chromatin remodeling of the microcephaly-associated genes may be implicated in pathogenesis of ASD. Adoption of WGS as a new yet efficient technique to illustrate the full genetic spectrum in complex disorders, such as ASD, could provide novel insights into pathogenesis, diagnosis and treatment.


Subject(s)
Asian People/genetics , Autism Spectrum Disorder/genetics , 3' Untranslated Regions , Adolescent , Adult , Cell Cycle Proteins , Child , Child, Preschool , China , DNA Copy Number Variations , DNA-Binding Proteins/genetics , Female , Genetic Predisposition to Disease , Humans , Male , Mutation , Nerve Tissue Proteins/genetics , Transcription Factors/genetics , Whole Genome Sequencing , Young Adult
9.
Article in English | MEDLINE | ID: mdl-29411640

ABSTRACT

We investigated all coding regions of 17 known amyotrophic lateral sclerosis (ALS)-related genes in 311 sporadic ALS patients who were of Chinese ancestry using next-generation sequencing technology. All nonsynonymous variants identified were confirmed by Sanger sequencing. 29 (9.32%) patients harbored at least one pathogenic or likely pathogenic variants. Nine (2.8%) patients harbored two or three variants which frequency <1% in population databases that may be related to oligogenic pathogenesis. A higher allele frequency was observed in East Asian than in European patients for the majority variants identified in this screening, which may indicate that genetic factors are responsible for the different clinical characteristics between Chinese and European ALS patients. Our study reports the results of extensive genetic screening and is the first to investigate the possible oligogenic pathogenesis in Chinese sporadic ALS patients. These findings are useful for exploring ALS pathogenesis and treatment strategies.


Subject(s)
Amyotrophic Lateral Sclerosis/diagnosis , Amyotrophic Lateral Sclerosis/genetics , Genetic Predisposition to Disease , Multifactorial Inheritance/genetics , Adolescent , Adult , Age of Onset , Aged , Aged, 80 and over , Amyotrophic Lateral Sclerosis/epidemiology , Asian People/genetics , Chi-Square Distribution , Female , Gene Frequency , Genetic Association Studies , Humans , Male , Middle Aged , Retrospective Studies , Young Adult
10.
Nucleic Acids Res ; 46(D1): D64-D70, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29059379

ABSTRACT

Circadian rhythms govern various kinds of physiological and behavioral functions of the living organisms, and disruptions of the rhythms are highly detrimental to health. Although several databases have been built for circadian genes, a resource for comprehensive post-transcriptional regulatory information of circadian RNAs and expression patterns of disease-related circadian RNAs is still lacking. Here, we developed CirGRDB (http://cirgrdb.biols.ac.cn) by integrating more than 4936 genome-wide assays, with the aim of fulfilling the growing need to understand the rhythms of life. CirGRDB presents a friendly web interface that allows users to search and browse temporal expression patterns of interested genes in 37 human/mouse tissues or cell lines, and three clinical disorders including sleep disorder, aging and tumor. More importantly, eight kinds of potential transcriptional and post-transcriptional regulators involved in the rhythmic expression of the specific genes, including transcription factors, histone modifications, chromatin accessibility, enhancer RNAs, miRNAs, RNA-binding proteins, RNA editing and RNA methylation, can also be retrieved. Furthermore, a regulatory network could be generated based on the regulatory information. In summary, CirGRDB offers a useful repository for exploring disease-related circadian RNAs, and deciphering the transcriptional and post-transcriptional regulation of circadian rhythms.


Subject(s)
Circadian Rhythm/genetics , Databases, Genetic , Animals , CLOCK Proteins/genetics , Circadian Clocks/genetics , Gene Expression Regulation , Gene Regulatory Networks , Genome , Genome-Wide Association Study , Histone Code , Humans , Internet , Mice , RNA/genetics , RNA/metabolism , RNA Editing , RNA Processing, Post-Transcriptional , User-Computer Interface
11.
J Pathol ; 244(2): 215-226, 2018 02.
Article in English | MEDLINE | ID: mdl-29144541

ABSTRACT

Improvement in the clinical outcome of human cancers requires characterization of the genetic alterations underlying their pathogenesis. Large-scale genomic and transcriptomic characterization of papillary thyroid carcinomas (PTCs) in Western populations has revealed multiple oncogenic drivers which are essential for understanding pathogenic mechanisms of this disease, while, so far, the genetic landscape in Chinese patients with PTC remains uncharacterized. Here, we conducted a large-scale genetic analysis of PTCs from patients in China to determine the mutational landscape of this cancer. By performing targeted DNA amplicon and targeted RNA deep-sequencing, we elucidated the landscape of somatic genetic alterations in 355 Chinese patients with PTC. A total of 88.7% of PTCs were found to harbor at least one candidate oncogenic driver genetic alteration. Among them, around 72.4% of the cases carried BRAF mutations; 2.8% of cases harbored RAS mutations; and 13.8% of cases were characterized with in-frame gene fusions, including seven newly identified kinase gene fusions. TERT promoter mutations were likely to occur in a sub-clonal manner in our PTC cohort. The prevalence of somatic genetic alterations in PTC was significantly different between our Chinese cohort and TCGA datasets for American patients. Additionally, combined analyses of genetic alterations and clinicopathologic features demonstrated that kinase gene fusion was associated with younger age at diagnosis, larger tumor size, and lymph node metastasis in PTC. With the analyses of DNA rearrangement sites of RET gene fusions in PTC, signatures of chromosome translocations related to RET fusion events were also depicted. Collectively, our results provide fundamental insight into the pathogenesis of PTC in the Chinese population. Copyright © 2017 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.


Subject(s)
Asian People/genetics , Biomarkers, Tumor/genetics , Genetic Heterogeneity , Thyroid Cancer, Papillary/genetics , Thyroid Neoplasms/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Child , China/epidemiology , Female , Gene Fusion , Gene Rearrangement , Genes, ras , Genetic Predisposition to Disease , Humans , Male , Middle Aged , Mutation , Phenotype , Proto-Oncogene Proteins B-raf/genetics , Proto-Oncogene Proteins c-ret/genetics , Telomerase/genetics , Thyroid Cancer, Papillary/ethnology , Thyroid Cancer, Papillary/pathology , Thyroid Neoplasms/ethnology , Thyroid Neoplasms/pathology , Young Adult
12.
Int J Biol Sci ; 13(7): 923-934, 2017.
Article in English | MEDLINE | ID: mdl-28808424

ABSTRACT

Oxidative stress is considered to be a key risk state for a variety of human diseases. In response to oxidative stress, the regulation of transcriptional expression of DNA repair genes would be important to DNA repair and genomic stability. However, the overall pattern of transcriptional expression of DNA repair genes and the underlying molecular response mechanism to oxidative stress remain unclear. Here, by employing colorectal cancer cell lines following exposure to hydrogen peroxide, we generated expression profiles of DNA repair genes via RNA-seq and identified gene subsets that are induced or repressed following oxidative stress exposure. RRBS-seq analyses further indicated that transcriptional regulation of most of the DNA repair genes that were induced or repressed is independent of their DNA methylation status. Our analyses also indicate that hydrogen peroxide induces deacetylase SIRT1 which decreases chromatin affinity and the activity of histone acetyltransferase hMOF toward H4K16ac and results in decreased transcriptional expression of DNA repair genes. Taken together, our findings provide a potential mechanism by which oxidative stress suppresses DNA repair genes which is independent of the DNA methylation status of their promoters.


Subject(s)
DNA Repair , Histone Acetyltransferases/metabolism , Histones/metabolism , Oxidative Stress/physiology , RNA/chemistry , Colorectal Neoplasms , Gene Expression Regulation/drug effects , Gene Expression Regulation/physiology , HCT116 Cells , Histone Acetyltransferases/genetics , Histones/genetics , Humans , Hydrogen Peroxide/toxicity , Nucleic Acid Amplification Techniques , Promoter Regions, Genetic , RNA/metabolism , Sirtuin 1/genetics , Sirtuin 1/metabolism
13.
Mol Biol Evol ; 34(9): 2214-2228, 2017 09 01.
Article in English | MEDLINE | ID: mdl-28482038

ABSTRACT

Murine rodents are excellent models for study of adaptive radiations and speciation. Brown Norway rats (Rattus norvegicus) are successful global colonizers and the contributions of their domesticated laboratory strains to biomedical research are well established. To identify nucleotide-based speciation timing of the rat and genomic information contributing to its colonization capabilities, we analyzed 51 whole-genome sequences of wild-derived Brown Norway rats and their sibling species, R. nitidus, and identified over 20 million genetic variants in the wild Brown Norway rats that were absent in the laboratory strains, which substantially expand the reservoir of rat genetic diversity. We showed that divergence of the rat and its siblings coincided with drastic climatic changes that occurred during the Middle Pleistocene. Further, we revealed that there was a geographically widespread influx of genes between Brown Norway rats and the sibling species following the divergence, resulting in numerous introgressed regions in the genomes of admixed Brown Norway rats. Intriguing, genes related to chemical communications among these introgressed regions appeared to contribute to the population-specific adaptations of the admixed Brown Norway rats. Our data reveals evolutionary history of the Brown Norway rat, and offers new insights into the role of climatic changes in speciation of animals and the effect of interspecies introgression on animal adaptation.


Subject(s)
Metagenomics/methods , Rats/genetics , Animals , Biological Evolution , Evolution, Molecular , Genetic Speciation , Genetic Variation , Genome , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Phylogeny , Phylogeography/methods , Polymorphism, Single Nucleotide , Sequence Analysis, DNA
14.
Nucleic Acids Res ; 45(2): 672-684, 2017 01 25.
Article in English | MEDLINE | ID: mdl-27733505

ABSTRACT

The Ten Eleven Translocation 1 (TET1) protein is a DNA demethylase that regulates gene expression through altering statue of DNA methylation. However, recent studies have demonstrated that TET1 could modulate transcriptional expression independent of its DNA demethylation activity; yet, the detailed mechanisms underlying TET1's role in such transcriptional regulation remain not well understood. Here, we uncovered that Tet1 formed a chromatin complex with histone acetyltransferase Mof and scaffold protein Sin3a in mouse embryonic stem cells by integrative genomic analysis using publicly available ChIP-seq data sets and a series of in vitro biochemical studies in human cell lines. Mechanistically, the TET1 facilitated chromatin affinity and enzymatic activity of hMOF against acetylation of histone H4 at lysine 16 via preventing auto-acetylation of hMOF, to regulate expression of the downstream genes, including DNA repair genes. We found that Tet1 knockout MEF cells exhibited an accumulation of DNA damage and genomic instability and Tet1 deficient mice were more sensitive to x-ray exposure. Taken together, our findings reveal that TET1 forms a complex with hMOF to modulate its function and the level of H4K16Ac ultimately affect gene expression and DNA repair.


Subject(s)
DNA Repair , DNA-Binding Proteins/metabolism , Gene Expression Regulation , Histone Acetyltransferases/metabolism , Histones/metabolism , Proto-Oncogene Proteins/metabolism , Acetylation , Animals , Cell Line , Chromatin Immunoprecipitation , Genomic Instability , Genomics/methods , High-Throughput Nucleotide Sequencing , Humans , Mice , Mice, Knockout , Mouse Embryonic Stem Cells/metabolism , Repressor Proteins/metabolism , Sin3 Histone Deacetylase and Corepressor Complex
15.
Nucleic Acids Res ; 44(D1): D154-63, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26635394

ABSTRACT

Transcription factors bind to the genome by forming specific contacts with the primary DNA sequence; however, RNA-binding proteins (RBPs) have greater scope to achieve binding specificity through the RNA secondary structure. It has been revealed that single nucleotide variants (SNVs) that alter RNA structure, also known as RiboSNitches, exhibit 3-fold greater local structure changes than replicates of the same DNA sequence, demonstrated by the fact that depletion of RiboSNitches could result in the alteration of specific RNA shapes at thousands of sites, including 3' UTRs, binding sites of microRNAs and RBPs. However, the network between SNVs and post-transcriptional regulation remains unclear. Here, we developed RBP-Var, a database freely available at http://www.rbp-var.biols.ac.cn/, which provides annotation of functional variants involved in post-transcriptional interaction and regulation. RBP-Var provides an easy-to-use web interface that allows users to rapidly find whether SNVs of interest can transform the secondary structure of RNA and identify RBPs whose binding may be subsequently disrupted. RBP-Var integrates DNA and RNA biology to understand how various genetic variants and post-transcriptional mechanisms cooperate to orchestrate gene expression. In summary, RBP-Var is useful in selecting candidate SNVs for further functional studies and exploring causal SNVs underlying human diseases.


Subject(s)
Databases, Genetic , RNA-Binding Proteins/metabolism , RNA/chemistry , RNA/metabolism , Gene Expression Regulation , Genetic Variation , Humans , Internet , Molecular Sequence Annotation , Nucleic Acid Conformation , Protein Binding
16.
Epigenetics ; 10(9): 775-83, 2015.
Article in English | MEDLINE | ID: mdl-26213102

ABSTRACT

Reduced representation bisulfite sequencing (RRBS) is a powerful method of DNA methylome profiling that can be applied to single cells. However, no previous report has described how PCR-based duplication-induced artifacts affect the accuracy of this method when measuring DNA methylation levels. For quantifying the effects of duplication-induced artifacts on methylome profiling when using ultra-trace amounts of starting material, we developed a novel method, namely quantitative RRBS (Q-RRBS), in which PCR-induced duplication is excluded through the use of unique molecular identifiers (UMIs). By performing Q-RRBS on varying amounts of starting material, we determined that duplication-induced artifacts were more severe when small quantities of the starting material were used. However, through using the UMIs, we successfully eliminated these artifacts. In addition, Q-RRBS could accurately detect allele-specific methylation in absence of allele-specific genetic variants. Our results demonstrate that Q-RRBS is an optimal strategy for DNA methylation profiling of single cells or samples containing ultra-trace amounts of cells.


Subject(s)
DNA Methylation , High-Throughput Nucleotide Sequencing/methods , Single-Cell Analysis/methods , Animals , CpG Islands , Humans , Polymerase Chain Reaction/methods
17.
J Mol Cell Biol ; 7(4): 299-313, 2015 Aug.
Article in English | MEDLINE | ID: mdl-26078362

ABSTRACT

Organisms and cells, in response to environmental influences or during development, undergo considerable changes in DNA methylation on a genome-wide scale, which are linked to a variety of biological processes. Using MethylC-seq to decipher DNA methylome at single-base resolution is prohibitively costly. In this study, we develop a novel approach, named MBRidge, to detect the methylation levels of repertoire CpGs, by innovatively introducing C-hydroxylmethylated adapters and bisulfate treatment into the MeDIP-seq protocol and employing ridge regression in data analysis. A systematic evaluation of DNA methylome in a human ovarian cell line T29 showed that MBRidge achieved high correlation (R > 0.90) with much less cost (∼10%) in comparison with MethylC-seq. We further applied MBRidge to profiling DNA methylome in T29H, an oncogenic counterpart of T29's. By comparing methylomes of T29H and T29, we identified 131790 differential methylation regions (DMRs), which are mainly enriched in carcinogenesis-related pathways. These are substantially different from 7567 DMRs that were obtained by RRBS and related with cell development or differentiation. The integrated analysis of DMRs in the promoter and expression of DMR-corresponding genes revealed that DNA methylation enforced reverse regulation of gene expression, depending on the distance from the proximal DMR to transcription starting sites in both mRNA and lncRNA. Taken together, our results demonstrate that MBRidge is an efficient and cost-effective method that can be widely applied to profiling DNA methylomes.


Subject(s)
DNA Methylation/genetics , Sequence Analysis, DNA/economics , Sequence Analysis, DNA/methods , Software , Base Sequence , Calibration , Cell Line, Tumor , Cost-Benefit Analysis , Female , Gene Expression Regulation, Neoplastic , Humans , Models, Biological , Ovarian Neoplasms/genetics , Regression Analysis , Reproducibility of Results
18.
Proc Natl Acad Sci U S A ; 112(26): 8064-9, 2015 Jun 30.
Article in English | MEDLINE | ID: mdl-26077908

ABSTRACT

Spondylolysis is a fracture in part of the vertebra with a reported prevalence of about 3-6% in the general population. Genetic etiology of this disorder remains unknown. The present study was aimed at identifying genomic mutations in patients with dysplastic spondylolysis as well as the potential pathogenesis of the abnormalities. Whole-exome sequencing and functional analysis were performed for patients with spondylolysis. We identified a novel heterozygous mutation (c.2286A > T; p.D673V) in the sulfate transporter gene SLC26A2 in five affected subjects of a Chinese family. Two additional mutations (e.g., c.1922A > G; p.H641R and g.18654T > C in the intron 1) in the gene were identified by screening a cohort of 30 unrelated patients with the disease. In situ hybridization analysis showed that SLC26A2 is abundantly expressed in the lumbosacral spine of the mouse embryo at day 14.5. Sulfate uptake activities in CHO cells transfected with mutant SLC26A2 were dramatically reduced compared with the wild type, confirming the pathogenicity of the two missense mutations. Further analysis of the gene-disease network revealed a convergent pathogenic network for the development of lumbosacral spine. To our knowledge, our findings provide the first identification of autosomal dominant SLC26A2 mutations in patients with dysplastic spondylolysis, suggesting a new clinical entity in the pathogenesis of chondrodysplasia involving lumbosacral spine. The analysis of the gene-disease network may shed new light on the study of patients with dysplastic spondylolysis and spondylolisthesis as well as high-risk individuals who are asymptomatic.


Subject(s)
Anion Transport Proteins/genetics , Mutation , Spondylolysis/genetics , Adult , Aged , Amino Acid Sequence , Animals , Anion Transport Proteins/chemistry , Female , Humans , In Situ Hybridization , Male , Middle Aged , Molecular Sequence Data , Pedigree , Sequence Homology, Amino Acid , Spondylolysis/physiopathology , Sulfate Transporters
19.
Genet Med ; 17(12): 971-9, 2015 Dec.
Article in English | MEDLINE | ID: mdl-25741867

ABSTRACT

PURPOSE: Genetic etiology of congenital/infantile nystagmus remains largely unknown. This study aimed to identify genomic mutations in patients with infantile nystagmus and an associated disease network. METHODS: Patients with inherited and sporadic infantile nystagmus were recruited for whole-exome and Sanger sequencing. ß-Mannosidase activities were measured. Gene expression, protein-protein interaction, and nystagmus-associated lysosomal storage disease (LSD) genes were analyzed. RESULTS: A novel heterozygous mutation (c.2013G>A; p.R638H) of MANBA, which encodes lysosomal ß-mannosidase, was identified in patients with autosomal-dominant nystagmus. An additional mutation (c.2346T>A; p.L749H) in MANBA was found by screening patients with sporadic nystagmus. MANBA was expressed in the pretectal nucleus of the developing midbrain, known to be involved in oculomotor and optokinetic nystagmus. Functional validation of these mutations demonstrated a significant decrease of ß-mannosidase activities in the patients as well as in mutant-transfected HEK293T cells. Further analysis revealed that nystagmus is present in at least 24 different LSDs involving the brain. CONCLUSION: This is the first identification of MANBA mutations in patients with autosomal-dominant nystagmus, suggesting a new clinical entity. Because ß-mannosidase activities are required for development of the oculomotor nervous system, our findings shed new light on the role of LSD-associated genes in the pathogenesis of infantile nystagmus.


Subject(s)
Mutation , Nystagmus, Congenital/genetics , beta-Mannosidase/genetics , High-Throughput Nucleotide Sequencing , Humans , Lysosomal Storage Diseases/genetics , Nystagmus, Congenital/enzymology , Nystagmus, Congenital/physiopathology
20.
J Genet Genomics ; 41(10): 513-28, 2014 Oct 20.
Article in English | MEDLINE | ID: mdl-25438696

ABSTRACT

Intensively developed in the last few years, single-cell sequencing technologies now present numerous advantages over traditional sequencing methods for solving the problems of biological heterogeneity and low quantities of available biological materials. The application of single-cell sequencing technologies has profoundly changed our understanding of a series of biological phenomena, including gene transcription, embryo development, and carcinogenesis. However, before single-cell sequencing technologies can be used extensively, researchers face the serious challenge of overcoming inherent issues of high amplification bias, low accuracy and reproducibility. Here, we simply summarize the techniques used for single-cell isolation, and review the current technologies used in single-cell genomic, transcriptomic, and epigenomic sequencing. We discuss the merits, defects, and scope of application of single-cell sequencing technologies and then speculate on the direction of future developments.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA , Sequence Analysis, RNA , Single-Cell Analysis/methods , Cell Line , Epigenesis, Genetic , Humans , Reproducibility of Results , Transcriptome
SELECTION OF CITATIONS
SEARCH DETAIL
...