Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 69
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Brief Bioinform ; 24(2)2023 03 19.
Article in English | MEDLINE | ID: mdl-36917170

ABSTRACT

Metagenomic sequencing (mNGS) is a powerful diagnostic tool to detect causative pathogens in clinical microbiological testing owing to its unbiasedness and substantially reduced costs. Rapid and accurate classification of metagenomic sequences is a critical procedure for pathogen identification in dry-lab step of mNGS test. However, clinical practices of the testing technology are hampered by the challenge of classifying sequences within a clinically relevant timeframe. Here, we present GPMeta, a novel GPU-accelerated approach to ultrarapid pathogen identification from complex mNGS data, allowing users to bypass this limitation. Using mock microbial community datasets and public real metagenomic sequencing datasets from clinical samples, we show that GPMeta has not only higher accuracy but also significantly higher speed than existing state-of-the-art tools such as Bowtie2, Bwa, Kraken2 and Centrifuge. Furthermore, GPMeta offers GPMetaC clustering algorithm, a statistical model for clustering and rescoring ambiguous alignments to improve the discrimination of highly homologous sequences from microbial genomes with average nucleotide identity >95%. GPMetaC exhibits higher precision and recall rate than others. GPMeta underlines its key role in the development of the mNGS test in infectious diseases that require rapid turnaround times. Further study will discern how to best and easily integrate GPMeta into routine clinical practices. GPMeta is freely accessible to non-commercial users at https://github.com/Bgi-LUSH/GPMeta.


Subject(s)
Metagenome , Microbiota , High-Throughput Nucleotide Sequencing/methods , Metagenomics/methods , Sensitivity and Specificity
2.
Lancet ; 395(10224): 565-574, 2020 02 22.
Article in English | MEDLINE | ID: mdl-32007145

ABSTRACT

BACKGROUND: In late December, 2019, patients presenting with viral pneumonia due to an unidentified microbial agent were reported in Wuhan, China. A novel coronavirus was subsequently identified as the causative pathogen, provisionally named 2019 novel coronavirus (2019-nCoV). As of Jan 26, 2020, more than 2000 cases of 2019-nCoV infection have been confirmed, most of which involved people living in or visiting Wuhan, and human-to-human transmission has been confirmed. METHODS: We did next-generation sequencing of samples from bronchoalveolar lavage fluid and cultured isolates from nine inpatients, eight of whom had visited the Huanan seafood market in Wuhan. Complete and partial 2019-nCoV genome sequences were obtained from these individuals. Viral contigs were connected using Sanger sequencing to obtain the full-length genomes, with the terminal regions determined by rapid amplification of cDNA ends. Phylogenetic analysis of these 2019-nCoV genomes and those of other coronaviruses was used to determine the evolutionary history of the virus and help infer its likely origin. Homology modelling was done to explore the likely receptor-binding properties of the virus. FINDINGS: The ten genome sequences of 2019-nCoV obtained from the nine patients were extremely similar, exhibiting more than 99·98% sequence identity. Notably, 2019-nCoV was closely related (with 88% identity) to two bat-derived severe acute respiratory syndrome (SARS)-like coronaviruses, bat-SL-CoVZC45 and bat-SL-CoVZXC21, collected in 2018 in Zhoushan, eastern China, but were more distant from SARS-CoV (about 79%) and MERS-CoV (about 50%). Phylogenetic analysis revealed that 2019-nCoV fell within the subgenus Sarbecovirus of the genus Betacoronavirus, with a relatively long branch length to its closest relatives bat-SL-CoVZC45 and bat-SL-CoVZXC21, and was genetically distinct from SARS-CoV. Notably, homology modelling revealed that 2019-nCoV had a similar receptor-binding domain structure to that of SARS-CoV, despite amino acid variation at some key residues. INTERPRETATION: 2019-nCoV is sufficiently divergent from SARS-CoV to be considered a new human-infecting betacoronavirus. Although our phylogenetic analysis suggests that bats might be the original host of this virus, an animal sold at the seafood market in Wuhan might represent an intermediate host facilitating the emergence of the virus in humans. Importantly, structural analysis suggests that 2019-nCoV might be able to bind to the angiotensin-converting enzyme 2 receptor in humans. The future evolution, adaptation, and spread of this virus warrant urgent investigation. FUNDING: National Key Research and Development Program of China, National Major Project for Control and Prevention of Infectious Disease in China, Chinese Academy of Sciences, Shandong First Medical University.


Subject(s)
Betacoronavirus/genetics , Coronavirus Infections/epidemiology , Coronavirus Infections/virology , Genome, Viral , Pneumonia, Viral/epidemiology , Pneumonia, Viral/virology , Receptors, Virus/metabolism , Betacoronavirus/metabolism , Bronchoalveolar Lavage Fluid/virology , COVID-19 , China/epidemiology , Coronavirus Infections/diagnosis , Coronavirus Infections/transmission , DNA, Viral/genetics , Disease Reservoirs/virology , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Humans , Phylogeny , Pneumonia, Viral/diagnosis , Pneumonia, Viral/transmission , SARS-CoV-2 , Sequence Alignment
3.
RNA Biol ; 18(sup1): 172-181, 2021 10 15.
Article in English | MEDLINE | ID: mdl-34459719

ABSTRACT

The high-resolution feature of single-cell transcriptome sequencing technology allows researchers to observe cellular gene expression profiles at the single-cell level, offering numerous possibilities for subsequent biomedical investigation. However, the unavoidable technical impact of high missing values in the gene-cell expression matrices generated by insufficient RNA input severely hampers the accuracy of downstream analysis. To address this problem, it is essential to develop a more rapid and stable imputation method with greater accuracy, which should not only be able to recover the missing data, but also effectively facilitate the following biological mechanism analysis. The existing imputation methods all have their drawbacks and limitations, some require pre-assumed data distribution, some cannot distinguish between technical and biological zeros, and some have poor computational performance. In this paper, we presented a novel imputation software FRMC for single-cell RNA-Seq data, which innovates a fast and accurate singular value thresholding approximation method. The experiments demonstrated that FRMC can not only precisely distinguish 'true zeros' from dropout events and correctly impute missing values attributed to technical noises, but also effectively enhance intracellular and intergenic connections and achieve accurate clustering of cells in biological applications. In summary, FRMC can be a powerful tool for analysing single-cell data because it ensures biological significance, accuracy, and rapidity simultaneously. FRMC is implemented in Python and is freely accessible to non-commercial users on GitHub: https://github.com/HUST-DataMan/FRMC.


Subject(s)
Exome Sequencing/methods , Gene Expression Profiling , RNA-Seq/methods , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Software , Humans
4.
J Transl Med ; 18(1): 199, 2020 05 13.
Article in English | MEDLINE | ID: mdl-32404108

ABSTRACT

BACKGROUND: Accurate etiology diagnosis is crucial for central nervous system infections (CNS infections). The diagnostic value of metagenomic next-generation sequencing (mNGS), an emerging powerful platform, remains to be studied in CNS infections. METHODS: We conducted a single-center prospective cohort study to compare mNGS with conventional methods including culture, smear and etc. 248 suspected CNS infectious patients were enrolled and clinical data were recorded. RESULTS: mNGS reported a 90.00% (9/10) sensitivity in culture-positive patients without empirical treatment and 66.67% (6/9) in empirically-treated patients. Detected an extra of 48 bacteria and fungi in culture-negative patients, mNGS provided a higher detection rate compared to culture in patients with (34.45% vs. 7.56%, McNemar test, p < 0.0083) or without empirical therapy (50.00% vs. 25.00%, McNemar test, p > 0.0083). Compared to conventional methods, positive percent agreement and negative percent agreement was 75.00% and 69.11% separately. mNGS detection rate was significantly higher in patients with cerebrospinal fluid (CSF) WBC > 300 * 106/L, CSF protein > 500 mg/L or glucose ratio ≤ 0.3. mNGS sequencing read is correlated with CSF WBC, glucose ratio levels and clinical disease progression. CONCLUSION: mNGS showed a satisfying diagnostic performance in CNS infections and had an overall superior detection rate to culture. mNGS may held diagnostic advantages especially in empirically treated patients. CSF laboratory results were statistically relevant to mNGS detection rate, and mNGS could dynamically monitor disease progression.


Subject(s)
Central Nervous System Infections , Metagenomics , Adult , Central Nervous System Infections/diagnosis , High-Throughput Nucleotide Sequencing , Humans , Prospective Studies , Sensitivity and Specificity
5.
J Neurovirol ; 26(4): 556-564, 2020 08.
Article in English | MEDLINE | ID: mdl-32572833

ABSTRACT

Pseudorabies virus (PRV) is known to cause severe encephalitis in juvenile pigs and various non-native hosts; recent evidences suggest that PRV might cause encephalitis in humans. In a multicenter cohort study in China, next-generation sequencing of cerebrospinal fluid (CSF) was performed to detect pathogens in all patients with clinically suspected central nervous system infections. This study involved all the patients whose CSF samples were positive for PRV-DNA; their clinical features were evaluated, and species-specific PCR and serological tests were sequentially applied for validation. Among the 472 patients tested from June 1, 2016, to December 1, 2018, six were positive for PRV-DNA, which were partially validated by PCR and serological tests. Additionally, we retrospectively examined another case with similar clinical and neuroimaging appearance and detected the presence of PRV-DNA. These patients had similar clinical manifestations, including a rapid progression of panencephalitis, and similar neuroimaging features of symmetric lesions in the basal ganglia and bilateral hemispheres. Six of the patients were engaged in occupations connected with swine production. PRV infection should be suspected in patients with rapidly progressive panencephalitis and characteristic neuroimaging features, especially with exposure to swine.


Subject(s)
Basal Ganglia/pathology , Cerebrum/pathology , DNA, Viral/genetics , Encephalitis, Viral/pathology , Herpesvirus 1, Suid/genetics , Meat/virology , Pseudorabies/pathology , Adult , Animals , Antibodies, Viral/cerebrospinal fluid , Basal Ganglia/diagnostic imaging , Basal Ganglia/virology , Cerebrum/diagnostic imaging , Cerebrum/virology , China , DNA, Viral/cerebrospinal fluid , Encephalitis, Viral/cerebrospinal fluid , Encephalitis, Viral/diagnosis , Encephalitis, Viral/virology , Female , Herpesvirus 1, Suid/growth & development , Herpesvirus 1, Suid/pathogenicity , High-Throughput Nucleotide Sequencing , Humans , Magnetic Resonance Imaging , Male , Middle Aged , Polymerase Chain Reaction , Pseudorabies/cerebrospinal fluid , Pseudorabies/diagnosis , Pseudorabies/virology , Swine
6.
Emerg Infect Dis ; 24(6): 1087-1090, 2018 06.
Article in English | MEDLINE | ID: mdl-29774834

ABSTRACT

We report human endophthalmitis caused by pseudorabies virus infection after exposure to sewage on a hog farm in China. High-throughput sequencing and real-time PCR of vitreous humor showed pseudorabies virus sequences. This case showed that pseudorabies virus might infect humans after direct contact with contaminants.


Subject(s)
Endophthalmitis/epidemiology , Endophthalmitis/virology , Herpesvirus 1, Suid , Pseudorabies/epidemiology , Pseudorabies/virology , Animals , China/epidemiology , Endophthalmitis/diagnosis , Endophthalmitis/history , Evolution, Molecular , Female , Genes, Viral , History, 21st Century , Humans , Middle Aged , Phylogeny , Pseudorabies/diagnosis , Pseudorabies/history
7.
BMC Infect Dis ; 18(1): 113, 2018 03 06.
Article in English | MEDLINE | ID: mdl-29510659

ABSTRACT

BACKGROUND: Accurate and early diagnosis of neurocysticercosis (NCC) remains a challenge due to the heterogeneity of its clinical, immunological and imaging characteristics. The presence of cysticercus DNA in cerebrospinal fluid (CSF) of NCC patients has been previously detected via conventional PCR assays. To the best of our knowledge, the use of CSF Next-Generation Sequencing (NGS) based pathogen analysis in patients with NCC infection has never been reported. CASE PRESENTATION: This study reports the clinical, imaging, and immunological features of a patient initially presenting with several months of headache who further developed a pure sensory stroke. NGS was used to detect the pathogen, and her CSF demonstrated the presence of Taenia solium-DNA. This finding was confirmed by a positive reaction to CSF cysticercosis antibodies. After antiparasitic treatment, secondary CSF NGS revealed the DNA index have dropped considerably compared to the initial NGS readings. CONCLUSIONS: NGS is a promising tool for the early and accurate diagnosis of central nervous system (CNS) infection, especially in the setting of atypical clinical manifestations. Further studies are required to evaluate the persistence of DNA in the CSF of patients.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Neurocysticercosis/diagnosis , Neurocysticercosis/etiology , Taenia solium/genetics , Adult , Animals , Anthelmintics/administration & dosage , Anthelmintics/therapeutic use , Brain/diagnostic imaging , DNA, Helminth/cerebrospinal fluid , Female , Headache/parasitology , Humans , Magnetic Resonance Imaging , Neurocysticercosis/drug therapy , Praziquantel/administration & dosage , Praziquantel/therapeutic use , Stroke/etiology , Stroke/parasitology , Taenia solium/pathogenicity
8.
BMC Infect Dis ; 18(1): 379, 2018 08 07.
Article in English | MEDLINE | ID: mdl-30086724

ABSTRACT

BACKGROUND: Talaromyces marneffei, is an opportunistic pathogenic fungus that is most commonly reported in Southeast Asia and disseminated T.marneffei infection predominantly occurs in patients with immunodeficiency. With a potential to invade multiple organs, it can be fatal for patients if diagnosis and treatment are delayed. In current clinical practice, the diagnosis of T.marneffei infection relies heavily on tissue culture and histologic analysis, which may suffer from limited positive rate and is sometimes time consuming. The rapid and accurate diagnosis of disseminated T.marneffei infection remains challenging. CASE PRESENTATION: A 22-year-old man gradually developed fever, cough, lower extremities weakness, jaundice and rash, for which a 3-month extensive investigation failed to reach a diagnosis. After admitted into our hospital, laboratory and radiological tests revealed multiple lesions in the patient's brain, spinal cord, and lungs. We performed next generation sequencing on the patient's skin tissue, bone marrow, blood and cerebrospinal fluid, which all identified numerous Talaromyces marneffei nucleotide sequences and leaded to the rapid diagnosis and treatment of disseminated T.marneffei infection. CONCLUSIONS: This case underline the clinical significance of T.marneffei as a possible pathogen in immune-competent patients. This successful application of the next generation sequencing assisting the rapid diagnosis of disseminated T.marneffei infection provides a new perspective in the clinical approach to the systematic fungi infections and highlights the potential of this technique in rapid etiological diagnosis.


Subject(s)
High-Throughput Screening Assays , Invasive Fungal Infections/diagnosis , Talaromyces/genetics , Talaromyces/isolation & purification , Early Diagnosis , HIV Seronegativity , High-Throughput Screening Assays/methods , Humans , Invasive Fungal Infections/microbiology , Male , Penicillium/genetics , Penicillium/isolation & purification , Penicillium/pathogenicity , Sensitivity and Specificity , Talaromyces/pathogenicity , Time Factors , Young Adult
9.
BMC Infect Dis ; 17(1): 508, 2017 07 21.
Article in English | MEDLINE | ID: mdl-28732529

ABSTRACT

BACKGROUND: Bacterial species belonging to the genus Exiguobacterium are facultative anaerobic, non-spore-forming, Gram-positive bacilli, and rarely associated with human infections. Herein, we reported the first case of community-acquired pneumonia (CAP) and bacteremia due to Exiguobacterium spp. in China. CASE PRESENTATION: An adult male with severe CAP was hospitalized. The pathogen was isolated from his bloodstream and broncho-alveolar lavage fluid. The correct identification of the micro-organism was achieved using 16S rRNA sequencing, and its antibiotic susceptibility test was performed by microdilution method. The Whole Genome Sequencing (WGS) was used to characterize its genetic features and to elucidate its potential pathogenic mechanisms. Furthermore, its genome sequence was also compared with those of 3 publicly-available Exiguobacterium strains. A PubMed search was performed for further understanding the features of Exiguobacterium infections. Phylogenetic analysis of the 16S rRNA gene sequence showed that the strain GX59 was most closely related to Exiguobacterium AT1b (99.7%). The genome of GX59 was 2,727,929 bp in size, harbouring 2855 putative protein-coding genes, 5 rRNA operons, 37 tRNA genes and 1 tmRNA. The multiple genome comparison of 4 Exiguobacterium strains demonstrated that Exiguobacterium contained 37 genes of secretion systems, including sec, tat, FEA, Type IV Pili and competence-related DNA transformation transporter (Com). Virulence factors of the micro-organism included tlyC, NprR, MCP, Dam, which might play a critical role in causing lethal infection. CONCLUSIONS: The study highlighted the potential pathogenicity of the genus Exiguobacterium for its unique genes encoding various virulence factors and those associated with antibiotic resistance, therefore, its clinical significance should be valued.


Subject(s)
Bacillaceae/genetics , Bacteremia/microbiology , Community-Acquired Infections/microbiology , Phylogeny , Pneumonia, Bacterial/microbiology , Bacillaceae/isolation & purification , Bacillaceae/pathogenicity , China , DNA, Bacterial/genetics , Diabetes Mellitus, Type 2/microbiology , Genome, Bacterial , Humans , Male , Middle Aged , RNA, Ribosomal, 16S/genetics
10.
J Neurovirol ; 22(2): 240-5, 2016 Apr.
Article in English | MEDLINE | ID: mdl-26506841

ABSTRACT

We screened for viral DNA in cerebrospinal fluid samples using next-generation sequencing (NGS) technology to diagnose CNS viral infections. We collected CSF samples from four cases with clinically suspected viral meningoencephalitis. DNA extracted from the samples was analyzed with NGS, and the results were further validated using PCR. Herpes simplex virus 1 (HSV-1) was detected in the CSF of two patients, HSV-2 and human herpes virus type 3 (HHV-3, VZV) in the CSF of two other patients separately. The number of unique reads of the identified viral genes ranged from 144 to 44205 (93.51 to 99.57%). The coverage of identified viral genes ranged from 12 to 98% with a depth value of 1.1 to 35, respectively. The results were further confirmed using PCR in three cases. The clinical presentation and outcomes of these four cases were consistent with the diagnostic results of NGS. NGS of CSF samples can be used as a diagnostic assay for CNS viral infection. Its further application for "pan-viral" or even "pan-microbial" screening of CSF might influence the diagnosis of CNS infectious diseases.


Subject(s)
DNA, Viral/cerebrospinal fluid , Herpesviridae Infections/diagnosis , Herpesvirus 1, Human/genetics , Herpesvirus 2, Human/genetics , Herpesvirus 3, Human/genetics , Meningoencephalitis/diagnosis , Adult , DNA, Viral/genetics , Electroencephalography , Female , Gene Library , Herpesviridae Infections/cerebrospinal fluid , Herpesviridae Infections/physiopathology , Herpesviridae Infections/virology , Herpesvirus 1, Human/isolation & purification , Herpesvirus 2, Human/isolation & purification , Herpesvirus 3, Human/isolation & purification , High-Throughput Nucleotide Sequencing , Humans , Magnetic Resonance Imaging , Male , Meningoencephalitis/cerebrospinal fluid , Meningoencephalitis/physiopathology , Meningoencephalitis/virology , Middle Aged
11.
Proc Natl Acad Sci U S A ; 110(2): 577-82, 2013 Jan 08.
Article in English | MEDLINE | ID: mdl-23271803

ABSTRACT

The genetic diversity of Yersinia pestis, the etiologic agent of plague, is extremely limited because of its recent origin coupled with a slow clock rate. Here we identified 2,326 SNPs from 133 genomes of Y. pestis strains that were isolated in China and elsewhere. These SNPs define the genealogy of Y. pestis since its most recent common ancestor. All but 28 of these SNPs represented mutations that happened only once within the genealogy, and they were distributed essentially at random among individual genes. Only seven genes contained a significant excess of nonsynonymous SNP, suggesting that the fixation of SNPs mainly arises via neutral processes, such as genetic drift, rather than Darwinian selection. However, the rate of fixation varies dramatically over the genealogy: the number of SNPs accumulated by different lineages was highly variable and the genealogy contains multiple polytomies, one of which resulted in four branches near the time of the Black Death. We suggest that demographic changes can affect the speed of evolution in epidemic pathogens even in the absence of natural selection, and hypothesize that neutral SNPs are fixed rapidly during intermittent epidemics and outbreaks.


Subject(s)
Evolution, Molecular , Genetic Drift , Genetic Variation , Mutation Rate , Yersinia pestis/genetics , Base Sequence , China , Genetics, Population , Likelihood Functions , Models, Genetic , Molecular Epidemiology , Molecular Sequence Data , Phylogeny , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA
12.
J Autoimmun ; 54: 127-36, 2014 Nov.
Article in English | MEDLINE | ID: mdl-25091625

ABSTRACT

Systemic lupus erythematosus (SLE) is an autoimmune disease well known for its clinical heterogeneity, and its etiology secondary to a cross-talk involving genetic predisposition and environmental stimuli. Although genome-wide analysis has contributed greatly to our understanding of the genetic basis of SLE, there is increasing evidence for a role of epigenetics. Indeed, recent data have demonstrated that in patients with SLE, there are striking alterations of DNA methylation, histone modifications, and deregulated microRNA expression, the sum of which contribute to over-expression of select autoimmune-related genes and loss of tolerance. To address this issue at the level of clinical phenotype, we performed DNA methylation, mRNA and microRNA expression screening using high-throughput sequencing of purified CD4+ T cells from patients with SLE, compared to age and sex matched controls. In particular, we studied 42 patients with SLE and divided this group into three clinical phenotypes: a) the presence of skin lesions without signs of systemic pathology; b) skin lesions but also chronic renal pathology; and c) skin lesions, chronic renal pathology and polyarticular disease. Interestingly, and as expected, sequencing data revealed changes in DNA methylation in SLE compared to controls. However, and more importantly, although there were common methylation changes found in all groups of SLE compared to controls, there was specific DNA methylation changes that correlated with clinical phenotype. These included changes in the novel key target genes NLRP2, CD300LB and S1PR3, as well as changes in the critical pathways, including the adherens junction and leukocyte transendothelial migration. We also noted that a significant proportion of genes undergoing DNA methylation changes were inversely correlated with gene expression and that miRNA screening revealed the existence of subsets with changes in expression. Integrated analysis of this data highlights specific sets of miRNAs controlled by DNA methylation, and genes that are altered by methylation and targeted by miRNAs. In conclusion, our findings suggest select epigenetic mechanisms that contribute to clinical phenotypes and further shed light on a new venue for basic SLE research.


Subject(s)
CD4-Positive T-Lymphocytes/immunology , DNA Methylation/immunology , Gene Expression Regulation/immunology , Lupus Erythematosus, Systemic/immunology , MicroRNAs/immunology , RNA, Messenger/immunology , Adaptor Proteins, Signal Transducing/immunology , Adult , Apoptosis Regulatory Proteins , CD4-Positive T-Lymphocytes/pathology , Female , Humans , Lupus Erythematosus, Systemic/pathology , Male , Receptors, Immunologic/immunology , Receptors, Lysosphingolipid/immunology , Sphingosine-1-Phosphate Receptors
13.
BMC Genet ; 15 Suppl 1: S9, 2014.
Article in English | MEDLINE | ID: mdl-25080211

ABSTRACT

BACKGROUND: DNA methylation is an important biological form of epigenetic modification, playing key roles in plant development and environmental responses. RESULTS: In this study, we examined single-base resolution methylomes of Populus under control and drought stress conditions using high-throughput bisulfite sequencing for the first time. Our data showed methylation levels of methylated cytosines, upstream 2 kp, downstream 2kb, and repeatitive sequences significantly increased after drought treatment in Populus. Interestingly, methylation in 100 bp upstream of the transcriptional start site (TSS) repressed gene expression, while methylations in 100-2000 bp upstream of TSS and within the gene body were positively associated with gene expression. Integrated with the transcriptomic data, we found that all cis-splicing genes were non-methylated, suggesting that DNA methylation may not associate with cis-splicing. However, our results showed that 80% of trans-splicing genes were methylated. Moreover, we found 1156 transcription factors (TFs) with reduced methylation and expression levels and 690 TFs with increased methylation and expression levels after drought treatment. These TFs may play important roles in Populus drought stress responses through the changes of DNA methylation. CONCLUSIONS: These findings may provide valuable new insight into our understanding of the interaction between gene expression and methylation of drought responses in Populus.


Subject(s)
DNA Methylation , Droughts , Populus/genetics , Stress, Physiological/genetics , DNA, Plant/genetics , Epigenesis, Genetic , RNA Splicing , Transcription Factors/genetics , Transcriptome
14.
BMC Genomics ; 14: 11, 2013 Jan 16.
Article in English | MEDLINE | ID: mdl-23324053

ABSTRACT

BACKGROUND: Reduced representation bisulfite sequencing (RRBS) was developed to measure DNA methylation of high-CG regions at single base-pair resolution, and has been widely used because of its minimal DNA requirements and cost efficacy; however, the CpG coverage of genomic regions is restricted and important regions with low-CG will be ignored in DNA methylation profiling. This method could be improved to generate a more comprehensive representation. RESULTS: Based on in silico simulation of enzyme digestion of human and mouse genomes, we have optimized the current single-enzyme RRBS by applying double enzyme digestion in the library construction to interrogate more representative regions. CpG coverage of genomic regions was considerably increased in both high-CG and low-CG regions using the double-enzyme RRBS method, leading to more accurate detection of their average methylation levels and identification of differential methylation regions between samples. We also applied this double-enzyme RRBS method to comprehensively analyze the CpG methylation profiles of two colorectal cancer cell lines. CONCLUSION: The double-enzyme RRBS increases the CpG coverage of genomic regions considerably over the previous single-enzyme RRBS method, leading to more accurate detection of their average methylation levels. It will facilitate genome-wide DNA methylation studies in multiple and complex clinical samples.


Subject(s)
CpG Islands/genetics , DNA Methylation/drug effects , Genomics/methods , Restriction Mapping/methods , Sequence Analysis, DNA/methods , Sulfites/pharmacology , Animals , DNA (Cytosine-5-)-Methyltransferase 1 , DNA (Cytosine-5-)-Methyltransferases/deficiency , DNA (Cytosine-5-)-Methyltransferases/genetics , Gene Knockout Techniques , HCT116 Cells , Humans , Mice , DNA Methyltransferase 3B
15.
PLoS Biol ; 8(11): e1000533, 2010 Nov 09.
Article in English | MEDLINE | ID: mdl-21085693

ABSTRACT

DNA methylation plays an important role in biological processes in human health and disease. Recent technological advances allow unbiased whole-genome DNA methylation (methylome) analysis to be carried out on human cells. Using whole-genome bisulfite sequencing at 24.7-fold coverage (12.3-fold per strand), we report a comprehensive (92.62%) methylome and analysis of the unique sequences in human peripheral blood mononuclear cells (PBMC) from the same Asian individual whose genome was deciphered in the YH project. PBMC constitute an important source for clinical blood tests world-wide. We found that 68.4% of CpG sites and <0.2% of non-CpG sites were methylated, demonstrating that non-CpG cytosine methylation is minor in human PBMC. Analysis of the PBMC methylome revealed a rich epigenomic landscape for 20 distinct genomic features, including regulatory, protein-coding, non-coding, RNA-coding, and repeat sequences. Integration of our methylome data with the YH genome sequence enabled a first comprehensive assessment of allele-specific methylation (ASM) between the two haploid methylomes of any individual and allowed the identification of 599 haploid differentially methylated regions (hDMRs) covering 287 genes. Of these, 76 genes had hDMRs within 2 kb of their transcriptional start sites of which >80% displayed allele-specific expression (ASE). These data demonstrate that ASM is a recurrent phenomenon and is highly correlated with ASE in human PBMCs. Together with recently reported similar studies, our study provides a comprehensive resource for future epigenomic research and confirms new sequencing technology as a paradigm for large-scale epigenomics studies.


Subject(s)
DNA Methylation , Leukocytes, Mononuclear/metabolism , Alleles , CpG Islands , Haploidy , Humans , RNA, Untranslated/genetics , Sequence Alignment
16.
Microbiol Spectr ; 11(6): e0113923, 2023 Dec 12.
Article in English | MEDLINE | ID: mdl-37982612

ABSTRACT

IMPORTANCE: Metagenomic next-generation sequencing (mNGS) has been used broadly for pathogens detection of infectious diseases. However, there is a lack of method for the absolute quantitation of pathogens by mNGS. We compared the quantitative efficiency of three mNGS internal controls (ICs) Thermus thermophilus, T1 phages, and artificial DNA sequence and developed the most applicable strategies for pathogen quantitation via mNGS in central nervous system infection. The IC application strategy we developed will enable mNGS analysis to assess the pathogen load simultaneously with the detection of pathogens, which should provide critical information for quick decision-making of treatment as well as clinical prognosis.


Subject(s)
Bacteriophages , Central Nervous System Infections , Humans , High-Throughput Nucleotide Sequencing , Metagenome , Metagenomics
17.
Int J Mol Sci ; 13(6): 7098-7108, 2012.
Article in English | MEDLINE | ID: mdl-22837681

ABSTRACT

Adipose tissue is not only a storage organ involved in fuel metabolism, but also an endocrine organ involved in the regulation of insulin sensitivity, thermogenesis, immunity, and inflammation. There are anatomical, cellular, molecular and physiological differences among adipose tissues deposited in different body sites. However, current understanding of the intrinsic differences between the sub-compartments of the subcutaneous adipose tissue remains rudimentary. Here, we analyzed the genome-wide DNA methylation differences between the porcine superficial and deep backfat tissues using methylated DNA immunoprecipitation combined with high-throughput sequencing. We show that the genes with differentially methylated regions in their promoter are mainly involved in the processes of "lipid metabolism" and "regulation of immune-related cytokines". Compared with the deep backfat tissue, the promoters of genes related to the 'positive regulation of cytokine production' were significantly hypermethylated in the superficial backfat tissue, which reflects the intrinsic functional and metabolic differences between the sub-compartments of the subcutaneous adipose tissue. This study provides epigenetic evidence for functionally relevant methylation differences between different layers of porcine backfat tissues.


Subject(s)
Adipocytes/cytology , Adipose Tissue/chemistry , DNA Methylation , DNA/genetics , Animals , Cytokines/metabolism , Epigenesis, Genetic , Fatty Acids/chemistry , Female , Gene Expression Profiling , Genome , High-Throughput Nucleotide Sequencing , Lipid Metabolism , Lipids/chemistry , Oligonucleotide Array Sequence Analysis , Phenotype , Promoter Regions, Genetic , RNA, Messenger/metabolism , Swine
18.
PLoS One ; 16(2): e0246668, 2021.
Article in English | MEDLINE | ID: mdl-33571282

ABSTRACT

Identifying biomarkers that are associated with different types of cancer is an important goal in the field of bioinformatics. Different researcher groups have analyzed the expression profiles of many genes and found some certain genetic patterns that can promote the improvement of targeted therapies, but the significance of some genes is still ambiguous. More reliable and effective biomarkers identification methods are then needed to detect candidate cancer-related genes. In this paper, we proposed a novel method that combines the infinite latent feature selection (ILFS) method with the functional interaction (FIs) network to rank the biomarkers. We applied the proposed method to the expression data of five cancer types. The experiments indicated that our network-constrained ILFS (NCILFS) provides an improved prediction of the diagnosis of the samples and locates many more known oncogenes than the original ILFS and some other existing methods. We also performed functional enrichment analysis by inspecting the over-represented gene ontology (GO) biological process (BP) terms and applying the gene set enrichment analysis (GSEA) method on selected biomarkers for each feature selection method. The enrichments analysis reports show that our network-constraint ILFS can produce more biologically significant gene sets than other methods. The results suggest that network-constrained ILFS can identify cancer-related genes with a higher discriminative power and biological significance.


Subject(s)
Biomarkers, Tumor/genetics , Gene Expression Profiling/methods , Gene Regulatory Networks , Neoplasms/genetics , Algorithms , Female , Gene Ontology , Humans , Male , Oligonucleotide Array Sequence Analysis , Support Vector Machine
19.
Comput Struct Biotechnol J ; 19: 2637-2645, 2021.
Article in English | MEDLINE | ID: mdl-34025950

ABSTRACT

The high-throughput genome-wide chromosome conformation capture (Hi-C) method has recently become an important tool to study chromosomal interactions where one can extract meaningful biological information including P(s) curve, topologically associated domains, A/B compartments, and other biologically relevant signals. Normalization is a critical pre-processing step of downstream analyses for the elimination of systematic and technical biases from chromatin contact matrices due to different mappability, GC content, and restriction fragment lengths. Especially, the problem of high sparsity puts forward a huge challenge on the correction, indicating the urgent need for a stable and efficient method for Hi-C data normalization. Recently, some matrix balancing methods have been developed to normalize Hi-C data, such as the Knight-Ruiz (KR) algorithm, but it failed to normalize contact matrices with high sparsity. Here, we presented an algorithm, Hi-C Matrix Balancing (HCMB), based on an iterative solution of equations, combining with linear search and projection strategy to normalize the Hi-C original interaction data. Both the simulated and experimental data demonstrated that HCMB is robust and efficient in normalizing Hi-C data and preserving the biologically relevant Hi-C features even facing very high sparsity. HCMB is implemented in Python and is freely accessible to non-commercial users at GitHub: https://github.com/HUST-DataMan/HCMB.

20.
Front Cell Infect Microbiol ; 11: 745156, 2021.
Article in English | MEDLINE | ID: mdl-35127548

ABSTRACT

INTRODUCTION: The diagnosis of infection-caused fever of unknown origin (FUO) is still challenging, making it difficult for physicians to provide an early effective therapy. Therefore, a novel pathogen detection platform is needed. Metagenomic next-generation sequencing (mNGS) provides an unbiased, comprehensive technique for the sequence-based identification of pathogenic microbes, but the study of the diagnostic values of mNGS in FUO is still limited. METHODS: In a single-center retrospective cohort study, 175 FUO patients were enrolled, and clinical data were recorded and analyzed to compare mNGS with culture or traditional methods including as smears, serological tests, and nucleic acid amplification testing (NAAT) (traditional PCR, Xpert MTB/RIF, and Xpert MTB/RIF Ultra). RESULTS: The blood mNGS could increase the overall rate of new organisms detected in infection-caused FUO by roughly 22.9% and 19.79% in comparison to culture (22/96 vs. 0/96; OR, ∞; p = 0.000) and conventional methods (19/96 vs. 3/96; OR, 6.333; p = 0.001), respectively. Bloodstream infection was among the largest group of those identified, and the blood mNGS could have a 38% improvement in the diagnosis rate compared to culture (19/50 vs. 0/50; OR, ∞; p = 0.000) and 32.0% compared to conventional methods (16/50 vs. 3/50; OR, 5.333; p = 0.004). Among the non-blood samples in infection-caused FUO, we observed that the overall diagnostic performance of mNGS in infectious disease was better than that of conventional methods by 20% (9/45 vs. 2/45; OR, 4.5; p = 0.065), and expectedly, the use of non-blood mNGS in non-bloodstream infection increased the diagnostic rate by 26.2% (8/32 vs. 0/32; OR, ∞; p = 0.008). According to 175 patients' clinical decision-making, we found that the use of blood mNGS as the first-line investigation could effectively increase 10.9% of diagnosis rate of FUO compared to culture, and the strategy that the mNGS of suspected parts as the second-line test could further benefit infectious patients, improving the diagnosis rate of concurrent infection by 66.7% and 12.5% in non-bloodstream infection, respectively. CONCLUSION: The application of mNGS in the FUO had significantly higher diagnostic efficacy than culture or other conventional methods. In infection-caused FUO patients, application of blood mNGS as the first-line investigation and identification of samples from suspected infection sites as the second-line test could enhance the overall FUO diagnosis rate and serve as a promising optimized diagnostic protocol in the future.


Subject(s)
Fever of Unknown Origin , Adult , Fever of Unknown Origin/diagnosis , High-Throughput Nucleotide Sequencing/methods , Humans , Metagenome , Metagenomics/methods , Retrospective Studies , Sensitivity and Specificity
SELECTION OF CITATIONS
SEARCH DETAIL