Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
1.
Nat Methods ; 20(9): 1368-1378, 2023 09.
Article in English | MEDLINE | ID: mdl-37537351

ABSTRACT

Gene regulatory networks (GRNs) are key determinants of cell function and identity and are dynamically rewired during development and disease. Despite decades of advancement, challenges remain in GRN inference, including dynamic rewiring, causal inference, feedback loop modeling and context specificity. To address these challenges, we develop Dictys, a dynamic GRN inference and analysis method that leverages multiomic single-cell assays of chromatin accessibility and gene expression, context-specific transcription factor footprinting, stochastic process network and efficient probabilistic modeling of single-cell RNA-sequencing read counts. Dictys improves GRN reconstruction accuracy and reproducibility and enables the inference and comparative analysis of context-specific and dynamic GRNs across developmental contexts. Dictys' network analyses recover unique insights in human blood and mouse skin development with cell-type-specific and dynamic GRNs. Its dynamic network visualizations enable time-resolved discovery and investigation of developmental driver transcription factors and their regulated targets. Dictys is available as a free, open-source and user-friendly Python package.


Subject(s)
Gene Regulatory Networks , Multiomics , Animals , Mice , Humans , Reproducibility of Results , Transcription Factors/genetics , Algorithms
2.
Pac Symp Biocomput ; 26: 172-183, 2021.
Article in English | MEDLINE | ID: mdl-33691015

ABSTRACT

Concurrently available genomic and transcriptomic data from large cohorts provide opportunities to discover expression quantitative trait loci (eQTLs)-genetic variants associated with gene expression changes. However, the statistical power of detecting rare variant eQTLs is often limited and most existing eQTL tools are not compatible with sequence variant file formats. We have developed AeQTL (Aggregated eQTL), a software tool that performs eQTL analysis on variants aggregated according to user-specified regions and is designed to accommodate standard genomic files. AeQTL consistently yielded similar or higher powers for identifying rare variant eQTLs than single-variant tests. Using AeQTL, we discovered that aggregated rare germline truncations in cis exomic regions are significantly associated with the expression of BRCA1 and SLC25A39 in breast tumors. In a somatic mutation pan-cancer analysis, aggregated mutations of those predicted to be missense versus truncations were differentially associated with gene expressions of cancer drivers, and somatic truncation eQTLs were further identified as a new multi-omic classifier of oncogenes versus tumor-suppressor genes. AeQTL is easy to use and customize, allowing a broad application for discovering rare variants, including coding and noncoding variants, associated with gene expression. AeQTL is implemented in Python and the source code is freely available at https://github.com/Huan-glab/AeQTL under the MIT license.


Subject(s)
Polymorphism, Single Nucleotide , Quantitative Trait Loci , Computational Biology , Genome-Wide Association Study , Genomics , Humans , Oncogenes
4.
PLoS One ; 15(11): e0234669, 2020.
Article in English | MEDLINE | ID: mdl-33137091

ABSTRACT

SUMMARY: Large-scale sequencing projects, such as The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC), have generated high throughput sequencing and molecular profiling data sets, but it is still challenging to identify potentially causal changes in cellular processes in cancer as well as in other diseases in an automated fashion. We developed the netboxr package written in the R programming language, which makes use of the NetBox algorithm to identify candidate cancer-related functional modules. The algorithm makes use of a data-driven, network-based approach that combines prior knowledge with a network clustering algorithm, obviating the need for and the limitation of independently curated functionally labeled gene sets. The method can combine multiple data types, such as mutations and copy number alterations, leading to more reliable identification of functional modules. We make the tool available in the Bioconductor R ecosystem for applications in cancer research and cell biology. AVAILABILITY AND IMPLEMENTATION: The netboxr package is free and open-sourced under the GNU GPL-3 license R package available at https://www.bioconductor.org/packages/release/bioc/html/netboxr.html.


Subject(s)
Algorithms , Biomarkers, Tumor/genetics , Gene Regulatory Networks , Genome, Human , Genomics/methods , Neoplasms/genetics , Software , Humans , Metabolic Networks and Pathways , Programming Languages
5.
Cell ; 183(1): 269-283.e19, 2020 10 01.
Article in English | MEDLINE | ID: mdl-32916130

ABSTRACT

Determining protein levels in each tissue and how they compare with RNA levels is important for understanding human biology and disease as well as regulatory processes that control protein levels. We quantified the relative protein levels from over 12,000 genes across 32 normal human tissues. Tissue-specific or tissue-enriched proteins were identified and compared to transcriptome data. Many ubiquitous transcripts are found to encode tissue-specific proteins. Discordance of RNA and protein enrichment revealed potential sites of synthesis and action of secreted proteins. The tissue-specific distribution of proteins also provides an in-depth view of complex biological events that require the interplay of multiple tissues. Most importantly, our study demonstrated that protein tissue-enrichment information can explain phenotypes of genetic diseases, which cannot be obtained by transcript information alone. Overall, our results demonstrate how understanding protein levels can provide insights into regulation, secretome, metabolism, and human diseases.


Subject(s)
Proteome/genetics , Proteomics/methods , Transcriptome/genetics , Gene Expression/genetics , Gene Expression Profiling/methods , Humans , Proteome/physiology , RNA/genetics , RNA, Messenger/metabolism , Transcriptome/physiology
6.
Nat Commun ; 11(1): 4748, 2020 09 21.
Article in English | MEDLINE | ID: mdl-32958763

ABSTRACT

The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts.


Subject(s)
Genome, Human/genetics , Mutation , Neoplasms/genetics , Base Composition , DNA, Intergenic , Databases, Genetic , Exome/genetics , Exons , Humans , Retrospective Studies , Exome Sequencing , Whole Genome Sequencing
7.
J Biosci Bioeng ; 120(4): 450-5, 2015 Oct.
Article in English | MEDLINE | ID: mdl-25910962

ABSTRACT

Anammox is an environmental-friendly and cost-effective technology for nitrogen removal. This study provides the nitrogen removal profiles, physiological traits of anammox bacteria culture under the substrate deficiency conditions at the optimal cultivation temperature 35°C. The determined period of starvation tolerance was 4 weeks in the absence of nitrite, 5 weeks in the absence of ammonium, as well as 7 weeks for the absence of these two substrates at 36°C, pH 7-8 and anaerobic conditions. The physiological traits of bacteria consortium were identified through flow cytometry (FCM) analysis, and the ordinal change of increased RNA synthesizing amounts, phosphatidylserine exposure and bacteria death occurred under starvation stress. In addition, the starvation induced the increased protein content in extracellular polymeric substances and the poorer bacteria settling capacity. This study helps to develop a better understanding of anammox process in engineering environment.


Subject(s)
Bacteria/metabolism , Chemoautotrophic Growth , Nitrogen/isolation & purification , Nitrogen/metabolism , Temperature , Ammonium Compounds/metabolism , Anaerobiosis , Bacteria/growth & development , Hydrogen-Ion Concentration , Nitrites/metabolism , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL