Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 78
Filtrar
1.
Genome Res ; 34(1): 119-133, 2024 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-38190633

RESUMO

Single-cell technologies offer unprecedented opportunities to dissect gene regulatory mechanisms in context-specific ways. Although there are computational methods for extracting gene regulatory relationships from scRNA-seq and scATAC-seq data, the data integration problem, essential for accurate cell type identification, has been mostly treated as a standalone challenge. Here we present scTIE, a unified method that integrates temporal multimodal data and infers regulatory relationships predictive of cellular state changes. scTIE uses an autoencoder to embed cells from all time points into a common space by using iterative optimal transport, followed by extracting interpretable information to predict cell trajectories. Using a variety of synthetic and real temporal multimodal data sets, we show scTIE achieves effective data integration while preserving more biological signals than existing methods, particularly in the presence of batch effects and noise. Furthermore, on the exemplar multiome data set we generated from differentiating mouse embryonic stem cells over time, we show scTIE captures regulatory elements highly predictive of cell transition probabilities, providing new potentials to understand the regulatory landscape driving developmental processes.


Assuntos
Perfilação da Expressão Gênica , Análise de Célula Única , Animais , Camundongos , Perfilação da Expressão Gênica/métodos , Análise de Célula Única/métodos , Regulação da Expressão Gênica
2.
Brief Bioinform ; 24(2)2023 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-36813563

RESUMO

Cell-state transition can reveal additional information from single-cell ribonucleic acid (RNA)-sequencing data in time-resolved biological phenomena. However, most of the current methods are based on the time derivative of the gene expression state, which restricts them to the short-term evolution of cell states. Here, we present single-cell State Transition Across-samples of RNA-seq data (scSTAR), which overcomes this limitation by constructing a paired-cell projection between biological conditions with an arbitrary time span by maximizing the covariance between two feature spaces using partial least square and minimum squared error methods. In mouse ageing data, the response to stress in CD4+ memory T cell subtypes was found to be associated with ageing. A novel Treg subtype characterized by mTORC activation was identified to be associated with antitumour immune suppression, which was confirmed by immunofluorescence microscopy and survival analysis in 11 cancers from The Cancer Genome Atlas Program. On melanoma data, scSTAR improved immunotherapy-response prediction accuracy from 0.8 to 0.96.


Assuntos
Perfilação da Expressão Gênica , RNA , Animais , Camundongos , RNA/genética , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Genoma
3.
PLoS Biol ; 19(10): e3001419, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34618807

RESUMO

Evolving in sync with the computation revolution over the past 30 years, computational biology has emerged as a mature scientific field. While the field has made major contributions toward improving scientific knowledge and human health, individual computational biology practitioners at various institutions often languish in career development. As optimistic biologists passionate about the future of our field, we propose solutions for both eager and reluctant individual scientists, institutions, publishers, funding agencies, and educators to fully embrace computational biology. We believe that in order to pave the way for the next generation of discoveries, we need to improve recognition for computational biologists and better align pathways of career success with pathways of scientific progress. With 10 outlined steps, we call on all adjacent fields to move away from the traditional individual, single-discipline investigator research model and embrace multidisciplinary, data-driven, team science.


Assuntos
Biologia Computacional , Orçamentos , Comportamento Cooperativo , Humanos , Pesquisa Interdisciplinar , Tutoria , Motivação , Publicações , Recompensa , Software
4.
PLoS Comput Biol ; 18(10): e1010495, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36197936

RESUMO

COVID-19 patients display a wide range of disease severity, ranging from asymptomatic to critical symptoms with high mortality risk. Our ability to understand the interaction of SARS-CoV-2 infected cells within the lung, and of protective or dysfunctional immune responses to the virus, is critical to effectively treat these patients. Currently, our understanding of cell-cell interactions across different disease states, and how such interactions may drive pathogenic outcomes, is incomplete. Here, we developed a generalizable and scalable workflow for identifying cells that are differentially interacting across COVID-19 patients with distinct disease outcomes and use this to examine eight public single-cell RNA-seq datasets (six from peripheral blood mononuclear cells, one from bronchoalveolar lavage and one from nasopharyngeal), with a total of 211 individual samples. By characterizing the cell-cell interaction patterns across epithelial and immune cells in lung tissues for patients with varying disease severity, we illustrate diverse communication patterns across individuals, and discover heterogeneous communication patterns among moderate and severe patients. We further illustrate patterns derived from cell-cell interactions are potential signatures for discriminating between moderate and severe patients. Overall, this workflow can be generalized and scaled to combine multiple scRNA-seq datasets to uncover cell-cell interactions.


Assuntos
COVID-19 , Comunicação Celular , Humanos , Leucócitos Mononucleares , SARS-CoV-2 , Fluxo de Trabalho
5.
Nature ; 545(7653): 175-180, 2017 05 11.
Artigo em Inglês | MEDLINE | ID: mdl-28467829

RESUMO

Melanoma of the skin is a common cancer only in Europeans, whereas it arises in internal body surfaces (mucosal sites) and on the hands and feet (acral sites) in people throughout the world. Here we report analysis of whole-genome sequences from cutaneous, acral and mucosal subtypes of melanoma. The heavily mutated landscape of coding and non-coding mutations in cutaneous melanoma resolved novel signatures of mutagenesis attributable to ultraviolet radiation. However, acral and mucosal melanomas were dominated by structural changes and mutation signatures of unknown aetiology, not previously identified in melanoma. The number of genes affected by recurrent mutations disrupting non-coding sequences was similar to that affected by recurrent mutations to coding sequences. Significantly mutated genes included BRAF, CDKN2A, NRAS and TP53 in cutaneous melanoma, BRAF, NRAS and NF1 in acral melanoma and SF3B1 in mucosal melanoma. Mutations affecting the TERT promoter were the most frequent of all; however, neither they nor ATRX mutations, which correlate with alternative telomere lengthening, were associated with greater telomere length. Most melanomas had potentially actionable mutations, most in components of the mitogen-activated protein kinase and phosphoinositol kinase pathways. The whole-genome mutation landscape of melanoma reveals diverse carcinogenic processes across its subtypes, some unrelated to sun exposure, and extends potential involvement of the non-coding genome in its pathogenesis.


Assuntos
Genoma Humano/genética , Melanoma/genética , Mutação/genética , DNA Helicases/genética , GTP Fosfo-Hidrolases/genética , Genes p16 , Humanos , Melanoma/classificação , Proteínas de Membrana/genética , Proteínas Quinases Ativadas por Mitógeno/genética , Neurofibromatose 1/genética , Proteínas Nucleares/genética , Fosfoproteínas/genética , Proteínas Proto-Oncogênicas B-raf/genética , Fatores de Processamento de RNA/genética , Transdução de Sinais/efeitos dos fármacos , Telomerase/genética , Telômero/genética , Proteína Supressora de Tumor p53/genética , Raios Ultravioleta/efeitos adversos , Proteína Nuclear Ligada ao X
6.
Genes Chromosomes Cancer ; 61(9): 561-571, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-35670448

RESUMO

INTRODUCTION: Oral squamous cell carcinoma (OSCC) in the young (<50 years), without known carcinogenic risk factors, is on the rise globally. Whole genome duplication (WGD) has been shown to occur at higher rates in cancers without an identifiable carcinogenic agent. We aimed to evaluate the prevalence of WGD in a cohort of OSCC patients under the age of 50 years. METHODS: Whole genome sequencing (WGS) was performed on 28 OSCC patients from the Sydney Head and Neck Cancer Institute (SHNCI) biobank. An additional nine cases were obtained from The Cancer Genome Atlas (TCGA). RESULTS: WGD was seen in 27 of 37 (73%) cases. Non-synonymous, somatic TP53 mutations occurred in 25 of 27 (93%) cases of WGD and were predicted to precede WGD in 21 (77%). WGD was significantly associated with larger tumor size (p = 0.01) and was frequent in patients with recurrences (87%, p = 0.36). Overall survival was significantly worse in those with WGD (p = 0.05). CONCLUSIONS: Our data, based on one of the largest WGS datasets of young patients with OSCC, demonstrates a high frequency of WGD and its association with adverse pathologic characteristics and clinical outcomes. TP53 mutations also preceded WGD, as has been described in other tumors without a clear mutagenic driver.


Assuntos
Carcinoma de Células Escamosas , Neoplasias de Cabeça e Pescoço , Neoplasias Bucais , Carcinoma de Células Escamosas/genética , Duplicação Gênica , Neoplasias de Cabeça e Pescoço/genética , Humanos , Pessoa de Meia-Idade , Neoplasias Bucais/genética , Carcinoma de Células Escamosas de Cabeça e Pescoço/genética
7.
Am J Kidney Dis ; 79(4): 549-560, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-34461168

RESUMO

RATIONALE & OBJECTIVE: The risk of developing colorectal cancer in patients with chronic kidney disease (CKD) is twice that of the general population, but the factors associated with colorectal cancer are poorly understood. The aim of this study was to identify factors associated with advanced colorectal neoplasia in patients with CKD. STUDY DESIGN: Prospective cohort study. SETTING & PARTICIPANTS: Patients with CKD stages 3-5, including those treated with maintenance dialysis or transplantation across 11 sites in Australia, New Zealand, Canada, and Spain, were screened for colorectal neoplasia using a fecal immunochemical test (FIT) as part of the Detecting Bowel Cancer in CKD (DETECT) Study. EXPOSURE: Baseline characteristics for patients at the time of study enrollment were ascertained, including duration of CKD, comorbidities, and medications. OUTCOME: Advanced colorectal neoplasia was identified through a 2-step verification process with colonoscopy following positive FIT and 2-year clinical follow-up for all patients. ANALYTICAL APPROACH: Potential factors associated with advanced colorectal neoplasia were explored using multivariable logistic regression. Sensitivity analyses were performed using grouped LASSO (least absolute shrinkage and selection operator) logistic regression. RESULTS: Among 1,706 patients who received FIT-based screening-791 with CKD stages 3-5 not receiving kidney replacement therapy (KRT), 418 receiving dialysis, and 497 patients with a functioning kidney transplant-117 patients (6.9%) were detected to have advanced colorectal neoplasia (54 with CKD stages 3-5 without KRT, 34 receiving dialysis, and 29 transplant recipients), including 9 colorectal cancers. The factors found to be associated with advanced colorectal neoplasia included older age (OR per year older, 1.05 [95% CI, 1.03-1.07], P<0.001), male sex (OR, 2.27 [95% CI, 1.45-3.54], P<0.001), azathioprine use (OR, 2.99 [95% CI, 1.40-6.37], P=0.005), and erythropoiesis-stimulating agent use (OR, 1.92 [95% CI, 1.22-3.03], P=0.005). Grouped LASSO logistic regression revealed similar associations between these factors and advanced colorectal neoplasia. LIMITATIONS: Unmeasured confounding factors. CONCLUSIONS: Older age, male sex, erythropoiesis-stimulating agents, and azathioprine were found to be significantly associated with advanced colorectal neoplasia in patients with CKD.


Assuntos
Neoplasias Colorretais , Insuficiência Renal Crônica , Colonoscopia , Neoplasias Colorretais/diagnóstico , Neoplasias Colorretais/epidemiologia , Fezes , Humanos , Masculino , Sangue Oculto , Estudos Prospectivos , Insuficiência Renal Crônica/complicações , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/terapia , Fatores de Risco
8.
BMC Bioinformatics ; 21(1): 530, 2020 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-33203358

RESUMO

BACKGROUND: Nutrigenomics aims at understanding the interaction between nutrition and gene information. Due to the complex interactions of nutrients and genes, their relationship exhibits non-linearity. One of the most effective and efficient methods to explore their relationship is the nutritional geometry framework which fits a response surface for the gene expression over two prespecified nutrition variables. However, when the number of nutrients involved is large, it is challenging to find combinations of informative nutrients with respect to a certain gene and to test whether the relationship is stronger than chance. Methods for identifying informative combinations are essential to understanding the relationship between nutrients and genes. RESULTS: We introduce Local Consistency Nutrition to Graphics (LC-N2G), a novel approach for ranking and identifying combinations of nutrients with gene expression. In LC-N2G, we first propose a model-free quantity called Local Consistency statistic to measure whether there is non-random relationship between combinations of nutrients and gene expression measurements based on (1) the similarity between samples in the nutrient space and (2) their difference in gene expression. Then combinations with small LC are selected and a permutation test is performed to evaluate their significance. Finally, the response surfaces are generated for the subset of significant relationships. Evaluation on simulated data and real data shows the LC-N2G can accurately find combinations that are correlated with gene expression. CONCLUSION: The LC-N2G is practically powerful for identifying the informative nutrition variables correlated with gene expression. Therefore, LC-N2G is important in the area of nutrigenomics for understanding the relationship between nutrition and gene expression information.


Assuntos
Algoritmos , Análise de Dados , Nutrigenômica , Fenômenos Fisiológicos da Nutrição Animal , Animais , Simulação por Computador , Regulação da Expressão Gênica , Camundongos , Dinâmica não Linear
9.
Prostate ; 80(6): 508-517, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32119131

RESUMO

BACKGROUND: As a rare subtype of prostate carcinoma, basal cell carcinoma (BCC) has not been studied extensively and thus lacks systematic molecular characterization. METHODS: Here, we applied single-cell genomic amplification and RNA-Seq to a specimen of human prostate BCC (CK34ßE12+ /P63+ /PAP- /PSA- ). The mutational landscape was obtained via whole exome sequencing of the amplification mixture of 49 single cells, and the transcriptomes of 69 single cells were also obtained. RESULTS: The five putative driver genes mutated in BCC are CASC5, NUTM1, PTPRC, KMT2C, and TBX3, and the top three nucleotide substitutions are C>T, T>C, and C>A, similar to common prostate cancer. The distribution of the variant allele frequency values indicated that these single cells are from the same tumor clone. The 69 single cells were clustered into tumor, stromal, and immune cells based on their global transcriptomic profiles. The tumor cells specifically express basal cell markers like KRT5, KRT14, and KRT23 and epithelial markers EPCAM, CDH1, and CD24. The transcription factor covariance network analysis showed that the BCC tumor cells have distinct regulatory networks. By comparison with current prostate cancer datasets, we found that some of the bulk samples exhibit basal cell signatures. Interestingly, at single-cell resolution the gene expression patterns of prostate BCC tumor cells show uniqueness compared with that of common prostate cancer-derived circulating tumor cells. CONCLUSIONS: This study, for the first time, discloses the comprehensive mutational and transcriptomic landscapes of prostate BCC, which lays a foundation for the understanding of its tumorigenesis mechanism and provides new insights into prostate cancers in general.


Assuntos
Carcinoma Basocelular/genética , Neoplasias da Próstata/genética , Biópsia por Agulha , Carcinoma Basocelular/patologia , Amplificação de Genes , Perfilação da Expressão Gênica/métodos , Frequência do Gene , Humanos , Imuno-Histoquímica , Masculino , Pessoa de Meia-Idade , Mutação , Neoplasias da Próstata/patologia , Análise de Célula Única/métodos , Células Estromais/patologia , Transcriptoma , Sequenciamento do Exoma
11.
Bioinformatics ; 35(5): 823-829, 2019 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-30102408

RESUMO

MOTIVATION: Genes act as a system and not in isolation. Thus, it is important to consider coordinated changes of gene expression rather than single genes when investigating biological phenomena such as the aetiology of cancer. We have developed an approach for quantifying how changes in the association between pairs of genes may inform the outcome of interest called Differential Correlation across Ranked Samples (DCARS). Modelling gene correlation across a continuous sample ranking does not require the dichotomisation of samples into two distinct classes and can identify differences in gene correlation across early, mid or late stages of the outcome of interest. RESULTS: When we evaluated DCARS against the typical Fisher Z-transformation test for differential correlation, as well as a typical approach testing for interaction within a linear model, on real TCGA data, DCARS significantly ranked gene pairs containing known cancer genes more highly across several cancers. Similar results are found with our simulation study. DCARS was applied to 13 cancers datasets in TCGA, revealing several distinct relationships for which survival ranking was found to be associated with a change in correlation between genes. Furthermore, we demonstrated that DCARS can be used in conjunction with network analysis techniques to extract biological meaning from multi-layered and complex data. AVAILABILITY AND IMPLEMENTATION: DCARS R package and sample data are available at https://github.com/shazanfar/DCARS. Publicly available data from The Cancer Genome Atlas (TCGA) was used using the TCGABiolinks R package. Supplementary Files and DCARS R package is available at https://github.com/shazanfar/DCARS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Neoplasias , Genoma , Humanos , Software
12.
Bioinformatics ; 35(8): 1350-1357, 2019 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-30215668

RESUMO

MOTIVATION: Gene annotation and pathway databases such as Gene Ontology and Kyoto Encyclopaedia of Genes and Genomes are important tools in Gene-Set Test (GST) that describe gene biological functions and associated pathways. GST aims to establish an association relationship between a gene-set of interest and an annotation. Importantly, GST tests for over-representation of genes in an annotation term. One implicit assumption of GST is that the gene expression platform captures the complete or a very large proportion of the genome. However, this assumption is neither satisfied for the increasingly popular boutique array nor the custom designed gene expression profiling platform. Specifically, conventional GST is no longer appropriate due to the gene-set selection bias induced during the construction of these platforms. RESULTS: We propose bcGST, a bias-corrected GST by introducing bias-correction terms in the contingency table needed for calculating the Fisher's Exact Test. The adjustment method works by estimating the proportion of genes captured on the array with respect to the genome in order to assist filtration of annotation terms that would otherwise be falsely included or excluded. We illustrate the practicality of bcGST and its stability through multiple differential gene expression analyses in melanoma and the Cancer Genome Atlas cancer studies. AVAILABILITY AND IMPLEMENTATION: The bcGST method is made available as a Shiny web application at http://shiny.maths.usyd.edu.au/bcGST/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Perfilação da Expressão Gênica , Software , Biologia Computacional , Ontologia Genética , Genoma , Anotação de Sequência Molecular
13.
J Am Soc Nephrol ; 30(6): 1061-1072, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31040191

RESUMO

BACKGROUND: In patients with CKD, the risk of developing colorectal cancer is high and outcomes are poor. Screening using fecal immunochemical testing (FIT) is effective in reducing mortality from colorectal cancer, but performance characteristics of FIT in CKD are unknown. METHODS: To determine the detection rates and performance characteristics of FIT for advanced colorectal neoplasia (ACN) in patients with CKD, we used FIT to prospectively screen patients aged 35-74 years with CKD (stages 3-5 CKD, dialysis, and renal transplant) from 11 sites in Australia, New Zealand, Canada, and Spain. All participants received clinical follow-up at 2 years. We used a two-step reference standard approach to estimate disease status. RESULTS: Overall, 369 out of 1706 patients who completed FIT (21.6%) tested positive; 323 (87.5%) underwent colonoscopies. A total of 1553 (91.0%) completed follow-up; 82 (4.8%) had died and 71 (4.2%) were lost. The detection rate of ACN using FIT was 6.0% (5.6%, 7.4%, and 5.6% for stages 3-5 CKD, dialysis, and transplant). Sensitivity, specificity, and positive and negative predictive values of FIT for ACN were 0.90, 0.83, 0.30, and 0.99, respectively. Of participants who underwent colonoscopy, five (1.5%) experienced major colonoscopy-related complications, including bowel perforation and major bleeding. CONCLUSIONS: FIT appears to be an accurate screening test for patients with CKD, such that a negative test may rule out the diagnosis of colorectal cancer within 2 years. However, the risk of major complications from work-up colonoscopy are at least ten-fold higher than in the general population.


Assuntos
Causas de Morte , Neoplasias Colorretais/epidemiologia , Neoplasias Colorretais/patologia , Detecção Precoce de Câncer/métodos , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/terapia , Adulto , Idoso , Austrália , Canadá , Estudos de Coortes , Colonoscopia/métodos , Neoplasias Colorretais/diagnóstico , Comorbidade , Feminino , Humanos , Imuno-Histoquímica , Internacionalidade , Masculino , Programas de Rastreamento/métodos , Pessoa de Meia-Idade , Nova Zelândia , Sangue Oculto , Prevalência , Insuficiência Renal Crônica/diagnóstico , Estudos Retrospectivos , Medição de Risco , Espanha , Análise de Sobrevida
14.
BMC Bioinformatics ; 20(Suppl 19): 660, 2019 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-31870278

RESUMO

BACKGROUND: Single-cell RNA-sequencing (scRNA-seq) is a transformative technology, allowing global transcriptomes of individual cells to be profiled with high accuracy. An essential task in scRNA-seq data analysis is the identification of cell types from complex samples or tissues profiled in an experiment. To this end, clustering has become a key computational technique for grouping cells based on their transcriptome profiles, enabling subsequent cell type identification from each cluster of cells. Due to the high feature-dimensionality of the transcriptome (i.e. the large number of measured genes in each cell) and because only a small fraction of genes are cell type-specific and therefore informative for generating cell type-specific clusters, clustering directly on the original feature/gene dimension may lead to uninformative clusters and hinder correct cell type identification. RESULTS: Here, we propose an autoencoder-based cluster ensemble framework in which we first take random subspace projections from the data, then compress each random projection to a low-dimensional space using an autoencoder artificial neural network, and finally apply ensemble clustering across all encoded datasets to generate clusters of cells. We employ four evaluation metrics to benchmark clustering performance and our experiments demonstrate that the proposed autoencoder-based cluster ensemble can lead to substantially improved cell type-specific clusters when applied with both the standard k-means clustering algorithm and a state-of-the-art kernel-based clustering algorithm (SIMLR) designed specifically for scRNA-seq data. Compared to directly using these clustering algorithms on the original datasets, the performance improvement in some cases is up to 100%, depending on the evaluation metric used. CONCLUSIONS: Our results suggest that the proposed framework can facilitate more accurate cell type identification as well as other downstream analyses. The code for creating the proposed autoencoder-based cluster ensemble framework is freely available from https://github.com/gedcom/scCCESS.


Assuntos
Análise de Sequência de RNA , Algoritmos , Análise por Conglomerados , Análise de Dados , Humanos , Redes Neurais de Computação , RNA-Seq , Análise de Célula Única , Transcriptoma
15.
BMC Bioinformatics ; 20(Suppl 19): 721, 2019 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-31870280

RESUMO

BACKGROUND: Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis - none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition. RESULTS: We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions. CONCLUSIONS: scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney.


Assuntos
Análise de Célula Única/métodos , Humanos
16.
Microcirculation ; 26(2): e12488, 2019 02.
Artigo em Inglês | MEDLINE | ID: mdl-29956866

RESUMO

Identification of the four standard modifiable cardiovascular risk factors (SMuRFs)-diabetes mellitus, hyperlipidaemia, hypertension, and cigarette smoking-has allowed the development of risk scores. These have been used in conjunction with primary and secondary prevention strategies targeting SMuRFs to reduce the burden of CAD. Recent studies show that up to 25% of ACS patients do not have any SMuRFs. Thus, SMuRFs do not explain the entire burden of CAD. There appears to be variation at the individual level rendering some individuals relatively susceptible or resilient to developing atherosclerosis. Important disease pathways remain to be discovered, and there is renewed enthusiasm to discover novel biomarkers, biological mechanisms, and therapeutic targets for atherosclerosis. Two broad approaches are being taken: traditional approaches investigating known candidate pathways and unbiased omics approaches. We review recent progress in the field and discuss opportunities made possible by technological and data science advances. Developments in network analytics and machine learning algorithms used in conjunction with large-scale multi-omic platforms have the potential to uncover biological networks that may not have been identifiable using traditional approaches. These approaches are useful for both biomedical research and precision medicine strategies.


Assuntos
Tecnologia Biomédica/métodos , Biologia Computacional/métodos , Doença da Artéria Coronariana , Animais , Aterosclerose , Biomarcadores , Doença da Artéria Coronariana/diagnóstico , Doença da Artéria Coronariana/terapia , Humanos , Medicina de Precisão
17.
Nucleic Acids Res ; 44(13): e119, 2016 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-27190235

RESUMO

A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of important biomarkers that could lead to improved predictions and treatments. Here, we describe a new approach for assessing the importance of genes based on differential distribution (DD), which combines information from differential expression and differential variability into a unified metric. We show that feature ranking and selection stability based on DD can perform two to three times better than DE or DV alone, and that DD yields equivalent error rates to DE and DV. Finally, assessing genes via differential distribution produces a complementary set of selected genes to DE and DV, potentially opening up new categories of biomarkers.


Assuntos
Biomarcadores Tumorais/genética , Regulação Neoplásica da Expressão Gênica/genética , Melanoma/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Adenocarcinoma/genética , Adenocarcinoma/patologia , Adenocarcinoma de Pulmão , Algoritmos , Biomarcadores Tumorais/biossíntese , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Melanoma/patologia , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia
18.
J Proteome Res ; 16(7): 2359-2369, 2017 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-28580786

RESUMO

Tandem mass spectrometry is one of the most popular techniques for quantitation of proteomes. There exists a large variety of options in each stage of data preprocessing that impact the bias and variance of the summarized protein-level values. Using a newly released data set satisfying a replicated Latin squares design, a diverse set of performance metrics has been developed and implemented in a web-based application, Quantitative Performance Evaluator for Proteomics (QPEP). QPEP has the flexibility to allow users to apply their own method to preprocess this data set and share the results, allowing direct and straightforward comparison of new methodologies. Application of these new metrics to three case studies highlights that (i) the summarization of peptides to proteins is robust to the choice of peptide summary used, (ii) the differences between iTRAQ labels are stronger than the differences between experimental runs, and (iii) the commercial software ProteinPilot performs equivalently well at between-sample normalization to more complicated methods developed by academics. Importantly, finding (ii) underscores the benefits of using the principles of randomization and blocking to avoid the experimental measurements being confounded by technical factors. Data are available via ProteomeXchange with identifier PXD003608.


Assuntos
Peptídeos/análise , Proteoma/análise , Proteômica/estatística & dados numéricos , Proteínas de Saccharomyces cerevisiae/isolamento & purificação , Software , Espectrometria de Massas em Tandem/normas , Benchmarking , Internet , Reprodutibilidade dos Testes , Saccharomyces cerevisiae/química
19.
J Biol Chem ; 291(49): 25629-25640, 2016 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-27738101

RESUMO

Hyperinsulinemia, which is associated with aging and metabolic disease, may lead to defective protein homeostasis (proteostasis) due to hyperactivation of insulin-sensitive pathways such as protein synthesis. We investigated the effect of chronic hyperinsulinemia on proteostasis by generating a time-resolved map of insulin-regulated protein turnover in adipocytes using metabolic pulse-chase labeling and high resolution mass spectrometry. Hyperinsulinemia increased the synthesis of nearly half of all detected proteins and did not affect protein degradation despite suppressing autophagy. Unexpectedly, this marked elevation in protein synthesis was accompanied by enhanced protein stability and folding and not by markers of proteostasis stress such as protein carbonylation and aggregation. The improvement in proteostasis was attributed to a coordinate up-regulation of proteins in the global proteostasis network, including ribosomal, proteasomal, chaperone, and endoplasmic reticulum/mitochondrial unfolded protein response proteins. We conclude that defects associated with hyperactivation of the insulin signaling pathway are unlikely attributed to defective proteostasis because up-regulation of protein synthesis by insulin is accompanied by up-regulation of proteostatic machinery.


Assuntos
Adipócitos/metabolismo , Insulina/metabolismo , Biossíntese de Proteínas , Carbonilação Proteica , Proteólise , Transdução de Sinais , Resposta a Proteínas não Dobradas , Células 3T3-L1 , Adipócitos/patologia , Animais , Hiperinsulinismo/metabolismo , Hiperinsulinismo/patologia , Camundongos
20.
BMC Genomics ; 18(1): 946, 2017 Dec 04.
Artigo em Inglês | MEDLINE | ID: mdl-29202695

RESUMO

BACKGROUND: The differentiation and maturation trajectories of fetal liver stem/progenitor cells (LSPCs) are not fully understood at single-cell resolution, and a priori knowledge of limited biomarkers could restrict trajectory tracking. RESULTS: We employed marker-free single-cell RNA-Seq to characterize comprehensive transcriptional profiles of 507 cells randomly selected from seven stages between embryonic day 11.5 and postnatal day 2.5 during mouse liver development, and also 52 Epcam-positive cholangiocytes from postnatal day 3.25 mouse livers. LSPCs in developing mouse livers were identified via marker-free transcriptomic profiling. Single-cell resolution dynamic developmental trajectories of LSPCs exhibited contiguous but discrete genetic control through transcription factors and signaling pathways. The gene expression profiles of cholangiocytes were more close to that of embryonic day 11.5 rather than other later staged LSPCs, cuing the fate decision stage of LSPCs. Our marker-free approach also allows systematic assessment and prediction of isolation biomarkers for LSPCs. CONCLUSIONS: Our data provide not only a valuable resource but also novel insights into the fate decision and transcriptional control of self-renewal, differentiation and maturation of LSPCs.


Assuntos
Células-Tronco Embrionárias/metabolismo , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica no Desenvolvimento , Fígado/metabolismo , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Animais , Biomarcadores/metabolismo , Células Cultivadas , Células-Tronco Embrionárias/citologia , Fígado/embriologia , Camundongos , Camundongos Endogâmicos C57BL
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa