Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
EMBO J ; 42(11): e112590, 2023 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-36912146

RESUMO

During development, the lymphatic vasculature forms as a second network derived chiefly from blood vessels. The transdifferentiation of embryonic venous endothelial cells (VECs) into lymphatic endothelial cells (LECs) is a key step in this process. Specification, differentiation and maintenance of LEC fate are all driven by the transcription factor Prox1, yet the downstream mechanisms remain to be elucidated. We here present a single-cell transcriptomic atlas of lymphangiogenesis in zebrafish, revealing new markers and hallmarks of LEC differentiation over four developmental stages. We further profile single-cell transcriptomic and chromatin accessibility changes in zygotic prox1a mutants that are undergoing a LEC-VEC fate shift. Using maternal and zygotic prox1a/prox1b mutants, we determine the earliest transcriptomic changes directed by Prox1 during LEC specification. This work altogether reveals new downstream targets and regulatory regions of the genome controlled by Prox1 and presents evidence that Prox1 specifies LEC fate primarily by limiting blood vascular and haematopoietic fate. This extensive single-cell resource provides new mechanistic insights into the enigmatic role of Prox1 and the control of LEC differentiation in development.


Assuntos
Vasos Linfáticos , Peixe-Zebra , Animais , Peixe-Zebra/genética , Proteínas de Homeodomínio/genética , Proteínas Supressoras de Tumor/genética , Células Endoteliais , Células Cultivadas , Diferenciação Celular , Linfangiogênese/genética , Fatores de Transcrição/genética , Análise de Célula Única
2.
Noncoding RNA ; 7(2)2021 Jun 08.
Artigo em Inglês | MEDLINE | ID: mdl-34201302

RESUMO

Phenotypes are driven by regulated gene expression, which in turn are mediated by complex interactions between diverse biological molecules. Protein-DNA interactions such as histone and transcription factor binding are well studied, along with RNA-RNA interactions in short RNA silencing of genes. In contrast, lncRNA-protein interaction (LPI) mechanisms are comparatively unknown, likely directed by the difficulties in studying LPI. However, LPI are emerging as key interactions in epigenetic mechanisms, playing a role in development and disease. Their importance is further highlighted by their conservation across kingdoms. Hence, interest in LPI research is increasing. We therefore review the current state of the art in lncRNA-protein interactions. We specifically surveyed recent computational methods and databases which researchers can exploit for LPI investigation. We discovered that algorithm development is heavily reliant on a few generic databases containing curated LPI information. Additionally, these databases house information at gene-level as opposed to transcript-level annotations. We show that early methods predict LPI using molecular docking, have limited scope and are slow, creating a data processing bottleneck. Recently, machine learning has become the strategy of choice in LPI prediction, likely due to the rapid growth in machine learning infrastructure and expertise. While many of these methods have notable limitations, machine learning is expected to be the basis of modern LPI prediction algorithms.

3.
Brief Bioinform ; 22(6)2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-34036326

RESUMO

Despite the volume of experiments performed and data available, the complex biology of coronavirus SARS-COV-2 is not yet fully understood. Existing molecular profiling studies have focused on analysing functional omics data of a single type, which captures changes in a small subset of the molecular perturbations caused by the virus. As the logical next step, results from multiple such omics analysis may be aggregated to comprehensively interpret the molecular mechanisms of SARS-CoV-2. An alternative approach is to integrate data simultaneously in a parallel fashion to highlight the inter-relationships of disease-driving biomolecules, in contrast to comparing processed information from each omics level separately. We demonstrate that valuable information may be masked by using the former fragmented views in analysis, and biomarkers resulting from such an approach cannot provide a systematic understanding of the disease aetiology. Hence, we present a generic, reproducible and flexible open-access data harmonisation framework that can be scaled out to future multi-omics analysis to study a phenotype in a holistic manner. The pipeline source code, detailed documentation and automated version as a R package are accessible. To demonstrate the effectiveness of our pipeline, we applied it to a drug screening task. We integrated multi-omics data to find the lowest level of statistical associations between data features in two case studies. Strongly correlated features within each of these two datasets were used for drug-target analysis, resulting in a list of 84 drug-target candidates. Further computational docking and toxicity analyses revealed seven high-confidence targets, amsacrine, bosutinib, ceritinib, crizotinib, nintedanib and sunitinib as potential starting points for drug therapy and development.


Assuntos
Tratamento Farmacológico da COVID-19 , Genômica , Terapia de Alvo Molecular , SARS-CoV-2/efeitos dos fármacos , Algoritmos , Biomarcadores/química , COVID-19/genética , COVID-19/patologia , COVID-19/virologia , Biologia Computacional , Bases de Dados Genéticas , Humanos , SARS-CoV-2/química , SARS-CoV-2/genética , Software
4.
PLoS Comput Biol ; 16(9): e1008219, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32986694

RESUMO

Gene expression atlases have transformed our understanding of the development, composition and function of human tissues. New technologies promise improved cellular or molecular resolution, and have led to the identification of new cell types, or better defined cell states. But as new technologies emerge, information derived on old platforms becomes obsolete. We demonstrate that it is possible to combine a large number of different profiling experiments summarised from dozens of laboratories and representing hundreds of donors, to create an integrated molecular map of human tissue. As an example, we combine 850 samples from 38 platforms to build an integrated atlas of human blood cells. We achieve robust and unbiased cell type clustering using a variance partitioning method, selecting genes with low platform bias relative to biological variation. Other than an initial rescaling, no other transformation to the primary data is applied through batch correction or renormalisation. Additional data, including single-cell datasets, can be projected for comparison, classification and annotation. The resulting atlas provides a multi-scaled approach to visualise and analyse the relationships between sets of genes and blood cell lineages, including the maturation and activation of leukocytes in vivo and in vitro. In allowing for data integration across hundreds of studies, we address a key reproduciblity challenge which is faced by any new technology. This allows us to draw on the deep phenotypes and functional annotations that accompany traditional profiling methods, and provide important context to the high cellular resolution of single cell profiling. Here, we have implemented the blood atlas in the open access Stemformatics.org platform, drawing on its extensive collection of curated transcriptome data. The method is simple, scalable and amenable for rapid deployment in other biological systems or computational workflows.


Assuntos
Transcriptoma , Análise por Conglomerados , Curadoria de Dados , Perfilação da Expressão Gênica , Humanos
5.
Gigascience ; 9(6)2020 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-32543653

RESUMO

BACKGROUND: Diseases are complex phenotypes often arising as an emergent property of a non-linear network of genetic and epigenetic interactions. To translate this resulting state into a causal relationship with a subset of regulatory features, many experiments deploy an array of laboratory assays from multiple modalities. Often, each of these resulting datasets is large, heterogeneous, and noisy. Thus, it is non-trivial to unify these complex datasets into an interpretable phenotype. Although recent methods address this problem with varying degrees of success, they are constrained by their scopes or limitations. Therefore, an important gap in the field is the lack of a universal data harmonizer with the capability to arbitrarily integrate multi-modal datasets. RESULTS: In this review, we perform a critical analysis of methods with the explicit aim of harmonizing data, as opposed to case-specific integration. This revealed that matrix factorization, latent variable analysis, and deep learning are potent strategies. Finally, we describe the properties of an ideal universal data harmonization framework. CONCLUSIONS: A sufficiently advanced universal harmonizer has major medical implications, such as (i) identifying dysregulated biological pathways responsible for a disease is a powerful diagnostic tool; (2) investigating these pathways further allows the biological community to better understand a disease's mechanisms; and (3) precision medicine also benefits from developments in this area, particularly in the context of the growing field of selective epigenome editing, which can suppress or induce a desired phenotype.


Assuntos
Biologia Computacional/métodos , Epigênese Genética , Epigenômica/métodos , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Aprendizado de Máquina , Software
6.
Nat Cell Biol ; 22(1): 60-73, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31907413

RESUMO

Defining the ontogeny of the human adaptive immune system during embryogenesis has implications for understanding childhood diseases including leukaemias and autoimmune conditions. Using RAG1:GFP human pluripotent stem cell reporter lines, we examined human T-cell genesis from pluripotent-stem-cell-derived haematopoietic organoids. Under conditions favouring T-cell development, RAG1+ cells progressively upregulated a cohort of recognized T-cell-associated genes, arresting development at the CD4+CD8+ stage. Sort and re-culture experiments showed that early RAG1+ cells also possessed B-cell, myeloid and erythroid potential. Flow cytometry and single-cell-RNA-sequencing data showed that early RAG1+ cells co-expressed the endothelial/haematopoietic progenitor markers CD34, VECAD and CD90, whereas imaging studies identified RAG1+ cells within CD31+ endothelial structures that co-expressed SOX17+ or the endothelial marker CAV1. Collectively, these observations provide evidence for a wave of human T-cell development that originates directly from haemogenic endothelium via a RAG1+ intermediate with multilineage potential.


Assuntos
Endotélio/citologia , Hemangioblastos/citologia , Células-Tronco Hematopoéticas/citologia , Proteínas de Homeodomínio/metabolismo , Células-Tronco Pluripotentes/citologia , Diferenciação Celular/fisiologia , Linhagem Celular , Desenvolvimento Embrionário/fisiologia , Transplante de Células-Tronco Hematopoéticas/métodos , Humanos , Organoides/citologia
7.
Nucleic Acids Res ; 47(D1): D841-D846, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30407577

RESUMO

Stemformatics is an established gene expression data portal containing over 420 public gene expression datasets derived from microarray, RNA sequencing and single cell profiling technologies. Developed for the stem cell community, it has a major focus on pluripotency, tissue stem cells, and staged differentiation. Stemformatics includes curated 'collections' of data relevant to cell reprogramming, as well as hematopoiesis and leukaemia. Rather than simply rehosting datasets as they appear in public repositories, Stemformatics uses a stringent set of quality control metrics and its own pipelines to process handpicked datasets from raw files. This means that about 30% of datasets processed by Stemformatics fail the quality control metrics and never make it to the portal, ensuring that Stemformatics data are of high quality and have been processed in a consistent manner. Stemformatics provides easy-to-use and intuitive tools for biologists to visually explore the data, including interactive gene expression profiles, principal component analysis plots and hierarchical clusters, among others. The addition of tools that facilitate cross-dataset comparisons provides users with snapshots of gene expression in multiple cell and tissues, assisting the identification of cell-type restricted genes, or potential housekeeping genes. Stemformatics is freely available at stemformatics.org.


Assuntos
Bases de Dados Genéticas , Células-Tronco , Transcriptoma/genética , Animais , Diferenciação Celular/genética , Curadoria de Dados , Genes Essenciais/genética , Humanos , Análise de Sequência de RNA , Software
8.
PeerJ ; 4: e1845, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27042394

RESUMO

Mesenchymal stromal cells (MSC) are widely used for the study of mesenchymal tissue repair, and increasingly adopted for cell therapy, despite the lack of consensus on the identity of these cells. In part this is due to the lack of specificity of MSC markers. Distinguishing MSC from other stromal cells such as fibroblasts is particularly difficult using standard analysis of surface proteins, and there is an urgent need for improved classification approaches. Transcriptome profiling is commonly used to describe and compare different cell types; however, efforts to identify specific markers of rare cellular subsets may be confounded by the small sample sizes of most studies. Consequently, it is difficult to derive reproducible, and therefore useful markers. We addressed the question of MSC classification with a large integrative analysis of many public MSC datasets. We derived a sparse classifier (The Rohart MSC test) that accurately distinguished MSC from non-MSC samples with >97% accuracy on an internal training set of 635 samples from 41 studies derived on 10 different microarray platforms. The classifier was validated on an external test set of 1,291 samples from 65 studies derived on 15 different platforms, with >95% accuracy. The genes that contribute to the MSC classifier formed a protein-interaction network that included known MSC markers. Further evidence of the relevance of this new MSC panel came from the high number of Mendelian disorders associated with mutations in more than 65% of the network. These result in mesenchymal defects, particularly impacting on skeletal growth and function. The Rohart MSC test is a simple in silico test that accurately discriminates MSC from fibroblasts, other adult stem/progenitor cell types or differentiated stromal cells. It has been implemented in the www.stemformatics.org resource, to assist researchers wishing to benchmark their own MSC datasets or data from the public domain. The code is available from the CRAN repository and all data used to generate the MSC test is available to download via the Gene Expression Omnibus or the Stemformatics resource.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...