Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
1.
J Invest Dermatol ; 144(2): 252-262.e4, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37598867

RESUMEN

Tissue transcriptomics is used to uncover molecular dysregulations underlying diseases. However, the majority of transcriptomics studies focus on single diseases with limited relevance for understanding the molecular relationship between diseases or for identifying disease-specific markers. In this study, we used a normalization approach to compare gene expression across nine inflammatory skin diseases. The normalized datasets were found to retain differential expression signals that allowed unsupervised disease clustering and identification of disease-specific gene signatures. Using the NS-Forest algorithm, we identified a minimal set of biomarkers and validated their use as diagnostic disease classifier. Among them, PTEN was identified as being a specific marker for cutaneous lupus erythematosus and found to be strongly expressed by lesional keratinocytes in association with pathogenic type I IFNs. In fact, PTEN facilitated the expression of IFN-ß and IFN-κ in keratinocytes by promoting activation and nuclear translocation of IRF3. Thus, cross-comparison of tissue transcriptomics is a valid strategy to establish a molecular disease classification and to identify pathogenic disease biomarkers.


Asunto(s)
Dermatitis , Lupus Eritematoso Cutáneo , Lupus Eritematoso Sistémico , Humanos , Biomarcadores/metabolismo , Dermatitis/patología , Perfilación de la Expresión Génica , Queratinocitos/metabolismo , Lupus Eritematoso Cutáneo/diagnóstico , Lupus Eritematoso Cutáneo/genética , Lupus Eritematoso Cutáneo/metabolismo , Lupus Eritematoso Sistémico/genética , Fosfohidrolasa PTEN/genética , Piel/patología
2.
Sci Rep ; 13(1): 9567, 2023 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-37311768

RESUMEN

With the advent of multiplex fluorescence in situ hybridization (FISH) and in situ RNA sequencing technologies, spatial transcriptomics analysis is advancing rapidly, providing spatial location and gene expression information about cells in tissue sections at single cell resolution. Cell type classification of these spatially-resolved cells can be inferred by matching the spatial transcriptomics data to reference atlases derived from single cell RNA-sequencing (scRNA-seq) in which cell types are defined by differences in their gene expression profiles. However, robust cell type matching of the spatially-resolved cells to reference scRNA-seq atlases is challenging due to the intrinsic differences in resolution between the spatial and scRNA-seq data. In this study, we systematically evaluated six computational algorithms for cell type matching across four image-based spatial transcriptomics experimental protocols (MERFISH, smFISH, BaristaSeq, and ExSeq) conducted on the same mouse primary visual cortex (VISp) brain region. We find that many cells are assigned as the same type by multiple cell type matching algorithms and are present in spatial patterns previously reported from scRNA-seq studies in VISp. Furthermore, by combining the results of individual matching strategies into consensus cell type assignments, we see even greater alignment with biological expectations. We present two ensemble meta-analysis strategies used in this study and share the consensus cell type matching results in the Cytosplore Viewer ( https://viewer.cytosplore.org ) for interactive visualization and data exploration. The consensus matching can also guide spatial data analysis using SSAM, allowing segmentation-free cell type assignment.


Asunto(s)
Corteza Visual Primaria , Transcriptoma , Animales , Ratones , Hibridación Fluorescente in Situ , Perfilación de la Expresión Génica , Algoritmos
4.
Sci Data ; 10(1): 50, 2023 01 24.
Artículo en Inglés | MEDLINE | ID: mdl-36693887

RESUMEN

Large-scale single-cell 'omics profiling is being used to define a complete catalogue of brain cell types, something that traditional methods struggle with due to the diversity and complexity of the brain. But this poses a problem: How do we organise such a catalogue - providing a standard way to refer to the cell types discovered, linking their classification and properties to supporting data? Cell ontologies provide a partial solution to these problems, but no existing ontology schemas support the definition of cell types by direct reference to supporting data, classification of cell types using classifications derived directly from data, or links from cell types to marker sets along with confidence scores. Here we describe a generally applicable schema that solves these problems and its application in a semi-automated pipeline to build a data-linked extension to the Cell Ontology representing cell types in the Primary Motor Cortex of humans, mice and marmosets. The methods and resulting ontology are designed to be scalable and applicable to similar whole-brain atlases currently in preparation.


Asunto(s)
Ontologías Biológicas , Encéfalo , Animales , Humanos , Ratones , Callithrix , Recolección de Datos/normas
5.
PLoS One ; 17(9): e0275070, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36149937

RESUMEN

With the advent of single cell/nucleus RNA sequencing (sc/snRNA-seq), the field of cell phenotyping is now a data-driven exercise providing statistical evidence to support cell type/state categorization. However, the task of classifying cells into specific, well-defined categories with the empirical data provided by sc/snRNA-seq remains nontrivial due to the difficulty in determining specific differences between related cell types with close transcriptional similarities, resulting in challenges with matching cell types identified in separate experiments. To investigate possible approaches to overcome these obstacles, we explored the use of supervised machine learning methods-logistic regression, support vector machines, random forests, neural networks, and light gradient boosting machine (LightGBM)-as approaches to classify cell types using snRNA-seq datasets from human brain middle temporal gyrus (MTG) and human kidney. Classification accuracy was evaluated using an F-beta score weighted in favor of precision to account for technical artifacts of gene expression dropout. We examined the impact of hyperparameter optimization and feature selection methods on F-beta score performance. We found that the best performing model for granular cell type classification in both datasets is a multinomial logistic regression classifier and that an effective feature selection step was the most influential factor in optimizing the performance of the machine learning pipelines.


Asunto(s)
Aprendizaje Automático , ARN , Humanos , Modelos Logísticos , ARN Nuclear Pequeño , Análisis de Secuencia de ARN/métodos , Máquina de Vectores de Soporte
6.
Bioinformatics ; 38(20): 4735-4744, 2022 10 14.
Artículo en Inglés | MEDLINE | ID: mdl-36018232

RESUMEN

MOTIVATION: Flow cytometry (FCM) and transcription profiling are the two widely used assays in translational immunology research. However, there is no data integration pipeline for analyzing these two types of assays together with experiment variables for biomarker inference. Current FCM data analysis mainly relies on subjective manual gating analysis, which is difficult to be directly integrated with other automated computational methods. Existing deconvolutional analysis of bulk transcriptomics relies on predefined marker genes in the transcriptomics data, which are unavailable for novel cell types and does not utilize the FCM data that provide canonical phenotypic definitions of the cell types. RESULTS: We developed a novel analytics pipeline-FastMix-for computational immunology, which integrates flow cytometry, bulk transcriptomics and clinical covariates for identifying cell type-specific gene expression signatures and biomarker genes. FastMix addresses the 'large p, small n' problem in the gene expression and flow cytometry integration analysis via a linear mixed effects model (LMER) for both cross-sectional and longitudinal studies. Its novel moment-based estimator not only reduces bias in parameter estimation but also is more efficient than iterative optimization. The FastMix pipeline also includes a cutting-edge flow cytometry data analysis method-DAFi-for identifying cell populations of interest and their characteristics. Simulation studies showed that FastMix produced smaller type I/II errors than competing methods. Validation using real data of two vaccine studies showed that FastMix identified a consistent set of signature genes as in independent single-cell RNA-seq analysis, producing additional interesting findings. AVAILABILITY AND IMPLEMENTATION: Source code of FastMix is publicly available at https://github.com/terrysun0302/FastMix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Transcriptoma , Biomarcadores , Estudios Transversales , Análisis de Datos
8.
Plant Physiol ; 188(2): 879-897, 2022 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-34893913

RESUMEN

The ability to trace every cell in some model organisms has led to the fundamental understanding of development and cellular function. However, in plants the complexity of cell number, organ size, and developmental time makes this a challenge even in the diminutive model plant Arabidopsis (Arabidopsis thaliana). Duckweed, basal nongrass aquatic monocots, provide an opportunity to follow every cell of an entire plant due to their small size, reduced body plan, and fast clonal growth habit. Here we present a chromosome-resolved genome for the highly invasive Lesser Duckweed (Lemna minuta) and generate a preliminary cell atlas leveraging low cell coverage single-nuclei sequencing. We resolved the 360 megabase genome into 21 chromosomes, revealing a core nonredundant gene set with only the ancient tau whole-genome duplication shared with all monocots, and paralog expansion as a result of tandem duplications related to phytoremediation. Leveraging SMARTseq2 single-nuclei sequencing, which provided higher gene coverage yet lower cell count, we profiled 269 nuclei covering 36.9% (8,457) of the L. minuta transcriptome. Since molecular validation was not possible in this nonmodel plant, we leveraged gene orthology with model organism single-cell expression datasets, gene ontology, and cell trajectory analysis to define putative cell types. We found that the tissue that we computationally defined as mesophyll expressed high levels of elemental transport genes consistent with this tissue playing a role in L. minuta wastewater detoxification. The L. minuta genome and preliminary cell map provide a paradigm to decipher developmental genes and pathways for an entire plant.


Asunto(s)
Araceae/genética , Especies Introducidas , Dispersión de las Plantas/genética , Transcriptoma , Genoma de Planta
9.
Front Immunol ; 12: 690470, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34777332

RESUMEN

Vaccination to prevent infectious disease is one of the most successful public health interventions ever developed. And yet, variability in individual vaccine effectiveness suggests that a better mechanistic understanding of vaccine-induced immune responses could improve vaccine design and efficacy. We have previously shown that protective antibody levels could be elicited in a subset of recipients with only a single dose of the hepatitis B virus (HBV) vaccine and that a wide range of antibody levels were elicited after three doses. The immune mechanisms responsible for this vaccine response variability is unclear. Using single cell RNA sequencing of sorted innate immune cell subsets, we identified two distinct myeloid dendritic cell subsets (NDRG1-expressing mDC2 and CDKN1C-expressing mDC4), the ratio of which at baseline (pre-vaccination) correlated with the immune response to a single dose of HBV vaccine. Our results suggest that the participants in our vaccine study were in one of two different dendritic cell dispositional states at baseline - an NDRG2-mDC2 state in which the vaccine elicited an antibody response after a single immunization or a CDKN1C-mDC4 state in which the vaccine required two or three doses for induction of antibody responses. To explore this correlation further, genes expressed in these mDC subsets were used for feature selection prior to the construction of predictive models using supervised canonical correlation machine learning. The resulting models showed an improved correlation with serum antibody titers in response to full vaccination. Taken together, these results suggest that the propensity of circulating dendritic cells toward either activation or suppression, their "dispositional endotype" at pre-vaccination baseline, could dictate response to vaccination.


Asunto(s)
Células Dendríticas/inmunología , Anticuerpos contra la Hepatitis B/inmunología , Vacunas contra Hepatitis B/inmunología , Hepatitis B/prevención & control , Aprendizaje Automático , Análisis de la Célula Individual , Adulto , Anciano , Análisis de Correlación Canónica , Células Dendríticas/metabolismo , Femenino , Perfilación de la Expresión Génica , Hepatitis B/epidemiología , Secuenciación de Nucleótidos de Alto Rendimiento , Interacciones Huésped-Patógeno , Humanos , Masculino , Persona de Mediana Edad , Análisis de la Célula Individual/métodos , Vacunación , Eficacia de las Vacunas
10.
Nature ; 598(7879): 111-119, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-34616062

RESUMEN

The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch-seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations.


Asunto(s)
Corteza Motora/citología , Neuronas/clasificación , Análisis de la Célula Individual , Animales , Atlas como Asunto , Callithrix/genética , Epigénesis Genética , Epigenómica , Femenino , Neuronas GABAérgicas/citología , Neuronas GABAérgicas/metabolismo , Perfilación de la Expresión Génica , Glutamatos/metabolismo , Humanos , Hibridación Fluorescente in Situ , Masculino , Ratones , Persona de Mediana Edad , Corteza Motora/anatomía & histología , Neuronas/citología , Neuronas/metabolismo , Especificidad de Órganos , Filogenia , Especificidad de la Especie , Transcriptoma
11.
Nat Med ; 27(5): 892-903, 2021 05.
Artículo en Inglés | MEDLINE | ID: mdl-33767405

RESUMEN

Despite signs of infection-including taste loss, dry mouth and mucosal lesions such as ulcerations, enanthema and macules-the involvement of the oral cavity in coronavirus disease 2019 (COVID-19) is poorly understood. To address this, we generated and analyzed two single-cell RNA sequencing datasets of the human minor salivary glands and gingiva (9 samples, 13,824 cells), identifying 50 cell clusters. Using integrated cell normalization and annotation, we classified 34 unique cell subpopulations between glands and gingiva. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) viral entry factors such as ACE2 and TMPRSS members were broadly enriched in epithelial cells of the glands and oral mucosae. Using orthogonal RNA and protein expression assessments, we confirmed SARS-CoV-2 infection in the glands and mucosae. Saliva from SARS-CoV-2-infected individuals harbored epithelial cells exhibiting ACE2 and TMPRSS expression and sustained SARS-CoV-2 infection. Acellular and cellular salivary fractions from asymptomatic individuals were found to transmit SARS-CoV-2 ex vivo. Matched nasopharyngeal and saliva samples displayed distinct viral shedding dynamics, and salivary viral burden correlated with COVID-19 symptoms, including taste loss. Upon recovery, this asymptomatic cohort exhibited sustained salivary IgG antibodies against SARS-CoV-2. Collectively, these data show that the oral cavity is an important site for SARS-CoV-2 infection and implicate saliva as a potential route of SARS-CoV-2 transmission.


Asunto(s)
COVID-19/virología , Boca/virología , SARS-CoV-2/aislamiento & purificación , Saliva/virología , Enzima Convertidora de Angiotensina 2/análisis , Infecciones Asintomáticas , COVID-19/etiología , Humanos , Serina Endopeptidasas/análisis , Trastornos del Gusto/etiología , Trastornos del Gusto/virología , Replicación Viral
12.
Brief Bioinform ; 22(4)2021 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-33249453

RESUMEN

Single cell/nucleus RNA sequencing (scRNAseq) is emerging as an essential tool to unravel the phenotypic heterogeneity of cells in complex biological systems. While computational methods for scRNAseq cell type clustering have advanced, the ability to integrate datasets to identify common and novel cell types across experiments remains a challenge. Here, we introduce a cluster-to-cluster cell type matching method-FR-Match-that utilizes supervised feature selection for dimensionality reduction and incorporates shared information among cells to determine whether two cell type clusters share the same underlying multivariate gene expression distribution. FR-Match is benchmarked with existing cell-to-cell and cell-to-cluster cell type matching methods using both simulated and real scRNAseq data. FR-Match proved to be a stringent method that produced fewer erroneous matches of distinct cell subtypes and had the unique ability to identify novel cell phenotypes in new datasets. In silico validation demonstrated that the proposed workflow is the only self-contained algorithm that was robust to increasing numbers of true negatives (i.e. non-represented cell types). FR-Match was applied to two human brain scRNAseq datasets sampled from cortical layer 1 and full thickness middle temporal gyrus. When mapping cell types identified in specimens isolated from these overlapping human brain regions, FR-Match precisely recapitulated the laminar characteristics of matched cell type clusters, reflecting their distinct neuroanatomical distributions. An R package and Shiny application are provided at https://github.com/JCVenterInstitute/FRmatch for users to interactively explore and match scRNAseq cell type clusters with complementary visualization tools.


Asunto(s)
Algoritmos , Corteza Cerebral/metabolismo , Bases de Datos de Ácidos Nucleicos , RNA-Seq , ARN , Humanos , ARN/biosíntesis , ARN/genética , Análisis de la Célula Individual
13.
medRxiv ; 2020 Oct 27.
Artículo en Inglés | MEDLINE | ID: mdl-33140061

RESUMEN

Despite signs of infection, the involvement of the oral cavity in COVID-19 is poorly understood. To address this, single-cell RNA sequencing data-sets were integrated from human minor salivary glands and gingiva to identify 11 epithelial, 7 mesenchymal, and 15 immune cell clusters. Analysis of SARS-CoV-2 viral entry factor expression showed enrichment in epithelia including the ducts and acini of the salivary glands and the suprabasal cells of the mucosae. COVID-19 autopsy tissues confirmed in vivo SARS-CoV-2 infection in the salivary glands and mucosa. Saliva from SARS-CoV-2-infected individuals harbored epithelial cells exhibiting ACE2 expression and SARS-CoV-2 RNA. Matched nasopharyngeal and saliva samples found distinct viral shedding dynamics and viral burden in saliva correlated with COVID-19 symptoms including taste loss. Upon recovery, this cohort exhibited salivary antibodies against SARS-CoV-2 proteins. Collectively, the oral cavity represents a robust site for COVID-19 infection and implicates saliva in viral transmission.

14.
Nat Commun ; 11(1): 1172, 2020 03 03.
Artículo en Inglés | MEDLINE | ID: mdl-32127543

RESUMEN

von Economo neurons (VENs) are bipolar, spindle-shaped neurons restricted to layer 5 of human frontoinsula and anterior cingulate cortex that appear to be selectively vulnerable to neuropsychiatric and neurodegenerative diseases, although little is known about other VEN cellular phenotypes. Single nucleus RNA-sequencing of frontoinsula layer 5 identifies a transcriptomically-defined cell cluster that contained VENs, but also fork cells and a subset of pyramidal neurons. Cross-species alignment of this cell cluster with a well-annotated mouse classification shows strong homology to extratelencephalic (ET) excitatory neurons that project to subcerebral targets. This cluster also shows strong homology to a putative ET cluster in human temporal cortex, but with a strikingly specific regional signature. Together these results suggest that VENs are a regionally distinctive type of ET neuron. Additionally, we describe the first patch clamp recordings of VENs from neurosurgically-resected tissue that show distinctive intrinsic membrane properties relative to neighboring pyramidal neurons.


Asunto(s)
Neuronas/fisiología , Lóbulo Temporal/citología , Transcriptoma , Animales , Encéfalo/citología , Encéfalo/fisiología , Electrofisiología/métodos , Perfilación de la Expresión Génica , Humanos , Hibridación Fluorescente in Situ , Ratones , Neuronas/citología , Células Piramidales/fisiología , Telencéfalo/citología , Lóbulo Temporal/fisiología
15.
Nat Neurosci ; 21(9): 1185-1195, 2018 09.
Artículo en Inglés | MEDLINE | ID: mdl-30150662

RESUMEN

We describe convergent evidence from transcriptomics, morphology, and physiology for a specialized GABAergic neuron subtype in human cortex. Using unbiased single-nucleus RNA sequencing, we identify ten GABAergic interneuron subtypes with combinatorial gene signatures in human cortical layer 1 and characterize a group of human interneurons with anatomical features never described in rodents, having large 'rosehip'-like axonal boutons and compact arborization. These rosehip cells show an immunohistochemical profile (GAD1+CCK+, CNR1-SST-CALB2-PVALB-) matching a single transcriptomically defined cell type whose specific molecular marker signature is not seen in mouse cortex. Rosehip cells in layer 1 make homotypic gap junctions, predominantly target apical dendritic shafts of layer 3 pyramidal neurons, and inhibit backpropagating pyramidal action potentials in microdomains of the dendritic tuft. These cells are therefore positioned for potent local control of distal dendritic computation in cortical pyramidal neurons.


Asunto(s)
Corteza Cerebral/metabolismo , Corteza Cerebral/ultraestructura , Neuronas GABAérgicas/metabolismo , Neuronas GABAérgicas/ultraestructura , Transcriptoma , Adulto , Anciano , Axones/ultraestructura , Espinas Dendríticas/metabolismo , Espinas Dendríticas/ultraestructura , Uniones Comunicantes/metabolismo , Uniones Comunicantes/ultraestructura , Biblioteca de Genes , Humanos , Masculino , Reacción en Cadena de la Polimerasa , Terminales Presinápticos/metabolismo , Terminales Presinápticos/ultraestructura , Células Piramidales/metabolismo , Células Piramidales/ultraestructura , ARN/análisis , ARN/genética , Análisis de Secuencia de ARN
16.
Hum Mol Genet ; 27(R1): R40-R47, 2018 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-29590361

RESUMEN

Cells are fundamental function units of multicellular organisms, with different cell types playing distinct physiological roles in the body. The recent advent of single-cell transcriptional profiling using RNA sequencing is producing 'big data', enabling the identification of novel human cell types at an unprecedented rate. In this review, we summarize recent work characterizing cell types in the human central nervous and immune systems using single-cell and single-nuclei RNA sequencing, and discuss the implications that these discoveries are having on the representation of cell types in the reference Cell Ontology (CL). We propose a method, based on random forest machine learning, for identifying sets of necessary and sufficient marker genes, which can be used to assemble consistent and reproducible cell type definitions for incorporation into the CL. The representation of defined cell type classes and their relationships in the CL using this strategy will make the cell type classes being identified by high-throughput/high-content technologies findable, accessible, interoperable and reusable (FAIR), allowing the CL to serve as a reference knowledgebase of information about the role that distinct cellular phenotypes play in human health and disease.


Asunto(s)
Macrodatos , Perfilación de la Expresión Génica/tendencias , Análisis de Secuencia de ARN/tendencias , Análisis de la Célula Individual/tendencias , Linaje de la Célula/genética , Humanos , Transcriptoma/genética
17.
Nucleic Acids Res ; 45(D1): D466-D474, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27679478

RESUMEN

The Influenza Research Database (IRD) is a U.S. National Institute of Allergy and Infectious Diseases (NIAID)-sponsored Bioinformatics Resource Center dedicated to providing bioinformatics support for influenza virus research. IRD facilitates the research and development of vaccines, diagnostics and therapeutics against influenza virus by providing a comprehensive collection of influenza-related data integrated from various sources, a growing suite of analysis and visualization tools for data mining and hypothesis generation, personal workbench spaces for data storage and sharing, and active user community support. Here, we describe the recent improvements in IRD including the use of cloud and high performance computing resources, analysis and visualization of user-provided sequence data with associated metadata, predictions of novel variant proteins, annotations of phenotype-associated sequence markers and their predicted phenotypic effects, hemagglutinin (HA) clade classifications, an automated tool for HA subtype numbering conversion, linkouts to disease event data and the addition of host factor and antiviral drug components. All data and tools are freely available without restriction from the IRD website at https://www.fludb.org.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales , Virus de la Influenza A , Investigación , Programas Informáticos , Virus de la Influenza A/clasificación , Virus de la Influenza A/fisiología , Tipificación Molecular/métodos , Fenotipo , Filogenia , Proteínas Virales/genética , Virulencia
18.
BMC Bioinformatics ; 18(Suppl 17): 559, 2017 12 21.
Artículo en Inglés | MEDLINE | ID: mdl-29322913

RESUMEN

BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. RESULTS: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including "context annotations" in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. CONCLUSION: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.


Asunto(s)
Ontologías Biológicas , Biomarcadores/metabolismo , Células/clasificación , Células/metabolismo , Biología Computacional/métodos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Humanos
19.
Nat Protoc ; 11(3): 499-524, 2016 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-26890679

RESUMEN

A protocol is described for sequencing the transcriptome of a cell nucleus. Nuclei are isolated from specimens and sorted by FACS, cDNA libraries are constructed and RNA-seq is performed, followed by data analysis. Some steps follow published methods (Smart-seq2 for cDNA synthesis and Nextera XT barcoded library preparation) and are not described in detail here. Previous single-cell approaches for RNA-seq from tissues include cell dissociation using protease treatment at 30 °C, which is known to alter the transcriptome. We isolate nuclei at 4 °C from tissue homogenates, which cause minimal damage. Nuclear transcriptomes can be obtained from postmortem human brain tissue stored at -80 °C, making brain archives accessible for RNA-seq from individual neurons. The method also allows investigation of biological features unique to nuclei, such as enrichment of certain transcripts and precursors of some noncoding RNAs. By following this procedure, it takes about 4 d to construct cDNA libraries that are ready for sequencing.


Asunto(s)
Encéfalo/citología , Nucléolo Celular/genética , Neuronas/metabolismo , ARN/genética , Análisis de Secuencia de ARN/métodos , Transcriptoma , Autopsia/métodos , Fraccionamiento Celular/métodos , ADN Complementario/genética , Perfilación de la Expresión Génica/métodos , Biblioteca de Genes , Humanos , Análisis de la Célula Individual/métodos , Conservación de Tejido
20.
J Virol ; 89(10): 5427-40, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25741011

RESUMEN

UNLABELLED: Although a large number of immune epitopes have been identified in the influenza A virus (IAV) hemagglutinin (HA) protein using various experimental systems, it is unclear which are involved in protective immunity to natural infection in humans. We developed a data mining approach analyzing natural H1N1 human isolates to identify HA protein regions that may be targeted by the human immune system and can predict the evolution of IAV. We identified 16 amino acid sites experiencing diversifying selection during the evolution of prepandemic seasonal H1N1 strains and found that 11 sites were located in experimentally determined B-cell/antibody (Ab) epitopes, including three distinct neutralizing Caton epitopes: Sa, Sb, and Ca2 [A. J. Caton, G. G. Brownlee, J. W. Yewdell, and W. Gerhard, Cell 31:417-427, 1982, http://dx.doi.org/10.1016/0092-8674(82)90135-0]. We predicted that these diversified epitope regions would be the targets of mutation as the 2009 H1N1 pandemic (pH1N1) lineage evolves in response to the development of population-level protective immunity in humans. Using a chi-squared goodness-of-fit test, we identified 10 amino acid sites that significantly differed between the pH1N1 isolates and isolates from the recent 2012-2013 and 2013-2014 influenza seasons. Three of these sites were located in the same diversified B-cell/Ab epitope regions as identified in the analysis of prepandemic sequences, including Sa and Sb. As predicted, hemagglutination inhibition (HI) assays using human sera from subjects vaccinated with the initial pH1N1 isolate demonstrated reduced reactivity against 2013-2014 isolates. Taken together, these results suggest that diversifying selection analysis can identify key immune epitopes responsible for protective immunity to influenza virus in humans and thereby predict virus evolution. IMPORTANCE: The WHO estimates that approximately 5 to 10% of adults and 20 to 30% of children in the world are infected by influenza virus each year. While an adaptive immune response helps eliminate the virus following acute infection, the virus rapidly evolves to evade the established protective memory immune response, thus allowing for the regular seasonal cycles of influenza virus infection. The analytical approach described here, which combines an analysis of diversifying selection with an integration of immune epitope data, has allowed us to identify antigenic regions that contribute to protective immunity and are therefore the key targets of immune evasion by the virus. This information can be used to determine when sequence variations in seasonal influenza virus strains have affected regions responsible for protective immunity in order to decide when new vaccine formulations are warranted.


Asunto(s)
Evolución Molecular , Subtipo H1N1 del Virus de la Influenza A/genética , Subtipo H1N1 del Virus de la Influenza A/inmunología , Gripe Humana/inmunología , Gripe Humana/virología , Adulto , Anciano , Antígenos Virales/química , Antígenos Virales/genética , Epítopos de Linfocito B/química , Epítopos de Linfocito B/genética , Femenino , Glicoproteínas Hemaglutininas del Virus de la Influenza/química , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Glicoproteínas Hemaglutininas del Virus de la Influenza/inmunología , Humanos , Subtipo H1N1 del Virus de la Influenza A/aislamiento & purificación , Gripe Humana/epidemiología , Masculino , Persona de Mediana Edad , Modelos Moleculares , Mutación , Pandemias , Filogenia , Selección Genética , Adulto Joven
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA