ABSTRACT
We sequenced and assembled using multiple long-read sequencing technologies the genomes of chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, owl monkey, and marmoset. We identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. We estimate that 819.47 Mbp or â¼27% of the genome has been affected by SVs across primate evolution. We identify 1,607 structurally divergent regions wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (e.g., CARD, C4, and OLAH gene families) and additional lineage-specific genes are generated (e.g., CKAP2, VPS36, ACBD7, and NEK5 paralogs), becoming targets of rapid chromosomal diversification and positive selection (e.g., RGPD gene family). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species.
Subject(s)
Genome , Primates , Animals , Humans , Base Sequence , Primates/classification , Primates/genetics , Biological Evolution , Sequence Analysis, DNA , Genomic Structural VariationABSTRACT
Autism spectrum disorder (ASD) is a heterogeneous disease in which efforts to define subtypes behaviorally have met with limited success. Hypothesizing that genetically based subtype identification may prove more productive, we resequenced the ASD-associated gene CHD8 in 3,730 children with developmental delay or ASD. We identified a total of 15 independent mutations; no truncating events were identified in 8,792 controls, including 2,289 unaffected siblings. In addition to a high likelihood of an ASD diagnosis among patients bearing CHD8 mutations, characteristics enriched in this group included macrocephaly, distinct faces, and gastrointestinal complaints. chd8 disruption in zebrafish recapitulates features of the human phenotype, including increased head size as a result of expansion of the forebrain/midbrain and impairment of gastrointestinal motility due to a reduction in postmitotic enteric neurons. Our findings indicate that CHD8 disruptions define a distinct ASD subtype and reveal unexpected comorbidities between brain development and enteric innervation.
Subject(s)
Child Development Disorders, Pervasive/genetics , Child Development Disorders, Pervasive/physiopathology , DNA-Binding Proteins/genetics , Transcription Factors/genetics , Adolescent , Amino Acid Sequence , Animals , Brain/growth & development , Brain/pathology , Child , Child Development Disorders, Pervasive/classification , Child Development Disorders, Pervasive/pathology , Child, Preschool , DNA-Binding Proteins/metabolism , Female , Gastrointestinal Tract/innervation , Gastrointestinal Tract/physiopathology , Humans , Macaca mulatta , Male , Megalencephaly/pathology , Molecular Sequence Data , Mutation , Sequence Alignment , Transcription Factors/metabolism , Zebrafish , Zebrafish Proteins/genetics , Zebrafish Proteins/metabolismABSTRACT
The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch-seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations.
Subject(s)
Motor Cortex/cytology , Neurons/classification , Single-Cell Analysis , Animals , Atlases as Topic , Callithrix/genetics , Epigenesis, Genetic , Epigenomics , Female , GABAergic Neurons/cytology , GABAergic Neurons/metabolism , Gene Expression Profiling , Glutamates/metabolism , Humans , In Situ Hybridization, Fluorescence , Male , Mice , Middle Aged , Motor Cortex/anatomy & histology , Neurons/cytology , Neurons/metabolism , Organ Specificity , Phylogeny , Species Specificity , TranscriptomeABSTRACT
The neocortex is disproportionately expanded in human compared with mouse1,2, both in its total volume relative to subcortical structures and in the proportion occupied by supragranular layers composed of neurons that selectively make connections within the neocortex and with other telencephalic structures. Single-cell transcriptomic analyses of human and mouse neocortex show an increased diversity of glutamatergic neuron types in supragranular layers in human neocortex and pronounced gradients as a function of cortical depth3. Here, to probe the functional and anatomical correlates of this transcriptomic diversity, we developed a robust platform combining patch clamp recording, biocytin staining and single-cell RNA-sequencing (Patch-seq) to examine neurosurgically resected human tissues. We demonstrate a strong correspondence between morphological, physiological and transcriptomic phenotypes of five human glutamatergic supragranular neuron types. These were enriched in but not restricted to layers, with one type varying continuously in all phenotypes across layers 2 and 3. The deep portion of layer 3 contained highly distinctive cell types, two of which express a neurofilament protein that labels long-range projection neurons in primates that are selectively depleted in Alzheimer's disease4,5. Together, these results demonstrate the explanatory power of transcriptomic cell-type classification, provide a structural underpinning for increased complexity of cortical function in humans, and implicate discrete transcriptomic neuron types as selectively vulnerable in disease.
Subject(s)
Glutamic Acid/metabolism , Neocortex/cytology , Neocortex/growth & development , Neurons/cytology , Neurons/metabolism , Alzheimer Disease , Animals , Cell Shape , Collagen/metabolism , Electrophysiology , Extracellular Matrix Proteins/metabolism , Female , Humans , Lysine/analogs & derivatives , Male , Mice , Neocortex/anatomy & histology , Neurons/classification , Patch-Clamp Techniques , TranscriptomeABSTRACT
Genes associated with risk for brain disease exhibit characteristic expression patterns that reflect both anatomical and cell type relationships. Brain-wide transcriptomic patterns of disease risk genes provide a molecular-based signature, based on differential co-expression, that is often unique to that disease. Brain diseases can be compared and aggregated based on the similarity of their signatures which often associates diseases from diverse phenotypic classes. Analysis of 40 common human brain diseases identifies 5 major transcriptional patterns, representing tumor-related, neurodegenerative, psychiatric and substance abuse, and 2 mixed groups of diseases affecting basal ganglia and hypothalamus. Further, for diseases with enriched expression in cortex, single-nucleus data in the middle temporal gyrus (MTG) exhibits a cell type expression gradient separating neurodegenerative, psychiatric, and substance abuse diseases, with unique excitatory cell type expression differentiating psychiatric diseases. Through mapping of homologous cell types between mouse and human, most disease risk genes are found to act in common cell types, while having species-specific expression in those types and preserving similar phenotypic classification within species. These results describe structural and cellular transcriptomic relationships of disease risk genes in the adult brain and provide a molecular-based strategy for classifying and comparing diseases, potentially identifying novel disease relationships.
Subject(s)
Brain Diseases , Transcriptome , Adult , Animals , Humans , Mice , Basal Ganglia , Brain/metabolism , Brain Diseases/genetics , Brain Diseases/metabolism , Gene Expression Profiling/methods , Transcriptome/genetics , Transcriptome/physiology , Risk FactorsABSTRACT
Elucidating the cellular architecture of the human cerebral cortex is central to understanding our cognitive abilities and susceptibility to disease. Here we used single-nucleus RNA-sequencing analysis to perform a comprehensive study of cell types in the middle temporal gyrus of human cortex. We identified a highly diverse set of excitatory and inhibitory neuron types that are mostly sparse, with excitatory types being less layer-restricted than expected. Comparison to similar mouse cortex single-cell RNA-sequencing datasets revealed a surprisingly well-conserved cellular architecture that enables matching of homologous types and predictions of properties of human cell types. Despite this general conservation, we also found extensive differences between homologous human and mouse cell types, including marked alterations in proportions, laminar distributions, gene expression and morphology. These species-specific features emphasize the importance of directly studying human brain.
Subject(s)
Astrocytes/classification , Biological Evolution , Cerebral Cortex/cytology , Cerebral Cortex/metabolism , Neurons/classification , Adolescent , Adult , Aged , Animals , Astrocytes/cytology , Female , Humans , Male , Mice , Middle Aged , Neural Inhibition , Neurons/cytology , Principal Component Analysis , RNA-Seq , Single-Cell Analysis , Species Specificity , Transcriptome/genetics , Young AdultABSTRACT
Most genetic studies consider autism spectrum disorder (ASD) and developmental disorder (DD) separately despite overwhelming comorbidity and shared genetic etiology. Here, we analyzed de novo variants (DNVs) from 15,560 ASD (6,557 from SPARK) and 31,052 DD trios independently and also combined as broader neurodevelopmental disorders (NDDs) using three models. We identify 615 NDD candidate genes (false discovery rate [FDR] < 0.05) supported by ≥1 models, including 138 reaching Bonferroni exome-wide significance (P < 3.64e-7) in all models. The genes group into five functional networks associating with different brain developmental lineages based on single-cell nuclei transcriptomic data. We find no evidence for ASD-specific genes in contrast to 18 genes significantly enriched for DD. There are 53 genes that show mutational bias, including enrichments for missense (n = 41) or truncating (n = 12) DNVs. We also find 10 genes with evidence of male- or female-bias enrichment, including 4 X chromosome genes with significant female burden (DDX3X, MECP2, WDR45, and HDAC8). This large-scale integrative analysis identifies candidates and functional subsets of NDD genes.
Subject(s)
Autism Spectrum Disorder , Autistic Disorder , Child , Male , Female , Humans , Autistic Disorder/genetics , Autism Spectrum Disorder/genetics , Developmental Disabilities/genetics , Genetic Predisposition to Disease , Exome , Histone Deacetylases/genetics , Repressor Proteins/genetics , Carrier Proteins/geneticsABSTRACT
Neurons in the developing brain undergo extensive structural refinement as nascent circuits adopt their mature form. This physical transformation of neurons is facilitated by the engulfment and degradation of axonal branches and synapses by surrounding glial cells, including microglia and astrocytes. However, the small size of phagocytic organelles and the complex, highly ramified morphology of glia have made it difficult to define the contribution of these and other glial cell types to this crucial process. Here, we used large-scale, serial section transmission electron microscopy (TEM) with computational volume segmentation to reconstruct the complete 3D morphologies of distinct glial types in the mouse visual cortex, providing unprecedented resolution of their morphology and composition. Unexpectedly, we discovered that the fine processes of oligodendrocyte precursor cells (OPCs), a population of abundant, highly dynamic glial progenitors, frequently surrounded small branches of axons. Numerous phagosomes and phagolysosomes (PLs) containing fragments of axons and vesicular structures were present inside their processes, suggesting that OPCs engage in axon pruning. Single-nucleus RNA sequencing from the developing mouse cortex revealed that OPCs express key phagocytic genes at this stage, as well as neuronal transcripts, consistent with active axon engulfment. Although microglia are thought to be responsible for the majority of synaptic pruning and structural refinement, PLs were ten times more abundant in OPCs than in microglia at this stage, and these structures were markedly less abundant in newly generated oligodendrocytes, suggesting that OPCs contribute substantially to the refinement of neuronal circuits during cortical development.
Subject(s)
Neocortex , Oligodendrocyte Precursor Cells , Animals , Mice , Axons/metabolism , Oligodendroglia/metabolism , Neurons/metabolismABSTRACT
Single-cell genomics is rapidly advancing our knowledge of the diversity of cell phenotypes, including both cell types and cell states. Driven by single-cell/-nucleus RNA sequencing (scRNA-seq), comprehensive cell atlas projects characterizing a wide range of organisms and tissues are currently underway. As a result, it is critical that the transcriptional phenotypes discovered are defined and disseminated in a consistent and concise manner. Molecular biomarkers have historically played an important role in biological research, from defining immune cell types by surface protein expression to defining diseases by their molecular drivers. Here, we describe a machine learning-based marker gene selection algorithm, NS-Forest version 2.0, which leverages the nonlinear attributes of random forest feature selection and a binary expression scoring approach to discover the minimal marker gene expression combinations that optimally capture the cell type identity represented in complete scRNA-seq transcriptional profiles. The marker genes selected provide an expression barcode that serves as both a useful tool for downstream biological investigation and the necessary and sufficient characteristics for semantic cell type definition. The use of NS-Forest to identify marker genes for human brain middle temporal gyrus cell types reveals the importance of cell signaling and noncoding RNAs in neuronal cell type identity.
Subject(s)
Gene Expression Profiling , Single-Cell Analysis , Biomarkers , Gene Expression Profiling/methods , Machine Learning , Sequence Analysis, RNA/methods , Single-Cell Analysis/methodsABSTRACT
The neocortex contains a multitude of cell types that are segregated into layers and functionally distinct areas. To investigate the diversity of cell types across the mouse neocortex, here we analysed 23,822 cells from two areas at distant poles of the mouse neocortex: the primary visual cortex and the anterior lateral motor cortex. We define 133 transcriptomic cell types by deep, single-cell RNA sequencing. Nearly all types of GABA (γ-aminobutyric acid)-containing neurons are shared across both areas, whereas most types of glutamatergic neurons were found in one of the two areas. By combining single-cell RNA sequencing and retrograde labelling, we match transcriptomic types of glutamatergic neurons to their long-range projection specificity. Our study establishes a combined transcriptomic and projectional taxonomy of cortical cell types from functionally distinct areas of the adult mouse cortex.
Subject(s)
Gene Expression Profiling , Neocortex/cytology , Neocortex/metabolism , Animals , Biomarkers/analysis , Female , GABAergic Neurons/metabolism , Glutamic Acid/metabolism , Male , Mice , Motor Cortex/anatomy & histology , Motor Cortex/cytology , Motor Cortex/metabolism , Neocortex/anatomy & histology , Organ Specificity , Sequence Analysis, RNA , Single-Cell Analysis , Visual Cortex/anatomy & histology , Visual Cortex/cytology , Visual Cortex/metabolismABSTRACT
Single cell/nucleus RNA sequencing (scRNAseq) is emerging as an essential tool to unravel the phenotypic heterogeneity of cells in complex biological systems. While computational methods for scRNAseq cell type clustering have advanced, the ability to integrate datasets to identify common and novel cell types across experiments remains a challenge. Here, we introduce a cluster-to-cluster cell type matching method-FR-Match-that utilizes supervised feature selection for dimensionality reduction and incorporates shared information among cells to determine whether two cell type clusters share the same underlying multivariate gene expression distribution. FR-Match is benchmarked with existing cell-to-cell and cell-to-cluster cell type matching methods using both simulated and real scRNAseq data. FR-Match proved to be a stringent method that produced fewer erroneous matches of distinct cell subtypes and had the unique ability to identify novel cell phenotypes in new datasets. In silico validation demonstrated that the proposed workflow is the only self-contained algorithm that was robust to increasing numbers of true negatives (i.e. non-represented cell types). FR-Match was applied to two human brain scRNAseq datasets sampled from cortical layer 1 and full thickness middle temporal gyrus. When mapping cell types identified in specimens isolated from these overlapping human brain regions, FR-Match precisely recapitulated the laminar characteristics of matched cell type clusters, reflecting their distinct neuroanatomical distributions. An R package and Shiny application are provided at https://github.com/JCVenterInstitute/FRmatch for users to interactively explore and match scRNAseq cell type clusters with complementary visualization tools.
Subject(s)
Algorithms , Cerebral Cortex/metabolism , Databases, Nucleic Acid , RNA-Seq , RNA , Humans , RNA/biosynthesis , RNA/genetics , Single-Cell AnalysisABSTRACT
Elucidating the lineage relationships among different cell types is key to understanding human brain development. Here we developed parallel RNA and DNA analysis after deep sequencing (PRDD-seq), which combines RNA analysis of neuronal cell types with analysis of nested spontaneous DNA somatic mutations as cell lineage markers, identified from joint analysis of single-cell and bulk DNA sequencing by single-cell MosaicHunter (scMH). PRDD-seq enables simultaneous reconstruction of neuronal cell type, cell lineage, and sequential neuronal formation ("birthdate") in postmortem human cerebral cortex. Analysis of two human brains showed remarkable quantitative details that relate mutation mosaic frequency to clonal patterns, confirming an early divergence of precursors for excitatory and inhibitory neurons, and an "inside-out" layer formation of excitatory neurons as seen in other species. In addition our analysis allows an estimate of excitatory neuron-restricted precursors (about 10) that generate the excitatory neurons within a cortical column. Inhibitory neurons showed complex, subtype-specific patterns of neurogenesis, including some patterns of development conserved relative to mouse, but also some aspects of primate cortical interneuron development not seen in mouse. PRDD-seq can be broadly applied to characterize cell identity and lineage from diverse archival samples with single-cell resolution and in potentially any developmental or disease condition.
Subject(s)
Cell Lineage , Cerebral Cortex/cytology , Neurogenesis , Cerebral Cortex/growth & development , Cerebral Cortex/metabolism , High-Throughput Nucleotide Sequencing , Humans , Mutation Accumulation , Neural Stem Cells/cytology , Neural Stem Cells/metabolism , Sequence Analysis, DNA , Single-Cell AnalysisABSTRACT
While genes with an excess of de novo mutations (DNMs) have been identified in children with neurodevelopmental disorders (NDDs), few studies focus on DNM patterns where the sex of affected children is examined separately. We considered â¼8,825 sequenced parent-child trios (n â¼26,475 individuals) and identify 54 genes with a DNM enrichment in males (n = 18), females (n = 17), or overlapping in both the male and female subsets (n = 19). A replication cohort of 18,778 sequenced parent-child trios (n = 56,334 individuals) confirms 25 genes (n = 3 in males, n = 7 in females, n = 15 in both male and female subsets). As expected, we observe significant enrichment on the X chromosome for females but also find autosomal genes with potential sex bias (females, CDK13, ITPR1; males, CHD8, MBD5, SYNGAP1); 6.5% of females harbor a DNM in a female-enriched gene, whereas 2.7% of males have a DNM in a male-enriched gene. Sex-biased genes are enriched in transcriptional processes and chromatin binding, primarily reside in the nucleus of cells, and have brain expression. By downsampling, we find that DNM gene discovery is greatest when studying affected females. Finally, directly comparing de novo allele counts in NDD-affected males and females identifies one replicated genome-wide significant gene (DDX3X) with locus-specific enrichment in females. Our sex-based DNM enrichment analysis identifies candidate NDD genes differentially affecting males and females and indicates that the study of females with NDDs leads to greater gene discovery consistent with the female-protective effect.
Subject(s)
Exome/genetics , Genetic Markers , Mutation , Neurodevelopmental Disorders/genetics , Child , Cohort Studies , Female , Gene Regulatory Networks , Genome-Wide Association Study , Humans , Male , Neurodevelopmental Disorders/pathology , Phenotype , Sex FactorsABSTRACT
The transcriptional underpinnings of brain development remain poorly understood, particularly in humans and closely related non-human primates. We describe a high-resolution transcriptional atlas of rhesus monkey (Macaca mulatta) brain development that combines dense temporal sampling of prenatal and postnatal periods with fine anatomical division of cortical and subcortical regions associated with human neuropsychiatric disease. Gene expression changes more rapidly before birth, both in progenitor cells and maturing neurons. Cortical layers and areas acquire adult-like molecular profiles surprisingly late in postnatal development. Disparate cell populations exhibit distinct developmental timing of gene expression, but also unexpected synchrony of processes underlying neural circuit construction including cell projection and adhesion. Candidate risk genes for neurodevelopmental disorders including primary microcephaly, autism spectrum disorder, intellectual disability, and schizophrenia show disease-specific spatiotemporal enrichment within developing neocortex. Human developmental expression trajectories are more similar to monkey than rodent, although approximately 9% of genes show human-specific regulation with evidence for prolonged maturation or neoteny compared to monkey.
Subject(s)
Brain/growth & development , Brain/metabolism , Macaca mulatta/genetics , Transcriptome , Aging/genetics , Animals , Autism Spectrum Disorder/genetics , Brain/cytology , Brain/embryology , Cell Adhesion , Conserved Sequence , Female , Humans , Intellectual Disability/genetics , Male , Microcephaly/genetics , Neocortex/embryology , Neocortex/growth & development , Neocortex/metabolism , Neurodevelopmental Disorders/genetics , Neurogenesis/genetics , Risk Factors , Schizophrenia/genetics , Spatio-Temporal Analysis , Species Specificity , Transcription, Genetic/geneticsABSTRACT
Cells are fundamental function units of multicellular organisms, with different cell types playing distinct physiological roles in the body. The recent advent of single-cell transcriptional profiling using RNA sequencing is producing 'big data', enabling the identification of novel human cell types at an unprecedented rate. In this review, we summarize recent work characterizing cell types in the human central nervous and immune systems using single-cell and single-nuclei RNA sequencing, and discuss the implications that these discoveries are having on the representation of cell types in the reference Cell Ontology (CL). We propose a method, based on random forest machine learning, for identifying sets of necessary and sufficient marker genes, which can be used to assemble consistent and reproducible cell type definitions for incorporation into the CL. The representation of defined cell type classes and their relationships in the CL using this strategy will make the cell type classes being identified by high-throughput/high-content technologies findable, accessible, interoperable and reusable (FAIR), allowing the CL to serve as a reference knowledgebase of information about the role that distinct cellular phenotypes play in human health and disease.
Subject(s)
Big Data , Gene Expression Profiling/trends , Sequence Analysis, RNA/trends , Single-Cell Analysis/trends , Cell Lineage/genetics , Humans , Transcriptome/geneticsABSTRACT
We present a high-quality genome sequence of a Neanderthal woman from Siberia. We show that her parents were related at the level of half-siblings and that mating among close relatives was common among her recent ancestors. We also sequenced the genome of a Neanderthal from the Caucasus to low coverage. An analysis of the relationships and population history of available archaic genomes and 25 present-day human genomes shows that several gene flow events occurred among Neanderthals, Denisovans and early modern humans, possibly including gene flow into Denisovans from an unknown archaic group. Thus, interbreeding, albeit of low magnitude, occurred among many hominin groups in the Late Pleistocene. In addition, the high-quality Neanderthal genome allows us to establish a definitive list of substitutions that became fixed in modern humans after their separation from the ancestors of Neanderthals and Denisovans.
Subject(s)
Fossils , Genome/genetics , Neanderthals/genetics , Africa , Animals , Caves , DNA Copy Number Variations/genetics , Female , Gene Flow/genetics , Gene Frequency , Heterozygote , Humans , Inbreeding , Models, Genetic , Neanderthals/classification , Phylogeny , Population Density , Siberia/ethnology , Toe Phalanges/anatomy & histologyABSTRACT
The cerebral cortex constitutes more than half the volume of the human brain and is presumed to be responsible for the neuronal computations underlying complex phenomena, such as perception, thought, language, attention, episodic memory and voluntary movement. Rodent models are extremely valuable for the investigation of brain development, but cannot provide insight into aspects that are unique or highly derived in humans. Many human psychiatric and neurological conditions have developmental origins but cannot be studied adequately in animal models. The human cerebral cortex has some unique genetic, molecular, cellular and anatomical features, which need to be further explored. The Anatomical Society devoted its summer meeting to the topic of Human Brain Development in June 2018 to tackle these important issues. The meeting was organized by Gavin Clowry (Newcastle University) and Zoltán Molnár (University of Oxford), and held at St John's College, Oxford. The participants provided a broad overview of the structure of the human brain in the context of scaling relationships across the brains of mammals, conserved principles and recent changes in the human lineage. Speakers considered how neuronal progenitors diversified in human to generate an increasing variety of cortical neurons. The formation of the earliest cortical circuits of the earliest generated neurons in the subplate was discussed together with their involvement in neurodevelopmental pathologies. Gene expression networks and susceptibility genes associated to neurodevelopmental diseases were discussed and compared with the networks that can be identified in organoids developed from induced pluripotent stem cells that recapitulate some aspects of in vivo development. New views were discussed on the specification of glutamatergic pyramidal and γ-aminobutyric acid (GABA)ergic interneurons. With the advancement of various in vivo imaging methods, the histopathological observations can be now linked to in vivo normal conditions and to various diseases. Our review gives a general evaluation of the exciting new developments in these areas. The human cortex has a much enlarged association cortex with greater interconnectivity of cortical areas with each other and with an expanded thalamus. The human cortex has relative enlargement of the upper layers, enhanced diversity and function of inhibitory interneurons and a highly expanded transient subplate layer during development. Here we highlight recent studies that address how these differences emerge during development focusing on diverse facets of our evolution.
Subject(s)
Cerebral Cortex/embryology , Animals , Gene Regulatory Networks , Humans , Interneurons , Neurodevelopmental Disorders/genetics , Neurogenesis , Pyramidal CellsABSTRACT
BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. RESULTS: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including "context annotations" in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. CONCLUSION: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.