RESUMO
During postnatal life, thymopoiesis depends on the continuous colonization of the thymus by bone-marrow-derived hematopoietic progenitors that migrate through the bloodstream. The current understanding of the nature of thymic immigrants is largely based on data from pre-clinical models. Here, we employed single-cell RNA sequencing (scRNA-seq) to examine the immature postnatal thymocyte population in humans. Integration of bone marrow and peripheral blood precursor datasets identified two putative thymus seeding progenitors that varied in expression of CD7; CD10; and the homing receptors CCR7, CCR9, and ITGB7. Whereas both precursors supported T cell development, only one contributed to intrathymic dendritic cell (DC) differentiation, predominantly of plasmacytoid dendritic cells. Trajectory inference delineated the transcriptional dynamics underlying early human T lineage development, enabling prediction of transcription factor (TF) modules that drive stage-specific steps of human T cell development. This comprehensive dataset defines the expression signature of immature human thymocytes and provides a resource for the further study of human thymopoiesis.
Assuntos
Diferenciação Celular , Regulação da Expressão Gênica no Desenvolvimento , Células Progenitoras Linfoides/citologia , Células Progenitoras Linfoides/metabolismo , RNA Citoplasmático Pequeno/genética , Timócitos/citologia , Timócitos/metabolismo , Biomarcadores , Diferenciação Celular/genética , Diferenciação Celular/imunologia , Linhagem da Célula/genética , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Imunofenotipagem , Análise de Célula Única , Timócitos/imunologia , TranscriptomaRESUMO
Definitive haematopoiesis in the fetal liver supports self-renewal and differentiation of haematopoietic stem cells and multipotent progenitors (HSC/MPPs) but remains poorly defined in humans. Here, using single-cell transcriptome profiling of approximately 140,000 liver and 74,000 skin, kidney and yolk sac cells, we identify the repertoire of human blood and immune cells during development. We infer differentiation trajectories from HSC/MPPs and evaluate the influence of the tissue microenvironment on blood and immune cell development. We reveal physiological erythropoiesis in fetal skin and the presence of mast cells, natural killer and innate lymphoid cell precursors in the yolk sac. We demonstrate a shift in the haemopoietic composition of fetal liver during gestation away from being predominantly erythroid, accompanied by a parallel change in differentiation potential of HSC/MPPs, which we functionally validate. Our integrated map of fetal liver haematopoiesis provides a blueprint for the study of paediatric blood and immune disorders, and a reference for harnessing the therapeutic potential of HSC/MPPs.
Assuntos
Feto/citologia , Hematopoese , Fígado/citologia , Fígado/embriologia , Células Sanguíneas/citologia , Microambiente Celular , Feminino , Feto/metabolismo , Citometria de Fluxo , Perfilação da Expressão Gênica , Humanos , Fígado/metabolismo , Tecido Linfoide/citologia , Análise de Célula Única , Células-Tronco/metabolismoRESUMO
Scalable technologies to sequence the transcriptomes and epigenomes of single cells are transforming our understanding of cell types and cell states. The Brain Research through Advancing Innovative Neurotechnologies (BRAIN) Initiative Cell Census Network (BICCN) is applying these technologies at unprecedented scale to map the cell types in the mammalian brain. In an effort to increase data FAIRness (Findable, Accessible, Interoperable, Reusable), the NIH has established repositories to make data generated by the BICCN and related BRAIN Initiative projects accessible to the broader research community. Here, we describe the Neuroscience Multi-Omic Archive (NeMO Archive; nemoarchive.org), which serves as the primary repository for genomics data from the BRAIN Initiative. Working closely with other BRAIN Initiative researchers, we have organized these data into a continually expanding, curated repository, which contains transcriptomic and epigenomic data from over 50 million brain cells, including single-cell genomic data from all of the major regions of the adult and prenatal human and mouse brains, as well as substantial single-cell genomic data from non-human primates. We make available several tools for accessing these data, including a searchable web portal, a cloud-computing interface for large-scale data processing (implemented on Terra, terra.bio), and a visualization and analysis platform, NeMO Analytics (nemoanalytics.org).
Assuntos
Encéfalo , Bases de Dados Genéticas , Epigenômica , Multiômica , Transcriptoma , Animais , Camundongos , Genômica , Mamíferos , Primatas , Encéfalo/citologia , Encéfalo/metabolismoRESUMO
Type 2 innate lymphoid cells (ILC2s) both contribute to mucosal homeostasis and initiate pathologic inflammation in allergic asthma. However, the signals that direct ILC2s to promote homeostasis versus inflammation are unclear. To identify such molecular cues, we profiled mouse lung-resident ILCs using single-cell RNA sequencing at steady state and after in vivo stimulation with the alarmin cytokines IL-25 and IL-33. ILC2s were transcriptionally heterogeneous after activation, with subpopulations distinguished by expression of proliferative, homeostatic and effector genes. The neuropeptide receptor Nmur1 was preferentially expressed by ILC2s at steady state and after IL-25 stimulation. Neuromedin U (NMU), the ligand of NMUR1, activated ILC2s in vitro, and in vivo co-administration of NMU with IL-25 strongly amplified allergic inflammation. Loss of NMU-NMUR1 signalling reduced ILC2 frequency and effector function, and altered transcriptional programs following allergen challenge in vivo. Thus, NMUR1 signalling promotes inflammatory ILC2 responses, highlighting the importance of neuro-immune crosstalk in allergic inflammation at mucosal surfaces.
Assuntos
Hipersensibilidade/imunologia , Hipersensibilidade/patologia , Inflamação/imunologia , Inflamação/patologia , Pulmão/patologia , Linfócitos/imunologia , Neuropeptídeos/metabolismo , Animais , Feminino , Regulação da Expressão Gênica , Imunidade Inata/imunologia , Interleucina-17/imunologia , Interleucina-33/imunologia , Ligantes , Pulmão/imunologia , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Receptores de Neurotransmissores/biossíntese , Receptores de Neurotransmissores/genética , Receptores de Neurotransmissores/metabolismo , Mucosa Respiratória/imunologia , Mucosa Respiratória/patologia , Transdução de Sinais , Transcrição GênicaRESUMO
This corrects the article DOI: 10.1038/nature24029.
RESUMO
Many methods have been developed for statistical analysis of microbial community profiles, but due to the complex nature of typical microbiome measurements (e.g. sparsity, zero-inflation, non-independence, and compositionality) and of the associated underlying biology, it is difficult to compare or evaluate such methods within a single systematic framework. To address this challenge, we developed SparseDOSSA (Sparse Data Observations for the Simulation of Synthetic Abundances): a statistical model of microbial ecological population structure, which can be used to parameterize real-world microbial community profiles and to simulate new, realistic profiles of known structure for methods evaluation. Specifically, SparseDOSSA's model captures marginal microbial feature abundances as a zero-inflated log-normal distribution, with additional model components for absolute cell counts and the sequence read generation process, microbe-microbe, and microbe-environment interactions. Together, these allow fully known covariance structure between synthetic features (i.e. "taxa") or between features and "phenotypes" to be simulated for method benchmarking. Here, we demonstrate SparseDOSSA's performance for 1) accurately modeling human-associated microbial population profiles; 2) generating synthetic communities with controlled population and ecological structures; 3) spiking-in true positive synthetic associations to benchmark analysis methods; and 4) recapitulating an end-to-end mouse microbiome feeding experiment. Together, these represent the most common analysis types in assessment of real microbial community environmental and epidemiological statistics, thus demonstrating SparseDOSSA's utility as a general-purpose aid for modeling communities and evaluating quantitative methods. An open-source implementation is available at http://huttenhower.sph.harvard.edu/sparsedossa2.
Assuntos
Microbiota , Modelos Estatísticos , Algoritmos , Benchmarking , Biologia Computacional/métodos , Simulação por ComputadorRESUMO
It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2's linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles.
Assuntos
Biologia Computacional , Microbioma Gastrointestinal , Análise Multivariada , Simulação por Computador , Humanos , Doenças Inflamatórias Intestinais/genética , Doenças Inflamatórias Intestinais/metabolismo , Doenças Inflamatórias Intestinais/patologiaRESUMO
Complex microbial communities are an integral part of the Earth's ecosystem and of our bodies in health and disease. In the last two decades, culture-independent approaches have provided new insights into their structure and function, with the exponentially decreasing cost of high-throughput sequencing resulting in broadly available tools for microbial surveys. However, the field remains far from reaching a technological plateau, as both computational techniques and nucleotide sequencing platforms for microbial genomic and transcriptional content continue to improve. Current microbiome analyses are thus starting to adopt multiple and complementary meta'omic approaches, leading to unprecedented opportunities to comprehensively and accurately characterize microbial communities and their interactions with their environments and hosts. This diversity of available assays, analysis methods, and public data is in turn beginning to enable microbiome-based predictive and modeling tools. We thus review here the technological and computational meta'omics approaches that are already available, those that are under active development, their success in biological discovery, and several outstanding challenges.
Assuntos
Bactérias/genética , Regulação Bacteriana da Expressão Gênica , Metagenoma , Metagenômica/métodos , Consórcios Microbianos/genética , Software , Algoritmos , Bactérias/classificação , Simulação por Computador , Perfilação da Expressão Gênica , Humanos , Modelos Genéticos , FilogeniaRESUMO
Accurate cell type identification is a key and rate-limiting step in single-cell data analysis. Single-cell references with comprehensive cell types, reproducible and functionally validated cell identities, and common nomenclatures are much needed by the research community for automated cell type annotation, data integration, and data sharing. Here, we develop a computational pipeline utilizing the LungMAP CellCards as a dictionary to consolidate single-cell transcriptomic datasets of 104 human lungs and 17 mouse lung samples to construct LungMAP single-cell reference (CellRef) for both normal human and mouse lungs. CellRefs define 48 human and 40 mouse lung cell types catalogued from diverse anatomic locations and developmental time points. We demonstrate the accuracy and stability of LungMAP CellRefs and their utility for automated cell type annotation of both normal and diseased lungs using multiple independent methods and testing data. We develop user-friendly web interfaces for easy access and maximal utilization of the LungMAP CellRefs.
Assuntos
Perfilação da Expressão Gênica , Disseminação de Informação , Animais , Camundongos , Humanos , Análise de Célula Única , TranscriptomaRESUMO
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
Single-cell genomics is essential to chart tumor ecosystems. Although single-cell RNA-Seq (scRNA-Seq) profiles RNA from cells dissociated from fresh tumors, single-nucleus RNA-Seq (snRNA-Seq) is needed to profile frozen or hard-to-dissociate tumors. Each requires customization to different tissue and tumor types, posing a barrier to adoption. Here, we have developed a systematic toolbox for profiling fresh and frozen clinical tumor samples using scRNA-Seq and snRNA-Seq, respectively. We analyzed 216,490 cells and nuclei from 40 samples across 23 specimens spanning eight tumor types of varying tissue and sample characteristics. We evaluated protocols by cell and nucleus quality, recovery rate and cellular composition. scRNA-Seq and snRNA-Seq from matched samples recovered the same cell types, but at different proportions. Our work provides guidance for studies in a broad range of tumors, including criteria for testing and selecting methods from the toolbox for other tumors, thus paving the way for charting tumor atlases.
Assuntos
Algoritmos , Núcleo Celular/genética , Genômica/métodos , Neoplasias/genética , RNA-Seq/métodos , Análise de Célula Única/métodos , Adulto , Animais , Núcleo Celular/química , Núcleo Celular/metabolismo , Criança , Biologia Computacional/métodos , Feminino , Congelamento , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Humanos , Camundongos , Camundongos Knockout , Camundongos Nus , Neoplasias/metabolismo , Neoplasias/patologia , Análise de Sequência de RNA/métodos , Células Tumorais Cultivadas , Sequenciamento do Exoma/métodosRESUMO
Single-cell gene expression analyses of mammalian tissues have uncovered profound stage-specific molecular regulatory phenomena that have changed the understanding of unique cell types and signaling pathways critical for lineage determination, morphogenesis, and growth. We discuss here the case for a Pediatric Cell Atlas as part of the Human Cell Atlas consortium to provide single-cell profiles and spatial characterization of gene expression across human tissues and organs. Such data will complement adult and developmentally focused HCA projects to provide a rich cytogenomic framework for understanding not only pediatric health and disease but also environmental and genetic impacts across the human lifespan.
Assuntos
Desenvolvimento Embrionário/genética , Redes Reguladoras de Genes/genética , Pediatria/tendências , Análise de Célula Única/métodos , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento/genética , Humanos , Distribuição Tecidual/genéticaRESUMO
Utilizing microarray gene expression data in cancer research possesses the ability to identify deregulated cellular pathways involved in malignant development. This study investigated the relationships of three gene families, HOX, ErbB and IGFBP, with regard to the development of ovarian cancer. These families were of interest because of similar chromosomal locations and their deregulated expression in ovarian cancer. Higher level statistics were used to differentially analyze microarray data in 65 ovarian samples to assess correlation and relationships among the gene families of interest. Fifteen genes in the three families were found to be significantly deregulated. Thirty-eight significant correlations were found within and between the genes of interest. Our data indicates that the significantly modeled relationships between HOX, ErbB and IGFBP gene pairs could provide insight into the underlying biological mechanisms in ovarian cancer.
Assuntos
Receptores ErbB/genética , Regulação Neoplásica da Expressão Gênica , Proteínas de Homeodomínio/genética , Proteínas de Ligação a Fator de Crescimento Semelhante a Insulina/genética , Neoplasias Ovarianas/genética , Feminino , Humanos , Família Multigênica , Análise de Sequência com Séries de Oligonucleotídeos , RNA Neoplásico/genética , RNA Neoplásico/isolamento & purificação , Reação em Cadeia da Polimerase Via Transcriptase ReversaRESUMO
BACKGROUND: Obesity and type 2 diabetes (T2D) are linked both with host genetics and with environmental factors, including dysbioses of the gut microbiota. However, it is unclear whether these microbial changes precede disease onset. Twin cohorts present a unique genetically-controlled opportunity to study the relationships between lifestyle factors and the microbiome. In particular, we hypothesized that family-independent changes in microbial composition and metabolic function during the sub-clinical state of T2D could be either causal or early biomarkers of progression. METHODS: We collected fecal samples and clinical metadata from 20 monozygotic Korean twins at up to two time points, resulting in 36 stool shotgun metagenomes. While the participants were neither obese nor diabetic, they spanned the entire range of healthy to near-clinical values and thus enabled the study of microbial associations during sub-clinical disease while accounting for genetic background. RESULTS: We found changes both in composition and in function of the sub-clinical gut microbiome, including a decrease in Akkermansia muciniphila suggesting a role prior to the onset of disease, and functional changes reflecting a response to oxidative stress comparable to that previously observed in chronic T2D and inflammatory bowel diseases. Finally, our unique study design allowed us to examine the strain similarity between twins, and we found that twins demonstrate strain-level differences in composition despite species-level similarities. CONCLUSIONS: These changes in the microbiome might be used for the early diagnosis of an inflamed gut and T2D prior to clinical onset of the disease and will help to advance toward microbial interventions.
Assuntos
Diabetes Mellitus Tipo 2/microbiologia , Disbiose/microbiologia , Microbioma Gastrointestinal , Obesidade/microbiologia , Verrucomicrobia/classificação , Verrucomicrobia/isolamento & purificação , Adulto , Fezes/microbiologia , Feminino , Voluntários Saudáveis , Humanos , Masculino , Pessoa de Meia-Idade , Filogenia , Fatores de Risco , Gêmeos Monozigóticos , Verrucomicrobia/genéticaRESUMO
The increased availability of genomic and metagenomic data poses challenges at multiple analysis levels, including visualization of very large-scale microbial and microbial community data paired with rich metadata. We developed GraPhlAn (Graphical Phylogenetic Analysis), a computational tool that produces high-quality, compact visualizations of microbial genomes and metagenomes. This includes phylogenies spanning up to thousands of taxa, annotated with metadata ranging from microbial community abundances to microbial physiology or host and environmental phenotypes. GraPhlAn has been developed as an open-source command-driven tool in order to be easily integrated into complex, publication-quality bioinformatics pipelines. It can be executed either locally or through an online Galaxy web application. We present several examples including taxonomic and phylogenetic visualization of microbial communities, metabolic functions, and biomarker discovery that illustrate GraPhlAn's potential for modern microbial and community genomics.
RESUMO
BACKGROUND: Pouchitis is common after ileal pouch-anal anastomosis (IPAA) surgery for ulcerative colitis (UC). Similar to inflammatory bowel disease (IBD), both host genetics and the microbiota are implicated in its pathogenesis. We use the IPAA model of IBD to associate mucosal host gene expression with mucosal microbiomes and clinical outcomes. We analyze host transcriptomic data and 16S rRNA gene sequencing data from paired biopsies from IPAA patients with UC and familial adenomatous polyposis. To achieve power for a genome-wide microbiome-transcriptome association study, we use principal component analysis for transcript and clade reduction, and identify significant co-variation between clades and transcripts. RESULTS: Host transcripts co-vary primarily with biopsy location and inflammation, while microbes co-vary primarily with antibiotic use. Transcript-microbe associations are surprisingly modest, but the most strongly microbially-associated host transcript pattern is enriched for complement cascade genes and for the interleukin-12 pathway. Activation of these host processes is inversely correlated with Sutterella, Akkermansia, Bifidobacteria, and Roseburia abundance, and positively correlated with Escherichia abundance. CONCLUSIONS: This study quantifies the effects of inflammation, antibiotic use, and biopsy location upon the microbiome and host transcriptome during pouchitis. Understanding these effects is essential for basic biological insights as well as for well-designed and adequately-powered studies. Additionally, our study provides a method for profiling host-microbe interactions with appropriate statistical power using high-throughput sequencing, and suggests that cross-sectional changes in gut epithelial transcription are not a major component of the host-microbiome regulatory interface during pouchitis.
Assuntos
Bolsas Cólicas/microbiologia , Microbioma Gastrointestinal , Expressão Gênica , Interações Hospedeiro-Patógeno/genética , Doenças Inflamatórias Intestinais/microbiologia , Mucosa/microbiologia , Adolescente , Adulto , Idoso , Estudos de Coortes , Colite Ulcerativa/genética , Colite Ulcerativa/microbiologia , Bolsas Cólicas/patologia , Feminino , Trato Gastrointestinal/microbiologia , Perfilação da Expressão Gênica , Humanos , Doenças Inflamatórias Intestinais/cirurgia , Masculino , Análise em Microsséries , Pessoa de Meia-Idade , Análise Multivariada , Pouchite/genética , Pouchite/microbiologia , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/isolamento & purificação , Análise de Sequência de DNA , Adulto JovemRESUMO
The gut microbiome is widely studied by fecal sampling, but the extent to which stool reflects the commensal composition at intestinal sites is poorly understood. We investigated this relationship in rhesus macaques by 16S sequencing feces and paired lumenal and mucosal samples from ten sites distal to the jejunum. Stool composition correlated highly with the colonic lumen and mucosa and moderately with the distal small intestine. The mucosal microbiota varied most based on location and was enriched in oxygen-tolerant taxa (e.g., Helicobacter and Treponema), while the lumenal microbiota showed inter-individual variation and obligate anaerobe enrichment (e.g., Firmicutes). This mucosal and lumenal community variability corresponded to functional differences, such as nutrient availability. Additionally, Helicobacter, Faecalibacterium, and Lactobacillus levels in stool were highly predictive of their abundance at most other gut sites. These results quantify the composition and biogeographic relationships between gut microbial communities in macaques and support fecal sampling for translational studies.
Assuntos
Microbioma Gastrointestinal , Animais , Análise por Conglomerados , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Fezes/microbiologia , Mucosa Intestinal/microbiologia , Macaca mulatta , Dados de Sequência Molecular , Filogenia , RNA Ribossômico 16S/genética , Análise de Sequência de DNARESUMO
Interactions between the host and gut microbial community likely contribute to Crohn disease (CD) pathogenesis; however, direct evidence for these interactions at the onset of disease is lacking. Here, we characterized the global pattern of ileal gene expression and the ileal microbial community in 359 treatment-naive pediatric patients with CD, patients with ulcerative colitis (UC), and control individuals. We identified core gene expression profiles and microbial communities in the affected CD ilea that are preserved in the unaffected ilea of patients with colon-only CD but not present in those with UC or control individuals; therefore, this signature is specific to CD and independent of clinical inflammation. An abnormal increase of antimicrobial dual oxidase (DUOX2) expression was detected in association with an expansion of Proteobacteria in both UC and CD, while expression of lipoprotein APOA1 gene was downregulated and associated with CD-specific alterations in Firmicutes. The increased DUOX2 and decreased APOA1 gene expression signature favored oxidative stress and Th1 polarization and was maximally altered in patients with more severe mucosal injury. A regression model that included APOA1 gene expression and microbial abundance more accurately predicted month 6 steroid-free remission than a model using clinical factors alone. These CD-specific host and microbe profiles identify the ileum as the primary inductive site for all forms of CD and may direct prognostic and therapeutic approaches.
Assuntos
Apolipoproteína A-I/genética , Doença de Crohn/genética , Doença de Crohn/microbiologia , Íleo/metabolismo , Íleo/microbiologia , Microbiota , NADPH Oxidases/genética , Transcriptoma , Adolescente , Estudos de Casos e Controles , Criança , Estudos de Coortes , Colite Ulcerativa/genética , Colite Ulcerativa/microbiologia , Oxidases Duais , Feminino , Perfilação da Expressão Gênica , Humanos , Masculino , Proteobactérias/isolamento & purificaçãoRESUMO
Microbial community samples can be efficiently surveyed in high throughput by sequencing markers such as the 16S ribosomal RNA gene. Often, a collection of samples is then selected for subsequent metagenomic, metabolomic or other follow-up. Two-stage study design has long been used in ecology but has not yet been studied in-depth for high-throughput microbial community investigations. To avoid ad hoc sample selection, we developed and validated several purposive sample selection methods for two-stage studies (that is, biological criteria) targeting differing types of microbial communities. These methods select follow-up samples from large community surveys, with criteria including samples typical of the initially surveyed population, targeting specific microbial clades or rare species, maximizing diversity, representing extreme or deviant communities, or identifying communities distinct or discriminating among environment or host phenotypes. The accuracies of each sampling technique and their influences on the characteristics of the resulting selected microbial community were evaluated using both simulated and experimental data. Specifically, all criteria were able to identify samples whose properties were accurately retained in 318 paired 16S amplicon and whole-community metagenomic (follow-up) samples from the Human Microbiome Project. Some selection criteria resulted in follow-up samples that were strongly non-representative of the original survey population; diversity maximization particularly undersampled community configurations. Only selection of intentionally representative samples minimized differences in the selected sample set from the original microbial survey. An implementation is provided as the microPITA (Microbiomes: Picking Interesting Taxa for Analysis) software for two-stage study design of microbial communities.