Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Nat Genet ; 2024 Sep 26.
Artigo em Inglês | MEDLINE | ID: mdl-39327486

RESUMO

Disease risk alleles influence the composition of cells present in the body, but modeling genetic effects on the cell states revealed by single-cell profiling is difficult because variant-associated states may reflect diverse combinations of the profiled cell features that are challenging to predefine. We introduce Genotype-Neighborhood Associations (GeNA), a statistical tool to identify cell-state abundance quantitative trait loci (csaQTLs) in high-dimensional single-cell datasets. Instead of testing associations to predefined cell states, GeNA flexibly identifies the cell states whose abundance is most associated with genetic variants. In a genome-wide survey of single-cell RNA sequencing peripheral blood profiling from 969 individuals, GeNA identifies five independent loci associated with shifts in the relative abundance of immune cell states. For example, rs3003-T (P = 1.96 × 10-11) associates with increased abundance of natural killer cells expressing tumor necrosis factor response programs. This csaQTL colocalizes with increased risk for psoriasis, an autoimmune disease that responds to anti-tumor necrosis factor treatments. Flexibly characterizing csaQTLs for granular cell states may help illuminate how genetic background alters cellular composition to confer disease risk.

3.
Nat Genet ; 55(12): 2255-2268, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38036787

RESUMO

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues. To mitigate technical confounding, we developed scHLApers, a pipeline to accurately quantify single-cell HLA expression using personalized reference genomes. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B and T cells. For example, a T cell HLA-DQA1 eQTL ( rs3104371 ) is strongest in cytotoxic cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.


Assuntos
Regulação da Expressão Gênica , Locos de Características Quantitativas , Humanos , Regulação da Expressão Gênica/genética , Locos de Características Quantitativas/genética , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único
4.
medRxiv ; 2023 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-36993194

RESUMO

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation, and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here, we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues, using personalized reference genomes to mitigate technical confounding. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B, and T cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.

5.
Clin Immunol ; 246: 109209, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36539107

RESUMO

Children infected with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) develop less severe coronavirus disease 2019 (COVID-19) than adults. The mechanisms for the age-specific differences and the implications for infection-induced immunity are beginning to be uncovered. We show by longitudinal multimodal analysis that SARS-CoV-2 leaves a small footprint in the circulating T cell compartment in children with mild/asymptomatic COVID-19 compared to adult household contacts with the same disease severity who had more evidence of systemic T cell interferon activation, cytotoxicity and exhaustion. Children harbored diverse polyclonal SARS-CoV-2-specific naïve T cells whereas adults harbored clonally expanded SARS-CoV-2-specific memory T cells. A novel population of naïve interferon-activated T cells is expanded in acute COVID-19 and is recruited into the memory compartment during convalescence in adults but not children. This was associated with the development of robust CD4+ memory T cell responses in adults but not children. These data suggest that rapid clearance of SARS-CoV-2 in children may compromise their cellular immunity and ability to resist reinfection.


Assuntos
COVID-19 , Humanos , Adulto , SARS-CoV-2 , Linfócitos T CD4-Positivos , Imunidade Celular , Ativação Linfocitária , Anticorpos Antivirais
6.
Science ; 376(6589): eabf3041, 2022 04 08.
Artigo em Inglês | MEDLINE | ID: mdl-35389779

RESUMO

The human immune system displays substantial variation between individuals, leading to differences in susceptibility to autoimmune disease. We present single-cell RNA sequencing (scRNA-seq) data from 1,267,758 peripheral blood mononuclear cells from 982 healthy human subjects. For 14 cell types, we identified 26,597 independent cis-expression quantitative trait loci (eQTLs) and 990 trans-eQTLs, with most showing cell type-specific effects on gene expression. We subsequently show how eQTLs have dynamic allelic effects in B cells that are transitioning from naïve to memory states and demonstrate how commonly segregating alleles lead to interindividual variation in immune function. Finally, using a Mendelian randomization approach, we identify the causal route by which 305 risk loci contribute to autoimmune disease at the cellular level. This work brings together genetic epidemiology with scRNA-seq to uncover drivers of interindividual variation in the immune system.


Assuntos
Doenças Autoimunes , Leucócitos Mononucleares , Alelos , Doenças Autoimunes/genética , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Células Precursoras de Linfócitos B , Locos de Características Quantitativas , Análise de Sequência de RNA
7.
Genome Med ; 14(1): 19, 2022 02 22.
Artigo em Inglês | MEDLINE | ID: mdl-35189942

RESUMO

BACKGROUND: While single-cell transcriptional profiling has greatly increased our capacity to interrogate biology, accurate cell classification within and between datasets is a key challenge. This is particularly so in pluripotent stem cell-derived organoids which represent a model of a developmental system. Here, clustering algorithms and selected marker genes can fail to accurately classify cellular identity while variation in analyses makes it difficult to meaningfully compare datasets. Kidney organoids provide a valuable resource to understand kidney development and disease. However, direct comparison of relative cellular composition between protocols has proved challenging. Hence, an unbiased approach for classifying cell identity is required. METHODS: The R package, scPred, was trained on multiple single cell RNA-seq datasets of human fetal kidney. A hierarchical model classified cellular subtypes into nephron, stroma and ureteric epithelial elements. This model, provided in the R package DevKidCC ( github.com/KidneyRegeneration/DevKidCC ), was then used to predict relative cell identity within published kidney organoid datasets generated using distinct cell lines and differentiation protocols, interrogating the impact of such variations. The package contains custom functions for the display of differential gene expression within cellular subtypes. RESULTS: DevKidCC was used to directly compare between distinct kidney organoid protocols, identifying differences in relative proportions of cell types at all hierarchical levels of the model and highlighting variations in stromal and unassigned cell types, nephron progenitor prevalence and relative maturation of individual epithelial segments. Of note, DevKidCC was able to distinguish distal nephron from ureteric epithelium, cell types with overlapping profiles that have previously confounded analyses. When applied to a variation in protocol via the addition of retinoic acid, DevKidCC identified a consequential depletion of nephron progenitors. CONCLUSIONS: The application of DevKidCC to kidney organoids reproducibly classifies component cellular identity within distinct single-cell datasets. The application of the tool is summarised in an interactive Shiny application, as are examples of the utility of in-built functions for data presentation. This tool will enable the consistent and rapid comparison of kidney organoid protocols, driving improvements in patterning to kidney endpoints and validating new approaches.


Assuntos
Organoides , Células-Tronco Pluripotentes , Diferenciação Celular/genética , Humanos , Rim , Organogênese/genética , Células-Tronco Pluripotentes/metabolismo
8.
Clin Transl Immunology ; 10(7): e1308, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34221402

RESUMO

OBJECTIVES: A recent single-cell RNA sequencing study by Wilk et al. suggested that plasmablasts can transdifferentiate into 'developing neutrophils' in patients with severe COVID-19 disease. We explore the evidence for this. METHODS: We downloaded the original data and code used by the authors in their study to replicate their findings and explore the possibility that regressing out variables may have led the authors to overfit their data. RESULTS: The lineage relationship between plasmablasts and developing neutrophils breaks down when key features are not regressed out, and the data are not overfitted during the analysis. CONCLUSION: Plasmablasts do not transdifferentiate into developing neutrophils. The single-cell RNA sequencing is a powerful technique for biological discovery and hypothesis generation. However, caution should be exercised in the bioinformatic analysis and interpretation of the data and findings cross-validated by orthogonal techniques.

9.
Bioinformatics ; 37(16): 2485-2487, 2021 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-33459785

RESUMO

SUMMARY: Data sparsity in single-cell experiments prevents an accurate assessment of gene expression when visualized in a low-dimensional space. Here, we introduce Nebulosa, an R package that uses weighted kernel density estimation to recover signals lost through drop-out or low expression. AVAILABILITY AND IMPLEMENTATION: Nebulosa can be easily installed from www.github.com/powellgenomicslab/Nebulosa. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

10.
Nat Commun ; 11(1): 6291, 2020 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-33268785

RESUMO

A Correction to this paper has been published: https://doi.org/10.1038/s41467-020-20288-9.

11.
Nat Commun ; 11(1): 5650, 2020 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-33159064

RESUMO

Many computational methods have been developed to infer cell type proportions from bulk transcriptomics data. However, an evaluation of the impact of data transformation, pre-processing, marker selection, cell type composition and choice of methodology on the deconvolution results is still lacking. Using five single-cell RNA-sequencing (scRNA-seq) datasets, we generate pseudo-bulk mixtures to evaluate the combined impact of these factors. Both bulk deconvolution methodologies and those that use scRNA-seq data as reference perform best when applied to data in linear scale and the choice of normalization has a dramatic impact on some, but not all methods. Overall, methods that use scRNA-seq data have comparable performance to the best performing bulk methods whereas semi-supervised approaches show higher error values. Moreover, failure to include cell types in the reference that are present in a mixture leads to substantially worse results, regardless of the previous choices. Altogether, we evaluate the combined impact of factors affecting the deconvolution task across different datasets and propose general guidelines to maximize its performance.


Assuntos
Biologia Computacional/normas , Perfilação da Expressão Gênica/normas , Transcriptoma , Animais , Células/metabolismo , Biologia Computacional/métodos , Mineração de Dados , Perfilação da Expressão Gênica/métodos , Humanos , Camundongos , Análise de Sequência de RNA , Análise de Célula Única , Especificidade da Espécie
12.
Bioinformatics ; 36(16): 4532-4534, 2020 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-32573705

RESUMO

SUMMARY: RegulonDB has collected, harmonized and centralized data from hundreds of experiments for nearly two decades and is considered a point of reference for transcriptional regulation in Escherichia coli K12. Here, we present the regutools R package to facilitate programmatic access to RegulonDB data in computational biology. regutools gives researchers the possibility of writing reproducible workflows with automated queries to RegulonDB. The regutools package serves as a bridge between RegulonDB data and the Bioconductor ecosystem by reusing the data structures and statistical methods powered by other Bioconductor packages. We demonstrate the integration of regutools with Bioconductor by analyzing transcription factor DNA binding sites and transcriptional regulatory networks from RegulonDB. We anticipate that regutools will serve as a useful building block in our progress to further our understanding of gene regulatory networks. AVAILABILITY AND IMPLEMENTATION: regutools is an R package available through Bioconductor at bioconductor.org/packages/regutools.


Assuntos
Ecossistema , Escherichia coli K12 , Biologia Computacional , Escherichia coli K12/genética , Redes Reguladoras de Genes , Software
13.
Genome Biol ; 20(1): 264, 2019 12 12.
Artigo em Inglês | MEDLINE | ID: mdl-31829268

RESUMO

Single-cell RNA sequencing has enabled the characterization of highly specific cell types in many tissues, as well as both primary and stem cell-derived cell lines. An important facet of these studies is the ability to identify the transcriptional signatures that define a cell type or state. In theory, this information can be used to classify an individual cell based on its transcriptional profile. Here, we present scPred, a new generalizable method that is able to provide highly accurate classification of single cells, using a combination of unbiased feature selection from a reduced-dimension space, and machine-learning probability-based prediction method. We apply scPred to scRNA-seq data from pancreatic tissue, mononuclear cells, colorectal tumor biopsies, and circulating dendritic cells and show that scPred is able to classify individual cells with high accuracy. The generalized method is available at https://github.com/powellgenomicslab/scPred/.


Assuntos
Análise de Sequência de RNA , Análise de Célula Única/métodos , Classificação/métodos , Células Dendríticas/química , Neoplasias Gástricas/química , Neoplasias Gástricas/patologia
14.
Bioinformatics ; 33(24): 4033-4040, 2017 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-27592709

RESUMO

MOTIVATION: RNA sequencing (RNA-seq) experiments now span hundreds to thousands of samples. Current spliced alignment software is designed to analyze each sample separately. Consequently, no information is gained from analyzing multiple samples together, and it requires extra work to obtain analysis products that incorporate data from across samples. RESULTS: We describe Rail-RNA, a cloud-enabled spliced aligner that analyzes many samples at once. Rail-RNA eliminates redundant work across samples, making it more efficient as samples are added. For many samples, Rail-RNA is more accurate than annotation-assisted aligners. We use Rail-RNA to align 667 RNA-seq samples from the GEUVADIS project on Amazon Web Services in under 16 h for US$0.91 per sample. Rail-RNA outputs alignments in SAM/BAM format; but it also outputs (i) base-level coverage bigWigs for each sample; (ii) coverage bigWigs encoding normalized mean and median coverages at each base across samples analyzed; and (iii) exon-exon splice junctions and indels (features) in columnar formats that juxtapose coverages in samples in which a given feature is found. Supplementary outputs are ready for use with downstream packages for reproducible statistical analysis. We use Rail-RNA to identify expressed regions in the GEUVADIS samples and show that both annotated and unannotated (novel) expressed regions exhibit consistent patterns of variation across populations and with respect to known confounding variables. AVAILABILITY AND IMPLEMENTATION: Rail-RNA is open-source software available at http://rail.bio. CONTACTS: anellore@gmail.com or langmea@cs.jhu.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Splicing de RNA , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Software , Éxons , Perfilação da Expressão Gênica
15.
Genome Biol ; 17(1): 266, 2016 12 30.
Artigo em Inglês | MEDLINE | ID: mdl-28038678

RESUMO

BACKGROUND: Gene annotations, such as those in GENCODE, are derived primarily from alignments of spliced cDNA sequences and protein sequences. The impact of RNA-seq data on annotation has been confined to major projects like ENCODE and Illumina Body Map 2.0. RESULTS: We aligned 21,504 Illumina-sequenced human RNA-seq samples from the Sequence Read Archive (SRA) to the human genome and compared detected exon-exon junctions with junctions in several recent gene annotations. We found 56,861 junctions (18.6%) in at least 1000 samples that were not annotated, and their expression associated with tissue type. Junctions well expressed in individual samples tended to be annotated. Newer samples contributed few novel well-supported junctions, with the vast majority of detected junctions present in samples before 2013. We compiled junction data into a resource called intropolis available at http://intropolis.rail.bio . We used this resource to search for a recently validated isoform of the ALK gene and characterized the potential functional implications of unannotated junctions with publicly available TRAP-seq data. CONCLUSIONS: Considering only the variation contained in annotation may suffice if an investigator is interested only in well-expressed transcript isoforms. However, genes that are not generally well expressed and nonetheless present in a small but significant number of samples in the SRA are likelier to be incompletely annotated. The rate at which evidence for novel junctions has been added to the SRA has tapered dramatically, even to the point of an asymptote. Now is perhaps an appropriate time to update incomplete annotations to include splicing present in the now-stable snapshot provided by the SRA.


Assuntos
Anotação de Sequência Molecular , Sítios de Splice de RNA , Splicing de RNA/genética , Processamento Alternativo , Biologia Computacional/métodos , Éxons , Regulação da Expressão Gênica , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Reprodutibilidade dos Testes , Análise de Sequência de RNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA