Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Cell Rep ; 35(8): 109174, 2021 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-34038736

RESUMO

The CD8+ T cell response to an antigen is composed of many T cell clones with unique T cell receptors, together forming a heterogeneous repertoire of effector and memory cells. How individual T cell clones contribute to this heterogeneity throughout immune responses remains largely unknown. In this study, we longitudinally track human CD8+ T cell clones expanding in response to yellow fever virus (YFV) vaccination at the single-cell level. We observed a drop in clonal diversity in blood from the acute to memory phase, suggesting that clonal selection shapes the circulating memory repertoire. Clones in the memory phase display biased differentiation trajectories along a gradient from stem cell to terminally differentiated effector memory fates. In secondary responses, YFV- and influenza-specific CD8+ T cell clones are poised to recapitulate skewed differentiation trajectories. Collectively, we show that the sum of distinct clonal phenotypes results in the multifaceted human T cell response to acute viral infections.


Assuntos
Linfócitos T CD8-Positivos/imunologia , Viroses/virologia , Febre Amarela/virologia , Doença Aguda , Diferenciação Celular , Células Cultivadas , Humanos
2.
Mol Ecol Resour ; 21(8): 2598-2613, 2021 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-33950563

RESUMO

Simulation-based methods such as approximate Bayesian computation (ABC) are well-adapted to the analysis of complex scenarios of populations and species genetic history. In this context, supervised machine learning (SML) methods provide attractive statistical solutions to conduct efficient inferences about scenario choice and parameter estimation. The Random Forest methodology (RF) is a powerful ensemble of SML algorithms used for classification or regression problems. Random Forest allows conducting inferences at a low computational cost, without preliminary selection of the relevant components of the ABC summary statistics, and bypassing the derivation of ABC tolerance levels. We have implemented a set of RF algorithms to process inferences using simulated data sets generated from an extended version of the population genetic simulator implemented in DIYABC v2.1.0. The resulting computer package, named DIYABC Random Forest v1.0, integrates two functionalities into a user-friendly interface: the simulation under custom evolutionary scenarios of different types of molecular data (microsatellites, DNA sequences or SNPs) and RF treatments including statistical tools to evaluate the power and accuracy of inferences. We illustrate the functionalities of DIYABC Random Forest v1.0 for both scenario choice and parameter estimation through the analysis of pseudo-observed and real data sets corresponding to pool-sequencing and individual-sequencing SNP data sets. Because of the properties inherent to the implemented RF methods and the large feature vector (including various summary statistics and their linear combinations) available for SNP data, DIYABC Random Forest v1.0 can efficiently contribute to the analysis of large SNP data sets to make inferences about complex population genetic histories.


Assuntos
Algoritmos , Genética Populacional , Teorema de Bayes , Simulação por Computador , Demografia , Polimorfismo de Nucleotídeo Único , Aprendizado de Máquina Supervisionado
3.
Bioinformatics ; 35(20): 4011-4019, 2019 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-30865271

RESUMO

MOTIVATION: The development of high-throughput single-cell sequencing technologies now allows the investigation of the population diversity of cellular transcriptomes. The expression dynamics (gene-to-gene variability) can be quantified more accurately, thanks to the measurement of lowly expressed genes. In addition, the cell-to-cell variability is high, with a low proportion of cells expressing the same genes at the same time/level. Those emerging patterns appear to be very challenging from the statistical point of view, especially to represent a summarized view of single-cell expression data. Principal component analysis (PCA) is a most powerful tool for high dimensional data representation, by searching for latent directions catching the most variability in the data. Unfortunately, classical PCA is based on Euclidean distance and projections that poorly work in presence of over-dispersed count data with dropout events like single-cell expression data. RESULTS: We propose a probabilistic Count Matrix Factorization (pCMF) approach for single-cell expression data analysis that relies on a sparse Gamma-Poisson factor model. This hierarchical model is inferred using a variational EM algorithm. It is able to jointly build a low dimensional representation of cells and genes. We show how this probabilistic framework induces a geometry that is suitable for single-cell data visualization, and produces a compression of the data that is very powerful for clustering purposes. Our method is competed against other standard representation methods like t-SNE, and we illustrate its performance for the representation of single-cell expression data. AVAILABILITY AND IMPLEMENTATION: Our work is implemented in the pCMF R-package (https://github.com/gdurif/pCMF). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Análise de Dados , Software , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Célula Única
4.
Bioinformatics ; 34(3): 485-493, 2018 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-28968879

RESUMO

Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection, which combined constitute a powerful framework for classification, as well as data visualization and interpretation. However, current proposed combinations lead to unstable and non convergent methods due to inappropriate computational frameworks. We hereby propose a computationally stable and convergent approach for classification in high dimensional based on sparse Partial Least Squares (sparse PLS). Results: We start by proposing a new solution for the sparse PLS problem that is based on proximal operators for the case of univariate responses. Then we develop an adaptive version of the sparse PLS for classification, called logit-SPLS, which combines iterative optimization of logistic regression and sparse PLS to ensure computational convergence and stability. Our results are confirmed on synthetic and experimental data. In particular, we show how crucial convergence and stability can be when cross-validation is involved for calibration purposes. Using gene expression data, we explore the prediction of breast cancer relapse. We also propose a multicategorial version of our method, used to predict cell-types based on single-cell expression data. Availability and implementation: Our approach is implemented in the plsgenomics R-package. Contact: ghislain.durif@inria.fr. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Modelos Logísticos , Análise de Sequência de DNA/métodos , Software , Calibragem , Genômica/métodos , Genômica/normas , Análise dos Mínimos Quadrados , Análise de Sequência de DNA/normas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...