Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
bioRxiv ; 2024 Sep 08.
Artigo em Inglês | MEDLINE | ID: mdl-39282381

RESUMO

Systems vaccinology studies have been used to build computational models that predict individual vaccine responses and identify the factors contributing to differences in outcome. Comparing such models is challenging due to variability in study designs. To address this, we established a community resource to compare models predicting B. pertussis booster responses and generate experimental data for the explicit purpose of model evaluation. We here describe our second computational prediction challenge using this resource, where we benchmarked 49 algorithms from 53 scientists. We found that the most successful models stood out in their handling of nonlinearities, reducing large feature sets to representative subsets, and advanced data preprocessing. In contrast, we found that models adopted from literature that were developed to predict vaccine antibody responses in other settings performed poorly, reinforcing the need for purpose-built models. Overall, this demonstrates the value of purpose-generated datasets for rigorous and open model evaluations to identify features that improve the reliability and applicability of computational models in vaccine response prediction.

2.
bioRxiv ; 2024 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-38915554

RESUMO

Motivation: With the increased reliance on multi-omics data for bulk and single cell analyses, the availability of robust approaches to perform unsupervised analysis for clustering, visualization, and feature selection is imperative. Joint dimensionality reduction methods can be applied to multi-omics datasets to derive a global sample embedding analogous to single-omic techniques such as Principal Components Analysis (PCA). Multiple co-inertia analysis (MCIA) is a method for joint dimensionality reduction that maximizes the covariance between block- and global-level embeddings. Current implementations for MCIA are not optimized for large datasets such such as those arising from single cell studies, and lack capabilities with respect to embedding new data. Results: We introduce nipalsMCIA, an MCIA implementation that solves the objective function using an extension to Non-linear Iterative Partial Least Squares (NIPALS), and shows significant speed-up over earlier implementations that rely on eigendecompositions for single cell multi-omics data. It also removes the dependence on an eigendecomposition for calculating the variance explained, and allows users to perform out-of-sample embedding for new data. nipalsMCIA provides users with a variety of pre-processing and parameter options, as well as ease of functionality for down-stream analysis of single-omic and global-embedding factors. Availability: nipalsMCIA is available as a BioConductor package at https://bioconductor.org/packages/release/bioc/html/nipalsMCIA.html, and includes detailed documentation and application vignettes. Supplementary Materials are available online.

3.
bioRxiv ; 2024 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-38746164

RESUMO

HiChIP enables cost-effective and high-resolution profiling of regulatory and structural loops. To leverage the increasing number of publicly available HiChIP datasets from diverse cell lines and primary cells, we developed the Loop Catalog (https://loopcatalog.lji.org), a web-based database featuring HiChIP loop calls for 1319 samples across 133 studies and 44 high-resolution Hi-C loop calls. We demonstrate its utility in interpreting fine-mapped GWAS variants (SNP-to-gene linking), in identifying enriched sequence motifs and motif pairs at loop anchors, and in network-level analysis of loops connecting regulatory elements (community detection). Our comprehensive catalog, spanning over 4M unique 5kb loops, along with the accompanying analysis modalities constitutes an important resource for studies in gene regulation and genome organization.

4.
Cell Rep Methods ; 4(3): 100731, 2024 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-38490204

RESUMO

Systems vaccinology studies have identified factors affecting individual vaccine responses, but comparing these findings is challenging due to varying study designs. To address this lack of reproducibility, we established a community resource for comparing Bordetella pertussis booster responses and to host annual contests for predicting patients' vaccination outcomes. We report here on our experiences with the "dry-run" prediction contest. We found that, among 20+ models adopted from the literature, the most successful model predicting vaccination outcome was based on age alone. This confirms our concerns about the reproducibility of conclusions between different vaccinology studies. Further, we found that, for newly trained models, handling of baseline information on the target variables was crucial. Overall, multiple co-inertia analysis gave the best results of the tested modeling approaches. Our goal is to engage community in these prediction challenges by making data and models available and opening a public contest in August 2024.


Assuntos
Multiômica , Vacinas , Humanos , Vacinologia/métodos , Reprodutibilidade dos Testes , Simulação por Computador
5.
bioRxiv ; 2023 Aug 29.
Artigo em Inglês | MEDLINE | ID: mdl-37693565

RESUMO

Computational models that predict an individual's response to a vaccine offer the potential for mechanistic insights and personalized vaccination strategies. These models are increasingly derived from systems vaccinology studies that generate immune profiles from human cohorts pre- and post-vaccination. Most of these studies involve relatively small cohorts and profile the response to a single vaccine. The ability to assess the performance of the resulting models would be improved by comparing their performance on independent datasets, as has been done with great success in other areas of biology such as protein structure predictions. To transfer this approach to system vaccinology studies, we established a prototype platform that focuses on the evaluation of Computational Models of Immunity to Pertussis Booster vaccinations (CMI-PB). A community resource, CMI-PB generates experimental data for the explicit purpose of model evaluation, which is performed through a series of annual data releases and associated contests. We here report on our experience with the first such 'dry run' for a contest where the goal was to predict individual immune responses based on pre-vaccination multi-omic profiles. Over 30 models adopted from the literature were tested, but only one was predictive, and was based on age alone. The performance of new models built using CMI-PB training data was much better, but varied significantly based on the choice of pre-vaccination features used and the model building strategy. This suggests that previously published models developed for other vaccines do not generalize well to Pertussis Booster vaccination. Overall, these results reinforced the need for comparative analysis across models and datasets that CMI-PB aims to achieve. We are seeking wider community engagement for our first public prediction contest, which will open in early 2024.

7.
Sci Rep ; 13(1): 5420, 2023 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-37012431

RESUMO

Changes in the three-dimensional (3D) structure of the genome are an emerging hallmark of cancer. Cancer-associated copy number variants and single nucleotide polymorphisms promote rewiring of chromatin loops, disruption of topologically associating domains (TADs), active/inactive chromatin state switching, leading to oncogene expression and silencing of tumor suppressors. However, little is known about 3D changes during cancer progression to a chemotherapy-resistant state. We integrated chromatin conformation capture (Hi-C), RNA-seq, and whole-genome sequencing obtained from triple-negative breast cancer patient-derived xenograft primary tumors (UCD52) and carboplatin-resistant samples and found increased short-range (< 2 Mb) interactions, chromatin looping, formation of TAD, chromatin state switching into a more active state, and amplification of ATP-binding cassette transporters. Transcriptome changes suggested the role of long-noncoding RNAs in carboplatin resistance. Rewiring of the 3D genome was associated with TP53, TP63, BATF, FOS-JUN family of transcription factors and led to activation of aggressiveness-, metastasis- and other cancer-related pathways. Integrative analysis highlighted increased ribosome biogenesis and oxidative phosphorylation, suggesting the role of mitochondrial energy metabolism. Our results suggest that 3D genome remodeling may be a key mechanism underlying carboplatin resistance.


Assuntos
Neoplasias de Mama Triplo Negativas , Humanos , Carboplatina/farmacologia , Carboplatina/uso terapêutico , Neoplasias de Mama Triplo Negativas/tratamento farmacológico , Neoplasias de Mama Triplo Negativas/genética , Xenoenxertos , Genoma , Cromatina
8.
Elife ; 82019 11 20.
Artigo em Inglês | MEDLINE | ID: mdl-31746734

RESUMO

The MHC region is highly associated with autoimmune and infectious diseases. Here we conduct an in-depth interrogation of associations between genetic variation, gene expression and disease. We create a comprehensive map of regulatory variation in the MHC region using WGS from 419 individuals to call eight-digit HLA types and RNA-seq data from matched iPSCs. Building on this regulatory map, we explored GWAS signals for 4083 traits, detecting colocalization for 180 disease loci with eQTLs. We show that eQTL analyses taking HLA type haplotypes into account have substantially greater power compared with only using single variants. We examined the association between the 8.1 ancestral haplotype and delayed colonization in Cystic Fibrosis, postulating that downregulation of RNF5 expression is the likely causal mechanism. Our study provides insights into the genetic architecture of the MHC region and pinpoints disease associations that are due to differential expression of HLA genes and non-HLA genes.


Assuntos
Fibrose Cística/genética , Predisposição Genética para Doença , Complexo Principal de Histocompatibilidade/genética , Locos de Características Quantitativas/genética , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Alelos , Mapeamento Cromossômico , Fibrose Cística/patologia , Feminino , Estudo de Associação Genômica Ampla , Antígenos HLA/genética , Haplótipos , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , RNA-Seq , Adulto Jovem
9.
Cell Stem Cell ; 20(4): 533-546.e7, 2017 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-28388430

RESUMO

In this study, we used whole-genome sequencing and gene expression profiling of 215 human induced pluripotent stem cell (iPSC) lines from different donors to identify genetic variants associated with RNA expression for 5,746 genes. We were able to predict causal variants for these expression quantitative trait loci (eQTLs) that disrupt transcription factor binding and validated a subset of them experimentally. We also identified copy-number variant (CNV) eQTLs, including some that appear to affect gene expression by altering the copy number of intergenic regulatory regions. In addition, we were able to identify effects on gene expression of rare genic CNVs and regulatory single-nucleotide variants and found that reactivation of gene expression on the X chromosome depends on gene chromosomal position. Our work highlights the value of iPSCs for genetic association analyses and provides a unique resource for investigating the genetic regulation of gene expression in pluripotent cells.


Assuntos
Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica , Variação Genética , Células-Tronco Pluripotentes Induzidas/metabolismo , Sítios de Ligação/genética , Reprogramação Celular/genética , Cromossomos Humanos X/genética , Variações do Número de Cópias de DNA/genética , Heterogeneidade Genética , Humanos , Anotação de Sequência Molecular , Locos de Características Quantitativas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo
10.
Stem Cell Reports ; 8(4): 1086-1100, 2017 04 11.
Artigo em Inglês | MEDLINE | ID: mdl-28410642

RESUMO

Large-scale collections of induced pluripotent stem cells (iPSCs) could serve as powerful model systems for examining how genetic variation affects biology and disease. Here we describe the iPSCORE resource: a collection of systematically derived and characterized iPSC lines from 222 ethnically diverse individuals that allows for both familial and association-based genetic studies. iPSCORE lines are pluripotent with high genomic integrity (no or low numbers of somatic copy-number variants) as determined using high-throughput RNA-sequencing and genotyping arrays, respectively. Using iPSCs from a family of individuals, we show that iPSC-derived cardiomyocytes demonstrate gene expression patterns that cluster by genetic background, and can be used to examine variants associated with physiological and disease phenotypes. The iPSCORE collection contains representative individuals for risk and non-risk alleles for 95% of SNPs associated with human phenotypes through genome-wide association studies. Our study demonstrates the utility of iPSCORE for examining how genetic variants influence molecular and physiological traits in iPSCs and derived cell lines.


Assuntos
Arritmias Cardíacas/genética , Bases de Dados Factuais , Estudos de Associação Genética , Variação Genética , Células-Tronco Pluripotentes Induzidas/metabolismo , Miócitos Cardíacos/metabolismo , Arritmias Cardíacas/etnologia , Arritmias Cardíacas/metabolismo , Arritmias Cardíacas/fisiopatologia , Diferenciação Celular , Linhagem Celular , Reprogramação Celular/genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Família Multigênica , Miócitos Cardíacos/citologia , Análise de Sequência com Séries de Oligonucleotídeos , Fenótipo , Polimorfismo de Nucleotídeo Único , Grupos Raciais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA