Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 80
Filtrar
1.
Elife ; 122024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38635416

RESUMO

Transposable elements (TEs) are repetitive sequences representing ~45% of the human and mouse genomes and are highly expressed by medullary thymic epithelial cells (mTECs). In this study, we investigated the role of TEs on T-cell development in the thymus. We performed multiomic analyses of TEs in human and mouse thymic cells to elucidate their role in T-cell development. We report that TE expression in the human thymus is high and shows extensive age- and cell lineage-related variations. TE expression correlates with multiple transcription factors in all cell types of the human thymus. Two cell types express particularly broad TE repertoires: mTECs and plasmacytoid dendritic cells (pDCs). In mTECs, transcriptomic data suggest that TEs interact with transcription factors essential for mTEC development and function (e.g., PAX1 and REL), and immunopeptidomic data showed that TEs generate MHC-I-associated peptides implicated in thymocyte education. Notably, AIRE, FEZF2, and CHD4 regulate small yet non-redundant sets of TEs in murine mTECs. Human thymic pDCs homogenously express large numbers of TEs that likely form dsRNA, which can activate innate immune receptors, potentially explaining why thymic pDCs constitutively secrete IFN ɑ/ß. This study highlights the diversity of interactions between TEs and the adaptive immune system. TEs are genetic parasites, and the two thymic cell types most affected by TEs (mTEcs and pDCs) are essential to establishing central T-cell tolerance. Therefore, we propose that orchestrating TE expression in thymic cells is critical to prevent autoimmunity in vertebrates.


Assuntos
Proteína AIRE , Elementos de DNA Transponíveis , Camundongos , Humanos , Animais , Timo/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Timócitos/metabolismo , Células Epiteliais/metabolismo , Diferenciação Celular/genética , Camundongos Endogâmicos C57BL
2.
Bioinform Adv ; 3(1): vbad097, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37720006

RESUMO

Summary: We describe the problem of computing local feature attributions for dimensionality reduction methods. We use one such method that is well established within the context of supervised classification-using the gradients of target outputs with respect to the inputs-on the popular dimensionality reduction technique t-SNE, widely used in analyses of biological data. We provide an efficient implementation for the gradient computation for this dimensionality reduction technique. We show that our explanations identify significant features using novel validation methodology; using synthetic datasets and the popular MNIST benchmark dataset. We then demonstrate the practical utility of our algorithm by showing that it can produce explanations that agree with domain knowledge on a SARS-CoV-2 sequence dataset. Throughout, we provide a road map so that similar explanation methods could be applied to other dimensionality reduction techniques to rigorously analyze biological datasets. Availability and implementation: We have created a Python package that can be installed using the following command: pip install interpretable_tsne. All code used can be found at github.com/MattScicluna/interpretable_tsne.

3.
Genome Biol ; 24(1): 188, 2023 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-37582761

RESUMO

MHC-I-associated peptides deriving from non-coding genomic regions and mutations can generate tumor-specific antigens, including neoantigens. Quantifying tumor-specific antigens' RNA expression in malignant and benign tissues is critical for discriminating actionable targets. We present BamQuery, a tool attributing an exhaustive RNA expression to MHC-I-associated peptides of any origin from bulk and single-cell RNA-sequencing data. We show that many cryptic and mutated tumor-specific antigens can derive from multiple discrete genomic regions, abundantly expressed in normal tissues. BamQuery can also be used to predict MHC-I-associated peptides immunogenicity and identify actionable tumor-specific antigens de novo.


Assuntos
Neoplasias , Proteogenômica , Humanos , Antígenos de Neoplasias/genética , Antígenos de Histocompatibilidade Classe I , Neoplasias/genética , Peptídeos/genética , RNA
4.
iScience ; 25(9): 104968, 2022 Sep 16.
Artigo em Inglês | MEDLINE | ID: mdl-36111255

RESUMO

Based on analyses of TCR sequences from over 1,000 individuals, we report that the TCR repertoire is composed of two ontogenically and functionally distinct types of TCRs. Their production is regulated by variations in thymic output and terminal deoxynucleotidyl transferase (TDT) activity. Neonatal TCRs derived from TDT-negative progenitors persist throughout life, are highly shared among subjects, and are reported as disease-associated. Thus, 10%-30% of most frequent cord blood TCRs are associated with common pathogens and autoantigens. TDT-dependent TCRs present distinct structural features and are less shared among subjects. TDT-dependent TCRs are produced in maximal numbers during infancy when thymic output and TDT activity reach a summit, are more abundant in subjects with AIRE mutations, and seem to play a dominant role in graft-versus-host disease. Factors decreasing thymic output (age, male sex) negatively impact TCR diversity. Males compensate for their lower repertoire diversity via hyperexpansion of selected TCR clonotypes.

5.
Anal Chem ; 94(35): 12086-12094, 2022 09 06.
Artigo em Inglês | MEDLINE | ID: mdl-35995421

RESUMO

The sensitivity and depth of proteomic analyses are limited by isobaric ions and interferences that preclude the identification of low abundance peptides. Extensive sample fractionation is often required to extend proteome coverage when sample amount is not a limitation. Ion mobility devices provide a viable alternate approach to resolve confounding ions and improve peak capacity and mass spectrometry (MS) sensitivity. Here, we report the integration of differential ion mobility with segmented ion fractionation (SIFT) to enhance the comprehensiveness of proteomic analyses. The combination of differential ion mobility and SIFT, where narrow windows of ∼m/z 100 are acquired in turn, is found particularly advantageous in the analysis of protein digests and typically provided more than 60% gain in identification compared to conventional single-shot LC-MS/MS. The application of this approach is further demonstrated for the analysis of tryptic digests from different colorectal cancer cell lines where the enhanced sensitivity enabled the identification of single amino acid variants that were correlated with the corresponding transcriptomic data sets.


Assuntos
Neoplasias do Colo , Proteogenômica , Cromatografia Líquida/métodos , Neoplasias do Colo/genética , Humanos , Íons , Proteoma , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos
6.
Cell Rep ; 40(7): 111241, 2022 08 16.
Artigo em Inglês | MEDLINE | ID: mdl-35977509

RESUMO

Previous reports showed that mouse vaccination with pluripotent stem cells (PSCs) induces durable anti-tumor immune responses via T cell recognition of some elusive oncofetal epitopes. We characterize the MHC I-associated peptide (MAP) repertoire of human induced PSCs (iPSCs) using proteogenomics. Our analyses reveal a set of 46 pluripotency-associated MAPs (paMAPs) absent from the transcriptome of normal tissues and adult stem cells but expressed in PSCs and multiple adult cancers. These paMAPs derive from coding and allegedly non-coding (48%) transcripts involved in pluripotency maintenance, and their expression in The Cancer Genome Atlas samples correlates with source gene hypomethylation and genomic aberrations common across cancer types. We find that several of these paMAPs were immunogenic. However, paMAP expression in tumors coincides with activation of pathways instrumental in immune evasion (WNT, TGF-ß, and CDK4/6). We propose that currently available inhibitors of these pathways could synergize with immune targeting of paMAPs for the treatment of poorly differentiated cancers.


Assuntos
Células-Tronco Pluripotentes Induzidas , Neoplasias , Células-Tronco Pluripotentes , Animais , Antígenos de Histocompatibilidade Classe I/metabolismo , Humanos , Camundongos , Neoplasias/metabolismo , Peptídeos/metabolismo , Células-Tronco Pluripotentes/metabolismo
8.
Front Immunol ; 13: 867443, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35401501

RESUMO

Early T-cell development is precisely controlled by E proteins, that indistinguishably include HEB/TCF12 and E2A/TCF3 transcription factors, together with NOTCH1 and pre-T cell receptor (TCR) signalling. Importantly, perturbations of early T-cell regulatory networks are implicated in leukemogenesis. NOTCH1 gain of function mutations invariably lead to T-cell acute lymphoblastic leukemia (T-ALL), whereas inhibition of E proteins accelerates leukemogenesis. Thus, NOTCH1, pre-TCR, E2A and HEB functions are intertwined, but how these pathways contribute individually or synergistically to leukemogenesis remain to be documented. To directly address these questions, we leveraged Cd3e-deficient mice in which pre-TCR signaling and progression through ß-selection is abrogated to dissect and decouple the roles of pre-TCR, NOTCH1, E2A and HEB in SCL/TAL1-induced T-ALL, via the use of Notch1 gain of function transgenic (Notch1ICtg) and Tcf12+/- or Tcf3+/- heterozygote mice. As a result, we now provide evidence that both HEB and E2A restrain cell proliferation at the ß-selection checkpoint while the clonal expansion of SCL-LMO1-induced pre-leukemic stem cells in T-ALL is uniquely dependent on Tcf12 gene dosage. At the molecular level, HEB protein levels are decreased via proteasomal degradation at the leukemic stage, pointing to a reversible loss of function mechanism. Moreover, in SCL-LMO1-induced T-ALL, loss of one Tcf12 allele is sufficient to bypass pre-TCR signaling which is required for Notch1 gain of function mutations and for progression to T-ALL. In contrast, Tcf12 monoallelic deletion does not accelerate Notch1IC-induced T-ALL, indicating that Tcf12 and Notch1 operate in the same pathway. Finally, we identify a tumor suppressor gene set downstream of HEB, exhibiting significantly lower expression levels in pediatric T-ALL compared to B-ALL and brain cancer samples, the three most frequent pediatric cancers. In summary, our results indicate a tumor suppressor function of HEB/TCF12 in T-ALL to mitigate cell proliferation controlled by NOTCH1 in pre-leukemic stem cells and prevent NOTCH1-driven progression to T-ALL.


Assuntos
Leucemia-Linfoma Linfoblástico de Células T Precursoras , Animais , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Humanos , Camundongos , Leucemia-Linfoma Linfoblástico de Células T Precursoras/genética , Proteínas Proto-Oncogênicas/metabolismo , Receptor Notch1/genética , Receptor Notch1/metabolismo , Receptores de Antígenos de Linfócitos T , Proteína 1 de Leucemia Linfocítica Aguda de Células T , Linfócitos T/metabolismo , Fatores de Transcrição/metabolismo
9.
Blood Adv ; 6(2): 509-514, 2022 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-34731885

RESUMO

Cholesterol homeostasis has been proposed as one mechanism contributing to chemoresistance in AML and hence, inclusion of statins in therapeutic regimens as part of clinical trials in AML has shown encouraging results. Chemical screening of primary human AML specimens by our group led to the identification of lipophilic statins as potent inhibitors of AMLs from a wide range of cytogenetic groups. Genetic screening to identify modulators of the statin response uncovered the role of protein geranylgeranylation and of RAB proteins, coordinating various aspect of vesicular trafficking, in mediating the effects of statins on AML cell viability. We further show that statins can inhibit vesicle-mediated transport in primary human specimens, and that statins sensitive samples show expression signatures reminiscent of enhanced vesicular trafficking. Overall, this study sheds light into the mechanism of action of statins in AML and identifies a novel vulnerability for cytogenetically diverse AML.


Assuntos
Inibidores de Hidroximetilglutaril-CoA Redutases , Leucemia Mieloide Aguda , Humanos , Inibidores de Hidroximetilglutaril-CoA Redutases/farmacologia , Inibidores de Hidroximetilglutaril-CoA Redutases/uso terapêutico , Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/genética
10.
J Med Internet Res ; 23(11): e26450, 2021 11 11.
Artigo em Inglês | MEDLINE | ID: mdl-34762055

RESUMO

BACKGROUND: This study aims to identify a novel potential use for web portals in health care and health research: their adoption for the purposes of rapidly sharing health research findings with clinicians, scientists, and patients. In the era of precision medicine and learning health systems, the translation of research findings into targeted therapies depends on the availability of big data and emerging research results. Web portals may work to promote the availability of novel research, working in tandem with traditional scientific publications and conference proceedings. OBJECTIVE: This study aims to assess the potential use of web portals, which facilitate the sharing of health research findings among researchers, clinicians, patients, and the public. It also summarizes the potential legal, ethical, and policy implications associated with such tools for public use and in the management of patient care for complex diseases. METHODS: This study broadly adopts the methods for scoping literature reviews outlined by Arskey and O'Malley in 2005. Raised by the integration of web portals into patient care for complex diseases, we systematically searched 3 databases, PubMed, Scopus, and WestLaw Next, for sources describing web portals for sharing health research findings among clinicians, researchers, and patients and their associated legal, ethical, and policy challenges. Of the 719 candidate source citations, 22 were retained for the review. RESULTS: We found varied and inconsistent treatment of web portals for sharing health research findings among clinicians, researchers, and patients. Although the literature supports the view that portals of this kind are potentially highly promising, they remain novel and are not yet widely adopted. We also found a wide range of discussions on the legal, ethical, and policy issues related to the use of web portals to share research data. CONCLUSIONS: We identified 5 important legal and ethical challenges: privacy and confidentiality, patient health literacy, equity, training, and decision-making. We contend that each of these has meaningful implications for the increased integration of web portals into clinical care.


Assuntos
Letramento em Saúde , Portais do Paciente , Bibliometria , Big Data , Humanos
11.
PLoS Comput Biol ; 17(10): e1009482, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34679099

RESUMO

MHC-I associated peptides (MAPs) play a central role in the elimination of virus-infected and neoplastic cells by CD8 T cells. However, accurately predicting the MAP repertoire remains difficult, because only a fraction of the transcriptome generates MAPs. In this study, we investigated whether codon arrangement (usage and placement) regulates MAP biogenesis. We developed an artificial neural network called Codon Arrangement MAP Predictor (CAMAP), predicting MAP presentation solely from mRNA sequences flanking the MAP-coding codons (MCCs), while excluding the MCC per se. CAMAP predictions were significantly more accurate when using original codon sequences than shuffled codon sequences which reflect amino acid usage. Furthermore, predictions were independent of mRNA expression and MAP binding affinity to MHC-I molecules and applied to several cell types and species. Combining MAP ligand scores, transcript expression level and CAMAP scores was particularly useful to increase MAP prediction accuracy. Using an in vitro assay, we showed that varying the synonymous codons in the regions flanking the MCCs (without changing the amino acid sequence) resulted in significant modulation of MAP presentation at the cell surface. Taken together, our results demonstrate the role of codon arrangement in the regulation of MAP presentation and support integration of both translational and post-translational events in predictive algorithms to ameliorate modeling of the immunopeptidome.


Assuntos
Códon , Biologia Computacional/métodos , Antígenos de Histocompatibilidade Classe I , Redes Neurais de Computação , Algoritmos , Sequência de Aminoácidos , Códon/química , Códon/genética , Códon/metabolismo , Antígenos de Histocompatibilidade Classe I/química , Antígenos de Histocompatibilidade Classe I/genética , Antígenos de Histocompatibilidade Classe I/metabolismo , Humanos
12.
Front Immunol ; 12: 698565, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34434190

RESUMO

T-cell dysfunction arising upon repeated antigen exposure prevents effective immunity and immunotherapy. Using various clinically and physiologically relevant systems, we show that a prominent feature of PD-1-expressing exhausted T cells is the development of cellular senescence features both in vivo and ex vivo. This is associated with p16INK4a expression and an impaired cell cycle G1 to S-phase transition in repeatedly stimulated T cells. We show that these T cells accumulate DNA damage and activate the p38MAPK signaling pathway, which preferentially leads to p16INK4a upregulation. However, in highly dysfunctional T cells, p38MAPK inhibition does not restore functionality despite attenuating senescence features. In contrast, p16INK4a targeting can improve T-cell functionality in exhausted CAR T cells. Collectively, this work provides insights into the development of T-cell dysfunction and identifies T-cell senescence as a potential target in immunotherapy.


Assuntos
Linfócitos T CD8-Positivos/imunologia , Senescência Celular/imunologia , Inibidor p16 de Quinase Dependente de Ciclina/imunologia , Ativação Linfocitária/imunologia , Receptor de Morte Celular Programada 1/imunologia , Animais , Humanos , Camundongos , Camundongos Endogâmicos C57BL
13.
Immunity ; 54(4): 737-752.e10, 2021 04 13.
Artigo em Inglês | MEDLINE | ID: mdl-33740418

RESUMO

Acute myeloid leukemia (AML) has not benefited from innovative immunotherapies, mainly because of the lack of actionable immune targets. Using an original proteogenomic approach, we analyzed the major histocompatibility complex class I (MHC class I)-associated immunopeptidome of 19 primary AML samples and identified 58 tumor-specific antigens (TSAs). These TSAs bore no mutations and derived mainly (86%) from supposedly non-coding genomic regions. Two AML-specific aberrations were instrumental in the biogenesis of TSAs, intron retention, and epigenetic changes. Indeed, 48% of TSAs resulted from intron retention and translation, and their RNA expression correlated with mutations of epigenetic modifiers (e.g., DNMT3A). AML TSA-coding transcripts were highly shared among patients and were expressed in both blasts and leukemic stem cells. In AML patients, the predicted number of TSAs correlated with spontaneous expansion of cognate T cell receptor clonotypes, accumulation of activated cytotoxic T cells, immunoediting, and improved survival. These TSAs represent attractive targets for AML immunotherapy.


Assuntos
Epitopos/genética , Antígenos de Histocompatibilidade Classe I/genética , Leucemia Mieloide Aguda/genética , Animais , Antígenos de Neoplasias/genética , Antígenos de Neoplasias/imunologia , Linhagem Celular , Epigênese Genética/genética , Epigênese Genética/imunologia , Epitopos/imunologia , Antígenos de Histocompatibilidade Classe I/imunologia , Humanos , Imunoterapia/métodos , Leucemia Mieloide Aguda/imunologia , Camundongos , Camundongos Endogâmicos NOD , Camundongos SCID , Mutação/genética , Mutação/imunologia , Células-Tronco Neoplásicas/imunologia , Receptores de Antígenos de Linfócitos T/genética , Receptores de Antígenos de Linfócitos T/imunologia , Linfócitos T Citotóxicos/imunologia
14.
Cell Rep ; 34(10): 108815, 2021 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-33691108

RESUMO

Combining RNA sequencing, ribosome profiling, and mass spectrometry, we elucidate the contribution of non-canonical translation to the proteome and major histocompatibility complex (MHC) class I immunopeptidome. Remarkably, of 14,498 proteins identified in three human B cell lymphomas, 2,503 are non-canonical proteins. Of these, 28% are novel isoforms and 72% are cryptic proteins encoded by ostensibly non-coding regions (60%) or frameshifted canonical genes (12%). Cryptic proteins are translated as efficiently as canonical proteins, have more predicted disordered residues and lower stability, and critically generate MHC-I peptides 5-fold more efficiently per translation event. Translating 5' "untranslated" regions hinders downstream translation of genes involved in transcription, translation, and antiviral responses. Novel protein isoforms show strong enrichment for signaling pathways deregulated in cancer. Only a small fraction of cryptic proteins detected in the proteome contribute to the MHC-I immunopeptidome, demonstrating the high preferential access of cryptic defective ribosomal products to the class I pathway.


Assuntos
Proteoma/metabolismo , Linhagem Celular Tumoral , Cromatografia Líquida de Alta Pressão , Antígenos de Histocompatibilidade Classe I/genética , Antígenos de Histocompatibilidade Classe I/metabolismo , Humanos , Linfoma de Células B/metabolismo , Linfoma de Células B/patologia , Fases de Leitura Aberta/genética , Isoformas de Proteínas/metabolismo , Proteoma/análise , Ribossomos/metabolismo , Análise de Sequência de RNA , Transdução de Sinais/genética , Espectrometria de Massas em Tandem
15.
Transplant Cell Ther ; 27(1): 76.e1-76.e9, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-33022376

RESUMO

Rapid T cell reconstitution following hematopoietic stem cell transplantation (HSCT) is essential for protection against infections and has been associated with lower incidence of chronic graft-versus-host disease (cGVHD), relapse, and transplant-related mortality (TRM). While cord blood (CB) transplants are associated with lower rates of cGVHD and relapse, their low stem cell content results in slower immune reconstitution and higher risk of graft failure, severe infections, and TRM. Recently, results of a phase I/II trial revealed that single UM171-expanded CB transplant allowed the use of smaller CB units without compromising engraftment (www.clinicaltrials.gov, NCT02668315). We assessed T cell reconstitution in patients who underwent transplantation with UM171-expanded CB grafts and retrospectively compared it to that of patients receiving unmanipulated CB transplants. While median T cell dose infused was at least 2 to 3 times lower than that of unmanipulated CB, numbers and phenotype of T cells at 3, 6, and 12 months post-transplant were similar between the 2 cohorts. T cell receptor sequencing analyses revealed that UM171 patients had greater T cell diversity and higher numbers of clonotypes at 12 months post-transplant. This was associated with higher counts of naive T cells and recent thymic emigrants, suggesting active thymopoiesis and correlating with the demonstration that UM171 expands common lymphoid progenitors in vitro. UM171 patients also showed rapid virus-specific T cell reactivity and significantly reduced incidence of severe infections. These results suggest that UM171 patients benefit from rapid T cell reconstitution, which likely contributes to the absence of moderate/severe cGVHD, infection-related mortality, and late TRM observed in this cohort.


Assuntos
Transplante de Células-Tronco de Sangue do Cordão Umbilical , Doença Enxerto-Hospedeiro , Transplante de Células-Tronco de Sangue do Cordão Umbilical/efeitos adversos , Sangue Fetal , Humanos , Estudos Retrospectivos , Linfócitos T
16.
J Cell Biol ; 219(11)2020 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-32960945

RESUMO

Proteins of the ezrin, radixin, and moesin (ERM) family control cell and tissue morphogenesis. We previously reported that moesin, the only ERM in Drosophila, controls mitotic morphogenesis and epithelial integrity. We also found that the Pp1-87B phosphatase dephosphorylates moesin, counteracting its activation by the Ste20-like kinase Slik. To understand how this signaling pathway is itself regulated, we conducted a genome-wide RNAi screen, looking for new regulators of moesin activity. We identified that Slik is a new member of the striatin-interacting phosphatase and kinase complex (STRIPAK). We discovered that the phosphatase activity of STRIPAK reduces Slik phosphorylation to promote its cortical association and proper activation of moesin. Consistent with this finding, inhibition of STRIPAK phosphatase activity causes cell morphology defects in mitosis and impairs epithelial tissue integrity. Our results implicate the Slik-STRIPAK complex in the control of multiple morphogenetic processes.


Assuntos
Proteínas de Drosophila/metabolismo , Drosophila melanogaster/crescimento & desenvolvimento , Células Epiteliais/fisiologia , Mitose , Morfogênese , Proteínas Serina-Treonina Quinases/metabolismo , RNA Interferente Pequeno/genética , Animais , Proteínas de Drosophila/antagonistas & inibidores , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Células Epiteliais/citologia , Ensaios de Triagem em Larga Escala , Proteínas dos Microfilamentos/genética , Proteínas dos Microfilamentos/metabolismo , Complexos Multiproteicos/metabolismo , Fosforilação , Fosfotransferases/genética , Fosfotransferases/metabolismo , Proteínas Serina-Treonina Quinases/genética
17.
Bioinformatics ; 36(Suppl_1): i417-i426, 2020 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-32657403

RESUMO

MOTIVATION: The recent development of sequencing technologies revolutionized our understanding of the inner workings of the cell as well as the way disease is treated. A single RNA sequencing (RNA-Seq) experiment, however, measures tens of thousands of parameters simultaneously. While the results are information rich, data analysis provides a challenge. Dimensionality reduction methods help with this task by extracting patterns from the data by compressing it into compact vector representations. RESULTS: We present the factorized embeddings (FE) model, a self-supervised deep learning algorithm that learns simultaneously, by tensor factorization, gene and sample representation spaces. We ran the model on RNA-Seq data from two large-scale cohorts and observed that the sample representation captures information on single gene and global gene expression patterns. Moreover, we found that the gene representation space was organized such that tissue-specific genes, highly correlated genes as well as genes participating in the same GO terms were grouped. Finally, we compared the vector representation of samples learned by the FE model to other similar models on 49 regression tasks. We report that the representations trained with FE rank first or second in all of the tasks, surpassing, sometimes by a considerable margin, other representations. AVAILABILITY AND IMPLEMENTATION: A toy example in the form of a Jupyter Notebook as well as the code and trained embeddings for this project can be found at: https://github.com/TrofimovAssya/FactorizedEmbeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , RNA , Análise de Sequência de RNA
18.
Blood ; 135(21): 1882-1886, 2020 05 21.
Artigo em Inglês | MEDLINE | ID: mdl-32315381

RESUMO

RUNX1 is mutated in ∼10% of adult acute myeloid leukemia (AML). Although most RUNX1 mutations in this disease are believed to be acquired, they can also be germline. Indeed, germline RUNX1 mutations result in the well-described autosomal-dominant familial platelet disorder with predisposition to hematologic malignancies (RUNX1-FPD, FPD/AML, FPDMM); ∼44% of affected individuals progress to AML or myelodysplastic syndromes. Using the Leucegene RUNX1 AML patient group, we sought to investigate the proportion of germline vs acquired RUNX1 mutations in this cohort. Our results showed that 30% of RUNX1 mutations in our AML cohort are germline. Molecular profiling revealed higher frequencies of NRAS mutations and other mutations known to activate various signaling pathways in these patients with RUNX1 germline-mutated AML. Moreover, 2 patients (mother and son) had co-occurrence of RUNX1 and CEBPA germline mutations, with variable AML disease onset at 59 and 27 years, respectively. Together, these data suggest a higher than anticipated frequency of germline RUNX1 mutations in the Leucegene cohort and further highlight the importance of testing for RUNX1 mutations in instances in which allogeneic stem cell transplantation using a related donor is envisioned.


Assuntos
Biomarcadores Tumorais/genética , Proteínas Estimuladoras de Ligação a CCAAT/genética , Subunidade alfa 2 de Fator de Ligação ao Core/genética , Fator de Transcrição GATA2/genética , Mutação em Linhagem Germinativa , Leucemia Mieloide Aguda/genética , Mutação , Adulto , Idoso , Feminino , Seguimentos , Humanos , Leucemia Mieloide Aguda/patologia , Masculino , Pessoa de Meia-Idade , Prognóstico
19.
Genome Med ; 12(1): 40, 2020 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-32345368

RESUMO

BACKGROUND: Endogenous retroelements (EREs) constitute about 42% of the human genome and have been implicated in common human diseases such as autoimmunity and cancer. The dominant paradigm holds that EREs are expressed in embryonic stem cells (ESCs) and germline cells but are repressed in differentiated somatic cells. Despite evidence that some EREs can be expressed at the RNA and protein levels in specific contexts, a system-level evaluation of their expression in human tissues is lacking. METHODS: Using RNA sequencing data, we analyzed ERE expression in 32 human tissues and cell types, including medullary thymic epithelial cells (mTECs). A tissue specificity index was computed to identify tissue-restricted ERE families. We also analyzed the transcriptome of mTECs in wild-type and autoimmune regulator (AIRE)-deficient mice. Finally, we developed a proteogenomic workflow combining RNA sequencing and mass spectrometry (MS) in order to evaluate whether EREs might be translated and generate MHC I-associated peptides (MAP) in B-lymphoblastoid cell lines (B-LCL) from 16 individuals. RESULTS: We report that all human tissues express EREs, but the breadth and magnitude of ERE expression are very heterogeneous from one tissue to another. ERE expression was particularly high in two MHC I-deficient tissues (ESCs and testis) and one MHC I-expressing tissue, mTECs. In mutant mice, we report that the exceptional expression of EREs in mTECs was AIRE-independent. MS analyses identified 103 non-redundant ERE-derived MAPs (ereMAPs) in B-LCLs. These ereMAPs preferentially derived from sense translation of intronic EREs. Notably, detailed analyses of their amino acid composition revealed that ERE-derived MAPs presented homology to viral MAPs. CONCLUSIONS: This study shows that ERE expression in somatic tissues is more pervasive and heterogeneous than anticipated. The high and diversified expression of EREs in mTECs and their ability to generate MAPs suggest that EREs may play an important role in the establishment of self-tolerance. The viral-like properties of ERE-derived MAPs suggest that those not expressed in mTECs can be highly immunogenic.


Assuntos
Retroelementos , Sequência de Aminoácidos , Animais , Linfócitos T CD8-Positivos/efeitos dos fármacos , Citocinas/farmacologia , Células Dendríticas , Células Epiteliais/metabolismo , Humanos , Espectrometria de Massas , Camundongos Knockout , Análise de Sequência de RNA , Timo/citologia , Fatores de Transcrição/genética , Proteína AIRE
20.
J Proteome Res ; 19(4): 1873-1881, 2020 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-32108478

RESUMO

The immunopeptidome corresponds to the repertoire of peptides presented at the cell surface by the major histocompatibility complex (MHC) molecules. Cytotoxic T cells scan this repertoire to identify nonself antigens that can arise from tumors or infected cells. The identification of actionable antigenic targets is key to the development of therapeutic cancer vaccines, T-cell therapy, and other T-cell receptor-based biologics. The growing clinical interest for immunopeptidomics has accelerated the development of high throughput proteogenomic platforms that provide a system-level analysis of MHC-associated peptides. Improvement in sensitivity and throughput of mass spectrometers now allows the detection of a few thousands of peptides from less than 100 million cells. To manage the amount of data generated by these instruments, we have developed the MHC-associated peptide discovery platform (MAPDP), a novel open-source cloud-based computational platform for immunopeptidomic analyses. It provides convenient access from a web portal to immunopeptidomes stored in the database, filtering tools, various visualizations, annotations (e.g., IEDB, dbSNP, gnomAD), peptide-binding affinity prediction (mhcflurry, NetMHC), HLA genotyping, and the generation of personalized proteome databases. MAPDP functionalities are demonstrated here by the discovery of MHC peptides featuring new genetic variants identified in two previously published ovarian carcinoma data sets.


Assuntos
Computação em Nuvem , Neoplasias , Humanos , Espectrometria de Massas , Peptídeos , Proteoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...