Búsqueda | Portal de Búsqueda de la BVS Ecuador

1.

An aptamer-mediated base editing platform for simultaneous knock-in and multiple gene knockout for allogeneic CAR-T cells generation.

Porreca, Immacolata; Blassberg, Robert; Harbottle, Jennifer; Joubert, Bronwyn; Mielczarek, Olga; Stombaugh, Jesse; Hemphill, Kevin; Sumner, Jonathan; Pazeraitis, Deividas; Touza, Julia Liz; Francescatto, Margherita; Firth, Mike; Selmi, Tommaso; Collantes, Juan Carlos; Strezoska, Zaklina; Taylor, Benjamin; Jin, Shengkan; Wiggins, Ceri M; van Brabant Smith, Anja; Lambourne, John J.

Mol Ther ; 2024 Jun 26.

Artículo en Inglés | MEDLINE | ID: mdl-38937969

RESUMEN

Gene editing technologies hold promise for enabling the next generation of adoptive cellular therapies. Conventional gene editing platforms that rely on nuclease activity, such as Clustered regularly interspaced short palindromic repeats-CRISPR associated protein 9 (CRISPR-Cas9), allow efficient introduction of genetic modifications; however, these modifications occur via the generation of DNA double-strand breaks (DSBs) and can lead to unwanted genomic alterations and genotoxicity. Here, we apply a novel modular RNA aptamer-mediated Pin-point™ base editing platform to simultaneously introduce multiple gene knockouts and site-specific integration of a transgene in human primary T cells. We demonstrate high editing efficiency and purity at all target sites and significantly reduced frequency of chromosomal translocations compared to the conventional CRISPR-Cas9 system. Site-specific knock-in of a chimeric antigen receptor (CAR) and multiplex gene knockout are achieved within a single intervention and without the requirement for additional sequence-targeting components. The ability to perform complex genome editing efficiently and precisely highlights the potential of the Pin-point platform for application in a range of advanced cell therapies.

2.

Evaluating reproducibility of AI algorithms in digital pathology with DAPPER.

Bizzego, Andrea; Bussola, Nicole; Chierici, Marco; Maggio, Valerio; Francescatto, Margherita; Cima, Luca; Cristoforetti, Marco; Jurman, Giuseppe; Furlanello, Cesare.

PLoS Comput Biol ; 15(3): e1006269, 2019 03.

Artículo en Inglés | MEDLINE | ID: mdl-30917113

RESUMEN

Artificial Intelligence is exponentially increasing its impact on healthcare. As deep learning is mastering computer vision tasks, its application to digital pathology is natural, with the promise of aiding in routine reporting and standardizing results across trials. Deep learning features inferred from digital pathology scans can improve validity and robustness of current clinico-pathological features, up to identifying novel histological patterns, e.g., from tumor infiltrating lymphocytes. In this study, we examine the issue of evaluating accuracy of predictive models from deep learning features in digital pathology, as an hallmark of reproducibility. We introduce the DAPPER framework for validation based on a rigorous Data Analysis Plan derived from the FDA's MAQC project, designed to analyze causes of variability in predictive biomarkers. We apply the framework on models that identify tissue of origin on 787 Whole Slide Images from the Genotype-Tissue Expression (GTEx) project. We test three different deep learning architectures (VGG, ResNet, Inception) as feature extractors and three classifiers (a fully connected multilayer, Support Vector Machine and Random Forests) and work with four datasets (5, 10, 20 or 30 classes), for a total of 53, 000 tiles at 512 × 512 resolution. We analyze accuracy and feature stability of the machine learning classifiers, also demonstrating the need for diagnostic tests (e.g., random labels) to identify selection bias and risks for reproducibility. Further, we use the deep features from the VGG model from GTEx on the KIMIA24 dataset for identification of slide of origin (24 classes) to train a classifier on 1, 060 annotated tiles and validated on 265 unseen ones. The DAPPER software, including its deep learning pipeline and the Histological Imaging-Newsy Tiles (HINT) benchmark dataset derived from GTEx, is released as a basis for standardization and validation initiatives in AI for digital pathology.

Asunto(s)

Algoritmos , Inteligencia Artificial , Técnicas Histológicas/métodos , Interpretación de Imagen Asistida por Computador/métodos , Programas Informáticos , Humanos , Reproducibilidad de los Resultados

3.

A promoter-level mammalian expression atlas.

Forrest, Alistair R R; Kawaji, Hideya; Rehli, Michael; Baillie, J Kenneth; de Hoon, Michiel J L; Haberle, Vanja; Lassmann, Timo; Kulakovskiy, Ivan V; Lizio, Marina; Itoh, Masayoshi; Andersson, Robin; Mungall, Christopher J; Meehan, Terrence F; Schmeier, Sebastian; Bertin, Nicolas; Jørgensen, Mette; Dimont, Emmanuel; Arner, Erik; Schmidl, Christian; Schaefer, Ulf; Medvedeva, Yulia A; Plessy, Charles; Vitezic, Morana; Severin, Jessica; Semple, Colin A; Ishizu, Yuri; Young, Robert S; Francescatto, Margherita; Alam, Intikhab; Albanese, Davide; Altschuler, Gabriel M; Arakawa, Takahiro; Archer, John A C; Arner, Peter; Babina, Magda; Rennie, Sarah; Balwierz, Piotr J; Beckhouse, Anthony G; Pradhan-Bhatt, Swati; Blake, Judith A; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Burroughs, A Maxwell; Califano, Andrea; Cannistraci, Carlo V; Carbajo, Daniel; Chen, Yun.

Nature ; 507(7493): 462-70, 2014 Mar 27.

Artículo en Inglés | MEDLINE | ID: mdl-24670764

RESUMEN

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.

Asunto(s)

Atlas como Asunto , Anotación de Secuencia Molecular , Regiones Promotoras Genéticas/genética , Transcriptoma/genética , Animales , Línea Celular , Células Cultivadas , Análisis por Conglomerados , Secuencia Conservada/genética , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Genes Esenciales/genética , Genoma/genética , Humanos , Ratones , Sistemas de Lectura Abierta/genética , Especificidad de Órganos , ARN Mensajero/análisis , ARN Mensajero/genética , Factores de Transcripción/metabolismo , Sitio de Iniciación de la Transcripción , Transcripción Genética/genética

4.

The ISCB Student Council Internship Program: Expanding computational biology capacity worldwide.

Anupama, Jigisha; Francescatto, Margherita; Rahman, Farzana; Fatima, Nazeefa; DeBlasio, Dan; Shanmugam, Avinash Kumar; Satagopam, Venkata; Santos, Alberto; Kolekar, Pandurang; Michaut, Magali; Guney, Emre.

PLoS Comput Biol ; 14(1): e1005802, 2018 01.

Artículo en Inglés | MEDLINE | ID: mdl-29346365

RESUMEN

Education and training are two essential ingredients for a successful career. On one hand, universities provide students a curriculum for specializing in one's field of study, and on the other, internships complement coursework and provide invaluable training experience for a fruitful career. Consequently, undergraduates and graduates are encouraged to undertake an internship during the course of their degree. The opportunity to explore one's research interests in the early stages of their education is important for students because it improves their skill set and gives their career a boost. In the long term, this helps to close the gap between skills and employability among students across the globe and balance the research capacity in the field of computational biology. However, training opportunities are often scarce for computational biology students, particularly for those who reside in less-privileged regions. Aimed at helping students develop research and academic skills in computational biology and alleviating the divide across countries, the Student Council of the International Society for Computational Biology introduced its Internship Program in 2009. The Internship Program is committed to providing access to computational biology training, especially for students from developing regions, and improving competencies in the field. Here, we present how the Internship Program works and the impact of the internship opportunities so far, along with the challenges associated with this program.

Asunto(s)

Biología Computacional/educación , Internado y Residencia , Algoritmos , Australia , Curriculum , Países en Desarrollo , Europa (Continente) , Geografía , Humanos , Desarrollo de Programa , Estudiantes , Universidades

5.

Highlights from the Third International Society for Computational Biology (ISCB) European Student Council Symposium 2014.

Francescatto, Margherita; Hermans, Susanne M A; Babaei, Sepideh; Vicedo, Esmeralda; Borrel, Alexandre; Meysman, Pieter.

BMC Bioinformatics ; 16 Suppl 3: A1-9, 2015.

Artículo en Inglés | MEDLINE | ID: mdl-25708611

RESUMEN

In this meeting report, we give an overview of the talks, presentations and posters presented at the third European Symposium of the International Society for Computational Biology (ISCB) Student Council. The event was organized as a satellite meeting of the 13th European Conference for Computational Biology (ECCB) and took place in Strasbourg, France on September 6th, 2014.

Asunto(s)

Biología Computacional , Distinciones y Premios , Bases de Datos Factuales , Redes Reguladoras de Genes , Modelos Estadísticos , Revisión de la Investigación por Pares

6.

A multi-omics dataset for the analysis of frontotemporal dementia genetic subtypes.

Menden, Kevin; Francescatto, Margherita; Nyima, Tenzin; Blauwendraat, Cornelis; Dhingra, Ashutosh; Castillo-Lizardo, Melissa; Fernandes, Noémia; Kaurani, Lalit; Kronenberg-Versteeg, Deborah; Atasu, Burcu; Sadikoglou, Eldem; Borroni, Barbara; Rodriguez-Nieto, Salvador; Simon-Sanchez, Javier; Fischer, Andre; Craig, David Wesley; Neumann, Manuela; Bonn, Stefan; Rizzu, Patrizia; Heutink, Peter.

Sci Data ; 10(1): 849, 2023 12 01.

Artículo en Inglés | MEDLINE | ID: mdl-38040703

RESUMEN

Understanding the molecular mechanisms underlying frontotemporal dementia (FTD) is essential for the development of successful therapies. Systematic studies on human post-mortem brain tissue of patients with genetic subtypes of FTD are currently lacking. The Risk and Modyfing Factors of Frontotemporal Dementia (RiMod-FTD) consortium therefore has generated a multi-omics dataset for genetic subtypes of FTD to identify common and distinct molecular mechanisms disturbed in disease. Here, we present multi-omics datasets generated from the frontal lobe of post-mortem human brain tissue from patients with mutations in MAPT, GRN and C9orf72 and healthy controls. This data resource consists of four datasets generated with different technologies to capture the transcriptome by RNA-seq, small RNA-seq, CAGE-seq, and methylation profiling. We show concrete examples on how to use the resulting data and confirm current knowledge about FTD and identify new processes for further investigation. This extensive multi-omics dataset holds great value to reveal new research avenues for this devastating disease.

Asunto(s)

Demencia Frontotemporal , Multiómica , Humanos , Lóbulo Frontal , Demencia Frontotemporal/genética , Mutación

7.

A Type II-B Cas9 nuclease with minimized off-targets and reduced chromosomal translocations in vivo.

Bestas, Burcu; Wimberger, Sandra; Degtev, Dmitrii; Madsen, Alexandra; Rottner, Antje K; Karlsson, Fredrik; Naumenko, Sergey; Callahan, Megan; Touza, Julia Liz; Francescatto, Margherita; Möller, Carl Ivar; Badertscher, Lukas; Li, Songyuan; Cerboni, Silvia; Selfjord, Niklas; Ericson, Elke; Gordon, Euan; Firth, Mike; Chylinski, Krzysztof; Taheri-Ghahfarokhi, Amir; Bohlooly-Y, Mohammad; Snowden, Mike; Pangalos, Menelaos; Nuttall, Barrett; Akcakaya, Pinar; Sienski, Grzegorz; Maresca, Marcello.

Nat Commun ; 14(1): 5474, 2023 09 06.

Artículo en Inglés | MEDLINE | ID: mdl-37673883

RESUMEN

Streptococcus pyogenes Cas9 (SpCas9) and derived enzymes are widely used as genome editors, but their promiscuous nuclease activity often induces undesired mutations and chromosomal rearrangements. Several strategies for mapping off-target effects have emerged, but they suffer from limited sensitivity. To increase the detection sensitivity, we develop an off-target assessment workflow that uses Duplex Sequencing. The strategy increases sensitivity by one order of magnitude, identifying previously unknown SpCas9's off-target mutations in the humanized PCSK9 mouse model. To reduce off-target risks, we perform a bioinformatic search and identify a high-fidelity Cas9 variant of the II-B subfamily from Parasutterella secunda (PsCas9). PsCas9 shows improved specificity as compared to SpCas9 across multiple tested sites, both in vitro and in vivo, including the PCSK9 site. In the future, while PsCas9 will offer an alternative to SpCas9 for research and clinical use, the Duplex Sequencing workflow will enable a more sensitive assessment of Cas9 editing outcomes.

Asunto(s)

Proproteína Convertasa 9 , Translocación Genética , Animales , Ratones , Proproteína Convertasa 9/genética , Sistemas CRISPR-Cas/genética , Mutación , Endonucleasas/genética , Streptococcus pyogenes/genética

8.

The Role of Knockout Olfactory Receptor Genes in Odor Discrimination.

Concas, Maria Pina; Cocca, Massimiliano; Francescatto, Margherita; Battistuzzi, Thomas; Spedicati, Beatrice; Feresin, Agnese; Morgan, Anna; Gasparini, Paolo; Girotto, Giorgia.

Genes (Basel) ; 12(5)2021 04 23.

Artículo en Inglés | MEDLINE | ID: mdl-33922566

RESUMEN

To date, little is known about the role of olfactory receptor (OR) genes on smell performance. Thanks to the availability of whole-genome sequencing data of 802 samples, we identified 41 knockout (KO) OR genes (i.e., carriers of Loss of Function variants) and evaluated their effect on odor discrimination in 218 Italian individuals through recursive partitioning analysis. Furthermore, we checked the expression of these genes in human and mouse tissues using publicly available data and the presence of organ-related diseases in human KO (HKO) individuals for OR expressed in non-olfactory tissues (Fisher test). The recursive partitioning analysis showed that age and the high number (burden) of OR-KO genes impact the worsening of odor discrimination (p-value < 0.05). Human expression data showed that 33/41 OR genes are expressed in the olfactory system (OS) and 27 in other tissues. Sixty putative mouse homologs of the 41 humans ORs have been identified, 58 of which are expressed in the OS and 37 in other tissues. No association between OR-KO individuals and pathologies has been detected. In conclusion, our work highlights the role of the burden of OR-KO genes in worse odor discrimination.

Asunto(s)

Percepción Olfatoria , Receptores Odorantes/genética , Olfato , Anciano , Animales , Femenino , Humanos , Mutación con Pérdida de Función , Masculino , Ratones , Persona de Mediana Edad , Receptores Odorantes/metabolismo

9.

Natural human knockouts and Mendelian disorders: deep phenotyping in Italian isolates.

Spedicati, Beatrice; Cocca, Massimiliano; Palmisano, Roberto; Faletra, Flavio; Barbieri, Caterina; Francescatto, Margherita; Mezzavilla, Massimo; Morgan, Anna; Pelliccione, Giulia; Gasparini, Paolo; Girotto, Giorgia.

Eur J Hum Genet ; 29(8): 1272-1281, 2021 08.

Artículo en Inglés | MEDLINE | ID: mdl-33727708

RESUMEN

Whole genome sequencing (WGS) allows the identification of human knockouts (HKOs), individuals in whom loss of function (LoF) variants disrupt both alleles of a given gene. HKOs are a valuable model for understanding the consequences of genes function loss. Naturally occurring biallelic LoF variants tend to be significantly enriched in "genetic isolates," making these populations specifically suited for HKO studies. In this work, a meticulous WGS data analysis combined with an in-depth phenotypic assessment of 947 individuals from three Italian genetic isolates led to the identification of ten biallelic LoF variants in ten OMIM genes associated with known autosomal recessive diseases. Notably, only a minority of the identified HKOs (C7, F12, and GPR68 genes) displayed the expected phenotype. For most of the genes, instead, (ACADSB, FANCL, GRK1, LGI4, MPO, PGAM2, and RP1L1), the carriers showed none or few of the signs and symptoms typically associated with the related diseases. Of particular interest is a case presenting with a FANCL biallelic LoF variant and a positive diepoxybutane test but lacking a full Fanconi anemia phenotypic spectrum. Identifying KO subjects displaying expected phenotypes suggests that the lack of correct genetic diagnoses may lead to inappropriate and delayed treatment. In contrast, the presence of HKOs with phenotypes deviating from the expected patterns underlines how LoF variants may be responsible for broader phenotypic spectra. Overall, these results highlight the importance of in-depth phenotypical characterization to understand the role of LoF variants and the advantage of studying these variants in genetic isolates.

Asunto(s)

Frecuencia de los Genes , Enfermedades Genéticas Congénitas/genética , Mutación con Pérdida de Función , Población/genética , Humanos , Italia , Aislamiento Reproductivo , Secuenciación Completa del Genoma/estadística & datos numéricos

10.

Hearing Function: Identification of New Candidate Genes Further Explaining the Complexity of This Sensory Ability.

Concas, Maria Pina; Morgan, Anna; Serra, Fabrizio; Nagtegaal, Andries Paul; Oosterloo, Berthe C; Seshadri, Sudha; Heard-Costa, Nancy; Van Camp, Guy; Fransen, Erik; Francescatto, Margherita; Logroscino, Giancarlo; Sardone, Rodolfo; Quaranta, Nicola; Gasparini, Paolo; Girotto, Giorgia.

Genes (Basel) ; 12(8)2021 08 10.

Artículo en Inglés | MEDLINE | ID: mdl-34440402

RESUMEN

To date, the knowledge of the genetic determinants behind the modulation of hearing ability is relatively limited. To investigate this trait, we performed Genome-Wide Association Study (GWAS) meta-analysis using genotype and audiometric data (hearing thresholds at 0.25, 0.5, 1, 2, 4, and 8 kHz, and pure-tone averages of thresholds at low, medium, and high frequencies) collected in nine cohorts from Europe, South-Eastern USA, Caucasus, and Central Asia, for an overall number of ~9000 subjects. Three hundred seventy-five genes across all nine analyses were tagged by single nucleotide polymorphisms (SNPs) reaching a suggestive p-value (p < 10-5). Amongst these, 15 were successfully replicated using a gene-based approach in the independent Italian Salus in the Apulia cohort (n = 1774) at the nominal significance threshold (p < 0.05). In addition, the expression level of the replicated genes was assessed in published human and mouse inner ear datasets. Considering expression patterns in humans and mice, eleven genes were considered particularly promising candidates for the hearing function: BNIP3L, ELP5, MAP3K20, MATN2, MTMR7, MYO1E, PCNT, R3HDM1, SLC9A9, TGFB2, and YTHDC2. These findings represent a further contribution to our understanding of the genetic basis of hearing function and its related diseases.

Asunto(s)

Audición/genética , Animales , Estudios de Cohortes , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Italia , Masculino , Ratones , Umbral Sensorial

11.

A verified genomic reference sample for assessing performance of cancer panels detecting small variants of low allele frequency.

Jones, Wendell; Gong, Binsheng; Novoradovskaya, Natalia; Li, Dan; Kusko, Rebecca; Richmond, Todd A; Johann, Donald J; Bisgin, Halil; Sahraeian, Sayed Mohammad Ebrahim; Bushel, Pierre R; Pirooznia, Mehdi; Wilkins, Katherine; Chierici, Marco; Bao, Wenjun; Basehore, Lee Scott; Lucas, Anne Bergstrom; Burgess, Daniel; Butler, Daniel J; Cawley, Simon; Chang, Chia-Jung; Chen, Guangchun; Chen, Tao; Chen, Yun-Ching; Craig, Daniel J; Del Pozo, Angela; Foox, Jonathan; Francescatto, Margherita; Fu, Yutao; Furlanello, Cesare; Giorda, Kristina; Grist, Kira P; Guan, Meijian; Hao, Yingyi; Happe, Scott; Hariani, Gunjan; Haseley, Nathan; Jasper, Jeff; Jurman, Giuseppe; Kreil, David Philip; Labaj, Pawel; Lai, Kevin; Li, Jianying; Li, Quan-Zhen; Li, Yulong; Li, Zhiguang; Liu, Zhichao; López, Mario Solís; Miclaus, Kelci; Miller, Raymond; Mittal, Vinay K.

Genome Biol ; 22(1): 111, 2021 04 16.

Artículo en Inglés | MEDLINE | ID: mdl-33863366

RESUMEN

BACKGROUND: Oncopanel genomic testing, which identifies important somatic variants, is increasingly common in medical practice and especially in clinical trials. Currently, there is a paucity of reliable genomic reference samples having a suitably large number of pre-identified variants for properly assessing oncopanel assay analytical quality and performance. The FDA-led Sequencing and Quality Control Phase 2 (SEQC2) consortium analyze ten diverse cancer cell lines individually and their pool, termed Sample A, to develop a reference sample with suitably large numbers of coding positions with known (variant) positives and negatives for properly evaluating oncopanel analytical performance. RESULTS: In reference Sample A, we identify more than 40,000 variants down to 1% allele frequency with more than 25,000 variants having less than 20% allele frequency with 1653 variants in COSMIC-related genes. This is 5-100× more than existing commercially available samples. We also identify an unprecedented number of negative positions in coding regions, allowing statistical rigor in assessing limit-of-detection, sensitivity, and precision. Over 300 loci are randomly selected and independently verified via droplet digital PCR with 100% concordance. Agilent normal reference Sample B can be admixed with Sample A to create new samples with a similar number of known variants at much lower allele frequency than what exists in Sample A natively, including known variants having allele frequency of 0.02%, a range suitable for assessing liquid biopsy panels. CONCLUSION: These new reference samples and their admixtures provide superior capability for performing oncopanel quality control, analytical accuracy, and validation for small to large oncopanels and liquid biopsy assays.

Asunto(s)

Alelos , Biomarcadores de Tumor , Frecuencia de los Genes , Pruebas Genéticas/métodos , Variación Genética , Genómica/métodos , Neoplasias/genética , Línea Celular Tumoral , Variaciones en el Número de Copia de ADN , Heterogeneidad Genética , Pruebas Genéticas/normas , Genómica/normas , Humanos , Neoplasias/diagnóstico , Flujo de Trabajo

12.

Predictability of drug-induced liver injury by machine learning.

Chierici, Marco; Francescatto, Margherita; Bussola, Nicole; Jurman, Giuseppe; Furlanello, Cesare.

Biol Direct ; 15(1): 3, 2020 02 13.

Artículo en Inglés | MEDLINE | ID: mdl-32054490

RESUMEN

BACKGROUND: Drug-induced liver injury (DILI) is a major concern in drug development, as hepatotoxicity may not be apparent at early stages but can lead to life threatening consequences. The ability to predict DILI from in vitro data would be a crucial advantage. In 2018, the Critical Assessment Massive Data Analysis group proposed the CMap Drug Safety challenge focusing on DILI prediction. METHODS AND RESULTS: The challenge data included Affymetrix GeneChip expression profiles for the two cancer cell lines MCF7 and PC3 treated with 276 drug compounds and empty vehicles. Binary DILI labeling and a recommended train/test split for the development of predictive classification approaches were also provided. We devised three deep learning architectures for DILI prediction on the challenge data and compared them to random forest and multi-layer perceptron classifiers. On a subset of the data and for some of the models we additionally tested several strategies for balancing the two DILI classes and to identify alternative informative train/test splits. All the models were trained with the MAQC data analysis protocol (DAP), i.e., 10x5 cross-validation over the training set. In all the experiments, the classification performance in both cross-validation and external validation gave Matthews correlation coefficient (MCC) values below 0.2. We observed minimal differences between the two cell lines. Notably, deep learning approaches did not give an advantage on the classification performance. DISCUSSION: We extensively tested multiple machine learning approaches for the DILI classification task obtaining poor to mediocre performance. The results suggest that the CMap expression data on the two cell lines MCF7 and PC3 are not sufficient for accurate DILI label prediction. REVIEWERS: This article was reviewed by Maciej Kandula and Pawel P. Labaj.

Asunto(s)

Enfermedad Hepática Inducida por Sustancias y Drogas/etiología , Aprendizaje Automático , Humanos , Modelos Biológicos , Medición de Riesgo/métodos

13.

Integrative Network Fusion: A Multi-Omics Approach in Molecular Profiling.

Chierici, Marco; Bussola, Nicole; Marcolini, Alessia; Francescatto, Margherita; Zandonà, Alessandro; Trastulla, Lucia; Agostinelli, Claudio; Jurman, Giuseppe; Furlanello, Cesare.

Front Oncol ; 10: 1065, 2020.

Artículo en Inglés | MEDLINE | ID: mdl-32714870

RESUMEN

Recent technological advances and international efforts, such as The Cancer Genome Atlas (TCGA), have made available several pan-cancer datasets encompassing multiple omics layers with detailed clinical information in large collection of samples. The need has thus arisen for the development of computational methods aimed at improving cancer subtyping and biomarker identification from multi-modal data. Here we apply the Integrative Network Fusion (INF) pipeline, which combines multiple omics layers exploiting Similarity Network Fusion (SNF) within a machine learning predictive framework. INF includes a feature ranking scheme (rSNF) on SNF-integrated features, used by a classifier over juxtaposed multi-omics features (juXT). In particular, we show instances of INF implementing Random Forest (RF) and linear Support Vector Machine (LSVM) as the classifier, and two baseline RF and LSVM models are also trained on juXT. A compact RF model, called rSNFi, trained on the intersection of top-ranked biomarkers from the two approaches juXT and rSNF is finally derived. All the classifiers are run in a 10x5-fold cross-validation schema to warrant reproducibility, following the guidelines for an unbiased Data Analysis Plan by the US FDA-led initiatives MAQC/SEQC. INF is demonstrated on four classification tasks on three multi-modal TCGA oncogenomics datasets. Gene expression, protein expression and copy number variants are used to predict estrogen receptor status (BRCA-ER, N = 381) and breast invasive carcinoma subtypes (BRCA-subtypes, N = 305), while gene expression, miRNA expression and methylation data is used as predictor layers for acute myeloid leukemia and renal clear cell carcinoma survival (AML-OS, N = 157; KIRC-OS, N = 181). In test, INF achieved similar Matthews Correlation Coefficient (MCC) values and 97% to 83% smaller feature sizes (FS), compared with juXT for BRCA-ER (MCC: 0.83 vs. 0.80; FS: 56 vs. 1801) and BRCA-subtypes (0.84 vs. 0.80; 302 vs. 1801), improving KIRC-OS performance (0.38 vs. 0.31; 111 vs. 2319). INF predictions are generally more accurate in test than one-dimensional omics models, with smaller signatures too, where transcriptomics consistently play the leading role. Overall, the INF framework effectively integrates multiple data levels in oncogenomics classification tasks, improving over the performance of single layers alone and naive juxtaposition, and provides compact signature sizes.

14.

Multi-omics integration for neuroblastoma clinical endpoint prediction.

Francescatto, Margherita; Chierici, Marco; Rezvan Dezfooli, Setareh; Zandonà, Alessandro; Jurman, Giuseppe; Furlanello, Cesare.

Biol Direct ; 13(1): 5, 2018 04 03.

Artículo en Inglés | MEDLINE | ID: mdl-29615097

RESUMEN

BACKGROUND: High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. RESULTS: In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. CONCLUSIONS: The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves. REVIEWERS: This article was reviewed by Djork-Arné Clevert and Tieliu Shi.

Asunto(s)

Genómica/métodos , Neuroblastoma/genética , Neuroblastoma/metabolismo , Animales , Biología Computacional , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Neuroblastoma/patología

15.

Author Correction: Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system.

Francescatto, Margherita; Lizio, Marina; Philippens, Ingrid; Pardo, Luba M; Bontrop, Ronald; Sakai, Mizuho; Watanabe, Shoko; Itoh, Masayoshi; Hasegawa, Akira; Lassmann, Timo; Severin, Jessica; Harshbarger, Jayson; Abugessaisa, Imad; Kasukawa, Takeya; Carninci, Piero; Hayashizaki, Yoshihide; Forrest, Alistair R R; Kawaji, Hideya; Rizzu, Patrizia; Heutink, Peter.

Sci Data ; 5(1): 2, 2018 12 11.

Artículo en Inglés | MEDLINE | ID: mdl-30538238

RESUMEN

The authors regret that Luba M. Pardo was omitted in error from the author list of the original version of this Data Descriptor. This omission has now been corrected in the HTML and PDF versions. The authors also regret that Anemieke Rozemuller was omitted in error from the Acknowledgements of the original version of this Data Descriptor. This omission has now been corrected in the HTML and PDF versions.

16.

Neuronal hemoglobin affects dopaminergic cells' response to stress.

Codrich, Marta; Bertuzzi, Maria; Russo, Roberta; Francescatto, Margherita; Espinoza, Stefano; Zentilin, Lorena; Giacca, Mauro; Cesselli, Daniela; Beltrami, Antonio Paolo; Ascenzi, Paolo; Zucchelli, Silvia; Persichetti, Francesca; Leanza, Giampiero; Gustincich, Stefano.

Cell Death Dis ; 8(1): e2538, 2017 01 05.

Artículo en Inglés | MEDLINE | ID: mdl-28055011

RESUMEN

Hemoglobin (Hb) is the major protein in erythrocytes and carries oxygen (O2) throughout the body. Recently, Hb has been found synthesized in atypical sites, including the brain. Hb is highly expressed in A9 dopaminergic (DA) neurons of the substantia nigra (SN), whose selective degeneration leads to Parkinson's disease (PD). Here we show that Hb confers DA cells' susceptibility to 1-methyl-4-phenylpyridinium (MPP+) and rotenone, neurochemical cellular models of PD. The toxic property of Hb does not depend on O2 binding and is associated with insoluble aggregate formation in the nucleolus. Neurochemical stress induces epigenetic modifications, nucleolar alterations and autophagy inhibition that depend on Hb expression. When adeno-associated viruses carrying α- and ß-chains of Hb are stereotaxically injected into mouse SN, Hb forms aggregates and causes motor learning impairment. These results position Hb as a potential player in DA cells' homeostasis and dysfunction in PD.

Asunto(s)

Neuronas Dopaminérgicas/metabolismo , Hemoglobinas/genética , Enfermedad de Parkinson Secundaria/genética , Enfermedad de Parkinson/genética , 1-Metil-4-fenilpiridinio/toxicidad , Animales , Autofagia/genética , Encéfalo/metabolismo , Encéfalo/patología , Neuronas Dopaminérgicas/patología , Epigénesis Genética/genética , Expresión Génica/efectos de los fármacos , Hemoglobinas/biosíntesis , Hemoglobinas/metabolismo , Humanos , Ratones , Enfermedad de Parkinson/metabolismo , Enfermedad de Parkinson/patología , Enfermedad de Parkinson Secundaria/patología , Rotenona/toxicidad , Sustancia Negra/metabolismo , Sustancia Negra/patología

17.

Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system.

Francescatto, Margherita; Lizio, Marina; Philippens, Ingrid; Pardo, Luba M; Bontrop, Ronald; Sakai, Mizuho; Watanabe, Shoko; Itoh, Masayoshi; Hasegawa, Akira; Lassmann, Timo; Severin, Jessica; Harshbarger, Jayson; Abugessaisa, Imad; Kasukawa, Takeya; Carninci, Piero; Hayashizaki, Yoshihide; Forrest, Alistair R R; Kawaji, Hideya; Rizzu, Patrizia; Heutink, Peter.

Sci Data ; 4: 170163, 2017 10 31.

Artículo en Inglés | MEDLINE | ID: mdl-29087374

RESUMEN

Rhesus macaque was the second non-human primate whose genome has been fully sequenced and is one of the most used model organisms to study human biology and disease, thanks to the close evolutionary relationship between the two species. But compared to human, where several previously unknown RNAs have been uncovered, the macaque transcriptome is less studied. Publicly available RNA expression resources for macaque are limited, even for brain, which is highly relevant to study human cognitive abilities. In an effort to complement those resources, FANTOM5 profiled 15 distinct anatomical regions of the aged macaque central nervous system using Cap Analysis of Gene Expression, a high-resolution, annotation-independent technology that allows monitoring of transcription initiation events with high accuracy. We identified 25,869 CAGE peaks, representing bona fide promoters. For each peak we provide detailed annotation, expanding the landscape of 'known' macaque genes, and we show concrete examples on how to use the resulting data. We believe this data represents a useful resource to understand the central nervous system in macaque.

Asunto(s)

Sistema Nervioso Central , Macaca mulatta , Sitio de Iniciación de la Transcripción , Animales , Sistema Nervioso Central/anatomía & histología , Transcriptoma

18.

C9orf72 is differentially expressed in the central nervous system and myeloid cells and consistently reduced in C9orf72, MAPT and GRN mutation carriers.

Rizzu, Patrizia; Blauwendraat, Cornelis; Heetveld, Sasja; Lynes, Emily M; Castillo-Lizardo, Melissa; Dhingra, Ashutosh; Pyz, Elwira; Hobert, Markus; Synofzik, Matthis; Simón-Sánchez, Javier; Francescatto, Margherita; Heutink, Peter.

Acta Neuropathol Commun ; 4(1): 37, 2016 Apr 14.

Artículo en Inglés | MEDLINE | ID: mdl-27079381

RESUMEN

A non-coding hexanucleotide repeat expansion (HRE) in C9orf72 is a common cause of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) acting through a loss of function mechanism due to haploinsufficiency of C9orf72 or a gain of function mediated by aggregates of bidirectionally transcribed HRE-RNAs translated into di-peptide repeat (DPR) proteins. To fully understand regulation of C9orf72 expression we surveyed the C9orf72 locus using Cap Analysis of Gene Expression sequence data (CAGEseq). We observed C9orf72 was generally lowly expressed with the exception of a subset of myeloid cells, particularly CD14+ monocytes that showed up to seven fold higher expression as compared to central nervous system (CNS) and other tissues. The expression profile at the C9orf72 locus showed a complex architecture with differential expression of the transcription start sites (TSSs) for the annotated C9orf72 transcripts between myeloid and CNS tissues suggesting cell and/or tissue specific functions. We further detected novel TSSs in both the sense and antisense strand at the C9orf72 locus and confirmed their existence in brain tissues and CD14+ monocytes. Interestingly, our experiments showed a consistent decrease of C9orf72 coding transcripts not only in brain tissue and monocytes from C9orf72-HRE patients, but also in brains from MAPT and GRN mutation carriers together with an increase in antisense transcripts suggesting these could play a role in regulation of C9orf72. We found that the non-HRE related expression changes cannot be explained by promoter methylation but by the presence of the C9orf72-HRE risk haplotype and unknown functional interactions between C9orf72, MAPT and GRN.

Asunto(s)

Sistema Nervioso Central/metabolismo , Regulación de la Expresión Génica , Péptidos y Proteínas de Señalización Intercelular/genética , Mutación/genética , Células Mieloides/metabolismo , Proteínas/genética , Proteínas/metabolismo , Proteínas tau/genética , Esclerosis Amiotrófica Lateral/metabolismo , Animales , Proteína C9orf72 , Bases de Datos Factuales/normas , Bases de Datos Factuales/estadística & datos numéricos , Demencia Frontotemporal/metabolismo , Humanos , Receptores de Lipopolisacáridos/metabolismo , Progranulinas

19.

Comprehensive promoter level expression quantitative trait loci analysis of the human frontal lobe.

Blauwendraat, Cornelis; Francescatto, Margherita; Gibbs, J Raphael; Jansen, Iris E; Simón-Sánchez, Javier; Hernandez, Dena G; Dillman, Allissa A; Singleton, Andrew B; Cookson, Mark R; Rizzu, Patrizia; Heutink, Peter.

Genome Med ; 8(1): 65, 2016 06 10.

Artículo en Inglés | MEDLINE | ID: mdl-27287230

RESUMEN

BACKGROUND: Expression quantitative trait loci (eQTL) analysis is a powerful method to detect correlations between gene expression and genomic variants and is widely used to interpret the biological mechanism underlying identified genome wide association studies (GWAS) risk loci. Numerous eQTL studies have been performed on different cell types and tissues of which the majority has been based on microarray technology. METHODS: We present here an eQTL analysis based on cap analysis gene expression sequencing (CAGEseq) data created from human postmortem frontal lobe tissue combined with genotypes obtained through genotyping arrays, exome sequencing, and CAGEseq. Using CAGEseq as an expression profiling technique combined with these different genotyping techniques allows measurement of the molecular effect of variants on individual transcription start sites and increases the resolution of eQTL analysis by also including the non-annotated parts of the genome. RESULTS: We identified 2410 eQTLs and show that non-coding transcripts are more likely to contain an eQTL than coding transcripts, in particular antisense transcripts. We provide evidence for how previously identified GWAS loci for schizophrenia (NRGN), Parkinson's disease, and Alzheimer's disease (PARK16 and MAPT loci) could increase the risk for disease at a molecular level. Furthermore, we demonstrate that CAGEseq improves eQTL analysis because variants obtained from CAGEseq are highly enriched for having a functional effect and thus are an efficient method towards the identification of causal variants. CONCLUSION: Our data contain both coding and non-coding transcripts and has the added value that we have identified eQTLs for variants directly adjacent to TSS. Future eQTL studies would benefit from combining CAGEseq with RNA sequencing for a more complete interpretation of the transcriptome and increased understanding of eQTL signals.

Asunto(s)

Lóbulo Frontal/química , Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Sitios de Carácter Cuantitativo , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Niño , Preescolar , Bases de Datos Genéticas , Femenino , Variación Genética , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Regiones Promotoras Genéticas , Adulto Joven

20.

DGEclust: differential expression analysis of clustered count data.

Vavoulis, Dimitrios V; Francescatto, Margherita; Heutink, Peter; Gough, Julian.

Genome Biol ; 16: 39, 2015 Feb 20.

Artículo en Inglés | MEDLINE | ID: mdl-25853652

RESUMEN

We present a statistical methodology, DGEclust, for differential expression analysis of digital expression data. Our method treats differential expression as a form of clustering, thus unifying these two concepts. Furthermore, it simultaneously addresses the problem of how many clusters are supported by the data and uncertainty in parameter estimation. DGEclust successfully identifies differentially expressed genes under a number of different scenarios, maintaining a low error rate and an excellent control of its false discovery rate with reasonable computational requirements. It is formulated to perform particularly well on low-replicated data and be applicable to multi-group data. DGEclust is available at http://dvav.github.io/dgeclust/.

Asunto(s)

Perfilación de la Expresión Génica/métodos , Expresión Génica/genética , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Algoritmos , Análisis por Conglomerados , Humanos , Programas Informáticos

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA