Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Respir Res ; 23(1): 308, 2022 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-36369209

RESUMO

Idiopathic pulmonary fibrosis, the archetype of pulmonary fibrosis (PF), is a chronic lung disease of a poor prognosis, characterized by progressively worsening of lung function. Although histology is still the gold standard for PF assessment in preclinical practice, histological data typically involve less than 1% of total lung volume and are not amenable to longitudinal studies. A miniaturized version of computed tomography (µCT) has been introduced to radiologically examine lung in preclinical murine models of PF. The linear relationship between X-ray attenuation and tissue density allows lung densitometry on total lung volume. However, the huge density changes caused by PF usually require manual segmentation by trained operators, limiting µCT deployment in preclinical routine. Deep learning approaches have achieved state-of-the-art performance in medical image segmentation. In this work, we propose a fully automated deep learning approach to segment right and left lung on µCT imaging and subsequently derive lung densitometry. Our pipeline first employs a convolutional network (CNN) for pre-processing at low-resolution and then a 2.5D CNN for higher-resolution segmentation, combining computational advantage of 2D and ability to address 3D spatial coherence without compromising accuracy. Finally, lungs are divided into compartments based on air content assessed by density. We validated this pipeline on 72 mice with different grades of PF, achieving a Dice score of 0.967 on test set. Our tests demonstrate that this automated tool allows for rapid and comprehensive analysis of µCT scans of PF murine models, thus laying the ground for its wider exploitation in preclinical settings.


Assuntos
Aprendizado Profundo , Fibrose Pulmonar , Animais , Camundongos , Fibrose Pulmonar/diagnóstico por imagem , Microtomografia por Raio-X , Modelos Animais de Doenças , Densitometria
2.
Neurol Sci ; 41(2): 459-462, 2020 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-31659583

RESUMO

Machine learning (ML) applied to patient-reported (PROs) and clinical-assessed outcomes (CAOs) could favour a more predictive and personalized medicine. Our aim was to confirm the important role of applying ML to PROs and CAOs of people with relapsing-remitting (RR) and secondary progressive (SP) form of multiple sclerosis (MS), to promptly identifying information useful to predict disease progression. For our analysis, a dataset of 3398 evaluations from 810 persons with MS (PwMS) was adopted. Three steps were provided: course classification; extraction of the most relevant predictors at the next time point; prediction if the patient will experience the transition from RR to SP at the next time point. The Current Course Assignment (CCA) step correctly assigned the current MS course with an accuracy of about 86.0%. The MS course at the next time point can be predicted using the predictors selected in CCA. PROs/CAOs Evolution Prediction (PEP) followed by Future Course Assignment (FCA) was able to foresee the course at the next time point with an accuracy of 82.6%. Our results suggest that PROs and CAOs could help the clinician decision-making in their practice.


Assuntos
Progressão da Doença , Aprendizado de Máquina , Esclerose Múltipla/terapia , Avaliação de Resultados em Cuidados de Saúde/métodos , Índice de Gravidade de Doença , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Esclerose Múltipla/diagnóstico , Medidas de Resultados Relatados pelo Paciente , Prognóstico , Estudo de Prova de Conceito
3.
Neural Comput ; 26(12): 2855-95, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-25248086

RESUMO

We present an algorithm for dictionary learning that is based on the alternating proximal algorithm studied by Attouch, Bolte, Redont, and Soubeyran (2010), coupled with a reliable and efficient dual algorithm for computation of the related proximity operators. This algorithm is suitable for a general dictionary learning model composed of a Bregman-type data fit term that accounts for the goodness of the representation and several convex penalization terms on the coefficients and atoms, explaining the prior knowledge at hand. As Attouch et al. recently proved, an alternating proximal scheme ensures better convergence properties than the simpler alternating minimization. We take care of the issue of inexactness in the computation of the involved proximity operators, giving a sound stopping criterion for the dual inner algorithm, which keeps under control the related errors, unavoidable for such a complex penalty terms, providing ultimately an overall effective procedure. Thanks to the generality of the proposed framework, we give an application in the context of genome-wide data understanding, revising the model proposed by Nowak, Hastie, Pollack, and Tibshirani (2011). The aim is to extract latent features (atoms) and perform segmentation on array-based comparative genomic hybridization (aCGH) data. We improve several important aspects that increase the quality and interpretability of the results. We show the effectiveness of the proposed model with two experiments on synthetic data, which highlight the enhancements over the original model.


Assuntos
Algoritmos , Inteligência Artificial , Aprendizagem/fisiologia , Modelos Teóricos , Hibridização Genômica Comparativa , Simulação por Computador , Bases de Dados Factuais , Humanos
4.
Front Comput Neurosci ; 18: 1360095, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39371524

RESUMO

Introduction: Machine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices in data handling, and modeling design and assessment is crucial. In this work, we summarize and strictly adhere to such practices to ensure reproducible and reliable ML. Specifically, we focus on Alzheimer's Disease (AD) detection, a challenging problem in healthcare. Additionally, we investigate the impact of modeling choices, including different data augmentation techniques and model complexity, on overall performance. Methods: We utilize Magnetic Resonance Imaging (MRI) data from the ADNI corpus to address a binary classification problem using 3D Convolutional Neural Networks (CNNs). Data processing and modeling are specifically tailored to address data scarcity and minimize computational overhead. Within this framework, we train 15 predictive models, considering three different data augmentation strategies and five distinct 3D CNN architectures with varying convolutional layers counts. The augmentation strategies involve affine transformations, such as zoom, shift, and rotation, applied either concurrently or separately. Results: The combined effect of data augmentation and model complexity results in up to 10% variation in prediction accuracy. Notably, when affine transformation are applied separately, the model achieves higher accuracy, regardless the chosen architecture. Across all strategies, the model accuracy exhibits a concave behavior as the number of convolutional layers increases, peaking at an intermediate value. The best model reaches excellent performance both on the internal and additional external testing set. Discussions: Our work underscores the critical importance of adhering to rigorous experimental practices in the field of ML applied to healthcare. The results clearly demonstrate how data augmentation and model depth-often overlooked factors- can dramatically impact final performance if not thoroughly investigated. This highlights both the necessity of exploring neglected modeling aspects and the need to comprehensively report all modeling choices to ensure reproducibility and facilitate meaningful comparisons across studies.

5.
Sci Rep ; 14(1): 2349, 2024 01 29.
Artigo em Inglês | MEDLINE | ID: mdl-38287042

RESUMO

Epilepsy surgery is an option for people with focal onset drug-resistant (DR) seizures but a delayed or incorrect diagnosis of epileptogenic zone (EZ) location limits its efficacy. Seizure semiological manifestations and their chronological appearance contain valuable information on the putative EZ location but their interpretation relies on extensive experience. The aim of our work is to support the localization of EZ in DR patients automatically analyzing the semiological description of seizures contained in video-EEG reports. Our sample is composed of 536 descriptions of seizures extracted from Electronic Medical Records of 122 patients. We devised numerical representations of anamnestic records and seizures descriptions, exploiting Natural Language Processing (NLP) techniques, and used them to feed Machine Learning (ML) models. We performed three binary classification tasks: localizing the EZ in the right or left hemisphere, temporal or extra-temporal, and frontal or posterior regions. Our computational pipeline reached performances above 70% in all tasks. These results show that NLP-based numerical representation combined with ML-based classification models may help in localizing the origin of the seizures relying only on seizures-related semiological text data alone. Accurate early recognition of EZ could enable a more appropriate patient management and a faster access to epilepsy surgery to potential candidates.


Assuntos
Epilepsia Resistente a Medicamentos , Epilepsias Parciais , Epilepsia , Humanos , Processamento de Linguagem Natural , Convulsões , Epilepsia Resistente a Medicamentos/diagnóstico , Epilepsia Resistente a Medicamentos/cirurgia , Eletroencefalografia , Epilepsias Parciais/diagnóstico , Epilepsias Parciais/cirurgia
6.
BMC Cancer ; 13: 387, 2013 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-23947815

RESUMO

BACKGROUND: Paediatric low-grade gliomas (LGGs) encompass a heterogeneous set of tumours of different histologies, site of lesion, age and gender distribution, growth potential, morphological features, tendency to progression and clinical course. Among LGGs, Pilocytic astrocytomas (PAs) are the most common central nervous system (CNS) tumours in children. They are typically well-circumscribed, classified as grade I by the World Health Organization (WHO), but recurrence or progressive disease occurs in about 10-20% of cases. Despite radiological and neuropathological features deemed as classic are acknowledged, PA may present a bewildering variety of microscopic features. Indeed, tumours containing both neoplastic ganglion and astrocytic cells occur at a lower frequency. METHODS: Gene expression profiling on 40 primary LGGs including PAs and mixed glial-neuronal tumours comprising gangliogliomas (GG) and desmoplastic infantile gangliogliomas (DIG) using Affymetrix array platform was performed. A biologically validated machine learning workflow for the identification of microarray-based gene signatures was devised. The method is based on a sparsity inducing regularization algorithm l1l2 that selects relevant variables and takes into account their correlation. The most significant genetic signatures emerging from gene-chip analysis were confirmed and validated by qPCR. RESULTS: We identified an expression signature composed by a biologically validated list of 15 genes, able to distinguish infratentorial from supratentorial LGGs. In addition, a specific molecular fingerprinting distinguishes the supratentorial PAs from those originating in the posterior fossa. Lastly, within supratentorial tumours, we also identified a gene expression pattern composed by neurogenesis, cell motility and cell growth genes which dichotomize mixed glial-neuronal tumours versus PAs. Our results reinforce previous observations about aberrant activation of the mitogen-activated protein kinase (MAPK) pathway in LGGs, but still point to an active involvement of TGF-beta signaling pathway in the PA development and pick out some hitherto unreported genes worthy of further investigation for the mixed glial-neuronal tumours. CONCLUSIONS: The identification of a brain region-specific gene signature suggests that LGGs, with similar pathological features but located at different sites, may be distinguishable on the basis of cancer genetics. Molecular fingerprinting seems to be able to better sub-classify such morphologically heterogeneous tumours and it is remarkable that mixed glial-neuronal tumours are strikingly separated from PAs.


Assuntos
Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Glioma/genética , Glioma/patologia , Transcriptoma , Astrocitoma/genética , Astrocitoma/patologia , Criança , Pré-Escolar , Análise por Conglomerados , Feminino , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Humanos , Lactente , Neoplasias Infratentoriais/genética , Neoplasias Infratentoriais/metabolismo , Masculino , Gradação de Tumores , Reprodutibilidade dos Testes , Neoplasias Supratentoriais/genética , Neoplasias Supratentoriais/metabolismo
7.
Artigo em Inglês | MEDLINE | ID: mdl-37079415

RESUMO

This work represents the first attempt to provide an overview of how to face data integration as the result of a dialogue between neuroscientists and computer scientists. Indeed, data integration is fundamental for studying complex multifactorial diseases, such as the neurodegenerative diseases. This work aims at warning the readers of common pitfalls and critical issues in both medical and data science fields. In this context, we define a road map for data scientists when they first approach the issue of data integration in the biomedical domain, highlighting the challenges that inevitably emerge when dealing with heterogeneous, large-scale and noisy data and proposing possible solutions. Here, we discuss data collection and statistical analysis usually seen as parallel and independent processes, as cross-disciplinary activities. Finally, we provide an exemplary application of data integration to address Alzheimer's Disease (AD), which is the most common multifactorial form of dementia worldwide. We critically discuss the largest and most widely used datasets in AD, and demonstrate how the emergence of machine learning and deep learning methods has had a significant impact on disease's knowledge particularly in the perspective of an early AD diagnosis.

8.
Ophthalmologica ; 227(4): 190-6, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22269846

RESUMO

PURPOSE: To evaluate survival and clinical outcome for patients with a large uveal melanoma treated by either enucleation or proton beam radiotherapy (PBRT). PROCEDURES: This retrospective non-randomized study evaluated 132 consecutive patients with T3 and T4 choroidal melanoma classified according to TNM stage grouping. RESULTS: Cumulative all-cause mortality, melanoma-related mortality and metastasis-free survival were not statistically different between the two groups (log-rank test, p = 0.56, p = 0.99 and p = 0.25, respectively). Eye retention of the tumours treated with PBRT at 5 years was 74% (SD 6.2%). In these patients at diagnosis, 73% of eyes had a best-corrected visual acuity (BCVA) of 0.1 or better. After 12 and 60 months, BCVA of 0.1 or better was observed in 47.5 and 32%, respectively. CONCLUSION AND MESSAGE: Although enucleation is the most common primary treatment for large uveal melanomas, PBRT is an eye-preserving option that may be considered for some patients.


Assuntos
Neoplasias da Coroide/radioterapia , Neoplasias da Coroide/cirurgia , Enucleação Ocular , Melanoma/radioterapia , Melanoma/cirurgia , Radioterapia de Alta Energia , Idoso , Causas de Morte , Neoplasias da Coroide/mortalidade , Neoplasias da Coroide/patologia , Feminino , Seguimentos , Humanos , Masculino , Melanoma/mortalidade , Melanoma/patologia , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Prótons , Estudos Retrospectivos , Taxa de Sobrevida , Resultado do Tratamento , Acuidade Visual/fisiologia
9.
J Comput Biol ; 29(3): 213-232, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-33926217

RESUMO

More and more biologists and bioinformaticians turn to machine learning to analyze large amounts of data. In this context, it is crucial to understand which is the most suitable data analysis pipeline for achieving reliable results. This process may be challenging, due to a variety of factors, the most crucial ones being the data type and the general goal of the analysis (e.g., explorative or predictive). Life science data sets require further consideration as they often contain measures with a low signal-to-noise ratio, high-dimensional observations, and relatively few samples. In this complex setting, regularization, which can be defined as the introduction of additional information to solve an ill-posed problem, is the tool of choice to obtain robust models. Different regularization practices may be used depending both on characteristics of the data and of the question asked, and different choices may lead to different results. In this article, we provide a comprehensive description of the impact and importance of regularization techniques in life science studies. In particular, we provide an intuition of what regularization is and of the different ways it can be implemented and exploited. We propose four general life sciences problems in which regularization is fundamental and should be exploited for robustness. For each of these large families of problems, we enumerate different techniques as well as examples and case studies. Lastly, we provide a unified view of how to approach each data type with various regularization techniques.


Assuntos
Algoritmos , Disciplinas das Ciências Biológicas , Aprendizado de Máquina
10.
Mol Cancer ; 9: 185, 2010 Jul 12.
Artigo em Inglês | MEDLINE | ID: mdl-20624283

RESUMO

BACKGROUND: Hypoxia is a condition of low oxygen tension occurring in the tumor microenvironment and it is related to poor prognosis in human cancer. To examine the relationship between hypoxia and neuroblastoma, we generated and tested an in vitro derived hypoxia gene signature for its ability to predict patients' outcome. RESULTS: We obtained the gene expression profile of 11 hypoxic neuroblastoma cell lines and we derived a robust 62 probesets signature (NB-hypo) taking advantage of the strong discriminating power of the l1-l2 feature selection technique combined with the analysis of differential gene expression. We profiled gene expression of the tumors of 88 neuroblastoma patients and divided them according to the NB-hypo expression values by K-means clustering. The NB-hypo successfully stratifies the neuroblastoma patients into good and poor prognosis groups. Multivariate Cox analysis revealed that the NB-hypo is a significant independent predictor after controlling for commonly used risk factors including the amplification of MYCN oncogene. NB-hypo increases the resolution of the MYCN stratification by dividing patients with MYCN not amplified tumors in good and poor outcome suggesting that hypoxia is associated with the aggressiveness of neuroblastoma tumor independently from MYCN amplification. CONCLUSIONS: Our results demonstrate that the NB-hypo is a novel and independent prognostic factor for neuroblastoma and support the view that hypoxia is negatively correlated with tumors' outcome. We show the power of the biology-driven approach in defining hypoxia as a critical molecular program in neuroblastoma and the potential for improvement in the current criteria for risk stratification.


Assuntos
Hipóxia Celular/genética , Perfilação da Expressão Gênica , Neuroblastoma/genética , Linhagem Celular Tumoral , Genes myc , Humanos , Lactente , Neuroblastoma/patologia , Análise de Sequência com Séries de Oligonucleotídeos , Resultado do Tratamento
11.
Brief Bioinform ; 9(2): 119-28, 2008 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-18310105

RESUMO

The search for predictive biomarkers of disease from high-throughput mass spectrometry (MS) data requires a complex analysis path. Preprocessing and machine-learning modules are pipelined, starting from raw spectra, to set up a predictive classifier based on a shortlist of candidate features. As a machine-learning problem, proteomic profiling on MS data needs caution like the microarray case. The risk of overfitting and of selection bias effects is pervasive: not only potential features easily outnumber samples by 10(3) times, but it is easy to neglect information-leakage effects during preprocessing from spectra to peaks. The aim of this review is to explain how to build a general purpose design analysis protocol (DAP) for predictive proteomic profiling: we show how to limit leakage due to parameter tuning and how to organize classification and ranking on large numbers of replicate versions of the original data to avoid selection bias. The DAP can be used with alternative components, i.e. with different preprocessing methods (peak clustering or wavelet based), classifiers e.g. Support Vector Machine (SVM) or feature ranking methods (recursive feature elimination or I-Relief). A procedure for assessing stability and predictive value of the resulting biomarkers' list is also provided. The approach is exemplified with experiments on synthetic datasets (from the Cromwell MS simulator) and with publicly available datasets from cancer studies.


Assuntos
Inteligência Artificial , Biomarcadores/análise , Espectrometria de Massas , Reconhecimento Automatizado de Padrão , Proteômica , Algoritmos , Animais , Área Sob a Curva , Perfilação da Expressão Gênica , Humanos , Espectrometria de Massas/instrumentação , Espectrometria de Massas/métodos , Análise em Microsséries , Reconhecimento Automatizado de Padrão/métodos , Proteômica/instrumentação , Proteômica/métodos , Processamento de Sinais Assistido por Computador
12.
J Biomed Biotechnol ; 2010: 878709, 2010.
Artigo em Inglês | MEDLINE | ID: mdl-20652058

RESUMO

Hypoxia is a condition of low oxygen tension occurring in the tumor and negatively correlated with the progression of the disease. We studied the gene expression profiles of nine neuroblastoma cell lines grown under hypoxic conditions to define gene signatures that characterize hypoxic neuroblastoma. The l(1)-l(2) regularization applied to the entire transcriptome identified a single signature of 11 probesets discriminating the hypoxic state. We demonstrate that new hypoxia signatures, with similar discriminatory power, can be generated by a prior knowledge-based filtering in which a much smaller number of probesets, characterizing hypoxia-related biochemical pathways, are analyzed. l(1)-l(2) regularization identified novel and robust hypoxia signatures within apoptosis, glycolysis, and oxidative phosphorylation Gene Ontology classes. We conclude that the filtering approach overcomes the noisy nature of the microarray data and allows generating robust signatures suitable for biomarker discovery and patients risk assessment in a fraction of computer time.


Assuntos
Hipóxia Celular , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica/genética , Neuroblastoma/metabolismo , Algoritmos , Hipóxia Celular/genética , Hipóxia Celular/fisiologia , Linhagem Celular Tumoral , Análise por Conglomerados , Biologia Computacional/métodos , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Análise de Componente Principal , Reprodutibilidade dos Testes
13.
Sci Rep ; 10(1): 12063, 2020 07 21.
Artigo em Inglês | MEDLINE | ID: mdl-32694537

RESUMO

Genome-wide association studies (GWAS) have revealed a plethora of putative susceptibility genes for Alzheimer's disease (AD), with the sole exception of APOE gene unequivocally validated in independent study. Considering that the etiology of complex diseases like AD could depend on functional multiple genes interaction network, here we proposed an alternative GWAS analysis strategy based on (i) multivariate methods and on a (ii) telescope approach, in order to guarantee the identification of correlated variables, and reveal their connections at three biological connected levels. Specifically as multivariate methods, we employed two machine learning algorithms and a genetic association test and we considered SNPs, Genes and Pathways features in the analysis of two public GWAS dataset (ADNI-1 and ADNI-2). For each dataset and for each feature we addressed two binary classifications tasks: cases vs. controls and the low vs. high risk of developing AD considering the allelic status of APOEe4. This complex strategy allowed the identification of SNPs, genes and pathways lists statistically robust and meaningful from the biological viewpoint. Among the results, we confirm the involvement of TOMM40 gene in AD and we propose GRM7 as a novel gene significantly associated with AD.


Assuntos
Doença de Alzheimer/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Algoritmos , Alelos , Doença de Alzheimer/diagnóstico , Doença de Alzheimer/psicologia , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla/métodos , Humanos , Aprendizado de Máquina , Imageamento por Ressonância Magnética , Tomografia por Emissão de Pósitrons
14.
J Clin Endocrinol Metab ; 105(9)2020 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-32692360

RESUMO

AIMS: The purpose of this work is to find the gut microbial fingerprinting of pediatric patients with type 1 diabetes. METHODS: The microbiome of 31 children with type 1 diabetes at onset and of 25 healthy children was determined using multiple polymorphic regions of the 16S ribosomal RNA. We performed machine-learning analyses and metagenome functional analysis to identify significant taxa and their metabolic pathways content. RESULTS: Compared with healthy controls, patients showed a significantly higher relative abundance of the following most important taxa: Bacteroides stercoris, Bacteroides fragilis, Bacteroides intestinalis, Bifidobacterium bifidum, Gammaproteobacteria and its descendants, Holdemania, and Synergistetes and its descendants. On the contrary, the relative abundance of Bacteroides vulgatus, Deltaproteobacteria and its descendants, Parasutterella and the Lactobacillus, Turicibacter genera were significantly lower in patients with respect to healthy controls. The predicted metabolic pathway more associated with type 1 diabetes patients concerns "carbon metabolism," sugar and iron metabolisms in particular. Among the clinical variables considered, standardized body mass index, anti-insulin autoantibodies, glycemia, hemoglobin A1c, Tanner stage, and age at onset emerged as most significant positively or negatively correlated with specific clusters of taxa. CONCLUSIONS: The relative abundance and supervised analyses confirmed the importance of B stercoris in type 1 diabetes patients at onset and showed a relevant role of Synergistetes and its descendants in patients with respect to healthy controls. In general the robustness and coherence of the showed results underline the relevance of studying the microbioma using multiple polymorphic regions, different types of analysis, and different approaches within each analysis.


Assuntos
Algoritmos , Diabetes Mellitus Tipo 1/epidemiologia , Diabetes Mellitus Tipo 1/microbiologia , Microbioma Gastrointestinal/fisiologia , Aprendizado de Máquina , Adolescente , Idade de Início , Criança , Pré-Escolar , Estudos de Coortes , Diabetes Mellitus Tipo 1/etiologia , Fezes/microbiologia , Feminino , Humanos , Masculino , Metagenoma/fisiologia , Fatores de Risco
15.
J Clin Med ; 9(6)2020 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-32492887

RESUMO

During the phase of proliferation needed for hematopoietic reconstitution following transplantation, hematopoietic stem/progenitor cells (HSPC) must express genes involved in stem cell self-renewal. We investigated the expression of genes relevant for self-renewal and expansion of HSPC (operationally defined as CD34+ cells) in steady state and after transplantation. Specifically, we evaluated the expression of ninety-one genes that were analyzed by real-time PCR in CD34+ cells isolated from (i) 12 samples from umbilical cord blood (UCB); (ii) 15 samples from bone marrow healthy donors; (iii) 13 samples from bone marrow after umbilical cord blood transplant (UCBT); and (iv) 29 samples from patients after transplantation with adult hematopoietic cells. The results show that transplanted CD34+ cells from adult cells acquire an asset very different from transplanted CD34+ cells from cord blood. Multivariate machine learning analysis (MMLA) showed that four specific gene signatures can be obtained by comparing the four types of CD34+ cells. In several, but not all cases, transplanted HSPC from UCB overexpress reprogramming genes. However, these remarkable changes do not alter the commitment to hematopoietic lineage. Overall, these results reveal undisclosed aspects of transplantation biology.

16.
BMC Genomics ; 10: 474, 2009 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-19832978

RESUMO

BACKGROUND: Gene expression signatures are clusters of genes discriminating different statuses of the cells and their definition is critical for understanding the molecular bases of diseases. The identification of a gene signature is complicated by the high dimensional nature of the data and by the genetic heterogeneity of the responding cells. The l1-l2 regularization is an embedded feature selection technique that fulfills all the desirable properties of a variable selection algorithm and has the potential to generate a specific signature even in biologically complex settings. We studied the application of this algorithm to detect the signature characterizing the transcriptional response of neuroblastoma tumor cell lines to hypoxia, a condition of low oxygen tension that occurs in the tumor microenvironment. RESULTS: We determined the gene expression profile of 9 neuroblastoma cell lines cultured under normoxic and hypoxic conditions. We studied a heterogeneous set of neuroblastoma cell lines to mimic the in vivo situation and to test the robustness and validity of the l1-l2 regularization with double optimization. Analysis by hierarchical, spectral, and k-means clustering or supervised approach based on t-test analysis divided the cell lines on the bases of genetic differences. However, the disturbance of this strong transcriptional response completely masked the detection of the more subtle response to hypoxia. Different results were obtained when we applied the l1-l2 regularization framework. The algorithm distinguished the normoxic and hypoxic statuses defining signatures comprising 3 to 38 probesets, with a leave-one-out error of 17%. A consensus hypoxia signature was established setting the frequency score at 50% and the correlation parameter epsilon equal to 100. This signature is composed by 11 probesets representing 8 well characterized genes known to be modulated by hypoxia. CONCLUSION: We demonstrate that l1-l2 regularization outperforms more conventional approaches allowing the identification and definition of a gene expression signature under complex experimental conditions. The l1-l2 regularization and the cross validation generates an unbiased and objective output with a low classification error. We feel that the application of this algorithm to tumor biology will be instrumental to analyze gene expression signatures hidden in the transcriptome that, like hypoxia, may be major determinant of the course of the disease.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Neuroblastoma/genética , Hipóxia Celular/genética , Linhagem Celular Tumoral , Análise por Conglomerados , Regulação Neoplásica da Expressão Gênica , Humanos , Análise Multivariada , RNA Neoplásico/genética
17.
Bioinformatics ; 24(2): 258-64, 2008 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-18024475

RESUMO

MOTIVATION: We propose a method for studying the stability of biomarker lists obtained from functional genomics studies. It is common to adopt resampling methods to tune and evaluate marker-based diagnostic and prognostic systems in order to prevent selection bias. Such caution promotes honest estimation of class prediction, but leads to alternative sets of solutions. In microarray studies, the difference in lists may be bewildering, also due to the presence of modules of functionally related genes. Methods for assessing stability understand the dependency of the markers on the data or on the predictor's type and help selecting solutions. RESULTS: A computational framework for comparing sets of ranked biomarker lists is presented. Notions and algorithms are based on concepts from permutation group theory. We introduce several algebraic indicators and metric methods for symmetric groups, including the Canberra distance, a weighted version of Spearman's footrule. We also consider distances between partial lists and an aggregation of sets of lists into an optimal list based on voting theory (Borda count). The stability indicators are applied in practical situations to several synthetic, cancer microarray and proteomics datasets. The addressed issues are predictive classification, presence of modules, comparison of alternative biomarker lists, outlier removal, control of selection bias by randomization techniques and enrichment analysis. AVAILABILITY: Supplementary Material and software are available at the address http://biodcv.fbk.eu/listspy.html


Assuntos
Algoritmos , Biomarcadores Tumorais/análise , Perfilação da Expressão Gênica/métodos , Proteínas de Neoplasias/análise , Neoplasias/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Interpretação Estatística de Dados , Humanos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
18.
Eur J Ophthalmol ; 19(4): 654-60, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19551683

RESUMO

PURPOSE: To evaluate the results of 15 years of experience with proton beam radiotherapy in the treatment of intraocular melanoma, and to determine univariate and multivariate risk factors for local failure, eye retention, and survival. METHODS: A total of 368 cases of intraocular melanoma were treated with proton beam radiotherapy at Centre Lacassagne Cyclotron Biomedical of Nice, France, between 1991 and 2006. Actuarial methods were used to evaluate rate of local tumor control, eye retention, and survival after proton beam radiotherapy. Cox regression models were extracted to evaluate univariate risk factors, while regularized least squares algorithm was used to have a multivariate classification model to better discriminate risk factors. RESULTS: Tumor relapse occurred in 8.4% of the eyes, with a median recurrence time of 46 months. Enucleation was performed on 11.7% of the eyes after a median time of 49 months following proton beam; out of these, 29 eyes were enucleated due to relapse and 16 due to other causes. The univariate regression analysis identified tumor height and diameter as primary risk factors for enucleation. Regularized least squares analysis demonstrated the higher effectiveness of a multivariate model of five risk factors (macula distance, optic disc distance, tumor height, maximum diameter, and age) in discriminating relapsed vs nonrelapsed patients. CONCLUSIONS: This data set, which is the largest in Italy with relatively long-term follow-up, demonstrates that a high rate of tumor control, survival, and eye retention were achieved after proton beam irradiation, as in other series.


Assuntos
Melanoma/radioterapia , Prótons , Radioterapia de Alta Energia , Neoplasias Uveais/radioterapia , Enucleação Ocular , Feminino , Seguimentos , França , Humanos , Itália , Masculino , Melanoma/mortalidade , Melanoma/patologia , Pessoa de Meia-Idade , Recidiva Local de Neoplasia , Modelos de Riscos Proporcionais , Fatores de Risco , Taxa de Sobrevida , Resultado do Tratamento , Neoplasias Uveais/mortalidade , Neoplasias Uveais/patologia
19.
PLoS One ; 14(10): e0211844, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31626666

RESUMO

INTRODUCTION: The first line of treatment for people with Diabetes mellitus is metformin. However, over the course of the disease metformin may fail to achieve appropriate glycemic control, and a second-line therapy may become necessary. In this paper we introduce Tangle, a time span-guided neural attention model that can accurately and timely predict the upcoming need for a second-line diabetes therapy from administrative data in the Australian adult population. The method is suitable for designing automatic therapy review recommendations for patients and their providers without the need to collect clinical measures. DATA: We analyzed seven years of de-identified records (2008-2014) of the 10% publicly available linked sample of Medicare Benefits Schedule (MBS) and Pharmaceutical Benefits Scheme (PBS) electronic databases of Australia. METHODS: By design, Tangle inherits the representational power of pre-trained word embedding, such as GloVe, to encode sequences of claims with the related MBS codes. Moreover, the proposed attention mechanism natively exploits the information hidden in the time span between two successive claims (measured in number of days). We compared the proposed method against state-of-the-art sequence classification methods. RESULTS: Tangle outperforms state-of-the-art recurrent neural networks, including attention-based models. In particular, when the proposed time span-guided attention strategy is coupled with pre-trained embedding methods, the model performance reaches an Area Under the ROC Curve of 90%, an improvement of almost 10 percentage points over an attentionless recurrent architecture. IMPLEMENTATION: Tangle is implemented in Python using Keras and it is hosted on GitHub at https://github.com/samuelefiorini/tangle.


Assuntos
Diabetes Mellitus/tratamento farmacológico , Aprendizado de Máquina , Metformina/uso terapêutico , Modelos Biológicos , Redes Neurais de Computação , Austrália , Diabetes Mellitus/epidemiologia , Feminino , Humanos , Masculino , Valor Preditivo dos Testes
20.
Sci Rep ; 9(1): 10347, 2019 07 17.
Artigo em Inglês | MEDLINE | ID: mdl-31316102

RESUMO

Aging is a physiological process in which multifactorial processes determine a progressive decline. Several alterations contribute to the aging process, including telomere shortening, oxidative stress, deregulated autophagy and epigenetic modifications. In some cases, these alterations are so linked with the aging process that it is possible predict the age of a person on the basis of the modification of one specific pathway, as proposed by Horwath and his aging clock based on DNA methylation. Because the energy metabolism changes are involved in the aging process, in this work, we propose a new aging clock based on the modifications of glucose catabolism. The biochemical analyses were performed on mononuclear cells isolated from peripheral blood, obtained from a healthy population with an age between 5 and 106 years. In particular, we have evaluated the oxidative phosphorylation function and efficiency, the ATP/AMP ratio, the lactate dehydrogenase activity and the malondialdehyde content. Further, based on these biochemical markers, we developed a machine learning-based mathematical model able to predict the age of an individual with a mean absolute error of approximately 9.7 years. This mathematical model represents a new non-invasive tool to evaluate and define the age of individuals and could be used to evaluate the effects of drugs or other treatments on the early aging or the rejuvenation.


Assuntos
Envelhecimento/metabolismo , Glucose/metabolismo , Modelos Biológicos , Trifosfato de Adenosina/metabolismo , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Biomarcadores/metabolismo , Criança , Pré-Escolar , Metabolismo Energético , Feminino , Humanos , Leucócitos Mononucleares/metabolismo , Aprendizado de Máquina , Masculino , Malondialdeído/metabolismo , Pessoa de Meia-Idade , Mitocôndrias/metabolismo , Fosforilação Oxidativa , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA