RESUMEN
BACKGROUND: With an abundant amount of microarray gene expression data sets available through public repositories, new possibilities lie in combining multiple existing data sets. In this new context, analysis itself is no longer the problem, but retrieving and consistently integrating all this data before delivering it to the wide variety of existing analysis tools becomes the new bottleneck. RESULTS: We present the newly released inSilicoMerging R/Bioconductor package which, together with the earlier released inSilicoDb R/Bioconductor package, allows consistent retrieval, integration and analysis of publicly available microarray gene expression data sets. Inside the inSilicoMerging package a set of five visual and six quantitative validation measures are available as well. CONCLUSIONS: By providing (i) access to uniformly curated and preprocessed data, (ii) a collection of techniques to remove the batch effects between data sets from different sources, and (iii) several validation tools enabling the inspection of the integration process, these packages enable researchers to fully explore the potential of combining gene expression data for downstream analysis. The power of using both packages is demonstrated by programmatically retrieving and integrating gene expression studies from the InSilico DB repository [https://insilicodb.org/app/].
Asunto(s)
Perfilación de la Expresión Génica/estadística & datos numéricos , Análisis de Secuencia por Matrices de Oligonucleótidos/estadística & datos numéricos , Programas Informáticos , Acceso a la Información , HumanosRESUMEN
Microarray technology has become an integral part of biomedical research and increasing amounts of datasets become available through public repositories. However, re-use of these datasets is severely hindered by unstructured, missing or incorrect biological samples information; as well as the wide variety of preprocessing methods in use. The inSilicoDb R/Bioconductor package is a command-line front-end to the InSilico DB, a web-based database currently containing 86 104 expert-curated human Affymetrix expression profiles compiled from 1937 GEO repository series. The use of this package builds on the Bioconductor project's focus on reproducibility by enabling a clear workflow in which not only analysis, but also the retrieval of verified data is supported.
Asunto(s)
Perfilación de la Expresión Génica , Programas Informáticos , Bases de Datos Genéticas , Humanos , Análisis de Secuencia por Matrices de OligonucleótidosRESUMEN
INTRODUCTION: Psoriasis affects around 2% of children in Europe. The majority of cases is readily managed with topical treatments using corticosteroids without or with calcipotriol. More resistant and extensive moderate-to-severe cases require UVA or UVB phototherapies or conventional systemic treatment including ciclosporin, acitretin and methotrexate. However, these therapies are associated with a low tolerability and potential cumulative long-term adverse effects and toxicities. AREAS COVERED: About 15 years ago, the first biological appeared for the treatment of moderate-to-severe plaque type psoriasis in adult patients. Several years later, the first biologic treatment to be approved in children was etanercept, a soluble receptor that binds both tumor necrosis factor (TNF)-α and ß followed by adalimumab, a monoclonal antibody against TNF-α, and currently by ustekinumab, a monoclonal IL12/23 p40 antagonist and, very recently, secukinumab and ixekizumab, both IL17 antagonists. All these biologic treatments brought significantly improved treatment results compared to light-based therapies and conventional treatments and present very good tolerance and safety profiles. EXPERT OPINION: Due to their excellent efficacy and safety profiles ustekinumab, secukinumab and ixekizumab could currently be considered as a first-line treatment options for moderate-to-severe childhood and adolescent psoriasis requiring a systemic treatment.
Asunto(s)
Psoriasis , Adalimumab , Adolescente , Adulto , Anticuerpos Monoclonales/uso terapéutico , Niño , Etanercept , Humanos , Psoriasis/terapia , Índice de Severidad de la Enfermedad , Resultado del Tratamiento , Ustekinumab/uso terapéuticoRESUMEN
A plenitude of feature selection (FS) methods is available in the literature, most of them rising as a need to analyze data of very high dimension, usually hundreds or thousands of variables. Such data sets are now available in various application areas like combinatorial chemistry, text mining, multivariate imaging, or bioinformatics. As a general accepted rule, these methods are grouped in filters, wrappers, and embedded methods. More recently, a new group of methods has been added in the general framework of FS: ensemble techniques. The focus in this survey is on filter feature selection methods for informative feature discovery in gene expression microarray (GEM) analysis, which is also known as differentially expressed genes (DEGs) discovery, gene prioritization, or biomarker discovery. We present them in a unified framework, using standardized notations in order to reveal their technical details and to highlight their common characteristics as well as their particularities.
Asunto(s)
Biología Computacional/métodos , Perfilación de la Expresión Génica , Modelos Estadísticos , Análisis de Secuencia por Matrices de Oligonucleótidos , Análisis de Varianza , Teorema de Bayes , Marcadores Genéticos , Teoría de la Información , Curva ROC , Estadísticas no ParamétricasRESUMEN
Genomics datasets are increasingly useful for gaining biomedical insights, with adoption in the clinic underway. However, multiple hurdles related to data management stand in the way of their efficient large-scale utilization. The solution proposed is a web-based data storage hub. Having clear focus, flexibility and adaptability, InSilico DB seamlessly connects genomics dataset repositories to state-of-the-art and free GUI and command-line data analysis tools. The InSilico DB platform is a powerful collaborative environment, with advanced capabilities for biocuration, dataset sharing, and dataset subsetting and combination. InSilico DB is available from https://insilicodb.org.