Your browser doesn't support javascript.
loading
MultiDataSet: an R package for encapsulating multiple data sets with application to omic data integration.
Hernandez-Ferrer, Carles; Ruiz-Arenas, Carlos; Beltran-Gomila, Alba; González, Juan R.
Afiliação
  • Hernandez-Ferrer C; Institut de Salut Global de Barcelona (ISGlobal) - Campus Mar, Barcelona Biulding: Biomedical Research Park, c/Dr. Aiguader, 88, 08003, Barcelona, Spain.
  • Ruiz-Arenas C; Universitat Pompeu Fabra (UPF), Barcelona, Spain.
  • Beltran-Gomila A; CIBER Epidemiología y Salud Pública (CIBERESP), Barcelona, Spain.
  • González JR; Institut de Salut Global de Barcelona (ISGlobal) - Campus Mar, Barcelona Biulding: Biomedical Research Park, c/Dr. Aiguader, 88, 08003, Barcelona, Spain.
BMC Bioinformatics ; 18(1): 36, 2017 Jan 17.
Article em En | MEDLINE | ID: mdl-28095799
BACKGROUND: Reduction in the cost of genomic assays has generated large amounts of biomedical-related data. As a result, current studies perform multiple experiments in the same subjects. While Bioconductor's methods and classes implemented in different packages manage individual experiments, there is not a standard class to properly manage different omic datasets from the same subjects. In addition, most R/Bioconductor packages that have been designed to integrate and visualize biological data often use basic data structures with no clear general methods, such as subsetting or selecting samples. RESULTS: To cover this need, we have developed MultiDataSet, a new R class based on Bioconductor standards, designed to encapsulate multiple data sets. MultiDataSet deals with the usual difficulties of managing multiple and non-complete data sets while offering a simple and general way of subsetting features and selecting samples. We illustrate the use of MultiDataSet in three common situations: 1) performing integration analysis with third party packages; 2) creating new methods and functions for omic data integration; 3) encapsulating new unimplemented data from any biological experiment. CONCLUSIONS: MultiDataSet is a suitable class for data integration under R and Bioconductor framework.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genômica Limite: Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genômica Limite: Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article