A primer to frequent itemset mining for bioinformatics.

Naulaerts, Stefan; Meysman, Pieter; Bittremieux, Wout; Vu, Trung Nghia; Vanden Berghe, Wim; Goethals, Bart; Laukens, Kris

Naulaerts, Stefan; Meysman, Pieter; Bittremieux, Wout; Vu, Trung Nghia; Vanden Berghe, Wim; Goethals, Bart; Laukens, Kris.

Brief Bioinform ; 16(2): 216-31, 2015 Mar.

Article en En | MEDLINE | ID: mdl-24162173

ABSTRACT

ABSTRACT

Over the past two decades, pattern mining techniques have become an integral part of many bioinformatics solutions. Frequent itemset mining is a popular group of pattern mining techniques designed to identify elements that frequently co-occur. An archetypical example is the identification of products that often end up together in the same shopping basket in supermarket transactions. A number of algorithms have been developed to address variations of this computationally non-trivial problem. Frequent itemset mining techniques are able to efficiently capture the characteristics of (complex) data and succinctly summarize it. Owing to these and other interesting properties, these techniques have proven their value in biological data analysis. Nevertheless, information about the bioinformatics applications of these techniques remains scattered. In this primer, we introduce frequent itemset mining and their derived association rules for life scientists. We give an overview of various algorithms, and illustrate how they can be used in several real-life bioinformatics application domains. We end with a discussion of the future potential and open challenges for frequent itemset mining in the life sciences.

Asunto(s)

Algoritmos; Minería de Datos/estadística & datos numéricos; Animales; Análisis por Conglomerados; Biología Computacional; Perfilación de la Expresión Génica/estadística & datos numéricos; Redes Reguladoras de Genes; Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos; Humanos; Reconocimiento de Normas Patrones Automatizadas/estadística & datos numéricos; Polimorfismo de Nucleótido Simple; Programas Informáticos

Palabras clave

association rule; biclustering; frequent item set; market basket analysis; pattern mining

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Minería de Datos Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2015 Tipo del documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google