Nearest Consensus Clustering Classification to Identify Subclasses and Predict Disease.

Alyousef, Awad A; Nihtyanova, Svetlana; Denton, Chris; Bosoni, Pietro; Bellazzi, Riccardo; Tucker, Allan

Alyousef, Awad A; Nihtyanova, Svetlana; Denton, Chris; Bosoni, Pietro; Bellazzi, Riccardo; Tucker, Allan.

Afiliação

Alyousef AA; 1Department Computer Science, Brunel University London, Uxbridge, UK.
Nihtyanova S; 2UCL Royal Free Hospital, London, UK.
Denton C; 2UCL Royal Free Hospital, London, UK.
Bosoni P; 3University of Pavia, Pavia, Italy.
Bellazzi R; 3University of Pavia, Pavia, Italy.
Tucker A; 1Department Computer Science, Brunel University London, Uxbridge, UK.

J Healthc Inform Res ; 2(4): 402-422, 2018.

Article em En | MEDLINE | ID: mdl-30533598

RESUMO

Disease subtyping, which helps to develop personalized treatments, remains a challenge in data analysis because of the many different ways to group patients based upon their data. However, if we can identify subclasses of disease, then it will help to develop better models that are more specific to individuals and should therefore improve prediction and understanding of the underlying characteristics of the disease in question. This paper proposes a new algorithm that integrates consensus clustering methods with classification in order to overcome issues with sample bias. The new algorithm combines K-means with consensus clustering in order build cohort-specific decision trees that improve classification as well as aid the understanding of the underlying differences of the discovered groups. The methods are tested on a real-world freely available breast cancer dataset and data from a London hospital on systemic sclerosis, a rare potentially fatal condition. Results show that "nearest consensus clustering classification" improves the accuracy and the prediction significantly when this algorithm has been compared with competitive similar methods.

Palavras-chave

Classification; Consensus clustering; Disease subgroup discovery

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google