Clustering algorithm based on DINNSM and its application in gene expression data analysis.

Li, Zongjin; Song, Changxin; Yang, Jiyu; Jia, Zeyu; Chen, Dongzhen; Yan, Chengying; Tian, Liqin; Wu, Xiaoming

Li, Zongjin; Song, Changxin; Yang, Jiyu; Jia, Zeyu; Chen, Dongzhen; Yan, Chengying; Tian, Liqin; Wu, Xiaoming.

Afiliación

Li Z; Department of Computer, Qinghai Normal University, Xining, China.
Song C; Department of Mechanical Engineering and Information, Shanghai Urban Construction Vocational College, Shanghai, China.
Yang J; Department of Cardiovascular Medicine, Xining First People's Hospital, Xining, China.
Jia Z; Department of Computer, Qinghai Normal University, Xining, China.
Chen D; School of Materials Science and Engineering, Xi'an Polytechnic University, Xi'an, China.
Yan C; Department of Cardiovascular Medicine, Xining First People's Hospital, Xining, China.
Tian L; Department of Computer, Qinghai Normal University, Xining, China.
Wu X; School of Computer, North China Institute of Science and Technology, Langfang, China.

Technol Health Care ; 32(S1): 229-239, 2024.

Article en En | MEDLINE | ID: mdl-38759052

ABSTRACT

ABSTRACT

BACKGROUND:

Selecting an appropriate similarity measurement method is crucial for obtaining biologically meaningful clustering modules. Commonly used measurement methods are insufficient in capturing the complexity of biological systems and fail to accurately represent their intricate interactions.

OBJECTIVE:

This study aimed to obtain biologically meaningful gene modules by using the clustering algorithm based on a similarity measurement method.

METHODS:

A new algorithm called the Dual-Index Nearest Neighbor Similarity Measure (DINNSM) was proposed. This algorithm calculated the similarity matrix between genes using Pearson's or Spearman's correlation. It was then used to construct a nearest-neighbor table based on the similarity matrix. The final similarity matrix was reconstructed using the positions of shared genes in the nearest neighbor table and the number of shared genes.

RESULTS:

Experiments were conducted on five different gene expression datasets and compared with five widely used similarity measurement techniques for gene expression data. The findings demonstrate that when utilizing DINNSM as the similarity measure, the clustering results performed better than using alternative measurement techniques.

CONCLUSIONS:

DINNSM provided more accurate insights into the intricate biological connections among genes, facilitating the identification of more accurate and biological gene co-expression modules.

Asunto(s)

Algoritmos; Perfilación de la Expresión Génica; Análisis por Conglomerados; Humanos; Perfilación de la Expresión Génica/métodos; Biología Computacional/métodos

Palabras clave

Clustering; co-expression module; gene expression data; similarity measure

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Algoritmos / Perfilación de la Expresión Génica Límite: Humans Idioma: En Revista: Technol Health Care Asunto de la revista: ENGENHARIA BIOMEDICA / SERVICOS DE SAUDE Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google