Your browser doesn't support javascript.
loading
Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods.
Jia, Xuan; Yin, ZhiXiang; Peng, Yu.
Afiliación
  • Jia X; School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, China.
  • Yin Z; School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, China.
  • Peng Y; School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, China.
Front Microbiol ; 14: 1092143, 2023.
Article en En | MEDLINE | ID: mdl-36778885
ABSTRACT
Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of male infertility, as well as the diagnosis of genetic testing and the determination of clinical treatment options. While current research has made significant progress in the genes that cause sperm defects in men, genetic studies of sperm content defects are still lacking. This article is based on a dataset of gene expression data on the X chromosome in patients with azoospermia, mild and severe oligospermia. Due to the difference in the degree of disease between patients and the possible difference in genetic causes, common classical clustering methods such as k-means, hierarchical clustering, etc. cannot effectively identify samples (realize simultaneous clustering of samples and features). In this paper, we use machine learning and various statistical methods such as hypergeometric distribution, Gibbs sampling, Fisher test, etc. and genes the interaction network for cluster analysis of gene expression data of male infertility patients has certain advantages compared with existing methods. The cluster results were identified by differential co-expression analysis of gene expression data in male infertility patients, and the model recognition clusters were analyzed by multiple gene enrichment methods, showing different degrees of enrichment in various enzyme activities, cancer, virus-related, ATP and ADP production, and other pathways. At the same time, as this paper is an unsupervised analysis of genetic factors of male infertility patients, we constructed a simulated data set, in which the clustering results have been determined, which can be used to measure the effect of discriminant model recognition. Through comparison, it finds that the proposed model has a better identification effect.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Front Microbiol Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Front Microbiol Año: 2023 Tipo del documento: Article País de afiliación: China