Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 39
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Brief Bioinform ; 24(1)2023 01 19.
Artículo en Inglés | MEDLINE | ID: mdl-36458451

RESUMEN

In epistasis analysis, single-nucleotide polymorphism-single-nucleotide polymorphism interactions (SSIs) among genes may, alongside other environmental factors, influence the risk of multifactorial diseases. To identify SSI between cases and controls (i.e. binary traits), the score for model quality is affected by different objective functions (i.e. measurements) because of potential disease model preferences and disease complexities. Our previous study proposed a multiobjective approach-based multifactor dimensionality reduction (MOMDR), with the results indicating that two objective functions could enhance SSI identification with weak marginal effects. However, SSI identification using MOMDR remains a challenge because the optimal measure combination of objective functions has yet to be investigated. This study extended MOMDR to the many-objective version (i.e. many-objective MDR, MaODR) by integrating various disease probability measures based on a two-way contingency table to improve the identification of SSI between cases and controls. We introduced an objective function selection approach to determine the optimal measure combination in MaODR among 10 well-known measures. In total, 6 disease models with and 40 disease models without marginal effects were used to evaluate the general algorithms, namely those based on multifactor dimensionality reduction, MOMDR and MaODR. Our results revealed that the MaODR-based three objective function model, correct classification rate, likelihood ratio and normalized mutual information (MaODR-CLN) exhibited the higher 6.47% detection success rates (Accuracy) than MOMDR and higher 17.23% detection success rates than MDR through the application of an objective function selection approach. In a Wellcome Trust Case Control Consortium, MaODR-CLN successfully identified the significant SSIs (P < 0.001) associated with coronary artery disease. We performed a systematic analysis to identify the optimal measure combination in MaODR among 10 objective functions. Our combination detected SSIs-based binary traits with weak marginal effects and thus reduced spurious variables in the score model. MOAI is freely available at https://sites.google.com/view/maodr/home.


Asunto(s)
Epistasis Genética , Modelos Genéticos , Algoritmos , Fenotipo , Reducción de Dimensionalidad Multifactorial/métodos , Polimorfismo de Nucleótido Simple
2.
Brief Bioinform ; 23(3)2022 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-35397164

RESUMEN

Primers are critical for polymerase chain reaction (PCR) and influence PCR experimental outcomes. Designing numerous combinations of forward and reverse primers involves various primer constraints, posing a computational challenge. Most PCR primer design methods limit parameters because the available algorithms use general fitness functions. This study designed new fitness functions based on user-specified parameters and used the functions in a primer design approach based on the multiobjective particle swarm optimization (MOPSO) algorithm to address the challenge of primer design with user-specified parameters. Multicriteria evaluation was conducted simultaneously based on primer constraints. The fitness functions were evaluated using 7425 DNA sequences and compared with a predominant primer design approach based on optimization algorithms. Each DNA sequence was run 100 times to calculate the difference between the user-specified parameters and primer constraint values. The algorithms based on fitness functions with user-specified parameters outperformed the algorithms based on general fitness functions for 11 primer constraints. Moreover, MOPSO exhibited superior implementation in all experiments. Practical gel electrophoresis was conducted to verify the PCR experiments and established that MOPSO effectively designs primers based on user-specified parameters.


Asunto(s)
Algoritmos , Programas Informáticos , Secuencia de Bases , Cartilla de ADN/genética , Reacción en Cadena de la Polimerasa/métodos
3.
Brief Bioinform ; 23(1)2022 01 17.
Artículo en Inglés | MEDLINE | ID: mdl-34661627

RESUMEN

Identifying and characterizing the interaction between risk factors for multiple outcomes (multi-outcome interaction) has been one of the greatest challenges faced by complex multifactorial diseases. However, the existing approaches have several limitations in identifying the multi-outcome interaction. To address this issue, we proposed a multi-outcome interaction identification approach called MOAI. MOAI was motivated by the limitations of estimating the interaction simultaneously occurring in multi-outcomes and by the success of Pareto set filter operator for identifying multi-outcome interaction. MOAI permits the identification for the interaction of multiple outcomes and is applicable in population-based study designs. Our experimental results exhibited that the existing approaches are not effectively used to identify the multi-outcome interaction, whereas MOAI obviously exhibited superior performance in identifying multi-outcome interaction. We applied MOAI to identify the interaction between risk factors for colorectal cancer (CRC) in both metastases and mortality prognostic outcomes. An interaction between vaspin and carcinoembryonic antigen (CEA) was found, and the interaction indicated that patients with CRC characterized by higher vaspin (≥30%) and CEA (≥5) levels could simultaneously increase both metastases and mortality risk. The immunostaining evidence revealed that determined multi-outcome interaction could effectively identify the difference between non-metastases/survived and metastases/deceased patients, which offers multi-prognostic outcome risk estimation for CRC. To our knowledge, this is the first report of a multi-outcome interaction associated with a complex multifactorial disease. MOAI is freely available at https://sites.google.com/view/moaitool/home.


Asunto(s)
Antígeno Carcinoembrionario , Neoplasias Colorrectales , Biomarcadores de Tumor , Humanos
4.
Bioinformatics ; 34(13): 2228-2236, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29471406

RESUMEN

Motivation: Single-nucleotide polymorphism (SNP)-SNP interactions (SSIs) are popular markers for understanding disease susceptibility. Multifactor dimensionality reduction (MDR) can successfully detect considerable SSIs. Currently, MDR-based methods mainly adopt a single-objective function (a single measure based on contingency tables) to detect SSIs. However, generally, a single-measure function might not yield favorable results due to potential model preferences and disease complexities. Approach: This study proposes a multiobjective MDR (MOMDR) method that is based on a contingency table of MDR as an objective function. MOMDR considers the incorporated measures, including correct classification and likelihood rates, to detect SSIs and adopts set theory to predict the most favorable SSIs with cross-validation consistency. MOMDR enables simultaneously using multiple measures to determine potential SSIs. Results: Three simulation studies were conducted to compare the detection success rates of MOMDR and single-objective MDR (SOMDR), revealing that MOMDR had higher detection success rates than SOMDR. Furthermore, the Wellcome Trust Case Control Consortium dataset was analyzed by MOMDR to detect SSIs associated with coronary artery disease. Availability and implementation: MOMDR is freely available at https://goo.gl/M8dpDg. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Epistasis Genética , Modelos Genéticos , Reducción de Dimensionalidad Multifactorial/métodos , Polimorfismo de Nucleótido Simple , Estudios de Casos y Controles , Enfermedad de la Arteria Coronaria/genética , Predisposición Genética a la Enfermedad , Humanos
5.
Bioinformatics ; 33(15): 2354-2362, 2017 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-28379338

RESUMEN

MOTIVATION: Detecting epistatic interactions in genome-wide association studies (GWAS) is a computational challenge. Such huge numbers of single-nucleotide polymorphism (SNP) combinations limit the some of the powerful algorithms to be applied to detect the potential epistasis in large-scale SNP datasets. APPROACH: We propose a new algorithm which combines the differential evolution (DE) algorithm with a classification based multifactor-dimensionality reduction (CMDR), termed DECMDR. DECMDR uses the CMDR as a fitness measure to evaluate values of solutions in DE process for scanning the potential statistical epistasis in GWAS. RESULTS: The results indicated that DECMDR outperforms the existing algorithms in terms of detection success rate by the large simulation and real data obtained from the Wellcome Trust Case Control Consortium. For running time comparison, DECMDR can efficient to apply the CMDR to detect the significant association between cases and controls amongst all possible SNP combinations in GWAS. AVAILABILITY AND IMPLEMENTATION: DECMDR is freely available at https://goo.gl/p9sLuJ . CONTACT: chuang@isu.edu.tw or e0955767257@yahoo.com.tw. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Epistasis Genética , Estudio de Asociación del Genoma Completo/métodos , Reducción de Dimensionalidad Multifactorial/métodos , Polimorfismo de Nucleótido Simple , Humanos
6.
J Theor Biol ; 404: 251-261, 2016 09 07.
Artículo en Inglés | MEDLINE | ID: mdl-27291467

RESUMEN

The identification of transfer RNAs (tRNAs) is critical for a detailed understanding of the evolution of biological organisms and viruses. However, some tRNAs are difficult to recognize due to their unusual sub-structures and may result in the detection of the wrong anticodon. Therefore, the detection of unusual sub-structures of tRNA genes remains an important challenge. In this study, we propose a method to identify tRNA genes based on tRNA features. tRNAfeature attempts to refold the sequence with single-stranded regions longer than those found in the canonical and conventional structural models for tRNA. We predicted a set of 53926 archaeal, eubacterial and eukaryotic tRNA genes annotated in tRNADB-CE and scanned the tRNA genes in whole genome sequencing. The results indicate that tRNAfeature is more powerful than other existing methods for identifying tRNAs.


Asunto(s)
Algoritmos , ADN/genética , ARN de Transferencia/genética , Emparejamiento Base/genética , Secuencia de Bases , Bases de Datos de Ácidos Nucleicos , Genoma Arqueal , Intrones/genética , Conformación de Ácido Nucleico , ARN de Transferencia/química , Selenocisteína/genética
7.
J Biomed Inform ; 63: 112-119, 2016 10.
Artículo en Inglés | MEDLINE | ID: mdl-27507088

RESUMEN

OBJECTIVES: Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data. METHODS: We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data. RESULTS: We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility. CONCLUSION: IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes.


Asunto(s)
Algoritmos , ADN Mitocondrial , Procesamiento Automatizado de Datos , Polimorfismo de Nucleótido Simple , Genotipo , Humanos , Diálisis Renal/estadística & datos numéricos
8.
BMC Genomics ; 16: 489, 2015 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-26126977

RESUMEN

BACKGROUND: Multifactor dimensionality reduction (MDR) is widely used to analyze interactions of genes to determine the complex relationship between diseases and polymorphisms in humans. However, the astronomical number of high-order combinations makes MDR a highly time-consuming process which can be difficult to implement for multiple tests to identify more complex interactions between genes. This study proposes a new framework, named fast MDR (FMDR), which is a greedy search strategy based on the joint effect property. RESULTS: Six models with different minor allele frequencies (MAFs) and different sample sizes were used to generate the six simulation data sets. A real data set was obtained from the mitochondrial D-loop of chronic dialysis patients. Comparison of results from the simulation data and real data sets showed that FMDR identified significant gene-gene interaction with less computational complexity than the MDR in high-order interaction analysis. CONCLUSION: FMDR improves the MDR difficulties associated with the computational loading of high-order SNPs and can be used to evaluate the relative effects of each individual SNP on disease susceptibility. FMDR is freely available at http://bioinfo.kmu.edu.tw/FMDR.rar .


Asunto(s)
Epistasis Genética , Reducción de Dimensionalidad Multifactorial/métodos , Polimorfismo de Nucleótido Simple , Algoritmos , Biología Computacional/métodos , Susceptibilidad a Enfermedades , Frecuencia de los Genes , Humanos , Programas Informáticos
9.
Cancer Cell Int ; 14(1): 29, 2014 Mar 31.
Artículo en Inglés | MEDLINE | ID: mdl-24685237

RESUMEN

BACKGROUND: ORAI1 channels play an important role for breast cancer progression and metastasis. Previous studies indicated the strong correlation between breast cancer and individual single nucleotide polymorphisms (SNPs) of ORAI1 gene. However, the possible SNP-SNP interaction of ORAI1 gene was not investigated. RESULTS: To develop the complex analyses of SNP-SNP interaction, we propose a genetic algorithm (GA) to detect the model of breast cancer association between five SNPs (rs12320939, rs12313273, rs7135617, rs6486795 and rs712853) of ORAI1 gene. For individual SNPs, the differences between case and control groups in five SNPs of ORAI1 gene were not significant. In contrast, GA-generated SNP models show that 2-SNP (rs12320939-GT/rs6486795-CT), 3-SNP (rs12320939-GT/rs12313273-TT/rs6486795-TC), 5-SNP (rs12320939-GG/rs12313273-TC/rs7135617-TT/rs6486795-TT/rs712853-TT) have higher risks for breast cancer in terms of odds ratio analysis (1.357, 1.689, and 13.148, respectively). CONCLUSION: Taken together, the cumulative effects of SNPs of ORAI1 gene in breast cancer association study were well demonstrated in terms of GA-generated SNP models.

10.
Ann Gen Psychiatry ; 13: 15, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24955105

RESUMEN

BACKGROUND: Facial emotion perception (FEP) can affect social function. We previously reported that parts of five tested single-nucleotide polymorphisms (SNPs) in the MET and AKT1 genes may individually affect FEP performance. However, the effects of SNP-SNP interactions on FEP performance remain unclear. METHODS: This study compared patients with high and low FEP performances (n = 89 and 93, respectively). A particle swarm optimization (PSO) algorithm was used to identify the best SNP barcodes (i.e., the SNP combinations and genotypes that revealed the largest differences between the high and low FEP groups). RESULTS: The analyses of individual SNPs showed no significant differences between the high and low FEP groups. However, comparisons of multiple SNP-SNP interactions involving different combinations of two to five SNPs showed that the best PSO-generated SNP barcodes were significantly associated with high FEP score. The analyses of the joint effects of the best SNP barcodes for two to five interacting SNPs also showed that the best SNP barcodes had significantly higher odds ratios (2.119 to 3.138; P < 0.05) compared to other SNP barcodes. In conclusion, the proposed PSO algorithm effectively identifies the best SNP barcodes that have the strongest associations with FEP performance. CONCLUSIONS: This study also proposes a computational methodology for analyzing complex SNP-SNP interactions in social cognition domains such as recognition of facial emotion.

11.
Mol Biol Rep ; 40(7): 4227-33, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23695493

RESUMEN

Most non-significant individual single nucleotide polymorphisms (SNPs) were undiscovered in hypertension association studies. Their possible SNP-SNP interactions were usually ignored and leaded to missing heritability. In present study, we proposed a particle swarm optimization (PSO) algorithm to analyze the SNP-SNP interaction associated with hypertension. Genotype dataset of eight SNPs of renin-angiotensin system genes for 130 non-hypertension and 313 hypertension subjects were included. Without SNP-SNP interaction, most individual SNPs were non-significant difference between the hypertension and non-hypertension groups. For SNP-SNP interaction, PSO can select the SNP combinations involving different SNP numbers, namely the best SNP barcodes, to show the maximum frequency difference between non-hypertension and hypertension groups. After computation, the best PSO-generated SNP barcodes were dominant in non-hypertension in terms of the occurrences of frequency differences between non-hypertension and hypertension groups. The OR values of the best SNP barcodes involving 2-8 SNPs were 0.705-0.334, suggesting that these SNP barcodes were protective against hypertension. In conclusion, this study demonstrated that non-significant SNPs may generate the joint effect in association study. Our proposed PSO algorithm is effective to identify the best protective SNP barcodes against hypertension.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Epistasis Genética , Hipertensión/genética , Polimorfismo de Nucleótido Simple , Sistema Renina-Angiotensina/genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Modelos Biológicos , Oportunidad Relativa , Reproducibilidad de los Resultados
12.
Artículo en Inglés | MEDLINE | ID: mdl-38117627

RESUMEN

Next-generation sequencing (NGS) genomic data offer valuable high-throughput genomic information for computational applications in medicine. Using genomic data to identify disease-associated genes to estimate cancer mortality risk remains challenging regarding to computational efficiency and risk integration. For determining mortality-related genes, we propose an information fusion system based on a fuzzy system to fuse the numerous deep-learning-based risk scores, consider the significance of features related to time-varying effects and risk stratifications, and interpret the directional relationship and interaction between outcome and predictors. Fuzzy rules were implemented to integrate the considerations mentioned above by merging all the risk score models to achieve advanced risk estimation. The genomic data of head and neck squamous cell carcinoma (HNSCC) were used to evaluate the performance of the proposed computational approach. The results indicated that the proposed computational approach exhibited optimal ability to identify mortality risk-related genes in HNSCC patients. The results also suggest that HNSCC mortality is associated with cancer inflammatory response, the interleukin-17A signaling pathway, stellate cell activation, and the extracellular-regulated protein kinase five signaling pathway, which might offer new therapeutic targets HNSCC through immunologic or antiangiogenic mechanisms. The proposed information fusion system can promote the determination of high-risk genes related to cancer mortality. This study contributes a valid cancer mortality risk estimate that can identify mortality-related genes.

13.
Artículo en Inglés | MEDLINE | ID: mdl-35061588

RESUMEN

Epistasis detection is vital for understanding disease susceptibility in genetics. Multiobjective multifactor dimensionality reduction (MOMDR) was previously proposed to detect epistasis. MOMDR was performed using binary classification to distinguish the high-risk (H) and low-risk (L) groups to reduce multifactor dimensionality. However, the binary classification does not reflect the uncertainty of the H and L classification. In this study, we proposed an empirical fuzzy MOMDR (EFMOMDR) to address the limitations of binary classification using the degree of membership through an empirical fuzzy approach. The EFMOMDR can simultaneously consider two incorporated fuzzy-based measures, including correct classification rate and likelihood rate, and does not require parameter tuning. Simulation studies revealed that EFMOMDR has higher 7.14% detection success rates than MOMDR, indicating that the limitations of binary classification of MOMDR have been successfully improved by empirical fuzzy. Moreover, EFMOMDR was used to analyze coronary artery disease in the Wellcome Trust Case Control Consortium dataset.


Asunto(s)
Enfermedad de la Arteria Coronaria , Epistasis Genética , Humanos , Epistasis Genética/genética , Reducción de Dimensionalidad Multifactorial , Modelos Genéticos , Simulación por Computador , Enfermedad de la Arteria Coronaria/genética , Polimorfismo de Nucleótido Simple , Algoritmos
14.
Front Neurosci ; 16: 1018005, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36620438

RESUMEN

To understand students' learning behaviors, this study uses machine learning technologies to analyze the data of interactive learning environments, and then predicts students' learning outcomes. This study adopted a variety of machine learning classification methods, quizzes, and programming system logs, found that students' learning characteristics were correlated with their learning performance when they encountered similar programming practice. In this study, we used random forest (RF), support vector machine (SVM), logistic regression (LR), and neural network (NN) algorithms to predict whether students would submit on time for the course. Among them, the NN algorithm showed the best prediction results. Education-related data can be predicted by machine learning techniques, and different machine learning models with different hyperparameters can be used to obtain better results.

15.
Artif Intell Med ; 102: 101768, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31980105

RESUMEN

OBJECTIVE: Epistasis identification is critical for determining susceptibility to human genetic diseases. The rapid development of technology has enabled scalability to make multifactor dimensionality reduction (MDR) measurements an effective calculation tool that achieves superior detection. However, the classification of high-risk (H) or low-risk (L) groups in multidrug resistance operations calls for extensive research. METHODS AND MATERIAL: In this study, an improved fuzzy sigmoid (FS) method using the membership degree in MDR (FSMDR) was proposed for solving the limitations of binary classification. The FS method combined with MDR measurements yielded an improved ability to distinguish similar frequencies of potential multifactor genotypes. RESULTS: We compared our results with other MDR-based methods and FSMDR achieved superior detection rates on simulated data sets. The results indicated that the fuzzy classifications can provide insight into the uncertainty of H/L classification in MDR operation. CONCLUSION: FSMDR successfully detected significant epistasis of coronary artery disease in the Wellcome Trust Case Control Consortium data set.


Asunto(s)
Epistasis Genética , Lógica Difusa , Reducción de Dimensionalidad Multifactorial/métodos , Algoritmos , Inteligencia Artificial , Estudios de Casos y Controles , Resistencia a Medicamentos/genética , Resistencia a Múltiples Medicamentos/genética , Genotipo , Humanos , Modelos Genéticos
16.
Artículo en Inglés | MEDLINE | ID: mdl-30040653

RESUMEN

Detecting gene-gene interactions in single-nucleotide polymorphism data is vital for understanding disease susceptibility. However, existing approaches may be limited by the sample size in case-control studies. Herein, we propose a balance approach for the multifactor dimensionality reduction (BMDR) method to increase the accuracy of estimates of the prediction error rate in small samples. BMDR explicitly selects the best model by evaluating the average of prediction error rates over k-fold cross-validation without cross-validation consistency selection. In this study, we used several epistatic models with and without marginal effects under different parameter settings (heritability and minor allele frequencies) to evaluate the performance of existing approaches. Using simulated data sets, BMDR successfully detected gene-gene interactions, particularly for data sets with small sample sizes. A large data set was obtained from the Wellcome Trust Case Control Consortium, and results indicated that BMDR could effectively detect significant gene-gene interactions.


Asunto(s)
Biología Computacional/métodos , Epistasis Genética/genética , Modelos Genéticos , Reducción de Dimensionalidad Multifactorial/métodos , Algoritmos , Polimorfismo de Nucleótido Simple/genética
17.
Diagnostics (Basel) ; 10(10)2020 Oct 09.
Artículo en Inglés | MEDLINE | ID: mdl-33050209

RESUMEN

Colorectal cancer is a highly heterogeneous malignancy in the Asian population, and it is considered an important prognostic factor for baseline characteristics, tumor burden, and tumor markers. This study investigated the effect of baseline characteristics and tumor burden on tumor marker expression and progressive disease in colorectal cancer by using partial least squares variance-based path modeling (PLS-PM). PLS-PM can be used to evaluate the complex relationship between prognostic variables and progressive disease status with a small sample of measurements and structural models. A total of 89 tissue samples of colorectal cancer were analyzed. Our results suggested that the expression of visceral adipose tissue-derived serpin (vaspin) is a potential indicator of colorectal cancer progression and may be affected by baseline characteristics such as age, sex, body mass index, and diabetes mellitus. Moreover, according to the characteristics of tumor burden, the expression of vaspin was generally higher in each progressive disease patient. The overall findings suggest that vaspin is a potential indicator of the progressive disease and may be affected by the baseline characteristics of patients.

18.
IEEE J Biomed Health Inform ; 23(1): 416-426, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-29993963

RESUMEN

Gene-gene interactions (GGIs) are important markers for determining susceptibility to a disease. Multifactor dimensionality reduction (MDR) is a popular algorithm for detecting GGIs and primarily adopts the correct classification rate (CCR) to assess the quality of a GGI. However, CCR measurement alone may not successfully detect certain GGIs because of potential model preferences and disease complexities. In this study, multiple-criteria decision analysis (MCDA) based on MDR was named MCDA-MDR and proposed for detecting GGIs. MCDA facilitates MDR to simultaneously adopt multiple measures within the two-way contingency table of MDR to assess GGIs; the CCR and rule utility measure were employed. Cross-validation consistency was adopted to determine the most favorable GGIs among the Pareto sets. Simulation studies were conducted to compare the detection success rates of the MDR-only-based measure and MCDA-MDR, revealing that MCDA-MDR had superior detection success rates. The Wellcome Trust Case Control Consortium dataset was analyzed using MCDA-MDR to detect GGIs associated with coronary artery disease, and MCDA-MDR successfully detected numerous significant GGIs (p < 0.001). MCDA-MDR performance assessment revealed that the applied MCDA successfully enhanced the GGI detection success rate of the MDR-based method compared with MDR alone.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Epistasis Genética/genética , Modelos Genéticos , Polimorfismo de Nucleótido Simple/genética , Simulación por Computador , Humanos
19.
Comput Biol Med ; 113: 103397, 2019 10.
Artículo en Inglés | MEDLINE | ID: mdl-31494431

RESUMEN

Hydrophobic-polar (HP) models are widely used to predict protein folding and hydrophobic interactions. Numerous optimization algorithms have been proposed to predict protein folding using the two-dimensional (2D) HP model. However, to obtain an optimal protein structure from the 2D HP model remains challenging. In this study, an algorithm integrating particle swarm optimization (PSO) and Tabu search (TS), named PSO-TS, was proposed to predict protein structures based on the 2D HP model. TS can help PSO to avoid getting trapped in a local optima and thus to remove the limitation of PSO in predicting protein folding by the 2D HP model. In this study, a total of 28 protein sequences were used to evaluate the accuracy of PSO-TS in protein folding prediction. The proposed PSO-TS method was compared with 15 other approaches for predicting short and long protein sequences. Experimental results demonstrated that PSO-TS provides a highly accurate, reproducible, and stabile prediction ability for the protein folding by the 2D HP model.


Asunto(s)
Algoritmos , Modelos Moleculares , Pliegue de Proteína , Proteínas , Análisis de Secuencia de Proteína , Secuencia de Aminoácidos , Interacciones Hidrofóbicas e Hidrofílicas , Dominios Proteicos , Proteínas/química , Proteínas/genética
20.
IEEE Trans Nanobioscience ; 17(3): 291-299, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29994217

RESUMEN

Single-nucleotide polymorphism (SNP)-SNP interactions are crucial for understanding the association between disease-related multifactorials for disease analysis. Existing statistical methods for determining such interactions are limited by the considerable computation required for evaluating all potential associations between disease-related multifactorials. Identifying SNP-SNP interactions is thus a major challenge in genetic association studies. This paper proposes a catfish Taguchi-based binary differential evolution (CT-BDE) algorithm for identifying SNP-SNP interactions. In the search space, the catfish effect prevents the premature convergence of the population, and the Taguchi method improves the search ability of the BDE algorithm. Hence, the proposed algorithm enables obtaining a favorable solution regarding the identification of high-order SNP-SNP interactions. Additionally, the proposed algorithm applies an effective fitness function derived from a multifactor dimensionality reduction (MDR) operation to evaluate the solutions from BDE-based algorithms. Simulated and real data sets were used to evaluate the ability of several BDE-based algorithms in identifying specific SNP-SNP interactions. We compared the fitness function derived from the MDR operation with that derived according to the difference between cases and controls, by using the different BDE-based algorithms. The results showed that the proposed CT-BDE algorithm applying the fitness function derived from the MDR operation exhibited a superior ability in identifying SNP-SNP interactions compared with the other BDE-based algorithms.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Estudios de Asociación Genética/métodos , Polimorfismo de Nucleótido Simple/genética , Bases de Datos Factuales , Humanos , Reducción de Dimensionalidad Multifactorial/métodos , Diálisis Renal
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA