Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Entropy (Basel) ; 22(1)2020 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-33285864

RESUMO

Entropy is a key concept in the characterization of uncertainty for any given signal, and its extensions such as Spectral Entropy and Permutation Entropy. They have been used to measure the complexity of time series. However, these measures are subject to the discretization employed to study the states of the system, and identifying the relationship between complexity measures and the expected performance of the four selected forecasting methods that participate in the M4 Competition. This relationship allows the decision, in advance, of which algorithm is adequate. Therefore, in this paper, we found the relationships between entropy-based complexity framework and the forecasting error of four selected methods (Smyl, Theta, ARIMA, and ETS). Moreover, we present a framework extension based on the Emergence, Self-Organization, and Complexity paradigm. The experimentation with both synthetic and M4 Competition time series show that the feature space induced by complexities, visually constrains the forecasting method performance to specific regions; where the logarithm of its metric error is poorer, the Complexity based on the emergence and self-organization is maximal.

2.
J Chem Inf Model ; 59(7): 3144-3153, 2019 07 22.
Artigo em Inglês | MEDLINE | ID: mdl-31199647

RESUMO

Ionic liquids (ILs) are ionic compounds with low melting points that can be designed to be used in an extensive set of commercial and industrial applications. However, the design of ILs is limited by the quantity and quality of the available data in the literature; therefore, the estimation of physicochemical properties of ILs by computational methods is a promising way of solving this problem, since it provides approximations of the real values, resulting in savings in both time and money. We studied two data sets of 281 and 134 liquids based on the molecule imidazole that were analyzed with QSPR techniques. This paper presents a software architecture that uses clustering techniques to improve the robustness of estimation models of the melting point of ILs. These results indicate an error of 6.25% in the previously unmodeled data set and an error of 4.43% in the second data set. We have an improvement with the second data set of 1.81% over the last results previously found.


Assuntos
Imidazóis/química , Líquidos Iônicos/química , Temperatura , Aprendizado de Máquina , Modelos Químicos , Reprodutibilidade dos Testes , Solventes/química
3.
Comput Math Methods Med ; 2017: 8424198, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28487747

RESUMO

Background. Guillain-Barré Syndrome (GBS) is a potentially fatal autoimmune neurological disorder. The severity varies among the four main subtypes, named as Acute Inflammatory Demyelinating Polyneuropathy (AIDP), Acute Motor Axonal Neuropathy (AMAN), Acute Motor Sensory Axonal Neuropathy (AMSAN), and Miller-Fisher Syndrome (MF). A proper subtype identification may help to promptly carry out adequate treatment in patients. Method. We perform experiments with 15 single classifiers in two scenarios: four subtypes' classification and One versus All (OvA) classification. We used a dataset with the 16 relevant features identified in a previous phase. Performance evaluation is made by 10-fold cross validation (10-FCV). Typical classification performance measures are used. A statistical test is conducted in order to identify the top five classifiers for each case. Results. In four GBS subtypes' classification, half of the classifiers investigated in this study obtained an average accuracy above 0.90. In OvA classification, the two subtypes with the largest number of instances resulted in the best classification results. Conclusions. This study represents a comprehensive effort on creating a predictive model for Guillain-Barré Syndrome subtypes. Also, the analysis performed in this work provides insight about the best single classifiers for each classification case.


Assuntos
Algoritmos , Síndrome de Guillain-Barré/diagnóstico , Aprendizado de Máquina , Modelos Biológicos , Humanos
4.
Adv Bioinformatics ; 2016: 7357123, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27413369

RESUMO

A new hybrid Multiphase Simulated Annealing Algorithm using Boltzmann and Bose-Einstein distributions (MPSABBE) is proposed. MPSABBE was designed for solving the Protein Folding Problem (PFP) instances. This new approach has four phases: (i) Multiquenching Phase (MQP), (ii) Boltzmann Annealing Phase (BAP), (iii) Bose-Einstein Annealing Phase (BEAP), and (iv) Dynamical Equilibrium Phase (DEP). BAP and BEAP are simulated annealing searching procedures based on Boltzmann and Bose-Einstein distributions, respectively. DEP is also a simulated annealing search procedure, which is applied at the final temperature of the fourth phase, which can be seen as a second Bose-Einstein phase. MQP is a search process that ranges from extremely high to high temperatures, applying a very fast cooling process, and is not very restrictive to accept new solutions. However, BAP and BEAP range from high to low and from low to very low temperatures, respectively. They are more restrictive for accepting new solutions. DEP uses a particular heuristic to detect the stochastic equilibrium by applying a least squares method during its execution. MPSABBE parameters are tuned with an analytical method, which considers the maximal and minimal deterioration of problem instances. MPSABBE was tested with several instances of PFP, showing that the use of both distributions is better than using only the Boltzmann distribution on the classical SA.

5.
Biomed Res Int ; 2015: 542016, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26495300

RESUMO

The proper functioning of a hospital computer system is an arduous work for managers and staff. However, inconsistent policies are frequent and can produce enormous problems, such as stolen information, frequent failures, and loss of the entire or part of the hospital data. This paper presents a new method named EMRlog for computer security systems in hospitals. EMRlog is focused on two kinds of security policies: directive and implemented policies. Security policies are applied to computer systems that handle huge amounts of information such as databases, applications, and medical records. Firstly, a syntactic verification step is applied by using predicate logic. Then data mining techniques are used to detect which security policies have really been implemented by the computer systems staff. Subsequently, consistency is verified in both kinds of policies; in addition these subsets are contrasted and validated. This is performed by an automatic theorem prover. Thus, many kinds of vulnerabilities can be removed for achieving a safer computer system.


Assuntos
Algoritmos , Segurança Computacional/normas , Confidencialidade/normas , Mineração de Dados/normas , Registros Eletrônicos de Saúde/organização & administração , Fidelidade a Diretrizes/organização & administração , México , Garantia da Qualidade dos Cuidados de Saúde/métodos , Garantia da Qualidade dos Cuidados de Saúde/organização & administração
6.
Annu Int Conf IEEE Eng Med Biol Soc ; 2015: 7234-7, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-26737961

RESUMO

The severity of Guillain-Barré Syndrome (GBS) varies among subtypes, which can be mainly Acute Inflammatory Demyelinating Polyneuropathy (AIDP), Acute Motor Axonal Neuropathy (AMAN), Acute Motor Sensory Axonal Neuropathy (AMSAN) and Miller-Fisher Syndrome (MF). In this study, we use a real dataset that contains clinical, serological, and nerve conduction tests data obtained from 129 GBS patients. We apply C4.5 decision tree, SVM (Support Vector Machines) using a Gaussian kernel, and kNN (k Nearest Neighbour) to predict four GBS subtypes. Accuracies were calculated and averaged across 30 10-fold cross-validation (10-FCV) runs. C4.5 obtained 0.9211 (±0.0109), kNN 0.9179 (±0.0041), and SVM 0.9154 (±0.0069). This is an ongoing research project and further experiments are being conducted.


Assuntos
Árvores de Decisões , Previsões/métodos , Síndrome de Guillain-Barré/diagnóstico , Máquina de Vetores de Suporte , Humanos , Condução Nervosa , Exame Neurológico , Testes Sorológicos
7.
Comput Math Methods Med ; 2014: 432109, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25302074

RESUMO

Guillain-Barré syndrome (GBS) is a neurological disorder which has not been explored using clustering algorithms. Clustering algorithms perform more efficiently when they work only with relevant features. In this work, we applied correlation-based feature selection (CFS), chi-squared, information gain, symmetrical uncertainty, and consistency filter methods to select the most relevant features from a 156-feature real dataset. This dataset contains clinical, serological, and nerve conduction tests data obtained from GBS patients. The most relevant feature subsets, determined with each filter method, were used to identify four subtypes of GBS present in the dataset. We used partitions around medoids (PAM) clustering algorithm to form four clusters, corresponding to the GBS subtypes. We applied the purity of each cluster as evaluation measure. After experimentation, symmetrical uncertainty and information gain determined a feature subset of seven variables. These variables conformed as a dataset were used as input to PAM and reached a purity of 0.7984. This result leads to a first characterization of this syndrome using computational techniques.


Assuntos
Síndrome de Guillain-Barré/classificação , Síndrome de Guillain-Barré/diagnóstico , Reconhecimento Automatizado de Padrão , Algoritmos , Análise por Conglomerados , Biologia Computacional/métodos , Bases de Dados Factuais , Humanos , Condução Nervosa , Reprodutibilidade dos Testes
8.
ScientificWorldJournal ; 2014: 509429, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24977201

RESUMO

Efficient cropping requires yield estimation for each involved crop, where data-driven models are commonly applied. In recent years, some data-driven modeling technique comparisons have been made, looking for the best model to yield prediction. However, attributes are usually selected based on expertise assessment or in dimensionality reduction algorithms. A fairer comparison should include the best subset of features for each regression technique; an evaluation including several crops is preferred. This paper evaluates the most common data-driven modeling techniques applied to yield prediction, using a complete method to define the best attribute subset for each model. Multiple linear regression, stepwise linear regression, M5' regression trees, and artificial neural networks (ANN) were ranked. The models were built using real data of eight crops sowed in an irrigation module of Mexico. To validate the models, three accuracy metrics were used: the root relative square error (RRSE), relative mean absolute error (RMAE), and correlation factor (R). The results show that ANNs are more consistent in the best attribute subset composition between the learning and the training stages, obtaining the lowest average RRSE (86.04%), lowest average RMAE (8.75%), and the highest average correlation factor (0.63).


Assuntos
Irrigação Agrícola/estatística & dados numéricos , Agricultura/métodos , Produtos Agrícolas/crescimento & desenvolvimento , Modelos Lineares , Redes Neurais de Computação , Tempo (Meteorologia) , Agricultura/estatística & dados numéricos , Algoritmos , Produtos Agrícolas/classificação , Dinâmica não Linear , Análise de Regressão
9.
ScientificWorldJournal ; 2014: 364352, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24790563

RESUMO

The Chaotic Multiquenching Annealing algorithm (CMQA) is proposed. CMQA is a new algorithm, which is applied to protein folding problem (PFP). This algorithm is divided into three phases: (i) multiquenching phase (MQP), (ii) annealing phase (AP), and (iii) dynamical equilibrium phase (DEP). MQP enforces several stages of quick quenching processes that include chaotic functions. The chaotic functions can increase the exploration potential of solutions space of PFP. AP phase implements a simulated annealing algorithm (SA) with an exponential cooling function. MQP and AP are delimited by different ranges of temperatures; MQP is applied for a range of temperatures which goes from extremely high values to very high values; AP searches for solutions in a range of temperatures from high values to extremely low values. DEP phase finds the equilibrium in a dynamic way by applying least squares method. CMQA is tested with several instances of PFP.


Assuntos
Modelos Teóricos , Dobramento de Proteína , Proteínas/química , Algoritmos , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA