Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
Comput Chem Eng ; 115: 46-63, 2018 Jul 12.
Artigo em Inglês | MEDLINE | ID: mdl-30386002

RESUMO

This paper presents a novel data-driven framework for process monitoring in batch processes, a critical task in industry to attain a safe operability and minimize loss of productivity and profit. We exploit high dimensional process data with nonlinear Support Vector Machine-based feature selection algorithm, where we aim to retrieve the most informative process measurements for accurate and simultaneous fault detection and diagnosis. The proposed framework is applied to an extensive benchmark dataset which includes process data describing 22,200 batches with 15 faults. We train fault and time-specific models on the prealigned batch data trajectories via three distinct time horizon approaches: one-step rolling, two-step rolling, and evolving which varies the amount of data incorporation during modeling. The results show that two-step rolling and evolving time horizon approaches perform superior to the other. Regardless of the approach, proposed framework provides a promising decision support tool for online simultaneous fault detection and diagnosis for batch processes.

2.
Proteins ; 85(6): 1078-1098, 2017 06.
Artigo em Inglês | MEDLINE | ID: mdl-28241391

RESUMO

Protein structure refinement is the challenging problem of operating on any protein structure prediction to improve its accuracy with respect to the native structure in a blind fashion. Although many approaches have been developed and tested during the last four CASP experiments, a majority of the methods continue to degrade models rather than improve them. Princeton_TIGRESS (Khoury et al., Proteins 2014;82:794-814) was developed previously and utilizes separate sampling and selection stages involving Monte Carlo and molecular dynamics simulations and classification using an SVM predictor. The initial implementation was shown to consistently refine protein structures 76% of the time in our own internal benchmarking on CASP 7-10 targets. In this work, we improved the sampling and selection stages and tested the method in blind predictions during CASP11. We added a decomposition of physics-based and hybrid energy functions, as well as a coordinate-free representation of the protein structure through distance-binning Cα-Cα distances to capture fine-grained movements. We performed parameter estimation to optimize the adjustable SVM parameters to maximize precision while balancing sensitivity and specificity across all cross-validated data sets, finding enrichment in our ability to select models from the populations of similar decoys generated for targets in CASPs 7-10. The MD stage was enhanced such that larger structures could be further refined. Among refinement methods that are currently implemented as web-servers, Princeton_TIGRESS 2.0 demonstrated the most consistent and most substantial net refinement in blind predictions during CASP11. The enhanced refinement protocol Princeton_TIGRESS 2.0 is freely available as a web server at http://atlas.engr.tamu.edu/refinement/. Proteins 2017; 85:1078-1098. © 2017 Wiley Periodicals, Inc.


Assuntos
Modelos Estatísticos , Simulação de Dinâmica Molecular , Proteínas/química , Software , Máquina de Vetores de Suporte , Benchmarking , Biologia Computacional/métodos , Método Duplo-Cego , Internet , Método de Monte Carlo , Conformação Proteica
3.
Expert Rev Proteomics ; 11(1): 31-41, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24308552

RESUMO

Periodontitis is a common chronic and destructive disease whose pathogenetic mechanisms remain unclear. Due to their sensitivity and global scale, proteomics studies offer the opportunity to uncover critical host and pathogen activity indicators and can elucidate clinically applicable biomarkers for improved diagnosis and treatment of the disease. This review summarizes the literature of proteomics studies on periodontitis and comprehensively discusses commonly found candidate biomarkers. Key considerations in the design of an experimental proteomics platform are also outlined. The applicability of protein biomarkers across the progression of periodontitis and unexplored areas of research are highlighted.


Assuntos
Periodontite/diagnóstico , Proteoma/metabolismo , Animais , Biomarcadores/metabolismo , Humanos , Periodontite/metabolismo , Proteômica
4.
J Clin Periodontol ; 40(2): 131-9, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23190455

RESUMO

AIM: To identify optimal combination(s) of proteomic based biomarkers in gingival crevicular fluid (GCF) samples from chronic periodontitis (CP) and periodontally healthy individuals and validate the predictions through known and blind test sets. MATERIALS AND METHODS: GCF samples were collected from 96 CP and periodontally healthy subjects and analysed using high-performance liquid chromatography, tandem mass spectrometry and the PILOT_PROTEIN algorithm. A mixed-integer linear optimization (MILP) model was then developed to identify the optimal combination of biomarkers which could clearly distinguish a blind subject sample as healthy or diseased. RESULTS: A thorough cross-validation of the MILP model capability was performed on a training set of 55 samples and greater than 99% accuracy was consistently achieved when annotating the testing set samples as healthy or diseased. The model was then trained on all 55 samples and tested on two different blind test sets, and using an optimal combination of 7 human proteins and 3 bacterial proteins, the model was able to correctly predict 40 out of 41 healthy and diseased samples. CONCLUSIONS: The proposed large-scale proteomic analysis and MILP model led to the identification of novel combinations of biomarkers for consistent diagnosis of periodontal status with greater than 95% predictive accuracy.


Assuntos
Periodontite Crônica/metabolismo , Líquido do Sulco Gengival/química , Muramidase/análise , Proteômica/métodos , beta-Defensinas/análise , Adulto , Algoritmos , Proteínas de Bactérias/análise , Biomarcadores/análise , Estudos de Casos e Controles , Cromatografia Líquida de Alta Pressão , Periodontite Crônica/diagnóstico , Proteína 3 de Resposta de Crescimento Precoce/análise , Feminino , Humanos , Modelos Lineares , Masculino , Pessoa de Meia-Idade , Modelos Teóricos , Sensibilidade e Especificidade , Espectrometria de Massas em Tandem
5.
Int Symp Process Syst Eng ; 44: 2077-2082, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30534633

RESUMO

Rapid detection and identification of process faults in industrial applications is crucial to sustain a safe and profitable operation. Today, the advances in sensor technologies have facilitated large amounts of chemical process data collection in real time which subsequently broadened the use of data-driven process monitoring techniques via machine learning and multivariate statistical analysis. One of the well-known machine learning techniques is Support Vector Machines (SVM) which allows the use of high dimensional feature sets for learning problems such as classification and regression. In this paper, we present the application of a novel nonlinear (kernel-dependent) SVM-based feature selection algorithm to process monitoring and fault detection of continuous processes. The developed methodology is derived from sensitivity analysis of the dual SVM objective and utilizes existing and novel greedy algorithms to rank features that also guides fault diagnosis. Specifically, we train fault-specific two-class SVM models to detect faulty operations, while using the feature selection algorithm to improve the accuracy of the fault detection models and perform fault diagnosis. We present results for the Tennessee Eastman process as a case study and compare our approach to existing approaches for fault detection, diagnosis and identification.

6.
PLoS One ; 11(2): e0148974, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26859389

RESUMO

HIV-1 entry into host cells is mediated by interactions between the V3-loop of viral glycoprotein gp120 and chemokine receptor CCR5 or CXCR4, collectively known as HIV-1 coreceptors. Accurate genotypic prediction of coreceptor usage is of significant clinical interest and determination of the factors driving tropism has been the focus of extensive study. We have developed a method based on nonlinear support vector machines to elucidate the interacting residue pairs driving coreceptor usage and provide highly accurate coreceptor usage predictions. Our models utilize centroid-centroid interaction energies from computationally derived structures of the V3-loop:coreceptor complexes as primary features, while additional features based on established rules regarding V3-loop sequences are also investigated. We tested our method on 2455 V3-loop sequences of various lengths and subtypes, and produce a median area under the receiver operator curve of 0.977 based on 500 runs of 10-fold cross validation. Our study is the first to elucidate a small set of specific interacting residue pairs between the V3-loop and coreceptors capable of predicting coreceptor usage with high accuracy across major HIV-1 subtypes. The developed method has been implemented as a web tool named CRUSH, CoReceptor USage prediction for HIV-1, which is available at http://ares.tamu.edu/CRUSH/.


Assuntos
HIV-1/fisiologia , Receptores CCR5/fisiologia , Receptores CXCR4/fisiologia , Metabolismo Energético , Proteína gp120 do Envelope de HIV/fisiologia , HIV-1/crescimento & desenvolvimento , Humanos , Modelos Biológicos , Domínios e Motivos de Interação entre Proteínas/fisiologia , Relação Estrutura-Atividade , Tropismo
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa