Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
BMC Bioinformatics ; 18(1): 453, 2017 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-29029625

RESUMO

BACKGROUND: Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such pathway analysis (PA) methods, we must first evaluate their performance and reliability, using datasets where the pathways perturbed by the conditions of interest have been well characterized in advance. However, such 'ground truths' (or gold standards) are often unavailable. Furthermore, previous evaluation strategies that have focused on defining 'true answers' are unable to systematically and objectively assess PA methods under a wide range of conditions. RESULTS: In this work, we propose a novel strategy for evaluating PA methods independently of any gold standard, either established or assumed. The strategy involves the use of two mutually complementary metrics, recall and discrimination. Recall measures the consistency of the perturbed pathways identified by applying a particular analysis method to an original large dataset and those identified by the same method to a sub-dataset of the original dataset. In contrast, discrimination measures specificity-the degree to which the perturbed pathways identified by a particular method to a dataset from one experiment differ from those identifying by the same method to a dataset from a different experiment. We used these metrics and 24 datasets to evaluate six widely used PA methods. The results highlighted the common challenge in reliably identifying significant pathways from small datasets. Importantly, we confirmed the effectiveness of our proposed dual-metric strategy by showing that previous comparative studies corroborate the performance evaluations of the six methods obtained by our strategy. CONCLUSIONS: Unlike any previously proposed strategy for evaluating the performance of PA methods, our dual-metric strategy does not rely on any ground truth, either established or assumed, of the pathways perturbed by a specific clinical or experimental condition. As such, our strategy allows researchers to systematically and objectively evaluate pathway analysis methods by employing any number of datasets for a variety of conditions.


Assuntos
Transdução de Sinais , Bases de Dados Genéticas , Doença/genética , Expressão Gênica , Humanos , Reprodutibilidade dos Testes
2.
Emerg Infect Dis ; 23(8): 1316-1324, 2017 08.
Artigo em Inglês | MEDLINE | ID: mdl-28726603

RESUMO

The Ebola virus (EBOV) outbreak in West Africa during 2013-2016 demonstrated the need to improve Ebola virus disease (EVD) diagnostics and standards of care. This retrospective study compared laboratory values and clinical features of 3 nonhuman primate models of lethal EVD to assess associations with improved survival time. In addition, the study identified laboratory values useful as predictors of survival, surrogates for EBOV viral loads, and triggers for initiation of therapeutic interventions in these nonhuman primate models. Furthermore, the data support that, in nonhuman primates, the Makona strain of EBOV may be less virulent than the Kikwit strain of EBOV. The applicability of these findings as potential diagnostic and management tools for EVD in humans warrants further investigation.


Assuntos
Ebolavirus , Doença pelo Vírus Ebola/epidemiologia , Doença pelo Vírus Ebola/virologia , Animais , Biomarcadores , Doença pelo Vírus Ebola/mortalidade , Doença pelo Vírus Ebola/transmissão , Humanos , Estimativa de Kaplan-Meier , Primatas , RNA Viral , Curva ROC , Estudos Retrospectivos , Carga Viral
3.
BMC Genomics ; 17: 695, 2016 08 30.
Artigo em Inglês | MEDLINE | ID: mdl-27576376

RESUMO

BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects. RESULTS: We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles. CONCLUSIONS: The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association.


Assuntos
Epistasia Genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Degeneração Macular/genética , Frequência do Gene , Genótipo , Humanos , Aprendizado de Máquina , Degeneração Macular/patologia , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco
4.
BMC Genomics ; 16: 1106, 2015 Dec 29.
Artigo em Inglês | MEDLINE | ID: mdl-26714771

RESUMO

BACKGROUND: Francisella tularensis is a select bio-threat agent and one of the most virulent intracellular pathogens known, requiring just a few organisms to establish an infection. Although several virulence factors are known, we lack an understanding of virulence factors that act through host-pathogen protein interactions to promote infection. To address these issues in the highly infectious F. tularensis subsp. tularensis Schu S4 strain, we deployed a combined in silico, in vitro, and in vivo analysis to identify virulence factors and their interactions with host proteins to characterize bacterial infection mechanisms. RESULTS: We initially used comparative genomics and literature to identify and select a set of 49 putative and known virulence factors for analysis. Each protein was then subjected to proteome-scale yeast two-hybrid (Y2H) screens with human and murine cDNA libraries to identify potential host-pathogen protein-protein interactions. Based on the bacterial protein interaction profile with both hosts, we selected seven novel putative virulence factors for mutant construction and animal validation experiments. We were able to create five transposon insertion mutants and used them in an intranasal BALB/c mouse challenge model to establish 50 % lethal dose estimates. Three of these, ΔFTT0482c, ΔFTT1538c, and ΔFTT1597, showed attenuation in lethality and can thus be considered novel F. tularensis virulence factors. The analysis of the accompanying Y2H data identified intracellular protein trafficking between the early endosome to the late endosome as an important component in virulence attenuation for these virulence factors. Furthermore, we also used the Y2H data to investigate host protein binding of two known virulence factors, showing that direct protein binding was a component in the modulation of the inflammatory response via activation of mitogen-activated protein kinases and in the oxidative stress response. CONCLUSIONS: Direct interactions with specific host proteins and the ability to influence interactions among host proteins are important components for F. tularensis to avoid host-cell defense mechanisms and successfully establish an infection. Although direct host-pathogen protein-protein binding is only one aspect of Francisella virulence, it is a critical component in directly manipulating and interfering with cellular processes in the host cell.


Assuntos
Francisella tularensis/patogenicidade , Interações Hospedeiro-Patógeno/genética , Fatores de Virulência/metabolismo , Animais , Feminino , Francisella tularensis/genética , Camundongos , Camundongos Endogâmicos BALB C , Ligação Proteica/genética , Ligação Proteica/fisiologia , Virulência/genética , Fatores de Virulência/genética
5.
J Neurosci Res ; 93(2): 199-214, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25399920

RESUMO

The multifactorial nature of traumatic brain injury (TBI), especially the complex secondary tissue injury involving intertwined networks of molecular pathways that mediate cellular behavior, has confounded attempts to elucidate the pathology underlying the progression of TBI. Here, systems biology strategies are exploited to identify novel molecular mechanisms and protein indicators of brain injury. To this end, we performed a meta-analysis of four distinct high-throughput gene expression studies involving different animal models of TBI. By using canonical pathways and a large human protein-interaction network as a scaffold, we separately overlaid the gene expression data from each study to identify molecular signatures that were conserved across the different studies. At 24 hr after injury, the significantly activated molecular signatures were nonspecific to TBI, whereas the significantly suppressed molecular signatures were specific to the nervous system. In particular, we identified a suppressed subnetwork consisting of 58 highly interacting, coregulated proteins associated with synaptic function. We selected three proteins from this subnetwork, postsynaptic density protein 95, nitric oxide synthase 1, and disrupted in schizophrenia 1, and hypothesized that their abundance would be significantly reduced after TBI. In a penetrating ballistic-like brain injury rat model of severe TBI, Western blot analysis confirmed our hypothesis. In addition, our analysis recovered 12 previously identified protein biomarkers of TBI. The results suggest that systems biology may provide an efficient, high-yield approach to generate testable hypotheses that can be experimentally validated to identify novel mechanisms of action and molecular indicators of TBI.


Assuntos
Biomarcadores/metabolismo , Lesões Encefálicas , Regulação da Expressão Gênica/fisiologia , Biologia de Sistemas/métodos , Animais , Lesões Encefálicas/diagnóstico , Lesões Encefálicas/genética , Lesões Encefálicas/metabolismo , Modelos Animais de Doenças , Proteína 4 Homóloga a Disks-Large , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Masculino , Proteínas de Membrana/metabolismo , Proteínas do Tecido Nervoso/metabolismo , Óxido Nítrico Sintase Tipo I/metabolismo , Ratos , Ratos Sprague-Dawley
6.
Mol Cell Proteomics ; 12(11): 3036-51, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-23800426

RESUMO

Burkholderia mallei is an infectious intracellular pathogen whose virulence and resistance to antibiotics makes it a potential bioterrorism agent. Given its genetic origin as a commensal soil organism, it is equipped with an extensive and varied set of adapted mechanisms to cope with and modulate host-cell environments. One essential virulence mechanism constitutes the specialized secretion systems that are designed to penetrate host-cell membranes and insert pathogen proteins directly into the host cell's cytosol. However, the secretion systems' proteins and, in particular, their host targets are largely uncharacterized. Here, we used a combined in silico, in vitro, and in vivo approach to identify B. mallei proteins required for pathogenicity. We used bioinformatics tools, including orthology detection and ab initio predictions of secretion system proteins, as well as published experimental Burkholderia data to initially select a small number of proteins as putative virulence factors. We then used yeast two-hybrid assays against normalized whole human and whole murine proteome libraries to detect and identify interactions among each of these bacterial proteins and host proteins. Analysis of such interactions provided both verification of known virulence factors and identification of three new putative virulence proteins. We successfully created insertion mutants for each of these three proteins using the virulent B. mallei ATCC 23344 strain. We exposed BALB/c mice to mutant strains and the wild-type strain in an aerosol challenge model using lethal B. mallei doses. In each set of experiments, mice exposed to mutant strains survived for the 21-day duration of the experiment, whereas mice exposed to the wild-type strain rapidly died. Given their in vivo role in pathogenicity, and based on the yeast two-hybrid interaction data, these results point to the importance of these pathogen proteins in modulating host ubiquitination pathways, phagosomal escape, and actin-cytoskeleton rearrangement processes.


Assuntos
Burkholderia mallei/metabolismo , Burkholderia mallei/patogenicidade , Interações Hospedeiro-Patógeno/fisiologia , Fatores de Virulência/metabolismo , Animais , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Burkholderia mallei/genética , Feminino , Interações Hospedeiro-Patógeno/genética , Humanos , Camundongos , Camundongos Endogâmicos BALB C , Mutagênese Insercional , Mapeamento de Interação de Proteínas , Proteômica , Técnicas do Sistema de Duplo-Híbrido , Virulência/genética , Virulência/fisiologia , Fatores de Virulência/genética
7.
BMC Physiol ; 14: 14, 2014 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-25623799

RESUMO

BACKGROUND: Heat illness is a debilitating and potentially life-threatening condition. Limited data are available to identify individuals with heat illness at greatest risk for organ damage. We recently described the transcriptomic and proteomic responses to heat injury and recovery in multiple organs in an in vivo model of conscious rats heated to a maximum core temperature of 41.8°C (Tc,Max). In this study, we examined changes in plasma metabolic networks at Tc,Max, 24, or 48 hours after the heat stress stimulus. RESULTS: Circulating metabolites were identified by gas chromatography/mass spectrometry and liquid chromatography/tandem mass spectrometry. Bioinformatics analysis of the metabolomic data corroborated proteomics and transcriptomics data in the tissue at the pathway level, supporting modulations in metabolic networks including cell death or catabolism (pyrimidine and purine degradation, acetylation, sulfation, redox alterations and glutathione metabolism, and the urea cycle/creatinine metabolism), energetics (stasis in glycolysis and tricarboxylic acid cycle, ß-oxidation), cholesterol and nitric oxide metabolism, and bile acids. Hierarchical clustering identified 15 biochemicals that differentiated animals with histopathological evidence of cardiac injury at 48 hours from uninjured animals. The metabolic networks perturbed in the plasma corroborated the tissue proteomics and transcriptomics pathway data, supporting a model of irreversible cell death and decrements in energetics as key indicators of cardiac damage in response to heat stress. CONCLUSIONS: Integrating plasma metabolomics with tissue proteomics and transcriptomics supports a diagnostic approach to assessing individual susceptibility to organ injury and predicting recovery after heat stress.


Assuntos
Regulação da Temperatura Corporal , Exaustão por Calor/sangue , Resposta ao Choque Térmico , Animais , Biomarcadores/sangue , Morte Celular , Traumatismos Cardíacos/metabolismo , Exaustão por Calor/patologia , Rim/lesões , Rim/metabolismo , Fígado/lesões , Fígado/metabolismo , Lesão Pulmonar/metabolismo , Masculino , Metabolômica , Estresse Oxidativo , Ratos , Ratos Endogâmicos F344 , Espécies Reativas de Oxigênio/sangue
8.
Nucleic Acids Res ; 39(13): e88, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21572104

RESUMO

The unparalleled growth in the availability of genomic data offers both a challenge to develop orthology detection methods that are simultaneously accurate and high throughput and an opportunity to improve orthology detection by leveraging evolutionary evidence in the accumulated sequenced genomes. Here, we report a novel orthology detection method, termed QuartetS, that exploits evolutionary evidence in a computationally efficient manner. Based on the well-established evolutionary concept that gene duplication events can be used to discriminate homologous genes, QuartetS uses an approximate phylogenetic analysis of quartet gene trees to infer the occurrence of duplication events and discriminate paralogous from orthologous genes. We used function- and phylogeny-based metrics to perform a large-scale, systematic comparison of the orthology predictions of QuartetS with those of four other methods [bi-directional best hit (BBH), outgroup, OMA and QuartetS-C (QuartetS followed by clustering)], involving 624 bacterial genomes and >2 million genes. We found that QuartetS slightly, but consistently, outperformed the highly specific OMA method and that, while consuming only 0.5% additional computational time, QuartetS predicted 50% more orthologs with a 50% lower false positive rate than the widely used BBH method. We conclude that, for large-scale phylogenetic and functional analysis, QuartetS and QuartetS-C should be preferred, respectively, in applications where high accuracy and high throughput are required.


Assuntos
Algoritmos , Genes , Filogenia , Duplicação Gênica , Genoma Bacteriano , Genômica/métodos , Alinhamento de Sequência
9.
Shock ; 60(2): 199-205, 2023 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-37335312

RESUMO

ABSTRACT: Background: Hemorrhage remains the leading cause of death on the battlefield. This study aims to assess the ability of an artificial intelligence triage algorithm to automatically analyze vital-sign data and stratify hemorrhage risk in trauma patients. Methods: Here, we developed the APPRAISE-Hemorrhage Risk Index (HRI) algorithm, which uses three routinely measured vital signs (heart rate and diastolic and systolic blood pressures) to identify trauma patients at greatest risk of hemorrhage. The algorithm preprocesses the vital signs to discard unreliable data, analyzes reliable data using an artificial intelligence-based linear regression model, and stratifies hemorrhage risk into low (HRI:I), average (HRI:II), and high (HRI:III). Results: To train and test the algorithm, we used 540 h of continuous vital-sign data collected from 1,659 trauma patients in prehospital and hospital (i.e., emergency department) settings. We defined hemorrhage cases (n = 198) as those patients who received ≥1 unit of packed red blood cells within 24 h of hospital admission and had documented hemorrhagic injuries. The APPRAISE-HRI stratification yielded a hemorrhage likelihood ratio (95% confidence interval) of 0.28 (0.13-0.43) for HRI:I, 1.00 (0.85-1.15) for HRI:II, and 5.75 (3.57-7.93) for HRI:III, suggesting that patients categorized in the low-risk (high-risk) category were at least 3-fold less (more) likely to have hemorrhage than those in the average trauma population. We obtained similar results in a cross-validation analysis. Conclusions: The APPRAISE-HRI algorithm provides a new capability to evaluate routine vital signs and alert medics to specific casualties who have the highest risk of hemorrhage, to optimize decision-making for triage, treatment, and evacuation.


Assuntos
Inteligência Artificial , Triagem , Humanos , Triagem/métodos , Hemorragia/diagnóstico , Hemorragia/terapia , Algoritmos , Serviço Hospitalar de Emergência
10.
BMC Bioinformatics ; 13: 143, 2012 Jun 22.
Artigo em Inglês | MEDLINE | ID: mdl-22726705

RESUMO

BACKGROUND: The concept of orthology is key to decoding evolutionary relationships among genes across different species using comparative genomics. QuartetS is a recently reported algorithm for large-scale orthology detection. Based on the well-established evolutionary principle that gene duplication events discriminate paralogous from orthologous genes, QuartetS has been shown to improve orthology detection accuracy while maintaining computational efficiency. DESCRIPTION: QuartetS-DB is a new orthology database constructed using the QuartetS algorithm. The database provides orthology predictions among 1621 complete genomes (1365 bacterial, 92 archaeal, and 164 eukaryotic), covering more than seven million proteins and four million pairwise orthologs. It is a major source of orthologous groups, containing more than 300,000 groups of orthologous proteins and 236,000 corresponding gene trees. The database also provides over 500,000 groups of inparalogs. In addition to its size, a distinguishing feature of QuartetS-DB is the ability to allow users to select a cutoff value that modulates the balance between prediction accuracy and coverage of the retrieved pairwise orthologs. The database is accessible at https://applications.bioanalysis.org/quartetsdb. CONCLUSIONS: QuartetS-DB is one of the largest orthology resources available to date. Because its orthology predictions are underpinned by evolutionary evidence obtained from sequenced genomes, we expect its accuracy to continue to increase in future releases as the genomes of additional species are sequenced.


Assuntos
Algoritmos , Biologia Computacional/métodos , Bases de Dados Genéticas , Duplicação Gênica , Archaea/genética , Bactérias/genética , Evolução Biológica , Eucariotos/genética , Genômica/métodos , Proteínas/genética
11.
IEEE Trans Biomed Eng ; 69(6): 2119-2129, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-34941497

RESUMO

OBJECTIVE: Observational studies on the use of commercially available wearable devices for infection detection lack the rigor of controlled clinical studies, where time of exposure and onset of infection are exactly known. Towards that end, we carried out a feasibility study using a commercial smartwatch for monitoring heart rate, skin temperature, and body acceleration on subjects as they underwent a controlled human malaria infection (CHMI) challenge. METHODS: Ten subjects underwent CHMI and were asked to wear the smartwatch for at least 12 hours/day from 2 weeks pre-challenge to 4 weeks post-challenge. Using these data, we developed 2B-Healthy, a Bayesian-based infection-prediction algorithm that estimates a probability of infection. We also collected data from eight control subjects for 4 weeks to assess the false-positive rate of 2B-Healthy. RESULTS: Nine of 10 CHMI subjects developed parasitemia, with an average time to parasitemia of 12 days. 2B-Healthy detected infection in seven of nine subjects (78% sensitivity), where in six subjects it detected infection 6 days before parasitemia (on average). In the eight control subjects, we obtained a false-positive rate of 6%/week. CONCLUSION: The 2B-Healthy algorithm was able to reliably detect infection prior to the onset of symptoms using data collected from a commercial smartwatch in a controlled human infection study. SIGNIFICANCE: Our findings demonstrate the feasibility of wearables as a screening tool to provide early warning of infection and support further research on the use of the 2B-Healthy algorithm as the basis for a wearable infection-detection platform.


Assuntos
Malária Falciparum , Malária , Dispositivos Eletrônicos Vestíveis , Teorema de Bayes , Humanos , Malária/diagnóstico , Malária Falciparum/prevenção & controle , Parasitemia , Plasmodium falciparum
13.
Appl Radiat Isot ; 173: 109714, 2021 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-33892251

RESUMO

The production capacity of 89Sr and 90Sr in the 2 MW MSR are evaluated. The gaseous 89Kr and 90Kr are extracted from the core through the helium bubbling system, and then decay to 89Sr and 90Sr, respectively. In order to improve purity of 89Sr product, two cooling devices are adopted in the 89Sr and 90Sr production system. The annual yields of 89Sr and 90Sr are about 9000 Ci and 32 Ci, respectively, and the impurity of 89Sr product is less than 2 ppm which can meet the medical requirement.

14.
Appl Radiat Isot ; 166: 109350, 2020 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-32795697

RESUMO

Four I-131 production methods including irradiated TeO2 target and uranium target in the irradiation channel, batch-wise extracted iodine from the fuel salt, and online extracted solid tellurium through the by-pass loop system have been assessed in a 2 MW molten salt reactor. The latter method can produce a large annual yield of I-131 (about 155,000 Ci). The radioactivity shielding demand of the latter method is much smaller than the other I-131 production methods under the identical annual yield of I-131.

15.
Appl Radiat Isot ; 160: 109134, 2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32351226

RESUMO

The 99Mo production in a 2 MW molten salt reactor using liquid low-enriched uranium (LEU) fuel has been evaluated. The batch-wise extraction period of 99Mo is optimized to be one day corresponding to 9415 6-day Ci/week of the 99Mo production rate. The required amount of uranium is only 4.77 kg annually. The required chemically reprocessed amount of FPs is about 58.4 g annually, accounting for only 4.9% of the solid LEU target method under the identical production capacity of 99Mo.

16.
Proteins ; 74(2): 449-60, 2009 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-18636476

RESUMO

In this article, we present a new method termed CatFam (Catalytic Families) to automatically infer the functions of catalytic proteins, which account for 20-40% of all proteins in living organisms and play a critical role in a variety of biological processes. CatFam is a sequence-based method that generates sequence profiles to represent and infer protein catalytic functions. CatFam generates profiles through a stepwise procedure that carefully controls profile quality and employs nonenzymes as negative samples to establish profile-specific thresholds associated with a predefined nominal false-positive rate (FPR) of predictions. The adjustable FPR allows for fine precision control of each profile and enables the generation of profile databases that meet different needs: function annotation with high precision and hypothesis generation with moderate precision but better recall. Multiple tests of CatFam databases (generated with distinct nominal FPRs) against enzyme and nonenzyme datasets show that the method's predictions have consistently high precision and recall. For example, a 1% FPR database predicts protein catalytic functions for a dataset of enzymes and nonenzymes with 98.6% precision and 95.0% recall. Comparisons of CatFam databases against other established profile-based methods for the functional annotation of 13 bacterial genomes indicate that CatFam consistently achieves higher precision and (in most cases) higher recall, and that (on average) CatFam provides 21.9% additional catalytic functions not inferred by the other similarly reliable methods. These results strongly suggest that the proposed method provides a valuable contribution to the automated prediction of protein catalytic functions. The CatFam databases and the database search program are freely available at http://www.bhsai.org/downloads/catfam.tar.gz.


Assuntos
Algoritmos , Bases de Dados de Proteínas , Análise de Sequência de Proteína/métodos , Animais , Catálise , Análise por Conglomerados , Enzimas/genética , Enzimas/metabolismo , Genoma , Humanos , Redes e Vias Metabólicas , Estrutura Terciária de Proteína , Proteínas/genética , Proteínas/metabolismo , Reprodutibilidade dos Testes , Relação Estrutura-Atividade
17.
BMC Bioinformatics ; 9: 52, 2008 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-18221520

RESUMO

BACKGROUND: Automated protein function prediction methods are needed to keep pace with high-throughput sequencing. With the existence of many programs and databases for inferring different protein functions, a pipeline that properly integrates these resources will benefit from the advantages of each method. However, integrated systems usually do not provide mechanisms to generate customized databases to predict particular protein functions. Here, we describe a tool termed PIPA (Pipeline for Protein Annotation) that has these capabilities. RESULTS: PIPA annotates protein functions by combining the results of multiple programs and databases, such as InterPro and the Conserved Domains Database, into common Gene Ontology (GO) terms. The major algorithms implemented in PIPA are: (1) a profile database generation algorithm, which generates customized profile databases to predict particular protein functions, (2) an automated ontology mapping generation algorithm, which maps various classification schemes into GO, and (3) a consensus algorithm to reconcile annotations from the integrated programs and databases.PIPA's profile generation algorithm is employed to construct the enzyme profile database CatFam, which predicts catalytic functions described by Enzyme Commission (EC) numbers. Validation tests show that CatFam yields average recall and precision larger than 95.0%. CatFam is integrated with PIPA. We use an association rule mining algorithm to automatically generate mappings between terms of two ontologies from annotated sample proteins. Incorporating the ontologies' hierarchical topology into the algorithm increases the number of generated mappings. In particular, it generates 40.0% additional mappings from the Clusters of Orthologous Groups (COG) to EC numbers and a six-fold increase in mappings from COG to GO terms. The mappings to EC numbers show a very high precision (99.8%) and recall (96.6%), while the mappings to GO terms show moderate precision (80.0%) and low recall (33.0%). Our consensus algorithm for GO annotation is based on the computation and propagation of likelihood scores associated with GO terms. The test results suggest that, for a given recall, the application of the consensus algorithm yields higher precision than when consensus is not used. CONCLUSION: The algorithms implemented in PIPA provide automated genome-wide protein function annotation based on reconciled predictions from multiple resources.


Assuntos
Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Reconhecimento Automatizado de Padrão/métodos , Proteínas/genética , Proteínas/fisiologia , Proteômica/métodos , Sequência de Aminoácidos , Relação Estrutura-Atividade
18.
PLoS One ; 12(1): e0169918, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28081217

RESUMO

Autoimmune diseases occur when immune cells fail to develop or lose their tolerance toward self and destroy body's own tissues. Both insufficient negative selection of self-reactive T cells and impaired development of regulatory T cells preventing effector cell activation are believed to contribute to autoimmunity. Genetic predispositions center around the major histocompatibility complex (MHC) class II loci involved in antigen presentation, the key determinant of CD4+ T cell activation. Recent studies suggested that variants in the MHC region also exhibit significant non-additive interaction effects. However, collective interactions involving large numbers of single nucleotide polymorphisms (SNPs) contributing to such effects are yet to be characterized. In addition, relatively little is known about the cell-type-specificity of such interactions in the context of cellular pathways. Here, we analyzed type 1 diabetes (T1D) and rheumatoid arthritis (RA) genome-wide association data sets via large-scale, high-performance computations and inferred collective interaction effects involving MHC SNPs using the discrete discriminant analysis. Despite considerable differences in the details of SNP interactions in T1D and RA data, the enrichment pattern of interacting pairs in reference epigenomes was remarkably similar: statistically significant interactions were epigenetically active in cell-type combinations connecting B cells to T cells and intestinal epithelial cells, with both helper and regulatory T cells showing strong disease-associated interactions with B cells. Our results provide direct genetic evidence pointing to the important roles B cells play as antigen-presenting cells toward CD4+ T cells in the context of central and peripheral tolerance. In addition, they are consistent with recent experimental studies suggesting that the repertoire of B cell-specific self-antigens in the thymus are critical to the effective control of corresponding autoimmune activation in peripheral tissues.


Assuntos
Células Apresentadoras de Antígenos/metabolismo , Doenças Autoimunes/genética , Estudo de Associação Genômica Ampla , Células Apresentadoras de Antígenos/citologia , Células Apresentadoras de Antígenos/imunologia , Área Sob a Curva , Artrite Reumatoide/genética , Artrite Reumatoide/patologia , Doenças Autoimunes/patologia , Linfócitos B/citologia , Linfócitos B/imunologia , Linfócitos B/metabolismo , Linfócitos T CD4-Positivos/citologia , Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD4-Positivos/metabolismo , Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 1/patologia , Epigenômica , Redes Reguladoras de Genes , Antígenos de Histocompatibilidade Classe II/genética , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Curva ROC , Linfócitos T Auxiliares-Indutores/citologia , Linfócitos T Auxiliares-Indutores/imunologia , Linfócitos T Auxiliares-Indutores/metabolismo , Linfócitos T Reguladores/citologia , Linfócitos T Reguladores/imunologia , Linfócitos T Reguladores/metabolismo , Timo/citologia , Timo/metabolismo
19.
J Am Med Inform Assoc ; 13(3): 309-20, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16501184

RESUMO

OBJECTIVE: The development and application of data-driven decision-support systems for medical triage, diagnostics, and prognostics pose special requirements on physiologic data. In particular, that data are reliable in order to produce meaningful results. The authors describe a method that automatically estimates the reliability of reference heart rates (HRr) derived from electrocardiogram (ECG) waveforms and photoplethysmogram (PPG) waveforms recorded by vital-signs monitors. The reliability is quantitatively expressed through a quality index (QI) for each HRr. DESIGN: The proposed method estimates the reliability of heart rates from vital-signs monitors by (1) assessing the quality of the ECG and PPG waveforms, (2) separately computing heart rates from these waveforms, and (3) concisely combining this information into a QI that considers the physical redundancy of the signal sources and independence of heart rate calculations. The assessment of the waveforms is performed by a Support Vector Machine classifier and the independent computation of heart rate from the waveforms is performed by an adaptive peak identification technique, termed ADAPIT, which is designed to filter out motion-induced noise. RESULTS: The authors evaluated the method against 158 randomly selected data samples of trauma patients collected during helicopter transport, each sample consisting of 7-second ECG and PPG waveform segments and their associated HRr. They compared the results of the algorithm against manual analysis performed by human experts and found that in 92% of the cases, the algorithm either matches or is more conservative than the human's QI qualification. In the remaining 8% of the cases, the algorithm infers a less conservative QI, though in most cases this was because of algorithm/human disagreement over ambiguous waveform quality. If these ambiguous waveforms were relabeled, the misclassification rate would drop from 8% to 3%. CONCLUSION: This method provides a robust approach for automatically assessing the reliability of large quantities of heart rate data and the waveforms from which they are derived.


Assuntos
Algoritmos , Eletrocardiografia/métodos , Frequência Cardíaca , Fotopletismografia/métodos , Processamento de Sinais Assistido por Computador , Humanos , Reprodutibilidade dos Testes
20.
J Neurotrauma ; 30(13): 1101-16, 2013 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-23510232

RESUMO

The rate of traumatic brain injury (TBI) in service members with wartime injuries has risen rapidly in recent years, and complex, variable links have emerged between TBI and long-term neurological disorders. The multifactorial nature of TBI secondary cellular response has confounded attempts to find cellular biomarkers for its diagnosis and prognosis or for guiding therapy for brain injury. One possibility is to apply emerging systems biology strategies to holistically probe and analyze the complex interweaving molecular pathways and networks that mediate the secondary cellular response through computational models that integrate these diverse data sets. Here, we review available systems biology strategies, databases, and tools. In addition, we describe opportunities for applying this methodology to existing TBI data sets to identify new biomarker candidates and gain insights about the underlying molecular mechanisms of TBI response. As an exemplar, we apply network and pathway analysis to a manually compiled list of 32 protein biomarker candidates from the literature, recover known TBI-related mechanisms, and generate hypothetical new biomarker candidates.


Assuntos
Biomarcadores/metabolismo , Lesões Encefálicas/diagnóstico , Lesões Encefálicas/metabolismo , Biologia de Sistemas/métodos , Lesões Encefálicas/complicações , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA