Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
1.
N Engl J Med ; 390(2): 118-131, 2024 01 11.
Artículo en Inglés | MEDLINE | ID: mdl-38197815

RESUMEN

BACKGROUND: The early-generation ROS1 tyrosine kinase inhibitors (TKIs) that are approved for the treatment of ROS1 fusion-positive non-small-cell lung cancer (NSCLC) have antitumor activity, but resistance develops in tumors, and intracranial activity is suboptimal. Repotrectinib is a next-generation ROS1 TKI with preclinical activity against ROS1 fusion-positive cancers, including those with resistance mutations such as ROS1 G2032R. METHODS: In this registrational phase 1-2 trial, we assessed the efficacy and safety of repotrectinib in patients with advanced solid tumors, including ROS1 fusion-positive NSCLC. The primary efficacy end point in the phase 2 trial was confirmed objective response; efficacy analyses included patients from phase 1 and phase 2. Duration of response, progression-free survival, and safety were secondary end points in phase 2. RESULTS: On the basis of results from the phase 1 trial, the recommended phase 2 dose of repotrectinib was 160 mg daily for 14 days, followed by 160 mg twice daily. Response occurred in 56 of the 71 patients (79%; 95% confidence interval [CI], 68 to 88) with ROS1 fusion-positive NSCLC who had not previously received a ROS1 TKI; the median duration of response was 34.1 months (95% CI, 25.6 to could not be estimated), and median progression-free survival was 35.7 months (95% CI, 27.4 to could not be estimated). Response occurred in 21 of the 56 patients (38%; 95% CI, 25 to 52) with ROS1 fusion-positive NSCLC who had previously received one ROS1 TKI and had never received chemotherapy; the median duration of response was 14.8 months (95% CI, 7.6 to could not be estimated), and median progression-free survival was 9.0 months (95% CI, 6.8 to 19.6). Ten of the 17 patients (59%; 95% CI, 33 to 82) with the ROS1 G2032R mutation had a response. A total of 426 patients received the phase 2 dose; the most common treatment-related adverse events were dizziness (in 58% of the patients), dysgeusia (in 50%), and paresthesia (in 30%), and 3% discontinued repotrectinib owing to treatment-related adverse events. CONCLUSIONS: Repotrectinib had durable clinical activity in patients with ROS1 fusion-positive NSCLC, regardless of whether they had previously received a ROS1 TKI. Adverse events were mainly of low grade and compatible with long-term administration. (Funded by Turning Point Therapeutics, a wholly owned subsidiary of Bristol Myers Squibb; TRIDENT-1 ClinicalTrials.gov number, NCT03093116.).


Asunto(s)
Antineoplásicos , Carcinoma de Pulmón de Células no Pequeñas , Neoplasias Pulmonares , Proteínas Tirosina Quinasas , Humanos , Carcinoma de Pulmón de Células no Pequeñas/tratamiento farmacológico , Carcinoma de Pulmón de Células no Pequeñas/genética , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/genética , Proteínas Tirosina Quinasas/antagonistas & inhibidores , Proteínas Tirosina Quinasas/genética , Proteínas Proto-Oncogénicas/antagonistas & inhibidores , Proteínas Proto-Oncogénicas/genética , Antineoplásicos/uso terapéutico , Resultado del Tratamiento
2.
Mol Cancer Ther ; 20(12): 2446-2456, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34625502

RESUMEN

NTRK chromosomal rearrangements yield oncogenic TRK fusion proteins that are sensitive to TRK inhibitors (larotrectinib and entrectinib) but often mutate, limiting the durability of response for NTRK + patients. Next-generation inhibitors with compact macrocyclic structures (repotrectinib and selitrectinib) were designed to avoid resistance mutations. Head-to-head potency comparisons of TRK inhibitors and molecular characterization of binding interactions are incomplete, obscuring a detailed understanding of how molecular characteristics translate to potency. Larotrectinib, entrectinib, selitrectinib, and repotrectinib were characterized using cellular models of wild-type TRKA/B/C fusions and resistance mutant variants with a subset evaluated in xenograft tumor models. Crystal structures were determined for repotrectinib bound to TRKA (wild-type, solvent-front mutant). TKI-naïve and pretreated case studies are presented. Repotrectinib was the most potent inhibitor of wild-type TRKA/B/C fusions and was more potent than selitrectinib against all tested resistance mutations, underscoring the importance of distinct features of the macrocycle structures. Cocrystal structures of repotrectinib with wild-type TRKA and the TRKAG595R SFM variant elucidated how differences in macrocyclic inhibitor structure, binding orientation, and conformational flexibility affect potency and mutant selectivity. The SFM crystal structure revealed an unexpected intramolecular arginine sidechain interaction. Repotrectinib caused tumor regression in LMNA-NTRK1 xenograft models harboring GKM, SFM, xDFG, and GKM + SFM compound mutations. Durable responses were observed in TKI-naïve and -pretreated patients with NTRK + cancers treated with repotrectinib (NCT03093116). This comprehensive analysis of first- and second-generation TRK inhibitors informs the clinical utility, structural determinants of inhibitor potency, and design of new generations of macrocyclic inhibitors.


Asunto(s)
Compuestos Macrocíclicos/uso terapéutico , Proteínas de Fusión Oncogénica/uso terapéutico , Pirazoles/uso terapéutico , Humanos , Compuestos Macrocíclicos/farmacología , Modelos Moleculares , Mutación , Neoplasias/tratamiento farmacológico , Proteínas de Fusión Oncogénica/farmacología , Pirazoles/farmacología
3.
Blood Adv ; 3(23): 3962-3967, 2019 12 10.
Artículo en Inglés | MEDLINE | ID: mdl-31805192

RESUMEN

Therapy-related acute myeloid leukemia and myelodysplastic syndromes (t-AML/t-MDS) are secondary hematologic malignancies associated with poor prognosis, warranting insights into their predisposing conditions and cells of origin. We identified patients with myeloma who developed t-AML/t-MDS and analyzed their stem and progenitor cells collected years before the onset of secondary disease. We demonstrate that aberrant stem cells with high CD123 expression can be detected long before the onset of overt leukemia. Rigorous sorting, followed by targeted sequencing, resulted in ultradeep functional depth of sequencing and revealed preexisting mutant hematopoietic stem cell (HSC) clones, mainly harboring TP53 mutations, that became the dominant population at the time of leukemic presentation. Taken together, these data show that HSCs can act as reservoirs for leukemia-initiating cells many years before the onset of myeloid leukemia.


Asunto(s)
Células Madre Hematopoyéticas/metabolismo , Leucemia Mieloide Aguda/etiología , Mieloma Múltiple/complicaciones , Neoplasias Primarias Secundarias/etiología , Humanos , Leucemia Mieloide Aguda/patología , Mieloma Múltiple/patología , Mutación , Neoplasias Primarias Secundarias/patología
4.
Blood ; 132(14): 1507-1518, 2018 10 04.
Artículo en Inglés | MEDLINE | ID: mdl-30104217

RESUMEN

Adult T-cell leukemia lymphoma (ATLL) is a rare T cell neoplasm that is endemic in Japanese, Caribbean, and Latin American populations. Most North American ATLL patients are of Caribbean descent and are characterized by high rates of chemo-refractory disease and worse prognosis compared with Japanese ATLL. To determine genomic differences between these 2 cohorts, we performed targeted exon sequencing on 30 North American ATLL patients and compared the results with the Japanese ATLL cases. Although the frequency of TP53 mutations was comparable, the mutation frequency in epigenetic and histone modifying genes (57%) was significantly higher, whereas the mutation frequency in JAK/STAT and T-cell receptor/NF-κB pathway genes was significantly lower. The most common type of epigenetic mutation is that affecting EP300 (20%). As a category, epigenetic mutations were associated with adverse prognosis. Dissimilarities with the Japanese cases were also revealed by RNA sequencing analysis of 9 primary patient samples. ATLL samples with a mutated EP300 gene have decreased total and acetyl p53 protein and a transcriptional signature reminiscent of p53-mutated cancers. Most importantly, decitabine has highly selective single-agent activity in the EP300-mutated ATLL samples, suggesting that decitabine treatment induces a synthetic lethal phenotype in EP300-mutated ATLL cells. In conclusion, we demonstrate that North American ATLL has a distinct genomic landscape that is characterized by frequent epigenetic mutations that are targetable preclinically with DNA methyltransferase inhibitors.


Asunto(s)
Antimetabolitos Antineoplásicos/uso terapéutico , Decitabina/uso terapéutico , Leucemia-Linfoma de Células T del Adulto/tratamiento farmacológico , Leucemia-Linfoma de Células T del Adulto/genética , Adulto , Anciano , Anciano de 80 o más Años , Apoptosis/efectos de los fármacos , Proteína p300 Asociada a E1A/genética , Epigénesis Genética , Femenino , Humanos , Japón/epidemiología , Leucemia-Linfoma de Células T del Adulto/diagnóstico , Leucemia-Linfoma de Células T del Adulto/epidemiología , Masculino , Persona de Mediana Edad , Tasa de Mutación , Pronóstico , Transcriptoma , Proteína p53 Supresora de Tumor/genética , Estados Unidos/epidemiología
5.
Leuk Lymphoma ; 59(12): 2952-2962, 2018 12.
Artículo en Inglés | MEDLINE | ID: mdl-29616851

RESUMEN

To provide biologic insights into mechanisms underlying myelodysplastic syndromes (MDS) we evaluated the CD34+ marrow cells transcriptome using high-throughput RNA sequencing (RNA-Seq). We demonstrated significant differential gene expression profiles (GEPs) between MDS and normal and identified 41 disease classifier genes. Additionally, two main clusters of GEPs distinguished patients based on their major clinical features, particularly between those whose disease remained stable versus patients who transformed into acute myeloid leukemia within 12 months. The genes whose expression was associated with disease outcome were involved in functional pathways and biologic processes highly relevant for MDS. Combined with exomic analysis we identified differential isoform usage of genes in MDS mutational subgroups, with consequent dysregulation of distinct biologic functions. This combination of clinical, transcriptomic and exomic findings provides valuable understanding of mechanisms underlying MDS and its progression to a more aggressive stage and also facilitates prognostic characterization of MDS patients.


Asunto(s)
Células de la Médula Ósea/patología , Exones/genética , Leucemia Mieloide Aguda/genética , Síndromes Mielodisplásicos/genética , Transcriptoma/genética , Adulto , Anciano , Anciano de 80 o más Años , Antígenos CD34/metabolismo , Médula Ósea/patología , Progresión de la Enfermedad , Femenino , Estudios de Seguimiento , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Leucemia Mieloide Aguda/patología , Masculino , Persona de Mediana Edad , Síndromes Mielodisplásicos/patología , Pronóstico , Secuenciación del Exoma
6.
BMC Bioinformatics ; 17(Suppl 18): 472, 2016 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-28105913

RESUMEN

BACKGROUND: This work presents a machine learning strategy to increase sensitivity in tandem mass spectrometry (MS/MS) data analysis for peptide/protein identification. MS/MS yields thousands of spectra in a single run which are then interpreted by software. Most of these computer programs use a protein database to match peptide sequences to the observed spectra. The peptide-spectrum matches (PSMs) must also be assessed by computational tools since manual evaluation is not practicable. The target-decoy database strategy is largely used for error estimation in PSM assessment. However, in general, that strategy does not account for sensitivity. RESULTS: In a previous study, we proposed the method MUMAL that applies an artificial neural network to effectively generate a model to classify PSMs using decoy hits with increased sensitivity. Nevertheless, the present approach shows that the sensitivity can be further improved with the use of a cost matrix associated with the learning algorithm. We also demonstrate that using a threshold selector algorithm for probability adjustment leads to more coherent probability values assigned to the PSMs. Our new approach, termed MUMAL2, provides a two-fold contribution to shotgun proteomics. First, the increase in the number of correctly interpreted spectra in the peptide level augments the chance of identifying more proteins. Second, the more appropriate PSM probability values that are produced by the threshold selector algorithm impact the protein inference stage performed by programs that take probabilities into account, such as ProteinProphet. Our experiments demonstrate that MUMAL2 reached around 15% of improvement in sensitivity compared to the best current method. Furthermore, the area under the ROC curve obtained was 0.93, demonstrating that the probabilities generated by our model are in fact appropriate. Finally, Venn diagrams comparing MUMAL2 with the best current method show that the number of exclusive peptides found by our method was nearly 4-fold higher, which directly impacts the proteome coverage. CONCLUSIONS: The inclusion of a cost matrix and a probability threshold selector algorithm to the learning task further improves the target-decoy database analysis for identifying peptides, which optimally contributes to the challenging task of protein level identification, resulting in a powerful computational tool for shotgun proteomics.


Asunto(s)
Redes Neurales de la Computación , Proteómica/métodos , Algoritmos , Bases de Datos de Proteínas/economía , Péptidos/química , Probabilidad , Proteoma/química , Proteómica/economía , Programas Informáticos , Espectrometría de Masas en Tándem/métodos
7.
Source Code Biol Med ; 8(1): 22, 2013 Nov 13.
Artículo en Inglés | MEDLINE | ID: mdl-24225386

RESUMEN

BACKGROUND: Molecular descriptors have been extensively used in the field of structure-oriented drug design and structural chemistry. They have been applied in QSPR and QSAR models to predict ADME-Tox properties, which specify essential features for drugs. Molecular descriptors capture chemical and structural information, but investigating their interpretation and meaning remains very challenging. RESULTS: This paper introduces a large-scale database of molecular descriptors called COMMODE containing more than 25 million compounds originated from PubChem. About 2500 DRAGON-descriptors have been calculated for all compounds and integrated into this database, which is accessible through a web interface at http://commode.i-med.ac.at.

8.
PLoS One ; 8(2): e55207, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23390522

RESUMEN

ERG gene rearrangements are found in about one half of all prostate cancers. Functional analyses do not fully explain the selective pressure causing ERG rearrangement during the development of prostate cancer. To identify transcriptional changes in prostate cancer, including tumors with ERG gene rearrangements, we performed a meta-analysis on published gene expression data followed by validations on mRNA and protein levels as well as first functional investigations. Eight expression studies (n = 561) on human prostate tissues were included in the meta-analysis. Transcriptional changes between prostate cancer and non-cancerous prostate, as well as ERG rearrangement-positive (ERG+) and ERG rearrangement-negative (ERG-) prostate cancer, were analyzed. Detailed results can be accessed through an online database. We validated our meta-analysis using data from our own independent microarray study (n = 57). 84% and 49% (fold-change>2 and >1.5, respectively) of all transcriptional changes between ERG+ and ERG- prostate cancer determined by meta-analysis were verified in the validation study. Selected targets were confirmed by immunohistochemistry: NPY and PLA2G7 (up-regulated in ERG+ cancers), and AZGP1 and TFF3 (down-regulated in ERG+ cancers). First functional investigations for one of the most prominent ERG rearrangement-associated genes - neuropeptide Y (NPY) - revealed increased glucose uptake in vitro indicating the potential role of NPY in regulating cellular metabolism. In summary, we found robust population-independent transcriptional changes in prostate cancer and first signs of ERG rearrangements inducing metabolic changes in cancer cells by activating major metabolic signaling molecules like NPY. Our study indicates that metabolic changes possibly contribute to the selective pressure favoring ERG rearrangements in prostate cancer.


Asunto(s)
Regulación Neoplásica de la Expresión Génica , Neuropéptido Y/genética , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/metabolismo , ARN Mensajero/genética , Transactivadores/genética , 1-Alquil-2-acetilglicerofosfocolina Esterasa , Adipoquinas , Anciano , Transporte Biológico , Proteínas Portadoras/genética , Proteínas Portadoras/metabolismo , Perfilación de la Expresión Génica , Glucosa/metabolismo , Glicoproteínas/genética , Glicoproteínas/metabolismo , Humanos , Masculino , Persona de Mediana Edad , Neuropéptido Y/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Péptidos/genética , Péptidos/metabolismo , Fosfolipasas A2/genética , Fosfolipasas A2/metabolismo , Neoplasias de la Próstata/patología , ARN Mensajero/metabolismo , Transducción de Señal , Transactivadores/metabolismo , Transcripción Genética , Regulador Transcripcional ERG , Factor Trefoil-3
9.
BMC Genomics ; 13 Suppl 5: S4, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23095859

RESUMEN

BACKGROUND: The shotgun strategy (liquid chromatography coupled with tandem mass spectrometry) is widely applied for identification of proteins in complex mixtures. This method gives rise to thousands of spectra in a single run, which are interpreted by computational tools. Such tools normally use a protein database from which peptide sequences are extracted for matching with experimentally derived mass spectral data. After the database search, the correctness of obtained peptide-spectrum matches (PSMs) needs to be evaluated also by algorithms, as a manual curation of these huge datasets would be impractical. The target-decoy database strategy is largely used to perform spectrum evaluation. Nonetheless, this method has been applied without considering sensitivity, i.e., only error estimation is taken into account. A recently proposed method termed MUDE treats the target-decoy analysis as an optimization problem, where sensitivity is maximized. This method demonstrates a significant increase in the retrieved number of PSMs for a fixed error rate. However, the MUDE model is constructed in such a way that linear decision boundaries are established to separate correct from incorrect PSMs. Besides, the described heuristic for solving the optimization problem has to be executed many times to achieve a significant augmentation in sensitivity. RESULTS: Here, we propose a new method, termed MUMAL, for PSM assessment that is based on machine learning techniques. Our method can establish nonlinear decision boundaries, leading to a higher chance to retrieve more true positives. Furthermore, we need few iterations to achieve high sensitivities, strikingly shortening the running time of the whole process. Experiments show that our method achieves a considerably higher number of PSMs compared with standard tools such as MUDE, PeptideProphet, and typical target-decoy approaches. CONCLUSION: Our approach not only enhances the computational performance, and thus the turn around time of MS-based experiments in proteomics, but also improves the information content with benefits of a higher proteome coverage. This improvement, for instance, increases the chance to identify important drug targets or biomarkers for drug development or molecular diagnostics.


Asunto(s)
Algoritmos , Inteligencia Artificial , Cromatografía Liquida/métodos , Biología Computacional/métodos , Proteínas/análisis , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Análisis Multivariante , Redes Neurales de la Computación , Sensibilidad y Especificidad
10.
J Theor Biol ; 310: 216-22, 2012 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-22771628

RESUMEN

The identification and interpretation of metabolic biomarkers is a challenging task. In this context, network-based approaches have become increasingly a key technology in systems biology allowing to capture complex interactions in biological systems. In this work, we introduce a novel network-based method to identify highly predictive biomarker candidates for disease. First, we infer two different types of networks: (i) correlation networks, and (ii) a new type of network called ratio networks. Based on these networks, we introduce scores to prioritize features using topological descriptors of the vertices. To evaluate our method we use an example dataset where quantitative targeted MS/MS analysis was applied to a total of 52 blood samples from 22 persons with obesity (BMI >30) and 30 healthy controls. Using our network-based feature selection approach we identified highly discriminating metabolites for obesity (F-score >0.85, accuracy >85%), some of which could be verified by the literature.


Asunto(s)
Algoritmos , Redes y Vías Metabólicas , Metabolómica/métodos , Obesidad/metabolismo , Adulto , Estudios de Casos y Controles , Humanos , Persona de Mediana Edad , Modelos Biológicos
11.
ScientificWorldJournal ; 2012: 278352, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22654582

RESUMEN

A pooling design can be used as a powerful strategy to compensate for limited amounts of samples or high biological variation. In this paper, we perform a comparative study to model and quantify the effects of virtual pooling on the performance of the widely applied classifiers, support vector machines (SVMs), random forest (RF), k-nearest neighbors (k-NN), penalized logistic regression (PLR), and prediction analysis for microarrays (PAMs). We evaluate a variety of experimental designs using mock omics datasets with varying levels of pool sizes and considering effects from feature selection. Our results show that feature selection significantly improves classifier performance for non-pooled and pooled data. All investigated classifiers yield lower misclassification rates with smaller pool sizes. RF mainly outperforms other investigated algorithms, while accuracy levels are comparable among all the remaining ones. Guidelines are derived to identify an optimal pooling scheme for obtaining adequate predictive power and, hence, to motivate a study design that meets best experimental objectives and budgetary conditions, including time constraints.


Asunto(s)
Algoritmos , Biología Computacional/métodos
12.
Toxicol Pathol ; 40(6): 951-64, 2012 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-22573522

RESUMEN

The Liver Toxicity Biomarker Study is a systems toxicology approach to discover biomarkers that are indicative of a drug's potential to cause human idiosyncratic drug-induced liver injury. In phase I, the molecular effects in rat liver and blood plasma induced by tolcapone (a "toxic" drug) were compared with the molecular effects in the same tissues by dosing with entacapone (a "clean" drug, similar to tolcapone in chemical structure and primary pharmacological mechanism). Two durations of drug exposure, 3 and 28 days, were employed. Comprehensive molecular analysis of rat liver and plasma samples yielded marker analytes for various drug-vehicle or drug-drug comparisons. An important finding was that the marker analytes associated with tolcapone only partially overlapped with marker analytes associated with entacapone, despite the fact that both drugs have similar chemical structures and the same primary pharmacological mechanism of action. This result indicates that the molecular analyses employed in the study are detecting substantial "off-target" markers for the two drugs. An additional interesting finding was the modest overlap of the marker data sets for 3-day exposure and 28-day exposure, indicating that the molecular changes in liver and plasma caused by short- and long-term drug treatments do not share common characteristics.


Asunto(s)
Benzofenonas/toxicidad , Catecoles/toxicidad , Enfermedad Hepática Inducida por Sustancias y Drogas/metabolismo , Nitrilos/toxicidad , Nitrofenoles/toxicidad , Animales , Biomarcadores/análisis , Proteínas Sanguíneas/análisis , Enfermedad Hepática Inducida por Sustancias y Drogas/sangre , Femenino , Perfilación de la Expresión Génica , Hígado/química , Hígado/metabolismo , Masculino , Metaboloma/efectos de los fármacos , Metabolómica , Proteoma/análisis , Proteoma/efectos de los fármacos , Proteómica , Ratas , Proyectos de Investigación , Tolcapona , Pruebas de Toxicidad Aguda/métodos , Pruebas de Toxicidad Crónica/métodos
13.
BMC Bioinformatics ; 12: 492, 2011 Dec 24.
Artículo en Inglés | MEDLINE | ID: mdl-22195644

RESUMEN

BACKGROUND: Structural measures for networks have been extensively developed, but many of them have not yet demonstrated their sustainably. That means, it remains often unclear whether a particular measure is useful and feasible to solve a particular problem in network biology. Exemplarily, the classification of complex biological networks can be named, for which structural measures are used leading to a minimal classification error. Hence, there is a strong need to provide freely available software packages to calculate and demonstrate the appropriate usage of structural graph measures in network biology. RESULTS: Here, we discuss topological network descriptors that are implemented in the R-package QuACN and demonstrate their behavior and characteristics by applying them to a set of example graphs. Moreover, we show a representative application to illustrate their capabilities for classifying biological networks. In particular, we infer gene regulatory networks from microarray data and classify them by methods provided by QuACN. Note that QuACN is the first freely available software written in R containing a large number of structural graph measures. CONCLUSION: The R package QuACN is under ongoing development and we add promising groups of topological network descriptors continuously. The package can be used to answer intriguing research questions in network biology, e.g., classifying biological data or identifying meaningful biological features, by analyzing the topology of biological networks.


Asunto(s)
Redes Reguladoras de Genes , Programas Informáticos , Entropía , Mapas de Interacción de Proteínas
14.
J Clin Bioinforma ; 1(1): 34, 2011 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-22182709

RESUMEN

BACKGROUND: In metabolomics, biomarker discovery is a highly data driven process and requires sophisticated computational methods for the search and prioritization of novel and unforeseen biomarkers in data, typically gathered in preclinical or clinical studies. In particular, the discovery of biomarker candidates from longitudinal cohort studies is crucial for kinetic analysis to better understand complex metabolic processes in the organism during physical activity. FINDINGS: In this work we introduce a novel computational strategy that allows to identify and study kinetic changes of putative biomarkers using targeted MS/MS profiling data from time series cohort studies or other cross-over designs. We propose a prioritization model with the objective of classifying biomarker candidates according to their discriminatory ability and couple this discovery step with a novel network-based approach to visualize, review and interpret key metabolites and their dynamic interactions within the network. The application of our method on longitudinal stress test data revealed a panel of metabolic signatures, i.e., lactate, alanine, glycine and the short-chain fatty acids C2 and C3 in trained and physically fit persons during bicycle exercise. CONCLUSIONS: We propose a new computational method for the discovery of new signatures in dynamic metabolic profiling data which revealed known and unexpected candidate biomarkers in physical activity. Many of them could be verified and confirmed by literature. Our computational approach is freely available as R package termed BiomarkeR under LGPL via CRAN http://cran.r-project.org/web/packages/BiomarkeR/.

15.
Biol Direct ; 6: 53, 2011 Oct 13.
Artículo en Inglés | MEDLINE | ID: mdl-21995640

RESUMEN

BACKGROUND: Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-independent structural similarities, which pose a special challenge for our endeavour. We quantify specific structural information by using topological network descriptors to classify this set of metabolic networks. Such measures quantify the structural complexity of the underlying networks. In this study, we use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. So far, it has been a challenging undertaking to examine what kind of structural complexity such measures do detect. In this paper, we apply two groups of topological network descriptors to metabolic networks and evaluate their classification performance. Moreover, we combine the two groups to perform a feature selection to estimate the structural features with the highest classification ability in order to optimize the classification performance. RESULTS: By combining the two groups, we can identify seven topological network descriptors that show a group-specific characteristic by ANOVA. A multivariate analysis using feature selection and supervised machine learning leads to a reasonable classification performance with a weighted F-score of 83.7% and an accuracy of 83.9%. We further demonstrate that our approach outperforms alternative methods. Also, our results reveal that entropy-based descriptors show the highest classification ability for this set of networks. CONCLUSIONS: Our results show that these particular topological network descriptors are able to capture domain-specific structural characteristics for classifying metabolic networks between the three domains of life.


Asunto(s)
Archaea/clasificación , Bacterias/clasificación , Eucariontes/clasificación , Redes y Vías Metabólicas , Algoritmos , Análisis de Varianza , Archaea/metabolismo , Inteligencia Artificial , Bacterias/metabolismo , Gráficos por Computador , Eucariontes/metabolismo , Modelos Logísticos , Reproducibilidad de los Resultados , Programas Informáticos
16.
PLoS One ; 6(7): e22843, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-21829532

RESUMEN

Network-based analysis has been proven useful in biologically-oriented areas, e.g., to explore the dynamics and complexity of biological networks. Investigating a set of networks allows deriving general knowledge about the underlying topological and functional properties. The integrative analysis of networks typically combines networks from different studies that investigate the same or similar research questions. In order to perform an integrative analysis it is often necessary to compare the properties of matching edges across the data set. This identification of common edges is often burdensome and computational intensive. Here, we present an approach that is different from inferring a new network based on common features. Instead, we select one network as a graph prototype, which then represents a set of comparable network objects, as it has the least average distance to all other networks in the same set. We demonstrate the usefulness of the graph prototyping approach on a set of prostate cancer networks and a set of corresponding benign networks. We further show that the distances within the cancer group and the benign group are statistically different depending on the utilized distance measure.


Asunto(s)
Gráficos por Computador , Redes Reguladoras de Genes , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/metabolismo , Mapeo de Interacción de Proteínas , Algoritmos , Biología Computacional , Simulación por Computador , Humanos , Masculino
17.
J Clin Bioinforma ; 12011 Mar 08.
Artículo en Inglés | MEDLINE | ID: mdl-21743835

RESUMEN

BACKGROUND: Using serum, plasma or tumor tissue specimens from biobanks for biomarker discovery studies is attractive as samples are often readily available. However, storage over longer periods of time can alter concentrations of proteins in those specimens. We therefore assessed the bias in estimates of association from case-control studies conducted using banked specimens when maker levels changed over time for single markers and also for multiple correlated markers in simulations. Data from a small laboratory experiment using serum samples guided the choices of simulation parameters for various functions of changes of biomarkers over time. RESULTS: In the laboratory experiment levels of two serum markers measured at sample collection and again in the same samples after approximately ten years in storage increased by 15%. For a 15% increase in marker levels over ten years, odds ratios (ORs) of association were significantly underestimated, with a relative bias of -10%, while for a 15% decrease in marker levels over time ORs were too high, with a relative bias of 20%. CONCLUSION: Biases in estimates of parameters of association need to be considered in sample size calculations for studies to replicate markers identified in exploratory analyses.

18.
Bioinformatics ; 27(1): 140-1, 2011 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-21075747

RESUMEN

MOTIVATION: Network-based representations of biological data have become an important way to analyze high-throughput data. To interpret the large amount of data that is produced by different high-throughput technologies, networks offer multifaceted aspects to analyze the data. As networks represent biological relationships within their structure, it turned out to be fruitful to analyze their topology. Therefore, we developed a freely available, open source R-package called Quantitative Analysis of Complex Networks (QuACN) to meet this challenge. QuACN contains different, information-theoretic and non-information-theoretic, topological network descriptors to analyze, classify and compare biological networks. AVAILABILITY: QuACN is freely available under LGPL via CRAN (http://cran.r-project.org/web/packages/QuACN/).


Asunto(s)
Modelos Biológicos , Programas Informáticos
19.
Biomarkers ; 15(8): 677-83, 2010 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-20923317

RESUMEN

Sample handling and storage conditions in specimens frozen over long periods of time can severely impact marker levels. If laboratory technologies, practices and related protocols change over time, biomarker studies are potentially biased and report erroneous results. These issues and pitfalls are often overlooked in system biology studies using previously collected and stored materials, and are likely to be one notable cause for biomarker candidates failing to be validated. We present results from simulation studies quantifying the loss in statistical power to detect true biomarkers, due to diminishing concentration of analytes in samples subject to poor handling and storage conditions.


Asunto(s)
Biomarcadores , Manejo de Especímenes/métodos , Biología de Sistemas , Humanos , Modelos Teóricos
20.
BMC Bioinformatics ; 11: 447, 2010 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-20815881

RESUMEN

BACKGROUND: data generated using 'omics' technologies are characterized by high dimensionality, where the number of features measured per subject vastly exceeds the number of subjects in the study. In this paper, we consider issues relevant in the design of biomedical studies in which the goal is the discovery of a subset of features and an associated algorithm that can predict a binary outcome, such as disease status. We compare the performance of four commonly used classifiers (K-Nearest Neighbors, Prediction Analysis for Microarrays, Random Forests and Support Vector Machines) in high-dimensionality data settings. We evaluate the effects of varying levels of signal-to-noise ratio in the dataset, imbalance in class distribution and choice of metric for quantifying performance of the classifier. To guide study design, we present a summary of the key characteristics of 'omics' data profiled in several human or animal model experiments utilizing high-content mass spectrometry and multiplexed immunoassay based techniques. RESULTS: the analysis of data from seven 'omics' studies revealed that the average magnitude of effect size observed in human studies was markedly lower when compared to that in animal studies. The data measured in human studies were characterized by higher biological variation and the presence of outliers. The results from simulation studies indicated that the classifier Prediction Analysis for Microarrays (PAM) had the highest power when the class conditional feature distributions were Gaussian and outcome distributions were balanced. Random Forests was optimal when feature distributions were skewed and when class distributions were unbalanced. We provide a free open-source R statistical software library (MVpower) that implements the simulation strategy proposed in this paper. CONCLUSION: no single classifier had optimal performance under all settings. Simulation studies provide useful guidance for the design of biomedical studies involving high-dimensionality data.


Asunto(s)
Algoritmos , Clasificación/métodos , Animales , Bases de Datos Factuales , Perfilación de la Expresión Génica/métodos , Humanos , Modelos Estadísticos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Reconocimiento de Normas Patrones Automatizadas , Tamaño de la Muestra
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...