Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 101
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Methods ; 218: 125-132, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37574160

RESUMO

Hepatocellular carcinoma (HCC) has been an approved indication for the administration of immunotherapy since 2017, but biomarkers that predict therapeutic response have remained limited. Understanding and characterizing the tumor immune microenvironment enables better classification of these tumors and may reveal biomarkers that predict immunotherapeutic efficacy. In this paper, we applied a cell-type deconvolution algorithm using DNA methylation array data to investigate the composition of the tumor microenvironment in HCC. Using publicly available and in-house datasets with a total cohort size of 57 patients, each with tumor and matched normal tissue samples, we identified key differences in immune cell composition. We found that NK cell abundance was significantly decreased in HCC tumors compared to adjacent normal tissue. We also applied DNA methylation "clocks" which estimate phenotypic aging and compared these findings to expression-based determinations of cellular senescence. Senescence and epigenetic aging were significantly increased in HCC tumors, and the degree of age acceleration and senescence was strongly associated with decreased NK cell abundance. In summary, we found that NK cell infiltration in the tumor microenvironment is significantly diminished, and that this loss of NK abundance is strongly associated with increased senescence and age-related phenotype. These findings point to key interactions between NK cells and the senescent tumor microenvironment and offer insights into the pathogenesis of HCC as well as potential biomarkers of therapeutic efficacy.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Humanos , Carcinoma Hepatocelular/genética , Neoplasias Hepáticas/genética , Metilação de DNA/genética , Microambiente Tumoral/genética , Senescência Celular/genética , Biomarcadores Tumorais/genética
2.
J Sep Sci ; 45(23): 4236-4244, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36168850

RESUMO

Human serum N-linked glycans expression levels change during the disease progression. The low abundance, structural diversity, and coexisting matrices hinder their detection in mass spectrometry analysis. Considering the hydrophilic nature of N-glycans, cellulose/polymer (1,2-Epoxy-5-hexene) nanohybrid is fabricated with oxirane groups functionalized of asparagine to develop solid phase extraction based hydrophilic interaction liquid chromatography sorbent (cellulose/1,2-Epoxy-5-hexene/asparagine). The morphology, elemental analysis, and surface properties are studied through scanning electron microscopy, energy dispersive X-ray spectroscopy, and Fourier-transform infrared spectroscopy. The large surface area of cellulose/polymer nanohybrid (2.09 × 102  m2 /g) facilitates the high density of asparagine immobilization resulting in better hydrophilic interaction liquid chromatography enrichment under optimized conditions. The enrichment capability of nanohybrid/asparagine is assessed by the N-Linked glycans released from ovalbumin and immunoglobulin G where 23 and 13 N-glycans are detected respectively. The nanohybrid/asparagine shows selectivity of 1:1200 with spiked bovine serum albumin and sensitivity down to 100 attomole. Human serum profiling for N-glycans identifies 52 glycan structures. This new enrichment strategy enriches serum N-linked glycans in the presence of salts, proteins, endogenous serum peptides, and so forth.


Assuntos
Celulose , Polímeros , Humanos , Asparagina
3.
Mikrochim Acta ; 189(8): 277, 2022 07 13.
Artigo em Inglês | MEDLINE | ID: mdl-35829791

RESUMO

A new polymeric (methyl methacrylate/ethylene glycol dimethacrylate/1,2-epoxy-5-hexene) base/matrix has been fabricated and decorated with zwitterionic hydrophilic cysteic acid (Cya) for the enrichment of intact N-glycopeptides from standards and biological samples. Terpolymer-Cya provides good enrichment efficiency, improved hydrophilicity, and selectivity by virtue of better surface area (2.09 × 102 m2/g) provided by terpolymer and the zwitterionic property offered by cysteic acid. Cysteic acid-functionalized polymeric hydrophilic interaction liquid chromatography (HILIC) sorbent enriches 35 and 24 N-linked glycopeptides via SPE (solid phase extraction) mode from tryptic digests of model glycoproteins, i.e., immunoglobulin G (IgG) and horseradish peroxidase (HRP), respectively. Zwitterionic chemistry of cysteine helps in achieving higher selectivity with BSA digest (1:200), and lower detection limit down to 100 attomoles with a complete glycosylation profile of each standard digest. The recovery of 81% and good reproducibility define the application of terpolymer-Cya for complex samples like a serum. Analysis of human serum provides a profile of 807 intact N-linked glycopeptides via nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS). To the best of our knowledge, this is the highest number of glycopeptides enriched by any HILIC sorbent. Selected glycoproteins are evaluated in link to various cancers including the breast, lung, uterine, and melanoma using single-nucleotide variances (BioMuta). This study represents the complete idea of using an in-house developed strategy as a successful tool to help analyze, relate, and answer glycoprotein-based clinical issues regarding cancers.


Assuntos
Ácido Cisteico , Glicopeptídeos , Glicopeptídeos/análise , Glicoproteínas , Humanos , Interações Hidrofóbicas e Hidrofílicas , Reprodutibilidade dos Testes , Espectrometria de Massas em Tandem
4.
Mikrochim Acta ; 188(12): 417, 2021 11 11.
Artigo em Inglês | MEDLINE | ID: mdl-34762162

RESUMO

A three-step strategy is introduced to develop inherent iminodiacetic (IDA)-functionalized nanopolymer. SEM micrographs show homogenous spherical beads with a particle size of 500 nm. Further modification to COOH-functionalized 1,2-epoxy-5-hexene/DVB mesoporous nanopolymer enriches glycopeptides via hydrophilic interactions followed by their MS determination. Significantly high BET surface area 433.4336 m2 g-1 contributes to the improved surface hydrophilicity which is also shown by high concentration of ionizable carboxylic acids, 14.59 ± 0.25 mmol g-1. Measured surface area is the highest among DVB-based polymers and in general much higher in comparison to the previously reported BET surface areas of co-polymers, terpolymers, MOFs, and graphene-based composites. Thirty-one, 19, and 16 N-glycopeptides are enriched/identified by nanopolymer beads from tryptic digests of immunoglobulin G, horseradish peroxidase, and chicken avidin, respectively, without additional desalting steps. Material exhibits high selectivity (1:400 IgG:BSA), sensitivity (down to 0.1 fmol), regeneration ability up to three cycles, and batch-to-batch reproducibility (RSD > 1%). Furthermore, from 1 µL of digested human serum, 343 N-glycopeptide characteristics of 134 glycoproteins including 30 FDA-approved serum biomarkers are identified via nano-LC-MS/MS. The developed strategy to self-generate IDA on polymeric surface with improved surface area, porosity, and ordered morphology is insignia of its potential as chromatographic tool contributing to future developments in large-scale biomedical glycoproteomics studies.


Assuntos
Glicopeptídeos/química , Iminoácidos/química , Nanoestruturas/química , Polímeros/química , Humanos , Interações Hidrofóbicas e Hidrofílicas , Microscopia Eletrônica de Varredura , Nanoestruturas/ultraestrutura , Porosidade , Propriedades de Superfície
5.
Metabolomics ; 16(10): 104, 2020 09 30.
Artigo em Inglês | MEDLINE | ID: mdl-32997169

RESUMO

INTRODUCTION: Metabolite annotation is a critical and challenging step in mass spectrometry-based metabolomic profiling. In a typical untargeted MS/MS-based metabolomic study, experimental MS/MS spectra are matched against those in spectral libraries for metabolite annotation. Yet, existing spectral libraries comprise merely a marginal percentage of known compounds. OBJECTIVE: The objective is to develop a method that helps rank putative metabolite IDs for analytes whose reference MS/MS spectra are not present in spectral libraries. METHODS: We introduce MetFID, which uses an artificial neural network (ANN) trained for predicting molecular fingerprints based on experimental MS/MS data. To narrow the search space, MetFID retrieves candidates from metabolite databases using molecular formula or m/z value of the precursor ions of the analytes. The candidate whose fingerprint is most analogous to the predicted fingerprint is used for metabolite annotation. A comprehensive evaluation was performed by training MetFID using MS/MS spectra from the MoNA repository and NIST library and by testing with structure-disjoint MS/MS spectra from the NIST library, the CASMI 2016 dataset, and in-house MS/MS data from a cancer biomarker discovery study. RESULTS: We observed that training separate models for distinct ranges of collision energies enhanced model performance compared to a single model that covers a wide range of collision energies. Using MetaboQuest to retrieve candidates, MetFID prioritized the correct putative ID in the first place rank for about 50% of the testing cases. Through the independent testing dataset, we demonstrated that MetFID has the potential to improve the accuracy of ranking putative metabolite IDs by more than 5% compared to other tools such as ChemDistiller, CSI:FingerID, and MetFrag. CONCLUSION: MetFID offers a promising opportunity to enhance the accuracy of metabolite annotation by using ANN for molecular fingerprint prediction.


Assuntos
Metabolômica/métodos , Algoritmos , Bases de Dados Factuais/normas , Humanos , Redes Neurais de Computação , Padrões de Referência , Valores de Referência , Software , Espectrometria de Massas por Ionização por Electrospray/métodos , Espectrometria de Massas em Tandem/métodos
6.
Exp Cell Res ; 384(1): 111621, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31513782

RESUMO

A long-term hepatocyte culture maintaining liver-specific functions is very essential for both basic research and the development of bioartificial liver devices in clinical application. However, primary hepatocytes rapidly lose their proliferation and hepatic functions over a few days in culture. This work is to establish an ornithine transcarbamylase deficiency (OTCD) patient-derived primary human hepatocyte (OTCD-PHH) culture with hepatic functions for providing an in vitro cell model. Liver tissue from an infant with OTCD was dispersed into single cells. The cells were cultured using conditional reprogramming. To characterize the cells, we assessed activities and mRNA expression of CYP3A4, 1A1, 2C9, as well as albumin and urea secretion. We found that the OTCD-PHH can be subpassaged for more than 15 passages. The cells do not express mRNA of fibroblast-specific maker, whereas they highly express markers of epithelial cells and hepatocytes. In addition, the OTCD-PHH retain native CYP3A4, 1A1, 2C9 activities and albumin secretion function at early passages. The OTCD-PHH at passages 2, 6, 9 and 13 have identical DNA fingerprint as the original tissue. Furthermore, under 3D culture environment, low urea production and hepatocyte marker staining of the OTCD-PHH were detected. The established OTCD-PHH maintain liver-specific functions at early passages and can be long-term cultured in vitro. We believe the established long-term OTCD-PHH culture is highly relevant to study liver diseases, particularly in infants with OTCD.


Assuntos
Hepatócitos/patologia , Hepatopatias/patologia , Fígado/patologia , Doença da Deficiência de Ornitina Carbomoiltransferase/patologia , Células 3T3 , Animais , Linhagem Celular , Linhagem Celular Tumoral , Citocromo P-450 CYP1A1/metabolismo , Citocromo P-450 CYP3A/metabolismo , Células Epiteliais/metabolismo , Células Hep G2 , Hepatócitos/metabolismo , Humanos , Lactente , Fígado/metabolismo , Hepatopatias/metabolismo , Masculino , Camundongos , Doença da Deficiência de Ornitina Carbomoiltransferase/metabolismo , RNA Mensageiro/metabolismo
7.
J Proteome Res ; 18(8): 3067-3076, 2019 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-31188000

RESUMO

Hepatocellular carcinoma (HCC) causes more than half a million annual deaths worldwide. Understanding the mechanisms contributing to HCC development is highly desirable for improved surveillance, diagnosis, and treatment. Liver tissue metabolomics has the potential to reflect the physiological changes behind HCC development. Also, it allows identification of biomarker candidates for future evaluation in biofluids and investigation of racial disparities in HCC. Tumor and nontumor tissues from 40 patients were analyzed by both gas chromatography-mass spectrometry (GC-MS) and liquid chromatography-mass spectrometry (LC-MS) platforms to increase the metabolome coverage. The levels of the metabolites extracted from solid liver tissue of the HCC area and adjacent non-HCC area were compared. Among the analytes detected by GC-MS and LC-MS with significant alterations, 18 were selected based on biological relevance and confirmed metabolite identification. These metabolites belong to TCA cycle, glycolysis, purines, and lipid metabolism and have been previously reported in liver metabolomic studies where high correlation with HCC progression is implied. We demonstrated that metabolites related to HCC pathogenesis can be identified through liver tissue metabolomic analysis. Additionally, this study has enabled us to identify race-specific metabolites associated with HCC.


Assuntos
Carcinoma Hepatocelular/metabolismo , Neoplasias Hepáticas/metabolismo , Metaboloma/genética , Metabolômica , Biomarcadores Tumorais/genética , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Feminino , Cromatografia Gasosa-Espectrometria de Massas , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Metabolismo dos Lipídeos/genética , Fígado/metabolismo , Fígado/patologia , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/patologia , Masculino , Pessoa de Meia-Idade
8.
Methods ; 124: 89-99, 2017 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-28651964

RESUMO

In this paper, we introduce a novel computational method for constructing protein networks based on reverse phase protein array (RPPA) data to identify complex patterns in protein signaling. The method is applied to phosphoproteomic profiles of basal expression and activation/phosphorylation of 76 key signaling proteins in three breast cancer cell lines (MCF7, LCC1, and LCC9). Temporal RPPA data are acquired at 48h, 96h, and 144h after knocking down four genes in separate experiments. These genes are selected from a previous study as important determinants for breast cancer survival. Interaction networks are constructed by analyzing the expression levels of protein pairs using a multivariate analysis of variance model. A new scoring criterion is introduced to determine relevant protein pairs. Through a network topology based analysis, we search for wiring patterns to identify key proteins that are associated with significant changes in expression levels across various experimental conditions.


Assuntos
Neoplasias da Mama/genética , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Proteínas de Neoplasias/genética , Análise Serial de Proteínas/estatística & dados numéricos , Processamento de Proteína Pós-Traducional , ATPases Associadas a Diversas Atividades Celulares/antagonistas & inibidores , ATPases Associadas a Diversas Atividades Celulares/genética , ATPases Associadas a Diversas Atividades Celulares/metabolismo , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Proteína Rica em Cisteína 61/antagonistas & inibidores , Proteína Rica em Cisteína 61/genética , Proteína Rica em Cisteína 61/metabolismo , Feminino , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/antagonistas & inibidores , Peptídeos e Proteínas de Sinalização Intracelular/genética , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Células MCF-7 , Análise Multivariada , Proteínas de Neoplasias/antagonistas & inibidores , Proteínas de Neoplasias/metabolismo , Fosforilação , Complexo de Endopeptidases do Proteassoma/genética , Complexo de Endopeptidases do Proteassoma/metabolismo , RNA Polimerase II/antagonistas & inibidores , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , RNA Interferente Pequeno/genética , RNA Interferente Pequeno/metabolismo , Transdução de Sinais , Proteínas Supressoras de Tumor/antagonistas & inibidores , Proteínas Supressoras de Tumor/genética , Proteínas Supressoras de Tumor/metabolismo
9.
BMC Bioinformatics ; 18(1): 99, 2017 Feb 10.
Artigo em Inglês | MEDLINE | ID: mdl-28187708

RESUMO

BACKGROUND: Conventional differential gene expression analysis by methods such as student's t-test, SAM, and Empirical Bayes often searches for statistically significant genes without considering the interactions among them. Network-based approaches provide a natural way to study these interactions and to investigate the rewiring interactions in disease versus control groups. In this paper, we apply weighted graphical LASSO (wgLASSO) algorithm to integrate a data-driven network model with prior biological knowledge (i.e., protein-protein interactions) for biological network inference. We propose a novel differentially weighted graphical LASSO (dwgLASSO) algorithm that builds group-specific networks and perform network-based differential gene expression analysis to select biomarker candidates by considering their topological differences between the groups. RESULTS: Through simulation, we showed that wgLASSO can achieve better performance in building biologically relevant networks than purely data-driven models (e.g., neighbor selection, graphical LASSO), even when only a moderate level of information is available as prior biological knowledge. We evaluated the performance of dwgLASSO for survival time prediction using two microarray breast cancer datasets previously reported by Bild et al. and van de Vijver et al. Compared with the top 10 significant genes selected by conventional differential gene expression analysis method, the top 10 significant genes selected by dwgLASSO in the dataset from Bild et al. led to a significantly improved survival time prediction in the independent dataset from van de Vijver et al. Among the 10 genes selected by dwgLASSO, UBE2S, SALL2, XBP1 and KIAA0922 have been confirmed by literature survey to be highly relevant in breast cancer biomarker discovery study. Additionally, we tested dwgLASSO on TCGA RNA-seq data acquired from patients with hepatocellular carcinoma (HCC) on tumors samples and their corresponding non-tumorous liver tissues. Improved sensitivity, specificity and area under curve (AUC) were observed when comparing dwgLASSO with conventional differential gene expression analysis method. CONCLUSIONS: The proposed network-based differential gene expression analysis algorithm dwgLASSO can achieve better performance than conventional differential gene expression analysis methods by integrating information at both gene expression and network topology levels. The incorporation of prior biological knowledge can lead to the identification of biologically meaningful genes in cancer biomarker studies.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes/genética , Área Sob a Curva , Biomarcadores/metabolismo , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Feminino , Humanos , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/patologia , RNA/química , RNA/isolamento & purificação , RNA/metabolismo , Curva ROC , Análise de Sequência de RNA
10.
Mol Carcinog ; 56(2): 594-606, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27341184

RESUMO

Smoking-related biomarkers for lung cancer and other diseases are needed to enhance early detection strategies and to provide a science base for tobacco product regulation. An untargeted metabolomics approach by ultra-performance liquid chromatography-quadrupole-time of flight mass spectrometry (UHPLC-Q-TOF MS) totaling 957 assays was used in a novel experimental design where 105 current smokers smoked two cigarettes 1 h apart. Blood was collected immediately before and after each cigarette allowing for within-subject replication. Dynamic changes of the metabolomic profiles from smokers' four blood samples were observed and biomarkers affected by cigarette smoking were identified. Thirty-one metabolites were definitively shown to be affected by acute effect of cigarette smoking, uniquely including menthol-glucuronide, the reduction of glutamate, oleamide, and 13 glycerophospholipids. This first time identification of a menthol metabolite in smokers' blood serves as proof-of-principle for using metabolomics to identify new tobacco-exposure biomarkers, and also provides new opportunities in studying menthol-containing tobacco products in humans. Gender and race differences also were observed. Network analysis revealed 12 molecules involved in cancer, notably inhibition of cAMP. These novel tobacco-related biomarkers provide new insights to the effects of smoking which may be important in carcinogenesis but not previously linked with tobacco-related diseases. © 2016 Wiley Periodicals, Inc.


Assuntos
Glucuronatos/sangue , Mentol/análogos & derivados , Metaboloma , Fumar/sangue , Adolescente , Adulto , Idoso , Biomarcadores/sangue , Biomarcadores/metabolismo , Feminino , Glucuronatos/metabolismo , Humanos , Masculino , Mentol/sangue , Mentol/metabolismo , Metabolômica , Pessoa de Meia-Idade , Fumar/metabolismo , Adulto Jovem
11.
Methods ; 111: 12-20, 2016 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-27592383

RESUMO

Differential expression (DE) analysis is commonly used to identify biomarker candidates that have significant changes in their expression levels between distinct biological groups. One drawback of DE analysis is that it only considers the changes on single biomolecule level. Recently, differential network (DN) analysis has become popular due to its capability to measure the changes on biomolecular pair level. In DN analysis, network is typically built based on correlation and biomarker candidates are selected by investigating the network topology. However, correlation tends to generate over-complicated networks and the selection of biomarker candidates purely based on network topology ignores the changes on single biomolecule level. In this paper, we propose a novel approach, INDEED, that builds sparse differential network based on partial correlation and integrates DE and DN analyses for biomarker discovery. We applied this approach on real proteomic and glycomic data generated by liquid chromatography coupled with mass spectrometry for hepatocellular carcinoma (HCC) biomarker discovery study. For each omic data, we used one dataset to select biomarker candidates, built a disease classifier and evaluated the performance of the classifier on an independent dataset. The biomarker candidates, selected by INDEED, were more reproducible across independent datasets, and led to a higher classification accuracy in predicting HCC cases and cirrhotic controls compared with those selected by separate DE and DN analyses. INDEED also identified some candidates previously reported to be relevant to HCC, such as intercellular adhesion molecule 2 (ICAM2) and c4b-binding protein alpha chain (C4BPA), which were missed by both DE and DN analyses. In addition, we applied INDEED for survival time prediction based on transcriptomic data acquired by analysis of samples from breast cancer patients. We selected biomarker candidates and built a regression model for survival time prediction based on a gene expression dataset and patients' survival records. We evaluated the performance of the regression model on an independent dataset. Compared with the biomarker candidates selected by DE and DN analyses, those selected through INDEED led to more accurate survival time prediction.


Assuntos
Antígenos CD/genética , Biomarcadores Tumorais/genética , Moléculas de Adesão Celular/genética , Proteína de Ligação ao Complemento C4b/genética , Proteômica/métodos , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/metabolismo , Cromatografia Líquida , Regulação Neoplásica da Expressão Gênica , Glicômica/métodos , Humanos , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/metabolismo , Espectrometria de Massas , Transcriptoma/genética
12.
BMC Genomics ; 17 Suppl 4: 545, 2016 08 18.
Artigo em Inglês | MEDLINE | ID: mdl-27535232

RESUMO

BACKGROUND: A fundamental challenge in quantitation of biomolecules for cancer biomarker discovery is owing to the heterogeneous nature of human biospecimens. Although this issue has been a subject of discussion in cancer genomic studies, it has not yet been rigorously investigated in mass spectrometry based proteomic and metabolomic studies. Purification of mass spectometric data is highly desired prior to subsequent analysis, e.g., quantitative comparison of the abundance of biomolecules in biological samples. METHODS: We investigated topic models to computationally analyze mass spectrometric data considering both integrated peak intensities and scan-level features, i.e., extracted ion chromatograms (EICs). Probabilistic generative models enable flexible representation in data structure and infer sample-specific pure resources. Scan-level modeling helps alleviate information loss during data preprocessing. We evaluated the capability of the proposed models in capturing mixture proportions of contaminants and cancer profiles on LC-MS based serum proteomic and GC-MS based tissue metabolomic datasets acquired from patients with hepatocellular carcinoma (HCC) and liver cirrhosis as well as synthetic data we generated based on the serum proteomic data. RESULTS: The results we obtained by analysis of the synthetic data demonstrated that both intensity-level and scan-level purification models can accurately infer the mixture proportions and the underlying true cancerous sources with small average error ratios (<7 %) between estimation and ground truth. By applying the topic model-based purification to mass spectrometric data, we found more proteins and metabolites with significant changes between HCC cases and cirrhotic controls. Candidate biomarkers selected after purification yielded biologically meaningful pathway analysis results and improved disease discrimination power in terms of the area under ROC curve compared to the results found prior to purification. CONCLUSIONS: We investigated topic model-based inference methods to computationally address the heterogeneity issue in samples analyzed by LC/GC-MS. We observed that incorporation of scan-level features have the potential to lead to more accurate purification results by alleviating the loss in information as a result of integrating peaks. We believe cancer biomarker discovery studies that use mass spectrometric analysis of human biospecimens can greatly benefit from topic model-based purification of the data prior to statistical and pathway analyses.


Assuntos
Biomarcadores Tumorais/sangue , Espectrometria de Massas/estatística & dados numéricos , Neoplasias/sangue , Proteômica/métodos , Carcinoma Hepatocelular/sangue , Carcinoma Hepatocelular/genética , Humanos , Cirrose Hepática/sangue , Cirrose Hepática/genética , Neoplasias Hepáticas/sangue , Neoplasias Hepáticas/genética , Metabolômica , Neoplasias/genética
13.
Proteomics ; 15(13): 2369-81, 2015 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-25778709

RESUMO

Associating changes in protein levels with the onset of cancer has been widely investigated to identify clinically relevant diagnostic biomarkers. In the present study, we analyzed sera from 205 patients recruited in the United States and Egypt for biomarker discovery using label-free proteomic analysis by LC-MS/MS. We performed untargeted proteomic analysis of sera to identify candidate proteins with statistically significant differences between hepatocellular carcinoma (HCC) and patients with liver cirrhosis. We further evaluated the significance of 101 proteins in sera from the same 205 patients through targeted quantitation by MRM on a triple quadrupole mass spectrometer. This led to the identification of 21 candidate protein biomarkers that were significantly altered in both the United States and Egyptian cohorts. Among the 21 candidates, ten were previously reported as HCC-associated proteins (eight exhibiting consistent trends with our observation), whereas 11 are new candidates discovered by this study. Pathway analysis based on the significant proteins reveals upregulation of the complement and coagulation cascades pathway and downregulation of the antigen processing and presentation pathway in HCC cases versus patients with liver cirrhosis. The results of this study demonstrate the power of combining untargeted and targeted quantitation methods for a comprehensive serum proteomic analysis, to evaluate changes in protein levels and discover novel diagnostic biomarkers. All MS data have been deposited in the ProteomeXchange with identifier PXD001171 (http://proteomecentral.proteomexchange.org/dataset/PXD001171).


Assuntos
Carcinoma Hepatocelular/metabolismo , Cromatografia Líquida/métodos , Neoplasias Hepáticas/metabolismo , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade
14.
BMC Bioinformatics ; 16: 259, 2015 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-26283310

RESUMO

BACKGROUND: Gas chromatography coupled with mass spectrometry (GC-MS) is one of the technologies widely used for qualitative and quantitative analysis of small molecules. In particular, GC coupled to single quadrupole MS can be utilized for targeted analysis by selected ion monitoring (SIM). However, to our knowledge, there are no software tools specifically designed for analysis of GC-SIM-MS data. In this paper, we introduce a new R/Bioconductor package called SIMAT for quantitative analysis of the levels of targeted analytes. SIMAT provides guidance in choosing fragments for a list of targets. This is accomplished through an optimization algorithm that has the capability to select the most appropriate fragments from overlapping chromatographic peaks based on a pre-specified library of background analytes. The tool also allows visualization of the total ion chromatograms (TIC) of runs and extracted ion chromatograms (EIC) of analytes of interest. Moreover, retention index (RI) calibration can be performed and raw GC-SIM-MS data can be imported in netCDF or NIST mass spectral library (MSL) formats. RESULTS: We evaluated the performance of SIMAT using two GC-SIM-MS datasets obtained by targeted analysis of: (1) plasma samples from 86 patients in a targeted metabolomic experiment; and (2) mixtures of internal standards spiked in plasma samples at varying concentrations in a method development study. Our results demonstrate that SIMAT offers alternative solutions to AMDIS and MetaboliteDetector to achieve accurate detection of targets and estimation of their relative intensities by analysis of GC-SIM-MS data. CONCLUSIONS: We introduce a new R package called SIMAT that allows the selection of the optimal set of fragments and retention time windows for target analytes in GC-SIM-MS based analysis. Also, various functions and algorithms are implemented in the tool to: (1) read and import raw data and spectral libraries; (2) perform GC-SIM-MS data preprocessing; and (3) plot and visualize EICs and TICs.


Assuntos
Software , Algoritmos , Cromatografia Gasosa-Espectrometria de Massas , Internet , Metabolômica
15.
Methods ; 69(3): 266-73, 2014 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-25003577

RESUMO

Biological network inference is a major challenge in systems biology. Traditional correlation-based network analysis results in too many spurious edges since correlation cannot distinguish between direct and indirect associations. To address this issue, Gaussian graphical models (GGM) were proposed and have been widely used. Though they can significantly reduce the number of spurious edges, GGM are insufficient to uncover a network structure faithfully due to the fact that they only consider the full order partial correlation. Moreover, when the number of samples is smaller than the number of variables, further technique based on sparse regularization needs to be incorporated into GGM to solve the singular covariance inversion problem. In this paper, we propose an efficient and mathematically solid algorithm that infers biological networks by computing low order partial correlation (LOPC) up to the second order. The bias introduced by the low order constraint is minimal compared to the more reliable approximation of the network structure achieved. In addition, the algorithm is suitable for a dataset with small sample size but large number of variables. Simulation results show that LOPC yields far less spurious edges and works well under various conditions commonly seen in practice. The application to a real metabolomics dataset further validates the performance of LOPC and suggests its potential power in detecting novel biomarkers for complex disease.


Assuntos
Biomarcadores , Biologia Computacional/métodos , Modelos Teóricos , Biologia de Sistemas , Algoritmos , Perfilação da Expressão Gênica , Humanos , Distribuição Normal
16.
J Proteome Res ; 13(11): 4859-68, 2014 Nov 07.
Artigo em Inglês | MEDLINE | ID: mdl-25077556

RESUMO

Defining clinically relevant biomarkers for early stage hepatocellular carcinoma (HCC) in a high-risk population of cirrhotic patients has potentially far-reaching implications for disease management and patient health. Changes in glycan levels have been associated with the onset of numerous diseases including cancer. In the present study, we used liquid chromatography coupled with electrospray ionization mass spectrometry (LC-ESI-MS) to analyze N-glycans in sera from 183 participants recruited in Egypt and the U.S. and identified candidate biomarkers that distinguish HCC cases from cirrhotic controls. N-Glycans were released from serum proteins and permethylated prior to the LC-ESI-MS analysis. Through two complementary LC-ESI-MS quantitation approaches, global profiling and targeted quantitation, we identified 11 N-glycans with statistically significant differences between HCC cases and cirrhotic controls. These glycans can further be categorized into four structurally related clusters, matching closely with the implications of important glycosyltransferases in cancer progression and metastasis. The results of this study illustrate the power of the integrative approach combining complementary LC-ESI-MS based quantitation approaches to investigate changes in N-glycan levels between HCC cases and patients with liver cirrhosis.


Assuntos
Biomarcadores Tumorais/sangue , Carcinoma Hepatocelular/diagnóstico , Cirrose Hepática/sangue , Neoplasias Hepáticas/diagnóstico , Polissacarídeos/sangue , Carcinoma Hepatocelular/sangue , Carcinoma Hepatocelular/etiologia , Cromatografia Líquida , Egito , Perfilação da Expressão Gênica/métodos , Humanos , Cirrose Hepática/complicações , Neoplasias Hepáticas/sangue , Neoplasias Hepáticas/etiologia , Espectrometria de Massas , Estados Unidos
17.
Brief Bioinform ; 13(4): 406-19, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22287794

RESUMO

Many complex diseases such as cancer are associated with changes in biological pathways and molecular networks rather than being caused by single gene alterations. A major challenge in the diagnosis and treatment of such diseases is to identify characteristic aberrancies in the biological pathways and molecular network activities and elucidate their relationship to the disease. This review presents recent progress in using high-throughput biological assays to decipher aberrant pathways and network activities. In particular, this review provides specific examples in which high-throughput data have been applied to identify relationships between diseases and aberrant pathways and network activities. The achievements in this field have been remarkable, but many challenges have yet to be addressed.


Assuntos
Redes Reguladoras de Genes , Ensaios de Triagem em Larga Escala , Bases de Dados Factuais , Biologia de Sistemas
18.
Bioinformatics ; 29(21): 2774-80, 2013 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-24013927

RESUMO

MOTIVATION: Liquid chromatography-mass spectrometry (LC-MS) has been widely used for profiling expression levels of biomolecules in various '-omic' studies including proteomics, metabolomics and glycomics. Appropriate LC-MS data preprocessing steps are needed to detect true differences between biological groups. Retention time (RT) alignment, which is required to ensure that ion intensity measurements among multiple LC-MS runs are comparable, is one of the most important yet challenging preprocessing steps. Current alignment approaches estimate RT variability using either single chromatograms or detected peaks, but do not simultaneously take into account the complementary information embedded in the entire LC-MS data. RESULTS: We propose a Bayesian alignment model for LC-MS data analysis. The alignment model provides estimates of the RT variability along with uncertainty measures. The model enables integration of multiple sources of information including internal standards and clustered chromatograms in a mathematically rigorous framework. We apply the model to LC-MS metabolomic, proteomic and glycomic data. The performance of the model is evaluated based on ground-truth data, by measuring correlation of variation, RT difference across runs and peak-matching performance. We demonstrate that Bayesian alignment model improves significantly the RT alignment performance through appropriate integration of relevant information. AVAILABILITY AND IMPLEMENTATION: MATLAB code, raw and preprocessed LC-MS data are available at http://omics.georgetown.edu/alignLCMS.html. CONTACT: hwr@georgetown.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Cromatografia Líquida/métodos , Espectrometria de Massas/métodos , Algoritmos , Teorema de Bayes , Cromatografia Líquida/normas , Glicômica , Humanos , Espectrometria de Massas/normas , Metabolômica , Modelos Estatísticos , Proteômica , Padrões de Referência
19.
Methods Mol Biol ; 2822: 263-290, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38907924

RESUMO

RNA-Seq data analysis stands as a vital part of genomics research, turning vast and complex datasets into meaningful biological insights. It is a field marked by rapid evolution and ongoing innovation, necessitating a thorough understanding for anyone seeking to unlock the potential of RNA-Seq data. In this chapter, we describe the intricate landscape of RNA-seq data analysis, elucidating a comprehensive pipeline that navigates through the entirety of this complex process. Beginning with quality control, the chapter underscores the paramount importance of ensuring the integrity of RNA-seq data, as it lays the groundwork for subsequent analyses. Preprocessing is then addressed, where the raw sequence data undergoes necessary modifications and enhancements, setting the stage for the alignment phase. This phase involves mapping the processed sequences to a reference genome, a step pivotal for decoding the origins and functions of these sequences.Venturing into the heart of RNA-seq analysis, the chapter then explores differential expression analysis-the process of identifying genes that exhibit varying expression levels across different conditions or sample groups. Recognizing the biological context of these differentially expressed genes is pivotal; hence, the chapter transitions into functional analysis. Here, methods and tools like Gene Ontology and pathway analyses help contextualize the roles and interactions of the identified genes within broader biological frameworks. However, the chapter does not stop at conventional analysis methods. Embracing the evolving paradigms of data science, it delves into machine learning applications for RNA-seq data, introducing advanced techniques in dimension reduction and both unsupervised and supervised learning. These approaches allow for patterns and relationships to be discerned in the data that might be imperceptible through traditional methods.


Assuntos
Biologia Computacional , RNA-Seq , Software , RNA-Seq/métodos , Humanos , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Genômica/métodos , Análise de Dados , Ontologia Genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos
20.
Methods Mol Biol ; 2822: 245-262, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38907923

RESUMO

RNA sequencing (RNA-Seq) has emerged as a powerful and versatile tool for the comprehensive analysis of transcriptomes and has been widely used to investigate gene expression, copy number variation, alternative splicing, and novel transcript discovery. This chapter outlines the methodology for conducting short-read RNA-Seq, starting from RNA enrichment to library preparation and sequencing. Throughout the chapter, practical tips and best practices are provided to guide researchers in order to optimize each step of the RNA-Seq workflow. Multiple quality control steps throughout the workflow that are critical to obtain high-quality RNA-Seq data are also discussed.


Assuntos
RNA-Seq , Humanos , RNA-Seq/métodos , Perfilação da Expressão Gênica/métodos , Transcriptoma/genética , Análise de Sequência de RNA/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Controle de Qualidade , RNA/genética , Fluxo de Trabalho , Software , Processamento Alternativo/genética , Biologia Computacional/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA