Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 88
Filtrar
1.
Biostatistics ; 24(4): 1031-1044, 2023 10 18.
Artigo em Inglês | MEDLINE | ID: mdl-35536588

RESUMO

Experimental design usually focuses on the setting where treatments and/or other aspects of interest can be manipulated. However, in observational biomedical studies with sequential processing, the set of available samples is often fixed, and the problem is thus rather the ordering and allocation of samples to batches such that comparisons between different treatments can be made with similar precision. In certain situations, this allocation can be done by hand, but this rapidly becomes impractical with more challenging cohort setups. Here, we present a fast and intuitive algorithm to generate balanced allocations of samples to batches for any single-variable model where the treatment variable is nominal. This greatly simplifies the grouping of samples into batches, makes the process reproducible, and provides a marked improvement over completely random allocations. The general challenges of allocation and why good solutions can be hard to find are also discussed, as well as potential extensions to multivariable settings.


Assuntos
Algoritmos , Estudos Observacionais como Assunto , Humanos , Projetos de Pesquisa
2.
Bioinformatics ; 39(10)2023 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-37756698

RESUMO

MOTIVATION: Biological network analysis for high-throughput biomedical data interpretation relies heavily on topological characteristics. Networks are commonly composed of nodes representing genes or proteins that are connected by edges when interacting. In this study, we use the rich information available in the Reactome pathway database to build biological networks accounting for small molecules and proteoforms modeled using protein isoforms and post-translational modifications to study the topological changes induced by this refinement of the network representation. RESULTS: We find that improving the interactome modeling increases the number of nodes and interactions, but that isoform and post-translational modification annotation is still limited compared to what can be expected biologically. We also note that small molecule information can distort the topology of the network due to the high connectedness of these molecules, which does not necessarily represent the reality of biology. However, by restricting the connections of small molecules to the context of biochemical reactions, we find that these improve the overall connectedness of the network and reduce the prevalence of isolated components and nodes. Overall, changing the representation of the network alters the prevalence of articulation points and bridges globally but also within and across pathways. Hence, some molecules can gain or lose in biological importance depending on the level of detail of the representation of the biological system, which might in turn impact network-based studies of diseases or druggability. AVAILABILITY AND IMPLEMENTATION: Networks are constructed based on data publicly available in the Reactome Pathway knowledgebase: reactome.org.

3.
BMC Pregnancy Childbirth ; 24(1): 238, 2024 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-38575863

RESUMO

BACKGROUND: The causal relationship between maternal smoking in pregnancy and reduced offspring birth weight is well established and is likely due to impaired placental function. However, observational studies have given conflicting results on the association between smoking and placental weight. We aimed to estimate the causal effect of newly pregnant mothers quitting smoking on their placental weight at the time of delivery. METHODS: We used one-sample Mendelian randomization, drawing data from the Avon Longitudinal Study of Parents and Children (ALSPAC) (N = 690 to 804) and the Norwegian Mother, Father and Child Cohort Study (MoBa) (N = 4267 to 4606). The sample size depends on the smoking definition used for different analyses. The analysis was performed in pre-pregnancy smokers only, due to the specific role of the single-nucleotide polymorphism (SNP) rs1051730 (CHRNA5 - CHRNA3 - CHRNB4) in affecting smoking cessation but not initiation. RESULTS: Fixed effect meta-analysis showed a 182 g [95%CI: 29,335] higher placental weight for pre-pregnancy smoking mothers who continued smoking at the beginning of pregnancy, compared with those who stopped smoking. Using the number of cigarettes smoked per day in the first trimester as the exposure, the causal effect on placental weight was 11 g [95%CI: 1,21] per cigarette per day. Similarly, smoking at the end of pregnancy was causally associated with higher placental weight. Using the residuals of birth weight regressed on placental weight as the outcome, we showed evidence of lower offspring birth weight relative to the placental weight, both for continuing smoking at the start of pregnancy as well as continuing smoking throughout pregnancy (change in z-score birth weight adjusted for z-score placental weight: -0.8 [95%CI: -1.6,-0.1]). CONCLUSION: Our results suggest that continued smoking during pregnancy causes higher placental weights.


Assuntos
Análise da Randomização Mendeliana , Placenta , Feminino , Humanos , Gravidez , Peso ao Nascer/genética , Estudos de Coortes , Estudos Longitudinais , Fumar/efeitos adversos
4.
J Proteome Res ; 22(10): 3190-3199, 2023 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-37656829

RESUMO

Precision medicine focuses on adapting care to the individual profile of patients, for example, accounting for their unique genetic makeup. Being able to account for the effect of genetic variation on the proteome holds great promise toward this goal. However, identifying the protein products of genetic variation using mass spectrometry has proven very challenging. Here we show that the identification of variant peptides can be improved by the integration of retention time and fragmentation predictors into a unified proteogenomic pipeline. By combining these intrinsic peptide characteristics using the search-engine post-processor Percolator, we demonstrate improved discrimination power between correct and incorrect peptide-spectrum matches. Our results demonstrate that the drop in performance that is induced when expanding a protein sequence database can be compensated, hence enabling efficient identification of genetic variation products in proteomics data. We anticipate that this enhancement of proteogenomic pipelines can provide a more refined picture of the unique proteome of patients and thereby contribute to improving patient care.

5.
Hum Mol Genet ; 29(23): 3845-3858, 2021 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-33291140

RESUMO

Parental genetic relatedness may lead to adverse health and fitness outcomes in the offspring. However, the degree to which it affects human delivery timing is unknown. We use genotype data from ≃25 000 parent-offspring trios from the Norwegian Mother, Father and Child Cohort Study to optimize runs of homozygosity (ROH) calling by maximizing the correlation between parental genetic relatedness and offspring ROHs. We then estimate the effect of maternal, paternal and fetal autozygosity and that of autozygosity mapping (common segments and gene burden test) on the timing of spontaneous onset of delivery. The correlation between offspring ROH using a variety of parameters and parental genetic relatedness ranged between -0.2 and 0.6, revealing the importance of the minimum number of genetic variants included in an ROH and the use of genetic distance. The optimized compared to predefined parameters showed a ≃45% higher correlation between parental genetic relatedness and offspring ROH. We found no evidence of an effect of maternal, paternal nor fetal overall autozygosity on spontaneous delivery timing. Yet, through autozygosity mapping, we identified three maternal loci TBC1D1, SIGLECs and EDN1 gene regions reducing the median time-to-spontaneous onset of delivery by ≃2-5% (P-value < 2.3 × 10-6). We also found suggestive evidence of a fetal locus at 3q22.2, near the RYK gene region (P-value = 2.0 × 10-6). Autozygosity mapping may provide new insights on the genetic determinants of delivery timing beyond traditional genome-wide association studies, but particular and rigorous attention should be given to ROH calling parameter selection.


Assuntos
Mapeamento Cromossômico/métodos , Genética Populacional , Estudo de Associação Genômica Ampla , Homozigoto , Polimorfismo de Nucleotídeo Único , Criança , Estudos de Coortes , Genoma Humano , Humanos , Noruega , Pais
6.
J Proteome Res ; 20(1): 122-128, 2021 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-32969222

RESUMO

Randomization is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomization can however create imbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomization is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provides the reader with an introduction to blocking and randomization, and insights into how to effectively organize samples during experimental design, with special considerations with respect to proteomics.


Assuntos
Proteômica , Projetos de Pesquisa , Distribuição Aleatória
7.
J Proteome Res ; 20(12): 5419-5423, 2021 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-34709836

RESUMO

Mass spectrometry-based proteomics is a high-throughput technology generating ever-larger amounts of data per project. However, storing, processing, and interpreting these data can be a challenge. A key element in simplifying this process is the development of interactive frameworks focusing on visualization that can greatly simplify both the interpretation of data and the generation of new knowledge. Here we present PeptideShaker Online, a user-friendly web-based framework for the identification of mass spectrometry-based proteomics data, from raw file conversion to interactive visualization of the resulting data. Storage and processing of the data are performed via the versatile Galaxy platform (through SearchGUI, PeptideShaker, and moFF), while the interaction with the results happens via a locally installed web server, thus enabling researchers to process and interpret their own data without requiring advanced bioinformatics skills or direct access to compute-intensive infrastructures. The source code, additional documentation, and a fully functional demo is available at https://github.com/barsnes-group/peptide-shaker-online.


Assuntos
Proteômica , Software , Biologia Computacional/métodos , Internet , Espectrometria de Massas , Proteômica/métodos
8.
Hum Mol Genet ; 28(19): 3327-3338, 2019 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-31504550

RESUMO

Although hundreds of genome-wide association studies-implicated loci have been reported for adult obesity-related traits, less is known about the genetics specific for early-onset obesity and with only a few studies conducted in non-European populations to date. Searching for additional genetic variants associated with childhood obesity, we performed a trans-ancestral meta-analysis of 30 studies consisting of up to 13 005 cases (≥95th percentile of body mass index (BMI) achieved 2-18 years old) and 15 599 controls (consistently <50th percentile of BMI) of European, African, North/South American and East Asian ancestry. Suggestive loci were taken forward for replication in a sample of 1888 cases and 4689 controls from seven cohorts of European and North/South American ancestry. In addition to observing 18 previously implicated BMI or obesity loci, for both early and late onset, we uncovered one completely novel locus in this trans-ancestral analysis (nearest gene, METTL15). The variant was nominally associated with only the European subgroup analysis but had a consistent direction of effect in other ethnicities. We then utilized trans-ancestral Bayesian analysis to narrow down the location of the probable causal variant at each genome-wide significant signal. Of all the fine-mapped loci, we were able to narrow down the causative variant at four known loci to fewer than 10 single nucleotide polymorphisms (SNPs) (FAIM2, GNPDA2, MC4R and SEC16B loci). In conclusion, an ethnically diverse setting has enabled us to both identify an additional pediatric obesity locus and further fine-map existing loci.


Assuntos
Mapeamento Cromossômico/métodos , Estudo de Associação Genômica Ampla/métodos , Obesidade Infantil/genética , Polimorfismo de Nucleotídeo Único , Tumor de Wilms/genética , Teorema de Bayes , Estudos de Casos e Controles , Criança , Feminino , Loci Gênicos , Predisposição Genética para Doença , Humanos , Masculino
9.
Mass Spectrom Rev ; 39(3): 292-306, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-28902424

RESUMO

Sequence database search engines are bioinformatics algorithms that identify peptides from tandem mass spectra using a reference protein sequence database. Two decades of development, notably driven by advances in mass spectrometry, have provided scientists with more than 30 published search engines, each with its own properties. In this review, we present the common paradigm behind the different implementations, and its limitations for modern mass spectrometry datasets. We also detail how the search engines attempt to alleviate these limitations, and provide an overview of the different software frameworks available to the researcher. Finally, we highlight alternative approaches for the identification of proteomic mass spectrometry datasets, either as a replacement for, or as a complement to, sequence database search engines.


Assuntos
Espectrometria de Massas/métodos , Proteínas/química , Proteômica/métodos , Ferramenta de Busca/métodos , Animais , Humanos , Fluxo de Trabalho
10.
Stem Cells ; 38(4): 542-555, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-31828876

RESUMO

A comprehensive characterization of the molecular processes controlling cell fate decisions is essential to derive stable progenitors and terminally differentiated cells that are functional from human pluripotent stem cells (hPSCs). Here, we report the use of quantitative proteomics to describe early proteome adaptations during hPSC differentiation toward pancreatic progenitors. We report that the use of unbiased quantitative proteomics allows the simultaneous profiling of numerous proteins at multiple time points, and is a valuable tool to guide the discovery of signaling events and molecular signatures underlying cellular differentiation. We also monitored the activity level of pathways whose roles are pivotal in the early pancreas differentiation, including the Hippo signaling pathway. The quantitative proteomics data set provides insights into the dynamics of the global proteome during the transition of hPSCs from a pluripotent state toward pancreatic differentiation.


Assuntos
Pâncreas/metabolismo , Células-Tronco Pluripotentes/metabolismo , Proteoma/metabolismo , Proteômica/métodos , Diferenciação Celular , Humanos , Pâncreas/citologia
11.
Bioinformatics ; 35(7): 1249-1251, 2019 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-30169737

RESUMO

SUMMARY: Data visualization plays critical roles in proteomics studies, ranging from quality control of MS/MS data to validation of peptide identification results. Herein, we present PDV, an integrative proteomics data viewer that can be used to visualize a wide range of proteomics data, including database search results, de novo sequencing results, proteogenomics files, MS/MS data in mzML/mzXML format and data from public proteomics repositories. PDV is a lightweight visualization tool that enables intuitive and fast exploration of diverse, large-scale proteomics datasets on standard desktop computers in both graphical user interface and command line modes. AVAILABILITY AND IMPLEMENTATION: PDV software and the user manual are freely available at http://pdv.zhang-lab.org. The source code is available at https://github.com/wenbostar/PDV and is released under the GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteômica , Peptídeos , Software , Espectrometria de Massas em Tandem
12.
Behav Genet ; 50(1): 51-66, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31493278

RESUMO

There is increasing interest within the genetics community in estimating the relative contribution of parental genetic effects on offspring phenotypes. Here we describe the user-friendly M-GCTA software package used to estimate the proportion of phenotypic variance explained by maternal (or alternatively paternal) and offspring genotypes on offspring phenotypes. The tool requires large studies where genome-wide genotype data are available on mother- (or alternatively father-) offspring pairs. The software includes several options for data cleaning and quality control, including the ability to detect and automatically remove cryptically related pairs of individuals. It also allows users to construct genetic relationship matrices indexing genetic similarity across the genome between parents and offspring, enabling the estimation of variance explained by maternal (or alternatively paternal) and offspring genetic effects. We evaluated the performance of the software using a range of data simulations and estimated the computing time and memory requirements. We demonstrate the use of M-GCTA on previously analyzed birth weight data from two large population based birth cohorts, the Avon Longitudinal Study of Parents and Children (ALSPAC) and the Norwegian Mother and Child Cohort Study (MoBa). We show how genetic variation in birth weight is predominantly explained by fetal genetic rather than maternal genetic sources of variation.


Assuntos
Peso ao Nascer/genética , Previsões/métodos , Criança , Estudos de Coortes , Simulação por Computador , Pai , Feminino , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Estudos Longitudinais , Masculino , Herança Materna/fisiologia , Modelos Genéticos , Mães , Pais , Herança Paterna/fisiologia , Fenótipo , Software
13.
Mol Cell Proteomics ; 16(7): 1275-1285, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28515314

RESUMO

The first stable version of the Proteomics Standards Initiative mzIdentML open data standard (version 1.1) was published in 2012-capturing the outputs of peptide and protein identification software. In the intervening years, the standard has become well-supported in both commercial and open software, as well as a submission and download format for public repositories. Here we report a new release of mzIdentML (version 1.2) that is required to keep pace with emerging practice in proteome informatics. New features have been added to support: (1) scores associated with localization of modifications on peptides; (2) statistics performed at the level of peptides; (3) identification of cross-linked peptides; and (4) support for proteogenomics approaches. In addition, there is now improved support for the encoding of de novo sequencing of peptides, spectral library searches, and protein inference. As a key point, the underlying XML schema has only undergone very minor modifications to simplify as much as possible the transition from version 1.1 to version 1.2 for implementers, but there have been several notable updates to the format specification, implementation guidelines, controlled vocabularies and validation software. mzIdentML 1.2 can be described as backwards compatible, in that reading software designed for mzIdentML 1.1 should function in most cases without adaptation. We anticipate that these developments will provide a continued stable base for software teams working to implement the standard. All the related documentation is accessible at http://www.psidev.info/mzidentml.


Assuntos
Biologia Computacional/normas , Proteômica/normas , Bases de Dados de Proteínas , Software
14.
Proteomics ; 18(18): e1700150, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29968278

RESUMO

In shotgun proteomics, peptide and protein identification is most commonly conducted using database search engines, the method of choice when reference protein sequences are available. Despite its widespread use the database-driven approach is limited, mainly because of its static search space. In contrast, de novo sequencing derives peptide sequence information in an unbiased manner, using only the fragment ion information from the tandem mass spectra. In recent years, with the improvements in MS instrumentation, various new methods have been proposed for de novo sequencing. This review article provides an overview of existing de novo sequencing algorithms and software tools ranging from peptide sequencing to sequence-to-protein mapping. Various use cases are described for which de novo sequencing was successfully applied. Finally, limitations of current methods are highlighted and new directions are discussed for a wider acceptance of de novo sequencing in the community.


Assuntos
Proteômica/métodos , Análise de Sequência de Proteína/métodos , Software , Algoritmos , Animais , Biologia Computacional/métodos , Humanos , Proteínas/análise , Proteínas/metabolismo
15.
J Proteome Res ; 17(7): 2552-2555, 2018 07 06.
Artigo em Inglês | MEDLINE | ID: mdl-29774740

RESUMO

Mass-spectrometry-based proteomics has become the standard approach for identifying and quantifying proteins. A vital step consists of analyzing experimentally generated mass spectra to identify the underlying peptide sequences for later mapping to the originating proteins. We here present the latest developments in SearchGUI, a common open-source interface for the most frequently used freely available proteomics search and de novo engines that has evolved into a central component in numerous bioinformatics workflows.


Assuntos
Proteômica/métodos , Ferramenta de Busca/métodos , Espectrometria de Massas em Tandem , Algoritmos , Biologia Computacional , Proteínas/análise , Fluxo de Trabalho
16.
J Proteome Res ; 17(11): 3801-3809, 2018 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-30251541

RESUMO

Biochemical pathways are commonly used as a reference to conduct functional analysis on biomedical omics data sets, where experimental results are mapped to knowledgebases comprising known molecular interactions collected from the literature. Due to their central role, the content of the functional knowledgebases directly influences the outcome of pathway analyses. In this study, we investigate the structure of the current pathway knowledge, as exemplified by Reactome, discuss the consequences for biological interpretation, and outline possible improvements in the use of pathway knowledgebases. By providing a view of the underlying protein interaction network, we aim to help pathway analysis users manage their expectations and better identify possible artifacts in the results.


Assuntos
Biologia Computacional/métodos , Linfócitos/metabolismo , Células Mieloides/metabolismo , Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Bases de Dados de Proteínas , Humanos , Bases de Conhecimento , Linfócitos/citologia , Redes e Vias Metabólicas/fisiologia , Células Mieloides/citologia , Mapas de Interação de Proteínas
17.
Bioinformatics ; 33(13): 2042-2044, 2017 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-28334306

RESUMO

AVAILABILITY AND IMPLEMENTATION: PeptideMapper is implemented in the open source Java CompOmics framework under the permissive Apache 2.0 license https://github.com/compomics/compomics-utilities . CONTACT: robert.ahrends@isas.de or marc.vaudel@uib.no. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Análise de Sequência de Proteína/métodos , Software , Animais , Humanos , Camundongos/metabolismo , Leveduras
18.
Bioinformatics ; 33(16): 2580-2582, 2017 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-28379341

RESUMO

MOTIVATION: BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). AVAILABILITY AND IMPLEMENTATION: The software is freely available at github.com/BioContainers/. CONTACT: yperez@ebi.ac.uk.


Assuntos
Biologia Computacional/métodos , Software , Genômica/métodos , Metabolômica/métodos , Proteômica/métodos
19.
J Proteome Res ; 16(1): 179-194, 2017 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-27728768

RESUMO

In the current study, we conducted a quantitative in-depth proteome and deglycoproteome analysis of cerebrospinal fluid (CSF) from relapsing-remitting multiple sclerosis (RRMS) and neurological controls using mass spectrometry and pathway analysis. More than 2000 proteins and 1700 deglycopeptides were quantified, with 484 proteins and 180 deglycopeptides significantly changed between pools of RRMS and pools of controls. Approximately 300 of the significantly changed proteins were assigned to various biological processes including inflammation, extracellular matrix organization, cell adhesion, immune response, and neuron development. Ninety-six significantly changed deglycopeptides mapped to proteins that were not found changed in the global protein study. In addition, four mapped to the proteins oligo-myelin glycoprotein and noelin, which were found oppositely changed in the global study. Both are ligands to the nogo receptor, and the glycosylation of these proteins appears to be affected by RRMS. Our study gives the most extensive overview of the RRMS affected processes observed from the CSF proteome to date, and the list of differential proteins will have great value for selection of biomarker candidates for further verification.


Assuntos
Proteínas do Líquido Cefalorraquidiano/genética , Matriz Extracelular/genética , Esclerose Múltipla Recidivante-Remitente/genética , Glicoproteína Mielina-Oligodendrócito/genética , Proteoma/genética , Biomarcadores/líquido cefalorraquidiano , Estudos de Casos e Controles , Adesão Celular , Proteínas do Líquido Cefalorraquidiano/líquido cefalorraquidiano , Proteínas do Líquido Cefalorraquidiano/imunologia , Matriz Extracelular/imunologia , Proteínas da Matriz Extracelular/líquido cefalorraquidiano , Proteínas da Matriz Extracelular/genética , Proteínas da Matriz Extracelular/imunologia , Expressão Gênica , Glicoproteínas/líquido cefalorraquidiano , Glicoproteínas/genética , Glicoproteínas/imunologia , Humanos , Imunidade Inata , Inflamação , Esclerose Múltipla Recidivante-Remitente/líquido cefalorraquidiano , Esclerose Múltipla Recidivante-Remitente/imunologia , Esclerose Múltipla Recidivante-Remitente/patologia , Glicoproteína Mielina-Oligodendrócito/líquido cefalorraquidiano , Glicoproteína Mielina-Oligodendrócito/imunologia , Neurogênese/genética , Neurogênese/imunologia , Receptor Nogo 1/genética , Receptor Nogo 1/imunologia , Receptor Nogo 1/metabolismo , Mapeamento de Interação de Proteínas , Proteoma/imunologia , Proteoma/metabolismo
20.
Expert Rev Proteomics ; 14(8): 649-663, 2017 08.
Artigo em Inglês | MEDLINE | ID: mdl-28693350

RESUMO

INTRODUCTION: Mass spectrometry (MS)-based proteomics has become an indispensable tool for the characterization of the proteome and its post-translational modifications (PTM). In addition to standard protein sequence databases, proteogenomics strategies search the spectral data against the theoretical spectra obtained from customized protein sequence databases. Up to date, there are no published proteogenomics studies on acute myeloid leukemia (AML) samples. Areas covered: Proteogenomics involves the understanding of genomic and proteomic data. The intersection of both datatypes requires advanced bioinformatics skills. A standard proteogenomics workflow that could be used for the study of AML samples is described. The generation of customized protein sequence databases as well as bioinformatics tools and pipelines commonly used in proteogenomics are discussed in detail. Expert commentary: Drawing on evidence from recent cancer proteogenomics studies and taking into account the public availability of AML genomic data, the interpretation of present and future MS-based AML proteomic data using AML-specific protein sequence databases could discover new biological mechanisms and targets in AML. However, proteogenomics workflows including bioinformatics guidelines can be challenging for the wide AML research community. It is expected that further automation and simplification of the bioinformatics procedures might attract AML investigators to adopt the proteogenomics strategy.


Assuntos
Biomarcadores Tumorais/metabolismo , Leucemia Mieloide Aguda/genética , Proteogenômica/métodos , Biologia Computacional , Humanos , Espectrometria de Massas
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa