Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
Mais filtros

Bases de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(W1): W411-W418, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37207338

RESUMO

Genomics studies routinely confront researchers with long lists of tumor alterations detected in patients. Such lists are difficult to interpret since only a minority of the alterations are relevant biomarkers for diagnosis and for designing therapeutic strategies. PanDrugs is a methodology that facilitates the interpretation of tumor molecular alterations and guides the selection of personalized treatments. To do so, PanDrugs scores gene actionability and drug feasibility to provide a prioritized evidence-based list of drugs. Here, we introduce PanDrugs2, a major upgrade of PanDrugs that, in addition to somatic variant analysis, supports a new integrated multi-omics analysis which simultaneously combines somatic and germline variants, copy number variation and gene expression data. Moreover, PanDrugs2 now considers cancer genetic dependencies to extend tumor vulnerabilities providing therapeutic options for untargetable genes. Importantly, a novel intuitive report to support clinical decision-making is generated. PanDrugs database has been updated, integrating 23 primary sources that support >74K drug-gene associations obtained from 4642 genes and 14 659 unique compounds. The database has also been reimplemented to allow semi-automatic updates to facilitate maintenance and release of future versions. PanDrugs2 does not require login and is freely available at https://www.pandrugs.org/.


Assuntos
Multiômica , Neoplasias , Humanos , Variações do Número de Cópias de DNA , Genômica/métodos , Neoplasias/tratamento farmacológico , Neoplasias/genética , Neoplasias/patologia , Medicina de Precisão/métodos
2.
BMC Bioinformatics ; 25(1): 200, 2024 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-38802733

RESUMO

BACKGROUND: The initial version of SEDA assists life science researchers without programming skills with the preparation of DNA and protein sequence FASTA files for multiple bioinformatics applications. However, the initial version of SEDA lacks a command-line interface for more advanced users and does not allow the creation of automated analysis pipelines. RESULTS: The present paper discusses the updates of the new SEDA release, including the addition of a complete command-line interface, new functionalities like gene annotation, a framework for automated pipelines, and improved integration in Linux environments. CONCLUSION: SEDA is an open-source Java application and can be installed using the different distributions available ( https://www.sing-group.org/seda/download.html ) as well as through a Docker image ( https://hub.docker.com/r/pegi3s/seda ). It is released under a GPL-3.0 license, and its source code is publicly accessible on GitHub ( https://github.com/sing-group/seda ). The software version at the time of submission is archived at Zenodo (version v1.6.0, http://doi.org/10.5281/zenodo.10201605 ).


Assuntos
Biologia Computacional , Software , Biologia Computacional/métodos , Análise de Dados
3.
Int J Mol Sci ; 25(4)2024 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-38397104

RESUMO

SARS-CoV-2 amino acid variants that contribute to an increased transmissibility or to host immune system escape are likely to increase in frequency due to positive selection and may be identified using different methods, such as codeML, FEL, FUBAR, and MEME. Nevertheless, when using different methods, the results do not always agree. The sampling scheme used in different studies may partially explain the differences that are found, but there is also the possibility that some of the identified positively selected amino acid sites are false positives. This is especially important in the context of very large-scale projects where hundreds of analyses have been performed for the same protein-coding gene. To account for these issues, in this work, we have identified positively selected amino acid sites in SARS-CoV-2 and 15 other coronavirus species, using both codeML and FUBAR, and compared the location of such sites in the different species. Moreover, we also compared our results to those that are available in the COV2Var database and the frequency of the 10 most frequent variants and predicted protein location to identify those sites that are supported by multiple lines of evidence. Amino acid changes observed at these sites should always be of concern. The information reported for SARS-CoV-2 can also be used to identify variants of concern in other coronaviruses.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Aminoácidos/genética
4.
Clin Proteomics ; 20(1): 54, 2023 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-38017382

RESUMO

BACKGROUND: This study investigates the proteomic landscapes of chromophobe renal cell carcinoma (chRCC) and renal oncocytomas (RO), two subtypes of renal cell carcinoma that together account for approximately 10% of all renal tumors. Despite their histological similarities and shared origins, chRCC is a malignant tumor necessitating aggressive intervention, while RO, a benign growth, is often subject to overtreatment due to difficulties in accurate differentiation. METHODS: We conducted a label-free quantitative proteomic analysis on solid biopsies of chRCC (n = 5), RO (n = 5), and normal adjacent tissue (NAT, n = 5). The quantitative analysis was carried out by comparing protein abundances between tumor and NAT specimens. Our analysis identified a total of 1610 proteins across all samples, with 1379 (85.7%) of these proteins quantified in at least seven out of ten LC‒MS/MS runs for one renal tissue type (chRCC, RO, or NAT). RESULTS: Our findings revealed significant similarities in the dysregulation of key metabolic pathways, including carbohydrate, lipid, and amino acid metabolism, in both chRCC and RO. Compared to NAT, both chRCC and RO showed a marked downregulation in gluconeogenesis proteins, but a significant upregulation of proteins integral to the citrate cycle. Interestingly, we observed a distinct divergence in the oxidative phosphorylation pathway, with RO showing a significant increase in the number and degree of alterations in proteins, surpassing that observed in chRCC. CONCLUSIONS: This study underscores the value of integrating high-resolution mass spectrometry protein quantification to effectively characterize and differentiate the proteomic landscapes of solid tumor biopsies diagnosed as chRCC and RO. The insights gained from this research offer valuable information for enhancing our understanding of these conditions and may aid in the development of improved diagnostic and therapeutic strategies.

5.
Int J Mol Sci ; 24(3)2023 Jan 18.
Artigo em Inglês | MEDLINE | ID: mdl-36768211

RESUMO

Schizophrenia (SZ) is a serious mental disorder that is typically treated with antipsychotic medication. Treatment-resistant schizophrenia (TRS) is the condition where symptoms remain after pharmacological intervention, resulting in long-lasting functional and social impairments. As the identification and treatment of a TRS patient requires previous failed treatments, early mechanisms of detection are needed in order to quicken the access to effective therapy, as well as improve treatment adherence. In this study, we aim to find a microRNA (miRNA) signature for TRS, as well as to shed some light on the molecular pathways potentially involved in this severe condition. To do this, we compared the blood miRNAs of schizophrenia patients that respond to medication and TRS patients, thus obtaining a 16-miRNA TRS profile. Then, we assessed the ability of this signature to separate responders and TRS patients using hierarchical clustering, observing that most of them are grouped correctly (~70% accuracy). We also conducted a network, pathway analysis, and bibliography search to spot molecular pathways potentially altered in TRS. We found that the response to stress seems to be a key factor in TRS and that proteins p53, SIRT1, MDM2, and TRIM28 could be the potential mediators of such responses. Finally, we suggest a molecular pathway potentially regulated by the miRNAs of the TRS profile.


Assuntos
Antipsicóticos , MicroRNAs , Esquizofrenia , Humanos , Esquizofrenia/tratamento farmacológico , Esquizofrenia/genética , Esquizofrenia/diagnóstico , MicroRNAs/genética , MicroRNAs/uso terapêutico , Esquizofrenia Resistente ao Tratamento , Antipsicóticos/farmacologia , Antipsicóticos/uso terapêutico , Resistência a Medicamentos/genética
6.
Bioinformatics ; 37(4): 578-579, 2021 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-32818254

RESUMO

MOTIVATION: Drug immunomodulation modifies the response of the immune system and can be therapeutically exploited in pathologies such as cancer and autoimmune diseases. RESULTS: DREIMT is a new hypothesis-generation web tool, which performs drug prioritization analysis for immunomodulation. DREIMT provides significant immunomodulatory drugs targeting up to 70 immune cells subtypes through a curated database that integrates 4960 drug profiles and ∼2600 immune gene expression signatures. The tool also suggests potential immunomodulatory drugs targeting user-supplied gene expression signatures. Final output includes drug-signature association scores, FDRs and downloadable plots and results tables. AVAILABILITYAND IMPLEMENTATION: http://www.dreimt.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Reposicionamento de Medicamentos , Transcriptoma , Bases de Dados Factuais , Bases de Dados de Produtos Farmacêuticos , Imunomodulação
7.
BMC Evol Biol ; 19(1): 126, 2019 06 18.
Artigo em Inglês | MEDLINE | ID: mdl-31215418

RESUMO

BACKGROUND: L-ascorbate (Vitamin C) is an important antioxidant and co-factor in eukaryotic cells, and in mammals it is indispensable for brain development and cognitive function. Vertebrates usually become L-ascorbate auxothrophs when the last enzyme of the synthetic pathway, an L-gulonolactone oxidase (GULO), is lost. Since Protostomes were until recently thought not to have a GULO gene, they were considered to be auxothrophs for Vitamin C. RESULTS: By performing phylogenetic analyses with tens of non-Bilateria and Protostomian genomes, it is shown, that a GULO gene is present in the non-Bilateria Placozoa, Myxozoa (here reported for the first time) and Anthozoa groups, and in Protostomians, in the Araneae family, the Gastropoda class, the Acari subclass (here reported for the first time), and the Priapulida, Annelida (here reported for the first time) and Brachiopoda phyla lineages. GULO is an old gene that predates the separation of Animals and Fungi, although it could be much older. We also show that within Protostomes, GULO has been lost multiple times in large taxonomic groups, namely the Pancrustacea, Nematoda, Platyhelminthes and Bivalvia groups, a pattern similar to that reported for Vertebrate species. Nevertheless, we show that Drosophila melanogaster seems to be capable of synthesizing L-ascorbate, likely through an alternative pathway, as recently reported for Caenorhabditis elegans. CONCLUSIONS: Non-Bilaterian and Protostomians seem to be able to synthesize Vitamin C either through the conventional animal pathway or an alternative pathway, but in this animal group, not being able to synthesize L-ascorbate seems to be the exception rather than the rule.


Assuntos
Ácido Ascórbico/metabolismo , Eucariotos/enzimologia , Eucariotos/genética , Evolução Molecular , L-Gulonolactona Oxidase/genética , Animais , Drosophila melanogaster/genética , Eucariotos/classificação , Eucariotos/metabolismo , Genoma , L-Gulonolactona Oxidase/química , L-Gulonolactona Oxidase/metabolismo , Modelos Moleculares , Filogenia , Vertebrados/classificação , Vertebrados/genética
8.
Bioinformatics ; 34(8): 1414-1415, 2018 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-29211825

RESUMO

Summary: High-throughput sequencing of bisulfite-converted DNA is a technique used to measure DNA methylation levels. Although a considerable number of computational pipelines have been developed to analyze such data, none of them tackles all the peculiarities of the analysis together, revealing limitations that can force the user to manually perform additional steps needed for a complete processing of the data. This article presents bicycle, an integrated, flexible analysis pipeline for bisulfite sequencing data. Bicycle analyzes whole genome bisulfite sequencing data, targeted bisulfite sequencing data and hydroxymethylation data. To show how bicycle overtakes other available pipelines, we compared them on a defined number of features that are summarized in a table. We also tested bicycle with both simulated and real datasets, to show its level of performance, and compared it to different state-of-the-art methylation analysis pipelines. Availability and implementation: Bicycle is publicly available under GNU LGPL v3.0 license at http://www.sing-group.org/bicycle. Users can also download a customized Ubuntu LiveCD including bicycle and other bisulfite sequencing data pipelines compared here. In addition, a docker image with bicycle and its dependencies, which allows a straightforward use of bicycle in any platform (e.g. Linux, OS X or Windows), is also available. Contact: ograna@cnio.es or dgpena@uvigo.es. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Metilação de DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Biologia Computacional , Sulfitos
9.
Brief Bioinform ; 15(5): 788-97, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23632294

RESUMO

Web services are the de facto standard in biomedical data integration. However, there are data integration scenarios that cannot be fully covered by Web services. A number of Web databases and tools do not support Web services, and existing Web services do not cover for all possible user data demands. As a consequence, Web data scraping, one of the oldest techniques for extracting Web contents, is still in position to offer a valid and valuable service to a wide range of bioinformatics applications, ranging from simple extraction robots to online meta-servers. This article reviews existing scraping frameworks and tools, identifying their strengths and limitations in terms of extraction capabilities. The main focus is set on showing how straightforward it is today to set up a data scraping pipeline, with minimal programming effort, and answer a number of practical needs. For exemplification purposes, we introduce a biomedical data extraction scenario where the desired data sources, well-known in clinical microbiology and similar domains, do not offer programmatic interfaces yet. Moreover, we describe the operation of WhichGenes and PathJam, two bioinformatics meta-servers that use scraping as means to cope with gene set enrichment analysis.


Assuntos
Internet , Sistemas de Gerenciamento de Base de Dados , Interface Usuário-Computador
10.
J Integr Bioinform ; 21(2)2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-39054685

RESUMO

When inferring the evolution of a gene/gene family, it is advisable to use all available coding sequences (CDS) from as many species genomes as possible in order to infer and date all gene duplications and losses. Nowadays, this means using hundreds or even thousands of CDSs, which makes the inferred phylogenetic trees difficult to visualize and interpret. Therefore, it is useful to have an automated way of collapsing large phylogenetic trees according to a taxonomic term decided by the user (family, class, or order, for instance), in order to highlight the minimal set of sequences that should be used to recapitulate the full history of the gene/gene family being studied at that taxonomic level, that can be refined using additional software. Here we present the Phylogenetic Tree Collapser (PTC) program (https://github.com/pegi3s/phylogenetic-tree-collapser), a flexible tool for automated tree collapsing using taxonomic information, that can be easily used by researchers without a background in informatics, since it only requires the installation of Docker, Podman or Singularity. The utility of PTC is demonstrated by addressing the evolution of the ascorbic acid synthesis pathway in insects. A Docker image is available at Docker Hub (https://hub.docker.com/r/pegi3s/phylogenetic-tree-collapser) with PTC installed and ready-to-run.


Assuntos
Ácido Ascórbico , Evolução Molecular , Insetos , Filogenia , Software , Animais , Ácido Ascórbico/biossíntese , Ácido Ascórbico/metabolismo , Insetos/genética , Insetos/metabolismo
11.
J Integr Bioinform ; 21(2)2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38529929

RESUMO

The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on S-RNase specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining S-RNase specificity, starting from both non-annotated Prunus genomes and sequences available in public databases.


Assuntos
Filogenia , Software , Algoritmos , Alinhamento de Sequência , Seleção Genética , Aminoácidos/genética , Aminoácidos/química
12.
Front Oncol ; 14: 1393815, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38846970

RESUMO

Background: PolyDeep is a computer-aided detection and classification (CADe/x) system trained to detect and classify polyps. During colonoscopy, CADe/x systems help endoscopists to predict the histology of colonic lesions. Objective: To compare the diagnostic performance of PolyDeep and expert endoscopists for the optical diagnosis of colorectal polyps on still images. Methods: PolyDeep Image Classification (PIC) is an in vitro diagnostic test study. The PIC database contains NBI images of 491 colorectal polyps with histological diagnosis. We evaluated the diagnostic performance of PolyDeep and four expert endoscopists for neoplasia (adenoma, sessile serrated lesion, traditional serrated adenoma) and adenoma characterization and compared them with the McNemar test. Receiver operating characteristic curves were constructed to assess the overall discriminatory ability, comparing the area under the curve of endoscopists and PolyDeep with the chi- square homogeneity areas test. Results: The diagnostic performance of the endoscopists and PolyDeep in the characterization of neoplasia is similar in terms of sensitivity (PolyDeep: 89.05%; E1: 91.23%, p=0.5; E2: 96.11%, p<0.001; E3: 86.65%, p=0.3; E4: 91.26% p=0.3) and specificity (PolyDeep: 35.53%; E1: 33.80%, p=0.8; E2: 34.72%, p=1; E3: 39.24%, p=0.8; E4: 46.84%, p=0.2). The overall discriminative ability also showed no statistically significant differences (PolyDeep: 0.623; E1: 0.625, p=0.8; E2: 0.654, p=0.2; E3: 0.629, p=0.9; E4: 0.690, p=0.09). In the optical diagnosis of adenomatous polyps, we found that PolyDeep had a significantly higher sensitivity and a significantly lower specificity. The overall discriminative ability of adenomatous lesions by expert endoscopists is significantly higher than PolyDeep (PolyDeep: 0.582; E1: 0.685, p < 0.001; E2: 0.677, p < 0.0001; E3: 0.658, p < 0.01; E4: 0.694, p < 0.0001). Conclusion: PolyDeep and endoscopists have similar diagnostic performance in the optical diagnosis of neoplastic lesions. However, endoscopists have a better global discriminatory ability than PolyDeep in the optical diagnosis of adenomatous polyps.

13.
Nucleic Acids Res ; 39(Web Server issue): W562-6, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21646339

RESUMO

Next-generation sequencing (NGS) technologies are making sequence data available on an unprecedented scale. In this context, new catalogs of Single Nucleotide Polymorphism and mutations generated by resequencing studies are usually stored in genome position files (e.g. Variant Call Format, SAMTools pileup, BED, GFF) comprising of large lists of genomic positions, which are difficult to handle by researchers. Here, we present PileLineGUI, a novel desktop application primarily designed for manipulating, browsing and analysing genome position files (GPF), with specific support to somatic mutation finding studies. The developed tool also integrates a new genome browser module specially designed for inspecting GPFs. PileLineGUI is free, multiplatform and designed to be intuitively used by biomedical researchers. PileLineGUI is available at: http://sing.ei.uvigo.es/pileline/pilelinegui.html.


Assuntos
Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Software , Mutação , Polimorfismo de Nucleotídeo Único
14.
Diagnostics (Basel) ; 13(5)2023 Mar 03.
Artigo em Inglês | MEDLINE | ID: mdl-36900110

RESUMO

Deep learning object-detection models are being successfully applied to develop computer-aided diagnosis systems for aiding polyp detection during colonoscopies. Here, we evidence the need to include negative samples for both (i) reducing false positives during the polyp-finding phase, by including images with artifacts that may confuse the detection models (e.g., medical instruments, water jets, feces, blood, excessive proximity of the camera to the colon wall, blurred images, etc.) that are usually not included in model development datasets, and (ii) correctly estimating a more realistic performance of the models. By retraining our previously developed YOLOv3-based detection model with a dataset that includes 15% of additional not-polyp images with a variety of artifacts, we were able to generally improve its F1 performance in our internal test datasets (from an average F1 of 0.869 to 0.893), which now include such type of images, as well as in four public datasets that include not-polyp images (from an average F1 of 0.695 to 0.722).

15.
Biomedicines ; 11(4)2023 Apr 21.
Artigo em Inglês | MEDLINE | ID: mdl-37189848

RESUMO

High-throughput sequencing of small RNA molecules such as microRNAs (miRNAs) has become a widely used approach for studying gene expression and regulation. However, analyzing miRNA-Seq data can be challenging because it requires multiple steps, from quality control and preprocessing to differential expression and pathway-enrichment analyses, with many tools and databases available for each step. Furthermore, reproducibility of the analysis pipeline is crucial to ensure that the results are accurate and reliable. Here, we present myBrain-Seq, a comprehensive and reproducible pipeline for analyzing miRNA-Seq data that incorporates miRNA-specific solutions at each step of the analysis. The pipeline was designed to be flexible and user-friendly, allowing researchers with different levels of expertise to perform the analysis in a standardized and reproducible manner, using the most common and widely used tools for each step. In this work, we describe the implementation of myBrain-Seq and demonstrate its capacity to consistently and reproducibly identify differentially expressed miRNAs and enriched pathways by applying it to a real case study in which we compared schizophrenia patients who responded to medication with treatment-resistant schizophrenia patients to obtain a 16-miRNA treatment-resistant schizophrenia profile.

16.
J Integr Bioinform ; 20(2)2023 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-36848492

RESUMO

EvoPPI (http://evoppi.i3s.up.pt), a meta-database for protein-protein interactions (PPI), has been upgraded (EvoPPI3) to accept new types of data, namely, PPI from patients, cell lines, and animal models, as well as data from gene modifier experiments, for nine neurodegenerative polyglutamine (polyQ) diseases caused by an abnormal expansion of the polyQ tract. The integration of the different types of data allows users to easily compare them, as here shown for Ataxin-1, the polyQ protein involved in spinocerebellar ataxia type 1 (SCA1) disease. Using all available datasets and the data here obtained for Drosophila melanogaster wt and exp Ataxin-1 mutants (also available at EvoPPI3), we show that, in humans, the Ataxin-1 network is much larger than previously thought (380 interactors), with at least 909 interactors. The functional profiling of the newly identified interactors is similar to the ones already reported in the main PPI databases. 16 out of 909 interactors are putative novel SCA1 therapeutic targets, and all but one are already being studied in the context of this disease. The 16 proteins are mainly involved in binding and catalytic activity (mainly kinase activity), functional features already thought to be important in the SCA1 disease.


Assuntos
Drosophila melanogaster , Ataxias Espinocerebelares , Animais , Humanos , Ataxina-1/genética , Ataxina-1/metabolismo , Drosophila melanogaster/genética , Ataxias Espinocerebelares/genética , Ataxias Espinocerebelares/metabolismo
17.
Commun Med (Lond) ; 3(1): 8, 2023 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-36646893

RESUMO

BACKGROUND: Monitoring bladder cancer over time requires invasive and costly procedures. Less invasive approaches are required using readily available biological samples such as urine. In this study, we demonstrate a method for longitudinal analysis of the urine proteome to monitor the disease course in patients with bladder cancer. METHODS: We compared the urine proteomes of patients who experienced recurrence and/or progression (n = 13) with those who did not (n = 17). We identified differentially expressed proteins within various pathways related to the hallmarks of cancer. The variation of such pathways during the disease course was determined using our differential personal pathway index (dPPi) calculation, which could indicate disease progression and the need for medical intervention. RESULTS: Seven hallmark pathways are used to develop the dPPi. We demonstrate that we can successfully longitudinally monitor the disease course in bladder cancer patients through a combination of urine proteomic analysis and the dPPi calculation, over a period of 62 months. CONCLUSIONS: Using the information contained in the patient's urinary proteome, the dPPi reflects the individual's course of bladder cancer, and helps to optimise the use of more invasive procedures such as cystoscopy.


Bladder cancer must be closely monitored for progression, but this requires expensive and invasive procedures such as cystoscopy. Less invasive procedures using readily available samples such as urine are needed. Here, we present an approach that measures the levels of various proteins in the urine. We compare protein levels at different points during the disease course in patients with bladder cancer, and show this helps to flag disease recurrence and the need for medical intervention. Our approach could help clinicians to determine which patients require more invasive testing and treatment.

18.
Diagnostics (Basel) ; 12(4)2022 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-35453946

RESUMO

Colorectal cancer is one of the most frequent malignancies. Colonoscopy is the de facto standard for precancerous lesion detection in the colon, i.e., polyps, during screening studies or after facultative recommendation. In recent years, artificial intelligence, and especially deep learning techniques such as convolutional neural networks, have been applied to polyp detection and localization in order to develop real-time CADe systems. However, the performance of machine learning models is very sensitive to changes in the nature of the testing instances, especially when trying to reproduce results for totally different datasets to those used for model development, i.e., inter-dataset testing. Here, we report the results of testing of our previously published polyp detection model using ten public colonoscopy image datasets and analyze them in the context of the results of other 20 state-of-the-art publications using the same datasets. The F1-score of our recently published model was 0.88 when evaluated on a private test partition, i.e., intra-dataset testing, but it decayed, on average, by 13.65% when tested on ten public datasets. In the published research, the average intra-dataset F1-score is 0.91, and we observed that it also decays in the inter-dataset setting to an average F1-score of 0.83.

19.
IEEE/ACM Trans Comput Biol Bioinform ; 19(3): 1850-1860, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-33237866

RESUMO

SEDA (SEquence DAtaset builder) is a multiplatform desktop application for the manipulation of FASTA files containing DNA or protein sequences. The convenient graphical user interface gives access to a collection of simple (filtering, sorting, or file reformatting, among others) and advanced (BLAST searching, protein domain annotation, gene annotation, and sequence alignment) utilities not present in similar applications, which eases the work of life science researchers working with DNA and/or protein sequences, especially those who have no programming skills. This paper presents general guidelines on how to build efficient data handling protocols using SEDA, as well as practical examples on how to prepare high-quality datasets for single gene phylogenetic studies, the characterization of protein families, or phylogenomic studies. The user-friendliness of SEDA also relies on two important features: (i) the availability of easy-to-install distributable versions and installers of SEDA, including a Docker image for Linux, and (ii) the facility with which users can manage large datasets. SEDA is open-source, with GNU General Public License v3.0 license, and publicly available at GitHub (https://github.com/sing-group/seda). SEDA installers and documentation are available at https://www.sing-group.org/seda/.


Assuntos
Proteínas , Software , Sequência de Aminoácidos , Filogenia , Alinhamento de Sequência
20.
Comput Biol Med ; 135: 104603, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34216893

RESUMO

MiRNAs are emerging as key molecules to study neuropsychiatric diseases. However, despite the large number of methodologies and software for miRNA-seq analyses, there is little supporting literature for researchers in this area. This review focuses on evaluating how miRNA-seq has been used to study neuropsychiatric diseases to date, analyzing both the main findings discovered and the bioinformatics workflows and tools used from a methodological perspective. The objective of this review is two-fold: first, to evaluate current miRNA-seq procedures used in neuropsychiatry; and second, to offer comprehensive information that can serve as a guide to new researchers in bioinformatics. After conducting a systematic search (from 2016 to June 30, 2020) of articles using miRNA-seq in neuropsychiatry, we have seen that it has already been used for different types of studies in three main categories: diagnosis, prognosis, and mechanism. We carefully analyzed the bioinformatics workflows of each study, observing a high degree of variability with respect to the tools and methods used and several methodological complexities that are identified and discussed in this review.


Assuntos
MicroRNAs , Neuropsiquiatria , Biologia Computacional , MicroRNAs/genética , Análise de Sequência de RNA , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA