Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
1.
Comput Struct Biotechnol J ; 23: 2011-2033, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38765606

RESUMO

The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.

2.
Comput Struct Biotechnol J ; 23: 1919-1928, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38711760

RESUMO

The decrease in sequencing expenses has facilitated the creation of reference genomes and proteomes for an expanding array of organisms. Nevertheless, no established repository that details organism-specific genomic and proteomic sequences of specific lengths, referred to as kmers, exists to our knowledge. In this article, we present kmerDB, a database accessible through an interactive web interface that provides kmer-based information from genomic and proteomic sequences in a systematic way. kmerDB currently contains 202,340,859,107 base pairs and 19,304,903,356 amino acids, spanning 54,039 and 21,865 reference genomes and proteomes, respectively, as well as 6,905,362 and 149,305,183 genomic and proteomic species-specific sequences, termed quasi-primes. Additionally, we provide access to 5,186,757 nucleic and 214,904,089 peptide sequences absent from every genome and proteome, termed primes. kmerDB features a user-friendly interface offering various search options and filters for easy parsing and searching. The service is available at: www.kmerdb.com.

3.
Nucleic Acids Res ; 52(D1): D502-D512, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37811892

RESUMO

The Novel Metagenome Protein Families Database (NMPFamsDB) is a database of metagenome- and metatranscriptome-derived protein families, whose members have no hits to proteins of reference genomes or Pfam domains. Each protein family is accompanied by multiple sequence alignments, Hidden Markov Models, taxonomic information, ecosystem and geolocation metadata, sequence and structure predictions, as well as 3D structure models predicted with AlphaFold2. In its current version, NMPFamsDB hosts over 100 000 protein families, each with at least 100 members. The reported protein families significantly expand (more than double) the number of known protein sequence clusters from reference genomes and reveal new insights into their habitat distribution, origins, functions and taxonomy. We expect NMPFamsDB to be a valuable resource for microbial proteome-wide analyses and for further discovery and characterization of novel functions. NMPFamsDB is publicly available in http://www.nmpfamsdb.org/ or https://bib.fleming.gr/NMPFamsDB.


Assuntos
Bases de Dados de Proteínas , Metagenoma , Proteínas , Sequência de Aminoácidos , Bases de Dados Factuais , Ecossistema , Proteínas/química , Geografia
4.
bioRxiv ; 2023 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-38045264

RESUMO

Massively parallel reporter assays (MPRAs) represent a set of high-throughput technologies that measure the functional effects of thousands of sequences/variants on gene regulatory activity. There are several different variations of MPRA technology and they are used for numerous applications, including regulatory element discovery, variant effect measurement, saturation mutagenesis, synthetic regulatory element generation or characterization of evolutionary gene regulatory differences. Despite their many designs and uses, there is no comprehensive database that incorporates the results of these experiments. To address this, we developed MPRAbase, a manually curated database that currently harbors 129 experiments, encompassing 17,718,677 elements tested across 35 cell types and 4 organisms. The MPRAbase web interface (http://www.mprabase.com) serves as a centralized user-friendly repository to download existing MPRA data for independent analysis and is designed with the ability to allow researchers to share their published data for rapid dissemination to the community.

5.
Comput Struct Biotechnol J ; 21: 5382-5393, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38022693

RESUMO

Analysis and interpretation of high-throughput transcriptional and chromatin accessibility data at single-cell (sc) resolution are still open challenges in the biomedical field. The existence of countless bioinformatics tools, for the different analytical steps, increases the complexity of data interpretation and the difficulty to derive biological insights. In this article, we present SCALA, a bioinformatics tool for analysis and visualization of single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) datasets, enabling either independent or integrative analysis of the two modalities. SCALA combines standard types of analysis by integrating multiple software packages varying from quality control to the identification of distinct cell populations and cell states. Additional analysis options enable functional enrichment, cellular trajectory inference, ligand-receptor analysis, and regulatory network reconstruction. SCALA is fully parameterizable, presenting data in tabular format and producing publication-ready visualizations. The different available analysis modules can aid biomedical researchers in exploring, analyzing, and visualizing their data without any prior experience in coding. We demonstrate the functionality of SCALA through two use-cases related to TNF-driven arthritic mice, handling both scRNA-seq and scATAC-seq datasets. SCALA is developed in R, Shiny and JavaScript and is mainly available as a standalone version, while an online service of more limited capacity can be found at http://scala.pavlopouloslab.info or https://scala.fleming.gr.

6.
Nature ; 622(7983): 594-602, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37821698

RESUMO

Metagenomes encode an enormous diversity of proteins, reflecting a multiplicity of functions and activities1,2. Exploration of this vast sequence space has been limited to a comparative analysis against reference microbial genomes and protein families derived from those genomes. Here, to examine the scale of yet untapped functional diversity beyond what is currently possible through the lens of reference genomes, we develop a computational approach to generate reference-free protein families from the sequence space in metagenomes. We analyse 26,931 metagenomes and identify 1.17 billion protein sequences longer than 35 amino acids with no similarity to any sequences from 102,491 reference genomes or the Pfam database3. Using massively parallel graph-based clustering, we group these proteins into 106,198 novel sequence clusters with more than 100 members, doubling the number of protein families obtained from the reference genomes clustered using the same approach. We annotate these families on the basis of their taxonomic, habitat, geographical and gene neighbourhood distributions and, where sufficient sequence diversity is available, predict protein three-dimensional models, revealing novel structures. Overall, our results uncover an enormously diverse functional space, highlighting the importance of further exploring the microbial functional dark matter.


Assuntos
Metagenoma , Metagenômica , Microbiologia , Proteínas , Análise por Conglomerados , Metagenoma/genética , Metagenômica/métodos , Proteínas/química , Proteínas/classificação , Proteínas/genética , Bases de Dados de Proteínas , Conformação Proteica
7.
Bioinformatics ; 39(8)2023 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-37540207

RESUMO

Functional enrichment is the process of identifying implicated functional terms from a given input list of genes or proteins. In this article, we present Flame (v2.0), a web tool which offers a combinatorial approach through merging and visualizing results from widely used functional enrichment applications while also allowing various flexible input options. In this version, Flame utilizes the aGOtool, g: Profiler, WebGestalt, and Enrichr pipelines and presents their outputs separately or in combination following a visual analytics approach. For intuitive representations and easier interpretation, it uses interactive plots such as parameterizable networks, heatmaps, barcharts, and scatter plots. Users can also: (i) handle multiple protein/gene lists and analyse union and intersection sets simultaneously through interactive UpSet plots, (ii) automatically extract genes and proteins from free text through text-mining and Named Entity Recognition (NER) techniques, (iii) upload single nucleotide polymorphisms (SNPs) and extract their relative genes, or (iv) analyse multiple lists of differentially expressed proteins/genes after selecting them interactively from a parameterizable volcano plot. Compared to the previous version of 197 supported organisms, Flame (v2.0) currently allows enrichment for 14 436 organisms. AVAILABILITY AND IMPLEMENTATION: Web Application: http://flame.pavlopouloslab.info. Code: https://github.com/PavlopoulosLab/Flame. Docker: https://hub.docker.com/r/pavlopouloslab/flame.


Assuntos
Proteínas , Software , Mineração de Dados
8.
NAR Genom Bioinform ; 5(2): lqad053, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37260509

RESUMO

Arena3Dweb is an interactive web tool that visualizes multi-layered networks in 3D space. In this update, Arena3Dweb supports directed networks as well as up to nine different types of connections between pairs of nodes with the use of Bézier curves. It comes with different color schemes (light/gray/dark mode), custom channel coloring, four node clustering algorithms which one can run on-the-fly, visualization in VR mode and predefined layer layouts (zig-zag, star and cube). This update also includes enhanced navigation controls (mouse orbit controls, layer dragging and layer/node selection), while its newly developed API allows integration with external applications as well as saving and loading of sessions in JSON format. Finally, a dedicated Cytoscape app has been developed, through which users can automatically send their 2D networks from Cytoscape to Arena3Dweb for 3D multi-layer visualization. Arena3Dweb is accessible at http://arena3d.pavlopouloslab.info or http://arena3d.org.

9.
Front Bioeng Biotechnol ; 11: 1182500, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37064232

RESUMO

[This corrects the article DOI: 10.3389/fbioe.2020.00034.].

10.
Front Bioinform ; 3: 1157956, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36959975

RESUMO

Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.

11.
Maturitas ; 170: 51-57, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-36773500

RESUMO

Undeniably, biological age can significantly differ between individuals of similar chronological age. Longitudinal, deep multi-omic profiling has recently enabled the identification of individuals with distinct aging phenotypes, termed 'ageotypes'. This effort has provided a plethora of data and new insights into the diverse molecular mechanisms presumed to drive aging. Translational opportunities stemming from this knowledge continue to evolve, providing an opportunity for the provision of nutritional interventions aiming to decelerate the aging process. In this framework, the contemporary ageotypes classification was revisited via in silico analyses, with the brain and nervous system being identified as the primary targets of age-related biomolecules, acting through inflammatory and metabolic pathways. Nutritional and lifestyle factors affecting these pathways in the brain and central nervous system that could help guide personalized recommendations for the attainment of healthy aging are discussed.


Assuntos
Envelhecimento Saudável , Humanos , Estilo de Vida , Fenótipo , Sistema Nervoso Central , Encéfalo
12.
J Immunol ; 209(10): 1906-1917, 2022 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-36426957

RESUMO

Rheumatoid arthritis (RA) is characterized by autoimmune joint destruction with debilitating consequences. Despite treatment advancements with biologic therapies, a significant proportion of RA patients show an inadequate clinical response, and restoration of immune self-tolerance represents an unmet therapeutic need. We have previously described a tolerogenic phenotype of plasmacytoid dendritic cells (pDCs) in RA patients responding to anti-TNF-α agents. However, the molecular mechanisms involved in tolerogenic reprogramming of pDCs in RA remain elusive. In this study, guided by transcriptomic analysis of CD303+CD123+ pDCs from RA patients in remission, we revealed enhanced expression of IL-6R and its downstream signaling compared with healthy pDCs. Functional assessment demonstrated that IL-6R engagement resulted in marked reduction of TNF-α secretion by pDCs whereas intracellular TNF-α was significantly increased. Accordingly, pharmacologic inhibition of IL-6R signaling restored TNF-α secretion levels by pDCs. Mechanistic analysis demonstrated impaired activity and decreased lysosomal degradation of ADAM17 (a disintegrin and metalloproteinase 17) sheddase in pDCs, which is essential for TNF-α cleavage. Importantly, reduction of TNF-α secretion by IL-6-treated pDCs attenuated the inflammatory potential of RA patient-derived synovial fibroblasts. Collectively, these findings position pDCs as an important source of TNF-α in RA pathogenesis and unravel an anti-inflammatory mechanism of IL-6 by limiting the pDC-derived TNF-α secretion.


Assuntos
Artrite Reumatoide , Interleucina-6 , Humanos , Inibidores do Fator de Necrose Tumoral , Células Dendríticas , Transdução de Sinais , Fator de Necrose Tumoral alfa
13.
Int J Mol Sci ; 23(19)2022 Sep 21.
Artigo em Inglês | MEDLINE | ID: mdl-36232413

RESUMO

Protein-protein interactions (PPIs) are of key importance for understanding how cells and organisms function. Thus, in recent decades, many approaches have been developed for the identification and discovery of such interactions. These approaches addressed the problem of PPI identification either by an experimental point of view or by a computational one. Here, we present an updated version of UniReD, a computational prediction tool which takes advantage of biomedical literature aiming to extract documented, already published protein associations and predict undocumented ones. The usefulness of this computational tool has been previously evaluated by experimentally validating predicted interactions and by benchmarking it against public databases of experimentally validated PPIs. In its updated form, UniReD allows the user to provide a list of proteins of known implication in, e.g., a particular disease, as well as another list of proteins that are potentially associated with the proteins of the first list. UniReD then automatically analyzes both lists and ranks the proteins of the second list by their association with the proteins of the first list, thus serving as a potential biomarker discovery/validation tool.


Assuntos
Mapeamento de Interação de Proteínas , Proteínas , Biomarcadores , Biologia Computacional , Proteínas/metabolismo
14.
Pharmacogenomics J ; 22(5-6): 294-302, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36171417

RESUMO

Available drugs have been used as an urgent attempt through clinical trials to minimize severe cases of hospitalizations with Coronavirus disease (COVID-19), however, there are limited data on common pharmacogenomics affecting concomitant medications response in patients with comorbidities. To identify the genomic determinants that influence COVID-19 susceptibility, we use a computational, statistical, and network biology approach to analyze relationships of ineffective concomitant medication with an adverse effect on patients. We statistically construct a pharmacogenetic/biomarker network with significant drug-gene interactions originating from gene-disease associations. Investigation of the predicted pharmacogenes encompassing the gene-disease-gene pharmacogenomics (PGx) network suggests that these genes could play a significant role in COVID-19 clinical manifestation due to their association with autoimmune, metabolic, neurological, cardiovascular, and degenerative disorders, some of which have been reported to be crucial comorbidities in a COVID-19 patient.


Assuntos
Tratamento Farmacológico da COVID-19 , Humanos , Mineração de Dados , Farmacogenética , Genômica
15.
STAR Protoc ; 3(2): 101418, 2022 06 17.
Artigo em Inglês | MEDLINE | ID: mdl-35669050

RESUMO

Whole Exome Sequencing (WES) is used for querying DNA variants using the protein coding parts of genomes (exomes). However, WES analysis can be challenging because of the complexity of the data. Here, we describe a consolidated protocol for unbiased WES analysis. The protocol uses three variant callers (HaplotypeCaller, FreeBayes, and DeepVariant), which have different underlying models. We provide detailed execution steps, as well as basic variant filtering, annotation, visualization, and consolidation aspects.


Assuntos
Exoma , Sequenciamento de Nucleotídeos em Larga Escala , Exoma/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento do Exoma
16.
Mol Ecol Resour ; 22(7): 2506-2523, 2022 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-35593171

RESUMO

Honeybees (Apis mellifera) continue to succumb to human and environmental pressures despite their crucial role in providing essential ecosystem services. Owing to their foraging and honey production activities, honeybees form complex relationships with species across all domains, such as plants, viruses, bacteria and other hive pests, making honey a valuable biomonitoring tool for assessing their ecological niche. Thus, the application of honey shotgun metagenomics (SM) has paved the way for a detailed description of the species honeybees interact with. Nevertheless, SM bioinformatics tools and DNA extraction methods rely on resources not necessarily optimized for honey. In this study, we compared five widely used taxonomic classifiers using simulated species communities commonly found in honey. We found that Kraken 2 with a threshold of 0.5 performs best in assessing species distribution. We also optimized a simple NaOH-based honey DNA extraction methodology (Direct-SM), which profiled species seasonal variability similarly to an established column-based DNA extraction approach (SM). Both approaches produce results consistent with melissopalinology analysis describing the botanical landscape surrounding the apiary. Interestingly, we detected a strong stability of the bacteria constituting the core and noncore gut microbiome across seasons, pointing to the potential utility of honey for noninvasive assessment of bee microbiota. Finally, the Direct-SM approach to detect Varroa correlates well with the biomonitoring of mite infestation observed in hives. These observations suggest that Direct-SM methodology has the potential to comprehensively describe honeybee ecological niches and can be tested as a building block for large-scale studies to assess bee health in the field.


Assuntos
Microbioma Gastrointestinal , Mel , Microbiota , Animais , Bactérias/genética , Abelhas/genética , DNA , Humanos , Metagenômica
17.
Biomolecules ; 12(4)2022 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-35454109

RESUMO

Finding, exploring and filtering frequent sentence-based associations between a disease and a biomedical entity, co-mentioned in disease-related PubMed literature, is a challenge, as the volume of publications increases. Darling is a web application, which utilizes Name Entity Recognition to identify human-related biomedical terms in PubMed articles, mentioned in OMIM, DisGeNET and Human Phenotype Ontology (HPO) disease records, and generates an interactive biomedical entity association network. Nodes in this network represent genes, proteins, chemicals, functions, tissues, diseases, environments and phenotypes. Users can search by identifiers, terms/entities or free text and explore the relevant abstracts in an annotated format.


Assuntos
Proteínas , Software , Mineração de Dados , Fenótipo , PubMed
18.
Microorganisms ; 10(2)2022 Jan 26.
Artigo em Inglês | MEDLINE | ID: mdl-35208748

RESUMO

To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO's capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.

19.
Genomics Proteomics Bioinformatics ; 20(3): 578-586, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-34171457

RESUMO

The Network Makeup Artist (NORMA) is a web tool for interactive network annotation visualization and topological analysis, able to handle multiple networks and annotations simultaneously. Precalculated annotations (e.g., Gene Ontology, Pathway enrichment, community detection, or clustering results) can be uploaded and visualized in a network, either as colored pie-chart nodes or as color-filled areas in a 2D/3D Venn-diagram-like style. In the case where no annotation exists, algorithms for automated community detection are offered. Users can adjust the network views using standard layout algorithms or allow NORMA to slightly modify them for visually better group separation. Once a network view is set, users can interactively select and highlight any group of interest in order to generate publication-ready figures. Briefly, with NORMA, users can encode three types of information simultaneously. These are 1) the network, 2) the communities or annotations of interest, and 3) node categories or expression values. Finally, NORMA offers basic topological analysis and direct topological comparison across any of the selected networks. NORMA service is available at http://norma.pavlopouloslab.info, whereas the code is available at https://github.com/PavlopoulosLab/NORMA.


Assuntos
Algoritmos , Software
20.
Bioinform Adv ; 2(1): vbac036, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36699373

RESUMO

Motivation: Network biology is a dominant player in today's multi-omics era. Therefore, the need for visualization tools which can efficiently cope with intra-network heterogeneity emerges. Results: NORMA-2.0 is a web application which uses efficient layouts to group together areas of interest in a network. In this version, NORMA-2.0 utilizes three different strategies to make such groupings as distinct as possible while it preserves all of the properties from its first version where one can handle multiple networks and annotation files simultaneously. Availability and implementation: The web resource is available at http://norma.pavlopouloslab.info/. The source code is freely available at https://github.com/PavlopoulosLab/NORMA.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA