Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de estudo
Tipo de documento
Intervalo de ano de publicação
1.
J Comput Biol ; 31(6): 539-548, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38781420

RESUMO

The thresholding problem is studied in the context of graph theoretical analysis of gene co-expression data. A number of thresholding methodologies are described, implemented, and tested over a large collection of graphs derived from real high-throughput biological data. Comparative results are presented and discussed.


Assuntos
Algoritmos , Perfilação da Expressão Gênica , Perfilação da Expressão Gênica/métodos , Humanos , Biologia Computacional/métodos , Redes Reguladoras de Genes
2.
Front Plant Sci ; 15: 1352253, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38919818

RESUMO

Potato (Solanum tuberosum) is the most popular tuber crop and a model organism. A variety of gene models for potato exist, and despite frequent updates, they are not unified. This hinders the comparison of gene models across versions, limits the ability to reuse experimental data without significant re-analysis, and leads to missing or wrongly annotated genes. Here, we unify the recent potato double monoploid v4 and v6 gene models by developing an automated merging protocol, resulting in a Unified poTato genome model (UniTato). We subsequently established an Apollo genome browser (unitato.nib.si) that enables public access to UniTato and further community-based curation. We demonstrate how the UniTato resource can help resolve problems with missing or misplaced genes and can be used to update or consolidate a wider set of gene models or genome information. The automated protocol, genome annotation files, and a comprehensive translation table are provided at github.com/NIB-SI/unitato.

3.
Plant Commun ; 5(6): 100920, 2024 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-38616489

RESUMO

Stress Knowledge Map (SKM; https://skm.nib.si) is a publicly available resource containing two complementary knowledge graphs that describe the current knowledge of biochemical, signaling, and regulatory molecular interactions in plants: a highly curated model of plant stress signaling (PSS; 543 reactions) and a large comprehensive knowledge network (488 390 interactions). Both were constructed by domain experts through systematic curation of diverse literature and database resources. SKM provides a single entry point for investigations of plant stress response and related growth trade-offs, as well as interactive explorations of current knowledge. PSS is also formulated as a qualitative and quantitative model for systems biology and thus represents a starting point for a plant digital twin. Here, we describe the features of SKM and show, through two case studies, how it can be used for complex analyses, including systematic hypothesis generation and design of validation experiments, or to gain new insights into experimental observations in plant biology.


Assuntos
Plantas , Estresse Fisiológico , Biologia de Sistemas , Plantas/genética , Plantas/metabolismo , Fenômenos Fisiológicos Vegetais/genética , Transdução de Sinais/genética , Bases de Dados Factuais
4.
Commun Biol ; 4(1): 117, 2021 01 26.
Artigo em Inglês | MEDLINE | ID: mdl-33500552

RESUMO

In this study, more than one hundred thousand Escherichia coli and Shigella genomes were examined and classified. This is, to our knowledge, the largest E. coli genome dataset analyzed to date. A Mash-based analysis of a cleaned set of 10,667 E. coli genomes from GenBank revealed 14 distinct phylogroups. A representative genome or medoid identified for each phylogroup was used as a proxy to classify 95,525 unassembled genomes from the Sequence Read Archive (SRA). We find that most of the sequenced E. coli genomes belong to four phylogroups (A, C, B1 and E2(O157)). Authenticity of the 14 phylogroups is supported by several different lines of evidence: phylogroup-specific core genes, a phylogenetic tree constructed with 2613 single copy core genes, and differences in the rates of gene gain/loss/duplication. The methodology used in this work is able to reproduce known phylogroups, as well as to identify previously uncharacterized phylogroups in E. coli species.


Assuntos
Escherichia coli/classificação , Escherichia coli/genética , Genoma Bacteriano , Biologia Computacional/métodos , Proteínas de Escherichia coli/genética , Especiação Genética , Genômica/métodos , Filogenia , Análise de Sequência de DNA , Shigella/classificação , Shigella/genética
5.
Front Genet ; 10: 417, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31134130

RESUMO

Various patterns of multi-phenotype associations (MPAs) exist in the results of Genome Wide Association Studies (GWAS) involving different topologies of single nucleotide polymorphism (SNP)-phenotype associations. These can provide interesting information about the different impacts of a gene on closely related phenotypes or disparate phenotypes (pleiotropy). In this work we present MPA Decomposition, a new network-based approach which decomposes the results of a multi-phenotype GWAS study into three bipartite networks, which, when used together, unravel the multi-phenotype signatures of genes on a genome-wide scale. The decomposition involves the construction of a phenotype powerset space, and subsequent mapping of genes into this new space. Clustering of genes in this powerset space groups genes based on their detailed MPA signatures. We show that this method allows us to find multiple different MPA and pleiotropic signatures within individual genes and to classify and cluster genes based on these SNP-phenotype association topologies. We demonstrate the use of this approach on a GWAS analysis of a large population of 882 Populus trichocarpa genotypes using untargeted metabolomics phenotypes. This method should prove invaluable in the interpretation of large GWAS datasets and aid in future synthetic biology efforts designed to optimize phenotypes of interest.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa