Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Nucleic Acids Res ; 51(W1): W207-W212, 2023 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-37144459

RESUMEN

g:Profiler is a reliable and up-to-date functional enrichment analysis tool that supports various evidence types, identifier types and organisms. The toolset integrates many databases, including Gene Ontology, KEGG and TRANSFAC, to provide a comprehensive and in-depth analysis of gene lists. It also provides interactive and intuitive user interfaces and supports ordered queries and custom statistical backgrounds, among other settings. g:Profiler provides multiple programmatic interfaces to access its functionality. These can be easily integrated into custom workflows and external tools, making them valuable resources for researchers who want to develop their own solutions. g:Profiler has been available since 2007 and is used to analyse millions of queries. Research reproducibility and transparency are achieved by maintaining working versions of all past database releases since 2015. g:Profiler supports 849 species, including vertebrates, plants, fungi, insects and parasites, and can analyse any organism through user-uploaded custom annotation files. In this update article, we introduce a novel filtering method highlighting Gene Ontology driver terms, accompanied by new graph visualizations providing a broader context for significant Gene Ontology terms. As a leading enrichment analysis and gene list interoperability service, g:Profiler offers a valuable resource for genetics, biology and medical researchers. It is freely accessible at https://biit.cs.ut.ee/gprofiler.


Asunto(s)
Mapeo Cromosómico , Biología Computacional , Genes , Programas Informáticos , Animales , Mapeo Cromosómico/instrumentación , Mapeo Cromosómico/métodos , Bases de Datos Genéticas , Internet , Reproducibilidad de los Resultados , Interfaz Usuario-Computador , Biología Computacional/instrumentación , Biología Computacional/métodos , Genes/genética , Humanos
2.
Nucleic Acids Res ; 47(W1): W191-W198, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31066453

RESUMEN

Biological data analysis often deals with lists of genes arising from various studies. The g:Profiler toolset is widely used for finding biological categories enriched in gene lists, conversions between gene identifiers and mappings to their orthologs. The mission of g:Profiler is to provide a reliable service based on up-to-date high quality data in a convenient manner across many evidence types, identifier spaces and organisms. g:Profiler relies on Ensembl as a primary data source and follows their quarterly release cycle while updating the other data sources simultaneously. The current update provides a better user experience due to a modern responsive web interface, standardised API and libraries. The results are delivered through an interactive and configurable web design. Results can be downloaded as publication ready visualisations or delimited text files. In the current update we have extended the support to 467 species and strains, including vertebrates, plants, fungi, insects and parasites. By supporting user uploaded custom GMT files, g:Profiler is now capable of analysing data from any organism. All past releases are maintained for reproducibility and transparency. The 2019 update introduces an extensive technical rewrite making the services faster and more flexible. g:Profiler is freely available at https://biit.cs.ut.ee/gprofiler.


Asunto(s)
Bases de Datos Genéticas , Genoma , Almacenamiento y Recuperación de la Información , Programas Informáticos , Animales , Hongos/genética , Humanos , Parásitos/genética , Plantas/genética
3.
Int J Immunogenet ; 46(2): 49-58, 2019 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-30659741

RESUMEN

Allele-specific analyses to understand frequency differences across populations, particularly populations not well studied, are important to help identify variants that may have a functional effect on disease mechanisms and phenotypic predisposition, facilitating new Genome-Wide Association Studies (GWAS). We aimed to compare the allele frequency of 11 asthma-associated and 16 liver disease-associated single nucleotide polymorphisms (SNPs) between the Estonian, HapMap and 1000 genome project populations. When comparing EGCUT with HapMap populations, the largest difference in allele frequencies was observed with the Maasai population in Kinyawa, Kenya, with 12 SNP variants reporting statistical significance. Similarly, when comparing EGCUT with 1000 genomes project populations, the largest difference in allele frequencies was observed with pooled African populations with 22 SNP variants reporting statistical significance. For 11 asthma-associated and 16 liver disease-associated SNPs, Estonians are genetically similar to other European populations but significantly different from African populations. Understanding differences in genetic architecture between ethnic populations is important to facilitate new GWAS targeted at underserved ethnic groups to enable novel genetic findings to aid the development of new therapies to reduce morbidity and mortality.


Asunto(s)
Asma/genética , Frecuencia de los Genes/genética , Genética de Población , Genoma Humano , Proyecto Mapa de Haplotipos , Hepatopatías/genética , Polimorfismo de Nucleótido Simple/genética , Estonia , Humanos
4.
BMC Genomics ; 19(1): 817, 2018 Nov 14.
Artículo en Inglés | MEDLINE | ID: mdl-30428831

RESUMEN

BACKGROUND: A widely applied approach to extract knowledge from high-throughput genomic data is clustering of gene expression profiles followed by functional enrichment analysis. This type of analysis, when done manually, is highly subjective and has limited reproducibility. Moreover, this pipeline can be very time-consuming and resource-demanding as enrichment analysis is done for tens to hundreds of clusters at a time. Thus, the task often needs programming skills to form a pipeline of different software tools or R packages to enable an automated approach. Furthermore, visualising the results can be challenging. RESULTS: We developed a web tool, funcExplorer, which automatically combines hierarchical clustering and enrichment analysis to detect functionally related gene clusters. The functional characterisation is achieved using structured knowledge from data sources such as Gene Ontology, KEGG and Reactome pathways, Human Protein Atlas, and Human Phenotype Ontology. funcExplorer includes various measures for finding biologically meaningful clusters, provides a modern graphical user interface, and has wide-ranging data export and sharing options as well as software transparency by open-source code. The results are presented in a visually compact and interactive format, enabling users to explore the biological essence of the data. We compared our results with previously published gene clusters to demonstrate that funcExplorer can perform the data characterisation equally well, but without requiring labour-intensive manual interference. CONCLUSIONS: The open-source web tool funcExplorer enables scientists with high-throughput genomic data to obtain a preliminary interactive overview of the expression patterns, gene names, and shared functionalities in their dataset in a visually pleasing format. funcExplorer is publicly available at https://biit.cs.ut.ee/funcexplorer.


Asunto(s)
Redes Reguladoras de Genes , Genómica/métodos , Proteómica/métodos , Programas Informáticos , Transcriptoma , Análisis por Conglomerados , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Interfaz Usuario-Computador
5.
Nucleic Acids Res ; 44(W1): W83-9, 2016 07 08.
Artículo en Inglés | MEDLINE | ID: mdl-27098042

RESUMEN

Functional enrichment analysis is a key step in interpreting gene lists discovered in diverse high-throughput experiments. g:Profiler studies flat and ranked gene lists and finds statistically significant Gene Ontology terms, pathways and other gene function related terms. Translation of hundreds of gene identifiers is another core feature of g:Profiler. Since its first publication in 2007, our web server has become a popular tool of choice among basic and translational researchers. Timeliness is a major advantage of g:Profiler as genome and pathway information is synchronized with the Ensembl database in quarterly updates. g:Profiler supports 213 species including mammals and other vertebrates, plants, insects and fungi. The 2016 update of g:Profiler introduces several novel features. We have added further functional datasets to interpret gene lists, including transcription factor binding site predictions, Mendelian disease annotations, information about protein expression and complexes and gene mappings of human genetic polymorphisms. Besides the interactive web interface, g:Profiler can be accessed in computational pipelines using our R package, Python interface and BioJS component. g:Profiler is freely available at http://biit.cs.ut.ee/gprofiler/.


Asunto(s)
Regulación de la Expresión Génica , Ontología de Genes , Factores de Transcripción/genética , Interfaz Usuario-Computador , Animales , Sitios de Unión , Gráficos por Computador , Hongos/genética , Perfilación de la Expresión Génica , Humanos , Insectos/genética , Internet , Anotación de Secuencia Molecular , Plantas/genética , Unión Proteica , Factores de Transcripción/metabolismo , Vertebrados/genética
6.
Reprod Biomed Online ; 32(6): 597-613, 2016 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-27090967

RESUMEN

Little consensus has been reached on the best protocol for endometrial preparation for frozen embryo transfer (FET). It is not known how, and to what extent, hormone supplementation in artificial cycles influences endometrial preparation for embryo implantation at a molecular level, especially in patients who have experienced recurrent implantation failure. Transcriptome analysis of 15 endometrial biopsy samples at the time of embryo implantation was used to compare two different endometrial preparation protocols, natural versus artificial cycles, for FET in women who have experienced recurrent implantation failure compared with fertile women. IPA and DAVID were used for functional analyses of differentially expressed genes. The TRANSFAC database was used to identify oestrogen and progesterone response elements upstream of differentially expressed genes. Cluster analysis demonstrated that natural cycles are associated with a better endometrial receptivity transcriptome than artificial cycles. Artificial cycles seemed to have a stronger negative effect on expression of genes and pathways crucial for endometrial receptivity, including ESR2, FSHR, LEP, and several interleukins and matrix metalloproteinases. Significant overrepresentation of oestrogen response elements among the genes with deteriorated expression in artificial cycles (P < 0.001) was found; progesterone response elements predominated in genes with amended expression with artificial cycles (P = 0.0052).


Asunto(s)
Implantación del Embrión/fisiología , Transferencia de Embrión/métodos , Endometrio/patología , Adulto , Biopsia , Análisis por Conglomerados , Criopreservación/métodos , Estradiol/uso terapéutico , Estrógenos/metabolismo , Femenino , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Hormonas/metabolismo , Humanos , Metaloproteinasas de la Matriz/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Embarazo , Índice de Embarazo , Análisis de Componente Principal , Progesterona/metabolismo , Recurrencia , Transcriptoma , Resultado del Tratamiento
7.
Nat Genet ; 53(9): 1290-1299, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34493866

RESUMEN

Many gene expression quantitative trait locus (eQTL) studies have published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization. However, technical differences between these datasets are a barrier to their widespread use. Consequently, target genes for most genome-wide association study (GWAS) signals have still not been identified. In the present study, we present the eQTL Catalogue ( https://www.ebi.ac.uk/eqtl ), a resource of quality-controlled, uniformly re-computed gene expression and splicing QTLs from 21 studies. We find that, for matching cell types and tissues, the eQTL effect sizes are highly reproducible between studies. Although most QTLs were shared between most bulk tissues, we identified a greater diversity of cell-type-specific QTLs from purified cell types, a subset of which also manifested as new disease co-localizations. Our summary statistics are freely available to enable the systematic interpretation of human GWAS associations across many cell types and tissues.


Asunto(s)
Bases de Datos Genéticas , Regulación de la Expresión Génica/genética , Sitios de Carácter Cuantitativo/genética , Carácter Cuantitativo Heredable , Linfocitos T CD4-Positivos/citología , Conjuntos de Datos como Asunto , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética
8.
Elife ; 92020 09 03.
Artículo en Inglés | MEDLINE | ID: mdl-32880574

RESUMEN

Understanding the causal processes that contribute to disease onset and progression is essential for developing novel therapies. Although trans-acting expression quantitative trait loci (trans-eQTLs) can directly reveal cellular processes modulated by disease variants, detecting trans-eQTLs remains challenging due to their small effect sizes. Here, we analysed gene expression and genotype data from six blood cell types from 226 to 710 individuals. We used co-expression modules inferred from gene expression data with five methods as traits in trans-eQTL analysis to limit multiple testing and improve interpretability. In addition to replicating three established associations, we discovered a novel trans-eQTL near SLC39A8 regulating a module of metallothionein genes in LPS-stimulated monocytes. Interestingly, this effect was mediated by a transient cis-eQTL present only in early LPS response and lost before the trans effect appeared. Our analyses highlight how co-expression combined with functional enrichment analysis improves the identification and prioritisation of trans-eQTLs when applied to emerging cell-type-specific datasets.


Asunto(s)
Células Sanguíneas/metabolismo , Expresión Génica , Redes Reguladoras de Genes/genética , Genotipo , Sitios de Carácter Cuantitativo , Humanos
9.
F1000Res ; 92020.
Artículo en Inglés | MEDLINE | ID: mdl-33564394

RESUMEN

g:Profiler ( https://biit.cs.ut.ee/gprofiler) is a widely used gene list functional profiling and namespace conversion toolset that has been contributing to reproducible biological data analysis already since 2007. Here we introduce the accompanying R package, gprofiler2, developed to facilitate programmatic access to g:Profiler computations and databases via REST API. The gprofiler2 package provides an easy-to-use functionality that enables researchers to incorporate functional enrichment analysis into automated analysis pipelines written in R. The package also implements interactive visualisation methods to help to interpret the enrichment results and to illustrate them for publications. In addition, gprofiler2 gives access to the versatile gene/protein identifier conversion functionality in g:Profiler enabling to map between hundreds of different identifier types or orthologous species. The gprofiler2 package is freely available at the CRAN repository.


Asunto(s)
Biología Computacional , Perfilación de la Expresión Génica , Programas Informáticos
10.
PLoS One ; 14(4): e0215026, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30978214

RESUMEN

The Estonian Biobank, governed by the Institute of Genomics at the University of Tartu (Biobank), has stored genetic material/DNA and continuously collected data since 2002 on a total of 52,274 individuals representing ~5% of the Estonian adult population and is increasing. To explore the utility of data available in the Biobank, we conducted a phenome-wide association study (PheWAS) in two areas of interest to healthcare researchers; asthma and liver disease. We used 11 asthma and 13 liver disease-associated single nucleotide polymorphisms (SNPs), identified from published genome-wide association studies, to test our ability to detect established associations. We confirmed 2 asthma and 5 liver disease associated variants at nominal significance and directionally consistent with published results. We found 2 associations that were opposite to what was published before (rs4374383:AA increases risk of NASH/NAFLD, rs11597086 increases ALT level). Three SNP-diagnosis pairs passed the phenome-wide significance threshold: rs9273349 and E06 (thyroiditis, p = 5.50x10-8); rs9273349 and E10 (type-1 diabetes, p = 2.60x10-7); and rs2281135 and K76 (non-alcoholic liver diseases, including NAFLD, p = 4.10x10-7). We have validated our approach and confirmed the quality of the data for these conditions. Importantly, we demonstrate that the extensive amount of genetic and medical information from the Estonian Biobank can be successfully utilized for scientific research.


Asunto(s)
Asma/genética , Bancos de Muestras Biológicas/estadística & datos numéricos , Registros Electrónicos de Salud/estadística & datos numéricos , Hepatopatías/genética , Fenómica/métodos , Polimorfismo de Nucleótido Simple , Adulto , Asma/complicaciones , Asma/epidemiología , Estonia/epidemiología , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Hepatopatías/complicaciones , Hepatopatías/epidemiología , Masculino , Fenotipo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA