Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Bioinformatics ; 37(24): 4744-4755, 2021 12 11.
Artículo en Inglés | MEDLINE | ID: mdl-34270685

RESUMEN

MOTIVATION: The rise in the number of genotyped ancient individuals provides an opportunity to estimate population admixture models for many populations. However, in models describing modern populations as mixtures of ancient ones, it is typically difficult to estimate the model mixing coefficients and to evaluate its fit to the data. RESULTS: We present LINADMIX, designed to tackle this problem by solving a constrained linear model when both the ancient and the modern genotypes are represented in a low-dimensional space. LINADMIX estimates the mixing coefficients and their standard errors, and computes a P-value for testing the model fit to the data. We quantified the performance of LINADMIX using an extensive set of simulated studies. We show that LINADMIX can accurately estimate admixture coefficients, and is robust to factors such as population size, genetic drift, proportion of missing data and various types of model misspecification. AVAILABILITY AND IMPLEMENTATION: LINADMIX is available as a python code at https://github.com/swidler/linadmix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Humanos , Genotipo
2.
Database (Oxford) ; 20172017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-28605766

RESUMEN

A major challenge in understanding gene regulation is the unequivocal identification of enhancer elements and uncovering their connections to genes. We present GeneHancer, a novel database of human enhancers and their inferred target genes, in the framework of GeneCards. First, we integrated a total of 434 000 reported enhancers from four different genome-wide databases: the Encyclopedia of DNA Elements (ENCODE), the Ensembl regulatory build, the functional annotation of the mammalian genome (FANTOM) project and the VISTA Enhancer Browser. Employing an integration algorithm that aims to remove redundancy, GeneHancer portrays 285 000 integrated candidate enhancers (covering 12.4% of the genome), 94 000 of which are derived from more than one source, and each assigned an annotation-derived confidence score. GeneHancer subsequently links enhancers to genes, using: tissue co-expression correlation between genes and enhancer RNAs, as well as enhancer-targeted transcription factor genes; expression quantitative trait loci for variants within enhancers; and capture Hi-C, a promoter-specific genome conformation assay. The individual scores based on each of these four methods, along with gene­enhancer genomic distances, form the basis for GeneHancer's combinatorial likelihood-based scores for enhancer­gene pairing. Finally, we define 'elite' enhancer­gene relations reflecting both a high-likelihood enhancer definition and a strong enhancer­gene association. GeneHancer predictions are fully integrated in the widely used GeneCards Suite, whereby candidate enhancers and their annotations are displayed on every relevant GeneCard. This assists in the mapping of non-coding variants to enhancers, and via the linked genes, forms a basis for variant­phenotype interpretation of whole-genome sequences in health and disease. Database URL: http://www.genecards.org/.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Elementos de Facilitación Genéticos , Genoma , Análisis de Secuencia de ADN/métodos , Navegador Web , Estudio de Asociación del Genoma Completo , Valor Predictivo de las Pruebas
3.
Curr Protoc Bioinformatics ; 54: 1.30.1-1.30.33, 2016 06 20.
Artículo en Inglés | MEDLINE | ID: mdl-27322403

RESUMEN

GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. VarElect's capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. © 2016 by John Wiley & Sons, Inc.


Asunto(s)
Minería de Datos/métodos , Bases de Datos Genéticas , Genómica/métodos , Análisis de Secuencia/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Fenotipo , Proteoma , Programas Informáticos/normas
4.
Bioinformatics ; 29(2): 255-61, 2013 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-23172862

RESUMEN

MOTIVATION: Non-coding RNA (ncRNA) genes are increasingly acknowledged for their importance in the human genome. However, there is no comprehensive non-redundant database for all such human genes. RESULTS: We leveraged the effective platform of GeneCards, the human gene compendium, together with the power of fRNAdb and additional primary sources, to judiciously unify all ncRNA gene entries obtainable from 15 different primary sources. Overlapping entries were clustered to unified locations based on an algorithm employing genomic coordinates. This allowed GeneCards' gamut of relevant entries to rise ∼5-fold, resulting in ∼80,000 human non-redundant ncRNAs, belonging to 14 classes. Such 'grand unification' within a regularly updated data structure will assist future ncRNA research. AVAILABILITY AND IMPLEMENTATION: All of these non-coding RNAs are included among the ∼122,500 entries in GeneCards V3.09, along with pertinent annotation, automatically mined by its built-in pipeline from 100 data sources. This information is available at www.genecards.org. CONTACT: Frida.Belinky@weizmann.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Bases de Datos Genéticas , ARN no Traducido/genética , Algoritmos , Análisis por Conglomerados , Genes , Genoma Humano , Genómica , Humanos , Internet , Anotación de Secuencia Molecular
5.
Hum Genomics ; 5(6): 709-17, 2011 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-22155609

RESUMEN

Since 1998, the bioinformatics, systems biology, genomics and medical communities have enjoyed a synergistic relationship with the GeneCards database of human genes (http://www.genecards.org). This human gene compendium was created to help to introduce order into the increasing chaos of information flow. As a consequence of viewing details and deep links related to specific genes, users have often requested enhanced capabilities, such that, over time, GeneCards has blossomed into a suite of tools (including GeneDecks, GeneALaCart, GeneLoc, GeneNote and GeneAnnot) for a variety of analyses of both single human genes and sets thereof. In this paper, we focus on inhouse and external research activities which have been enabled, enhanced, complemented and, in some cases, motivated by GeneCards. In turn, such interactions have often inspired and propelled improvements in GeneCards. We describe here the evolution and architecture of this project, including examples of synergistic applications in diverse areas such as synthetic lethality in cancer, the annotation of genetic variations in disease, omics integration in a systems biology approach to kidney disease, and bioinformatics tools.


Asunto(s)
Bases de Datos Genéticas , Genes/genética , Genoma Humano , Genómica , Biología Computacional , Humanos
6.
Database (Oxford) ; 2010: baq020, 2010 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-20689021

RESUMEN

GeneCards (www.genecards.org) is a comprehensive, authoritative compendium of annotative information about human genes, widely used for nearly 15 years. Its gene-centric content is automatically mined and integrated from over 80 digital sources, resulting in a web-based deep-linked card for each of >73,000 human gene entries, encompassing the following categories: protein coding, pseudogene, RNA gene, genetic locus, cluster and uncategorized. We now introduce GeneCards Version 3, featuring a speedy and sophisticated search engine and a revamped, technologically enabling infrastructure, catering to the expanding needs of biomedical researchers. A key focus is on gene-set analyses, which leverage GeneCards' unique wealth of combinatorial annotations. These include the GeneALaCart batch query facility, which tabulates user-selected annotations for multiple genes and GeneDecks, which identifies similar genes with shared annotations, and finds set-shared annotations by descriptor enrichment analysis. Such set-centric features address a host of applications, including microarray data analysis, cross-database annotation mapping and gene-disorder associations for drug targeting. We highlight the new Version 3 database architecture, its multi-faceted search engine, and its semi-automated quality assurance system. Data enhancements include an expanded visualization of gene expression patterns in normal and cancer tissues, an integrated alternative splicing pattern display, and augmented multi-source SNPs and pathways sections. GeneCards now provides direct links to gene-related research reagents such as antibodies, recombinant proteins, DNA clones and inhibitory RNAs and features gene-related drugs and compounds lists. We also portray the GeneCards Inferred Functionality Score annotation landscape tool for scoring a gene's functional information status. Finally, we delineate examples of applications and collaborations that have benefited from the GeneCards suite. Database URL: www.genecards.org.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Empalme Alternativo , Bases de Datos de Proteínas , Expresión Génica , Redes Reguladoras de Genes , Enfermedades Genéticas Congénitas/genética , Humanos , Internet , Mutación , Polimorfismo de Nucleótido Simple , Mapeo de Interacción de Proteínas , Motor de Búsqueda
7.
Bioinformatics ; 20(9): 1457-8, 2004 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-14962929

RESUMEN

MOTIVATION: High density oligonucleotide arrays are usually annotated in a one-to-one fashion, with each probeset assigned to one gene. However, in reality, subsets of oligonucleotides in a probeset may match sequences within more than one gene, potentially leading to misinterpretations. Moreover, a gene is often represented by more than one probeset, and analyzing probe matches at the mRNA level can help one deduce whether these probesets are derived from the same or different splice variants. RESULTS: The GeneAnnot system comprehensively documents the many-to-many relationship between oligonucleotide array probesets and annotated genes in GeneCards. It performs pairwise alignments between the probe sequences and gene transcripts, and assigns sensitivity and specificity scores to each probeset/gene pair. AVAILABILITY: http://genecards.weizmann.ac.il/geneannot/ SUPPLEMENTARY INFORMATION: Program description and statistics http://genecards.weizmann.ac.il/geneannot/DOC/index.html


Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , Documentación/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Alineación de Secuencia/métodos , Programas Informáticos , Análisis de Secuencia de ADN/métodos
8.
Bioinformatics ; 19 Suppl 1: i222-4, 2003.
Artículo en Inglés | MEDLINE | ID: mdl-12855462

RESUMEN

MOTIVATION: Despite the numerous available whole-genome mapping resources, no comprehensive, integrated map of the human genome yet exists. RESULTS: GeneLoc, software adjunct to GeneCards and UDB, integrates gene lists by comparing genomic coordinates at the exon level and assigns unique and meaningful identifiers to each gene.


Asunto(s)
Mapeo Cromosómico/métodos , Bases de Datos Genéticas , Exones/genética , Genoma Humano , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Sistemas de Administración de Bases de Datos , Proyecto Genoma Humano , Humanos , Almacenamiento y Recuperación de la Información/métodos , Homología de Secuencia de Ácido Nucleico , Integración de Sistemas
9.
Nucleic Acids Res ; 31(1): 142-6, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12519968

RESUMEN

Recent enhancements and current research in the GeneCards (GC) (http://bioinfo.weizmann.ac.il/cards/) project are described, including the addition of gene expression profiles and integrated gene locations. Also highlighted are the contributions of specialized associated human gene-centric databases developed at the Weizmann Institute. These include the Unified Database (UDB) (http://bioinfo.weizmann.ac.il/udb) for human genome mapping, the human Chromosome 21 database at the Weizmann Insti-tute (CroW 21) (http://bioinfo.weizmann.ac.il/crow21), and the Human Olfactory Receptor Data Explora-torium (HORDE) (http://bioinfo.weizmann.ac.il/HORDE). The synergistic relationships amongst these efforts have positively impacted the quality, quantity and usefulness of the GeneCards gene compendium.


Asunto(s)
Cromosomas Humanos Par 21 , Bases de Datos Genéticas , Genoma Humano , Receptores Odorantes/genética , Algoritmos , Mapeo Cromosómico , Perfilación de la Expresión Génica , Humanos , Israel
10.
Brief Bioinform ; 4(4): 349-60, 2003 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-14725348

RESUMEN

The interpretation of microarray expression results often includes extensive efforts to identify and annotate the gene representatives immobilised on the arrays. In this paper we describe the usage of our automatic GeneAnnot system, which links between Affymetrix arrays and the rich human gene annotations available in GeneCards. We explain GeneCards search options and results display; elaborate on the presentation of expression information in GeneCards, including both our whole-genome GeneNote project and external expression resources; describe the various parameters and displays used by GeneAnnot to assess the annotation quality and probeset specificity; and show how to search GeneAnnot and GeneNote websites directly.


Asunto(s)
Interpretación Estadística de Datos , Análisis de Secuencia por Matrices de Oligonucleótidos , Programas Informáticos
11.
Bioinformatics ; 18(11): 1542-3, 2002 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-12424129

RESUMEN

MOTIVATION: In the post-genomic era, functional analysis of genes requires a sophisticated interdisciplinary arsenal. Comprehensive resources are challenged to provide consistently improving, state-of-the-art tools. RESULTS: GeneCards (Rebhan et al., 1998) has made innovative strides: (a). regular updates and enhancements incorporating new genes enriched with sequences, genomic locations, cDNA assemblies, orthologies, medical information, 3D protein structures, gene expression, and focused SNP summaries; (b). restructured software using object-oriented Perl, migration to schema-driven XML, and (c). pilot studies, introducing methods to produce cards for novel and predicted genes.


Asunto(s)
Algoritmos , Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Genoma Humano , Almacenamiento y Recuperación de la Información/métodos , Internet , Mapeo Cromosómico/métodos , Redes de Comunicación de Computadores , Perfilación de la Expresión Génica/métodos , Humanos , Alineación de Secuencia/métodos , Análisis de Secuencia/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...