Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Bioinformatics ; 37(24): 4744-4755, 2021 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-34270685

RESUMO

MOTIVATION: The rise in the number of genotyped ancient individuals provides an opportunity to estimate population admixture models for many populations. However, in models describing modern populations as mixtures of ancient ones, it is typically difficult to estimate the model mixing coefficients and to evaluate its fit to the data. RESULTS: We present LINADMIX, designed to tackle this problem by solving a constrained linear model when both the ancient and the modern genotypes are represented in a low-dimensional space. LINADMIX estimates the mixing coefficients and their standard errors, and computes a P-value for testing the model fit to the data. We quantified the performance of LINADMIX using an extensive set of simulated studies. We show that LINADMIX can accurately estimate admixture coefficients, and is robust to factors such as population size, genetic drift, proportion of missing data and various types of model misspecification. AVAILABILITY AND IMPLEMENTATION: LINADMIX is available as a python code at https://github.com/swidler/linadmix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Humanos , Genótipo
2.
Bioinformatics ; 29(2): 255-61, 2013 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-23172862

RESUMO

MOTIVATION: Non-coding RNA (ncRNA) genes are increasingly acknowledged for their importance in the human genome. However, there is no comprehensive non-redundant database for all such human genes. RESULTS: We leveraged the effective platform of GeneCards, the human gene compendium, together with the power of fRNAdb and additional primary sources, to judiciously unify all ncRNA gene entries obtainable from 15 different primary sources. Overlapping entries were clustered to unified locations based on an algorithm employing genomic coordinates. This allowed GeneCards' gamut of relevant entries to rise ∼5-fold, resulting in ∼80,000 human non-redundant ncRNAs, belonging to 14 classes. Such 'grand unification' within a regularly updated data structure will assist future ncRNA research. AVAILABILITY AND IMPLEMENTATION: All of these non-coding RNAs are included among the ∼122,500 entries in GeneCards V3.09, along with pertinent annotation, automatically mined by its built-in pipeline from 100 data sources. This information is available at www.genecards.org. CONTACT: Frida.Belinky@weizmann.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bases de Dados Genéticas , RNA não Traduzido/genética , Algoritmos , Análise por Conglomerados , Genes , Genoma Humano , Genômica , Humanos , Internet , Anotação de Sequência Molecular
3.
Hum Genomics ; 5(6): 709-17, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22155609

RESUMO

Since 1998, the bioinformatics, systems biology, genomics and medical communities have enjoyed a synergistic relationship with the GeneCards database of human genes (http://www.genecards.org). This human gene compendium was created to help to introduce order into the increasing chaos of information flow. As a consequence of viewing details and deep links related to specific genes, users have often requested enhanced capabilities, such that, over time, GeneCards has blossomed into a suite of tools (including GeneDecks, GeneALaCart, GeneLoc, GeneNote and GeneAnnot) for a variety of analyses of both single human genes and sets thereof. In this paper, we focus on inhouse and external research activities which have been enabled, enhanced, complemented and, in some cases, motivated by GeneCards. In turn, such interactions have often inspired and propelled improvements in GeneCards. We describe here the evolution and architecture of this project, including examples of synergistic applications in diverse areas such as synthetic lethality in cancer, the annotation of genetic variations in disease, omics integration in a systems biology approach to kidney disease, and bioinformatics tools.


Assuntos
Bases de Dados Genéticas , Genes/genética , Genoma Humano , Genômica , Biologia Computacional , Humanos
4.
Database (Oxford) ; 20172017 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-28605766

RESUMO

A major challenge in understanding gene regulation is the unequivocal identification of enhancer elements and uncovering their connections to genes. We present GeneHancer, a novel database of human enhancers and their inferred target genes, in the framework of GeneCards. First, we integrated a total of 434 000 reported enhancers from four different genome-wide databases: the Encyclopedia of DNA Elements (ENCODE), the Ensembl regulatory build, the functional annotation of the mammalian genome (FANTOM) project and the VISTA Enhancer Browser. Employing an integration algorithm that aims to remove redundancy, GeneHancer portrays 285 000 integrated candidate enhancers (covering 12.4% of the genome), 94 000 of which are derived from more than one source, and each assigned an annotation-derived confidence score. GeneHancer subsequently links enhancers to genes, using: tissue co-expression correlation between genes and enhancer RNAs, as well as enhancer-targeted transcription factor genes; expression quantitative trait loci for variants within enhancers; and capture Hi-C, a promoter-specific genome conformation assay. The individual scores based on each of these four methods, along with gene­enhancer genomic distances, form the basis for GeneHancer's combinatorial likelihood-based scores for enhancer­gene pairing. Finally, we define 'elite' enhancer­gene relations reflecting both a high-likelihood enhancer definition and a strong enhancer­gene association. GeneHancer predictions are fully integrated in the widely used GeneCards Suite, whereby candidate enhancers and their annotations are displayed on every relevant GeneCard. This assists in the mapping of non-coding variants to enhancers, and via the linked genes, forms a basis for variant­phenotype interpretation of whole-genome sequences in health and disease. Database URL: http://www.genecards.org/.


Assuntos
Bases de Dados de Ácidos Nucleicos , Elementos Facilitadores Genéticos , Genoma , Análise de Sequência de DNA/métodos , Navegador , Estudo de Associação Genômica Ampla , Valor Preditivo dos Testes
5.
Nucleic Acids Res ; 31(1): 142-6, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12519968

RESUMO

Recent enhancements and current research in the GeneCards (GC) (http://bioinfo.weizmann.ac.il/cards/) project are described, including the addition of gene expression profiles and integrated gene locations. Also highlighted are the contributions of specialized associated human gene-centric databases developed at the Weizmann Institute. These include the Unified Database (UDB) (http://bioinfo.weizmann.ac.il/udb) for human genome mapping, the human Chromosome 21 database at the Weizmann Insti-tute (CroW 21) (http://bioinfo.weizmann.ac.il/crow21), and the Human Olfactory Receptor Data Explora-torium (HORDE) (http://bioinfo.weizmann.ac.il/HORDE). The synergistic relationships amongst these efforts have positively impacted the quality, quantity and usefulness of the GeneCards gene compendium.


Assuntos
Cromossomos Humanos Par 21 , Bases de Dados Genéticas , Genoma Humano , Receptores Odorantes/genética , Algoritmos , Mapeamento Cromossômico , Perfilação da Expressão Gênica , Humanos , Israel
6.
Curr Protoc Bioinformatics ; 54: 1.30.1-1.30.33, 2016 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-27322403

RESUMO

GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. VarElect's capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. © 2016 by John Wiley & Sons, Inc.


Assuntos
Mineração de Dados/métodos , Bases de Dados Genéticas , Genômica/métodos , Análise de Sequência/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Fenótipo , Proteoma , Software/normas
7.
Database (Oxford) ; 2010: baq020, 2010 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-20689021

RESUMO

GeneCards (www.genecards.org) is a comprehensive, authoritative compendium of annotative information about human genes, widely used for nearly 15 years. Its gene-centric content is automatically mined and integrated from over 80 digital sources, resulting in a web-based deep-linked card for each of >73,000 human gene entries, encompassing the following categories: protein coding, pseudogene, RNA gene, genetic locus, cluster and uncategorized. We now introduce GeneCards Version 3, featuring a speedy and sophisticated search engine and a revamped, technologically enabling infrastructure, catering to the expanding needs of biomedical researchers. A key focus is on gene-set analyses, which leverage GeneCards' unique wealth of combinatorial annotations. These include the GeneALaCart batch query facility, which tabulates user-selected annotations for multiple genes and GeneDecks, which identifies similar genes with shared annotations, and finds set-shared annotations by descriptor enrichment analysis. Such set-centric features address a host of applications, including microarray data analysis, cross-database annotation mapping and gene-disorder associations for drug targeting. We highlight the new Version 3 database architecture, its multi-faceted search engine, and its semi-automated quality assurance system. Data enhancements include an expanded visualization of gene expression patterns in normal and cancer tissues, an integrated alternative splicing pattern display, and augmented multi-source SNPs and pathways sections. GeneCards now provides direct links to gene-related research reagents such as antibodies, recombinant proteins, DNA clones and inhibitory RNAs and features gene-related drugs and compounds lists. We also portray the GeneCards Inferred Functionality Score annotation landscape tool for scoring a gene's functional information status. Finally, we delineate examples of applications and collaborations that have benefited from the GeneCards suite. Database URL: www.genecards.org.


Assuntos
Bases de Dados Genéticas , Genoma Humano , Processamento Alternativo , Bases de Dados de Proteínas , Expressão Gênica , Redes Reguladoras de Genes , Doenças Genéticas Inatas/genética , Humanos , Internet , Mutação , Polimorfismo de Nucleotídeo Único , Mapeamento de Interação de Proteínas , Ferramenta de Busca
8.
Bioinformatics ; 19 Suppl 1: i222-4, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-12855462

RESUMO

MOTIVATION: Despite the numerous available whole-genome mapping resources, no comprehensive, integrated map of the human genome yet exists. RESULTS: GeneLoc, software adjunct to GeneCards and UDB, integrates gene lists by comparing genomic coordinates at the exon level and assigns unique and meaningful identifiers to each gene.


Assuntos
Mapeamento Cromossômico/métodos , Bases de Dados Genéticas , Éxons/genética , Genoma Humano , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Software , Sistemas de Gerenciamento de Base de Dados , Projeto Genoma Humano , Humanos , Armazenamento e Recuperação da Informação/métodos , Homologia de Sequência do Ácido Nucleico , Integração de Sistemas
9.
Brief Bioinform ; 4(4): 349-60, 2003 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-14725348

RESUMO

The interpretation of microarray expression results often includes extensive efforts to identify and annotate the gene representatives immobilised on the arrays. In this paper we describe the usage of our automatic GeneAnnot system, which links between Affymetrix arrays and the rich human gene annotations available in GeneCards. We explain GeneCards search options and results display; elaborate on the presentation of expression information in GeneCards, including both our whole-genome GeneNote project and external expression resources; describe the various parameters and displays used by GeneAnnot to assess the annotation quality and probeset specificity; and show how to search GeneAnnot and GeneNote websites directly.


Assuntos
Interpretação Estatística de Dados , Análise de Sequência com Séries de Oligonucleotídeos , Software
10.
Bioinformatics ; 20(9): 1457-8, 2004 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-14962929

RESUMO

MOTIVATION: High density oligonucleotide arrays are usually annotated in a one-to-one fashion, with each probeset assigned to one gene. However, in reality, subsets of oligonucleotides in a probeset may match sequences within more than one gene, potentially leading to misinterpretations. Moreover, a gene is often represented by more than one probeset, and analyzing probe matches at the mRNA level can help one deduce whether these probesets are derived from the same or different splice variants. RESULTS: The GeneAnnot system comprehensively documents the many-to-many relationship between oligonucleotide array probesets and annotated genes in GeneCards. It performs pairwise alignments between the probe sequences and gene transcripts, and assigns sensitivity and specificity scores to each probeset/gene pair. AVAILABILITY: http://genecards.weizmann.ac.il/geneannot/ SUPPLEMENTARY INFORMATION: Program description and statistics http://genecards.weizmann.ac.il/geneannot/DOC/index.html


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Documentação/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Alinhamento de Sequência/métodos , Software , Análise de Sequência de DNA/métodos
11.
Bioinformatics ; 18(11): 1542-3, 2002 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-12424129

RESUMO

MOTIVATION: In the post-genomic era, functional analysis of genes requires a sophisticated interdisciplinary arsenal. Comprehensive resources are challenged to provide consistently improving, state-of-the-art tools. RESULTS: GeneCards (Rebhan et al., 1998) has made innovative strides: (a). regular updates and enhancements incorporating new genes enriched with sequences, genomic locations, cDNA assemblies, orthologies, medical information, 3D protein structures, gene expression, and focused SNP summaries; (b). restructured software using object-oriented Perl, migration to schema-driven XML, and (c). pilot studies, introducing methods to produce cards for novel and predicted genes.


Assuntos
Algoritmos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma Humano , Armazenamento e Recuperação da Informação/métodos , Internet , Mapeamento Cromossômico/métodos , Redes de Comunicação de Computadores , Perfilação da Expressão Gênica/métodos , Humanos , Alinhamento de Sequência/métodos , Análise de Sequência/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA