Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 59
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Annu Rev Genomics Hum Genet ; 24: 347-368, 2023 08 25.
Article in English | MEDLINE | ID: mdl-37253596

ABSTRACT

Continued advances in precision medicine rely on the widespread sharing of data that relate human genetic variation to disease. However, data sharing is severely limited by legal, regulatory, and ethical restrictions that safeguard patient privacy. Federated analysis addresses this problem by transferring the code to the data-providing the technical and legal capability to analyze the data within their secure home environment rather than transferring the data to another institution for analysis. This allows researchers to gain new insights from data that cannot be moved, while respecting patient privacy and the data stewards' legal obligations. Because federated analysis is a technical solution to the legal challenges inherent in data sharing, the technology and policy implications must be evaluated together. Here, we summarize the technical approaches to federated analysis and provide a legal analysis of their policy implications.


Subject(s)
Fenbendazole , Privacy , Humans , Health Facilities , Information Dissemination , Policy
2.
Nucleic Acids Res ; 52(D1): D1082-D1088, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37953330

ABSTRACT

The UCSC Genome Browser (https://genome.ucsc.edu) is a web-based genomic visualization and analysis tool that serves data to over 7,000 distinct users per day worldwide. It provides annotation data on thousands of genome assemblies, ranging from human to SARS-CoV2. This year, we have introduced new data from the Human Pangenome Reference Consortium and on viral genomes including SARS-CoV2. We have added 1,200 new genomes to our GenArk genome system, increasing the overall diversity of our genomic representation. We have added support for nine new user-contributed track hubs to our public hub system. Additionally, we have released 29 new tracks on the human genome and 11 new tracks on the mouse genome. Collectively, these new features expand both the breadth and depth of the genomic knowledge that we share publicly with users worldwide.


Subject(s)
Databases, Genetic , Genomics , RNA, Viral , Animals , Humans , Mice , Genome, Human , Genome, Viral , Internet , Molecular Sequence Annotation , Software
3.
PLoS Genet ; 14(12): e1007752, 2018 12.
Article in English | MEDLINE | ID: mdl-30586411

ABSTRACT

The BRCA Challenge is a long-term data-sharing project initiated within the Global Alliance for Genomics and Health (GA4GH) to aggregate BRCA1 and BRCA2 data to support highly collaborative research activities. Its goal is to generate an informed and current understanding of the impact of genetic variation on cancer risk across the iconic cancer predisposition genes, BRCA1 and BRCA2. Initially, reported variants in BRCA1 and BRCA2 available from public databases were integrated into a single, newly created site, www.brcaexchange.org. The purpose of the BRCA Exchange is to provide the community with a reliable and easily accessible record of variants interpreted for a high-penetrance phenotype. More than 20,000 variants have been aggregated, three times the number found in the next-largest public database at the project's outset, of which approximately 7,250 have expert classifications. The data set is based on shared information from existing clinical databases-Breast Cancer Information Core (BIC), ClinVar, and the Leiden Open Variation Database (LOVD)-as well as population databases, all linked to a single point of access. The BRCA Challenge has brought together the existing international Evidence-based Network for the Interpretation of Germline Mutant Alleles (ENIGMA) consortium expert panel, along with expert clinicians, diagnosticians, researchers, and database providers, all with a common goal of advancing our understanding of BRCA1 and BRCA2 variation. Ongoing work includes direct contact with national centers with access to BRCA1 and BRCA2 diagnostic data to encourage data sharing, development of methods suitable for extraction of genetic variation at the level of individual laboratory reports, and engagement with participant communities to enable a more comprehensive understanding of the clinical significance of genetic variation in BRCA1 and BRCA2.


Subject(s)
Databases, Genetic , Genes, BRCA1 , Genes, BRCA2 , Genetic Variation , Alleles , Breast Neoplasms/genetics , Databases, Genetic/ethics , Female , Gene Frequency , Genetic Predisposition to Disease , Humans , Information Dissemination/ethics , Information Dissemination/legislation & jurisprudence , Male , Mutation , Ovarian Neoplasms/genetics , Penetrance , Phenotype , Risk Factors
4.
Hum Mutat ; 40(9): 1546-1556, 2019 09.
Article in English | MEDLINE | ID: mdl-31294896

ABSTRACT

Testing for variation in BRCA1 and BRCA2 (commonly referred to as BRCA1/2), has emerged as a standard clinical practice and is helping countless women better understand and manage their heritable risk of breast and ovarian cancer. Yet the increased rate of BRCA1/2 testing has led to an increasing number of Variants of Uncertain Significance (VUS), and the rate of VUS discovery currently outpaces the rate of clinical variant interpretation. Computational prediction is a key component of the variant interpretation pipeline. In the CAGI5 ENIGMA Challenge, six prediction teams submitted predictions on 326 newly-interpreted variants from the ENIGMA Consortium. By evaluating these predictions against the new interpretations, we have gained a number of insights on the state of the art of variant prediction and specific steps to further advance this state of the art.


Subject(s)
BRCA1 Protein/genetics , BRCA2 Protein/genetics , Breast Neoplasms/diagnosis , Computational Biology/methods , Ovarian Neoplasms/diagnosis , Breast Neoplasms/genetics , Early Detection of Cancer , Female , Genetic Predisposition to Disease , Genetic Testing , Genetic Variation , Humans , Models, Genetic , Ovarian Neoplasms/genetics
5.
Hum Mutat ; 38(9): 1266-1276, 2017 09.
Article in English | MEDLINE | ID: mdl-28544481

ABSTRACT

The advent of next-generation sequencing has dramatically decreased the cost for whole-genome sequencing and increased the viability for its application in research and clinical care. The Personal Genome Project (PGP) provides unrestricted access to genomes of individuals and their associated phenotypes. This resource enabled the Critical Assessment of Genome Interpretation (CAGI) to create a community challenge to assess the bioinformatics community's ability to predict traits from whole genomes. In the CAGI PGP challenge, researchers were asked to predict whether an individual had a particular trait or profile based on their whole genome. Several approaches were used to assess submissions, including ROC AUC (area under receiver operating characteristic curve), probability rankings, the number of correct predictions, and statistical significance simulations. Overall, we found that prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within the general population, whereas matching genomes to trait profiles relies heavily upon a small number of common traits including ancestry, blood type, and eye color. When a rare genetic disorder is present, profiles can be matched when one or more pathogenic variants are identified. Prediction accuracy has improved substantially over the last 6 years due to improved methodology and a better understanding of features.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Whole Genome Sequencing/methods , Area Under Curve , Genetic Predisposition to Disease , Human Genome Project , Humans , Phenotype , Quantitative Trait Loci
6.
Nucleic Acids Res ; 43(Database issue): D812-7, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25392408

ABSTRACT

The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/) is a web-based application that integrates relevant data, analysis and visualization, allowing users to easily discover and share their research observations. Users can explore the relationship between genomic alterations and phenotypes by visualizing various -omic data alongside clinical and phenotypic features, such as age, subtype classifications and genomic biomarkers. The Cancer Genomics Browser currently hosts 575 public datasets from genome-wide analyses of over 227,000 samples, including datasets from TCGA, CCLE, Connectivity Map and TARGET. Users can download and upload clinical data, generate Kaplan-Meier plots dynamically, export data directly to Galaxy for analysis, plus generate URL bookmarks of specific views of the data to share with others.


Subject(s)
Databases, Genetic , Neoplasms/genetics , Cell Line, Tumor , Child , Genomics , Humans , Internet , Kaplan-Meier Estimate , Neoplasms/diagnosis , Neoplasms/mortality , Phenotype
7.
Nucleic Acids Res ; 42(Web Server issue): W361-7, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24829458

ABSTRACT

Regulation of gene expression is executed in many cases by RNA-binding proteins (RBPs) that bind to mRNAs as well as to non-coding RNAs. RBPs recognize their RNA target via specific binding sites on the RNA. Predicting the binding sites of RBPs is known to be a major challenge. We present a new webserver, RBPmap, freely accessible through the website http://rbpmap.technion.ac.il/ for accurate prediction and mapping of RBP binding sites. RBPmap has been developed specifically for mapping RBPs in human, mouse and Drosophila melanogaster genomes, though it supports other organisms too. RBPmap enables the users to select motifs from a large database of experimentally defined motifs. In addition, users can provide any motif of interest, given as either a consensus or a PSSM. The algorithm for mapping the motifs is based on a Weighted-Rank approach, which considers the clustering propensity of the binding sites and the overall tendency of regulatory regions to be conserved. In addition, RBPmap incorporates a position-specific background model, designed uniquely for different genomic regions, such as splice sites, 5' and 3' UTRs, non-coding RNA and intergenic regions. RBPmap was tested on high-throughput RNA-binding experiments and was proved to be highly accurate.


Subject(s)
RNA-Binding Proteins/metabolism , RNA/chemistry , RNA/metabolism , Software , Algorithms , Animals , Binding Sites , Drosophila melanogaster/genetics , Humans , Internet , Mice , Nucleotide Motifs , Sequence Analysis, RNA
8.
Nucleic Acids Res ; 42(Database issue): D764-70, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24270787

ABSTRACT

The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a large collection of organisms, primarily vertebrates, with an emphasis on the human and mouse genomes. The Browser's web-based tools provide an integrated environment for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic data sets. As of September 2013, the database contained genomic sequence and a basic set of annotation 'tracks' for ∼90 organisms. Significant new annotations include a 60-species multiple alignment conservation track on the mouse, updated UCSC Genes tracks for human and mouse, and several new sets of variation and ENCODE data. New software tools include a Variant Annotation Integrator that returns predicted functional effects of a set of variants uploaded as a custom track, an extension to UCSC Genes that displays haplotype alleles for protein-coding genes and an expansion of data hubs that includes the capability to display remotely hosted user-provided assembly sequence in addition to annotation data. To improve European access, we have added a Genome Browser mirror (http://genome-euro.ucsc.edu) hosted at Bielefeld University in Germany.


Subject(s)
Databases, Genetic , Genome , Genomics , Alleles , Animals , Genome, Human , Humans , Internet , Mice , Molecular Sequence Annotation , Polymorphism, Single Nucleotide , Sequence Alignment , Software
9.
PLoS Genet ; 9(1): e1003186, 2013.
Article in English | MEDLINE | ID: mdl-23300487

ABSTRACT

Facioscapulohumeral muscular dystrophy (FSHD) is a common muscle disease whose molecular pathogenesis remains largely unknown. Over-expression of FSHD region gene 1 (FRG1) in mice, frogs, and worms perturbs muscle development and causes FSHD-like phenotypes. FRG1 has been implicated in splicing, and we asked how splicing might be involved in FSHD by conducting a genome-wide analysis in FRG1 mice. We find that splicing perturbations parallel the responses of different muscles to FRG1 over-expression and disease progression. Interestingly, binding sites for the Rbfox family of splicing factors are over-represented in a subset of FRG1-affected splicing events. Rbfox1 knockdown, over-expression, and RNA-IP confirm that these are direct Rbfox1 targets. We find that FRG1 is associated to the Rbfox1 RNA and decreases its stability. Consistent with this, Rbfox1 expression is down-regulated in mice and cells over-expressing FRG1 as well as in FSHD patients. Among the genes affected is Calpain 3, which is mutated in limb girdle muscular dystrophy, a disease phenotypically similar to FSHD. In FRG1 mice and FSHD patients, the Calpain 3 isoform lacking exon 6 (Capn3 E6-) is increased. Finally, Rbfox1 knockdown and over-expression of Capn3 E6- inhibit muscle differentiation. Collectively, our results suggest that a component of FSHD pathogenesis may arise by over-expression of FRG1, reducing Rbfox1 levels and leading to aberrant expression of an altered Calpain 3 protein through dysregulated splicing.


Subject(s)
Calpain , Muscle Proteins , Muscular Dystrophy, Facioscapulohumeral , Proteins , RNA-Binding Proteins/genetics , Alternative Splicing/genetics , Animals , Calpain/genetics , Calpain/metabolism , Cells, Cultured , Disease Models, Animal , Exons , Gene Expression Regulation, Developmental , HEK293 Cells , Humans , Mice , Microfilament Proteins , Muscle Development/genetics , Muscle Proteins/genetics , Muscle Proteins/metabolism , Muscular Dystrophy, Facioscapulohumeral/genetics , Muscular Dystrophy, Facioscapulohumeral/metabolism , Muscular Dystrophy, Facioscapulohumeral/pathology , Myoblasts/cytology , Myoblasts/metabolism , Proteins/genetics , Proteins/metabolism , RNA Splicing Factors , RNA-Binding Proteins/metabolism
10.
RNA ; 19(5): 627-38, 2013 May.
Article in English | MEDLINE | ID: mdl-23525800

ABSTRACT

Alternative splicing contributes to muscle development, but a complete set of muscle-splicing factors and their combinatorial interactions are unknown. Previous work identified ACUAA ("STAR" motif) as an enriched intron sequence near muscle-specific alternative exons such as Capzb exon 9. Mass spectrometry of myoblast proteins selected by the Capzb exon 9 intron via RNA affinity chromatography identifies Quaking (QK), a protein known to regulate mRNA function through ACUAA motifs in 3' UTRs. We find that QK promotes inclusion of Capzb exon 9 in opposition to repression by polypyrimidine tract-binding protein (PTB). QK depletion alters inclusion of 406 cassette exons whose adjacent intron sequences are also enriched in ACUAA motifs. During differentiation of myoblasts to myotubes, QK levels increase two- to threefold, suggesting a mechanism for QK-responsive exon regulation. Combined analysis of the PTB- and QK-splicing regulatory networks during myogenesis suggests that 39% of regulated exons are under the control of one or both of these splicing factors. This work provides the first evidence that QK is a global regulator of splicing during muscle development in vertebrates and shows how overlapping splicing regulatory networks contribute to gene expression programs during differentiation.


Subject(s)
Cell Differentiation/genetics , Polypyrimidine Tract-Binding Protein , RNA Splicing/genetics , RNA-Binding Proteins , 3' Untranslated Regions/genetics , Binding Sites , Cells, Cultured , Exons , Gene Expression Regulation, Developmental , Gene Regulatory Networks , HeLa Cells , Humans , Introns , Muscle Cells/cytology , Muscle Cells/metabolism , Muscle Development/genetics , Organ Specificity , Polypyrimidine Tract-Binding Protein/genetics , Polypyrimidine Tract-Binding Protein/metabolism , RNA-Binding Proteins/genetics , RNA-Binding Proteins/metabolism
11.
Nucleic Acids Res ; 41(Database issue): D949-54, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23109555

ABSTRACT

The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/) is a set of web-based tools to display, investigate and analyse cancer genomics data and its associated clinical information. The browser provides whole-genome to base-pair level views of several different types of genomics data, including some next-generation sequencing platforms. The ability to view multiple datasets together allows users to make comparisons across different data and cancer types. Biological pathways, collections of genes, genomic or clinical information can be used to sort, aggregate and zoom into a group of samples. We currently display an expanding set of data from various sources, including 201 datasets from 22 TCGA (The Cancer Genome Atlas) cancers as well as data from Cancer Cell Line Encyclopedia and Stand Up To Cancer. New features include a completely redesigned user interface with an interactive tutorial and updated documentation. We have also added data downloads, additional clinical heatmap features, and an updated Tumor Image Browser based on Google Maps. New security features allow authenticated users access to private datasets hosted by several different consortia through the public website.


Subject(s)
Databases, Genetic , Genomics , Neoplasms/genetics , Cell Line, Tumor , Humans , Internet
12.
Nucleic Acids Res ; 41(Database issue): D64-9, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23155063

ABSTRACT

The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic datasets. As of September 2012, genomic sequence and a basic set of annotation 'tracks' are provided for 63 organisms, including 26 mammals, 13 non-mammal vertebrates, 3 invertebrate deuterostomes, 13 insects, 6 worms, yeast and sea hare. In the past year 19 new genome assemblies have been added, and we anticipate releasing another 28 in early 2013. Further, a large number of annotation tracks have been either added, updated by contributors or remapped to the latest human reference genome. Among these are an updated UCSC Genes track for human and mouse assemblies. We have also introduced several features to improve usability, including new navigation menus. This article provides an update to the UCSC Genome Browser database, which has been previously featured in the Database issue of this journal.


Subject(s)
Databases, Genetic , Genomics , Animals , Genome, Human , Humans , Internet , Mice , Molecular Sequence Annotation , Software
13.
Ann Neurol ; 74(6): 862-72, 2013 Dec.
Article in English | MEDLINE | ID: mdl-23929620

ABSTRACT

OBJECTIVE: To develop RNA splicing biomarkers of disease severity and therapeutic response in myotonic dystrophy type 1 (DM1) and type 2 (DM2). METHODS: In a discovery cohort, we used microarrays to perform global analysis of alternative splicing in DM1 and DM2. The newly identified splicing changes were combined with previous data to create a panel of 50 putative splicing defects. In a validation cohort of 50 DM1 subjects, we measured the strength of ankle dorsiflexion (ADF) and then obtained a needle biopsy of tibialis anterior (TA) to analyze splice events in muscle RNA. The specificity of DM-associated splicing defects was assessed in disease controls. The CTG expansion size in muscle tissue was determined by Southern blot. The reversibility of splicing defects was assessed in transgenic mice by using antisense oligonucleotides to reduce levels of toxic RNA. RESULTS: Forty-two splicing defects were confirmed in TA muscle in the validation cohort. Among these, 20 events showed graded changes that correlated with ADF weakness. Five other splice events were strongly affected in DM1 subjects with normal ADF strength. Comparison to disease controls and mouse models indicated that splicing changes were DM-specific, mainly attributable to MBNL1 sequestration, and reversible in mice by targeted knockdown of toxic RNA. Splicing defects and weakness were not correlated with CTG expansion size in muscle tissue. INTERPRETATION: Alternative splicing changes in skeletal muscle may serve as biomarkers of disease severity and therapeutic response in myotonic dystrophy.


Subject(s)
Alternative Splicing , Myotonic Dystrophy/genetics , Adolescent , Adult , Aged , Animals , Biomarkers , Cohort Studies , DNA-Binding Proteins/genetics , Disease Models, Animal , Female , Humans , Male , Mice , Mice, Knockout , Mice, Transgenic , Middle Aged , Muscle, Skeletal/pathology , Muscle, Skeletal/physiopathology , Myotonic Disorders/genetics , Myotonic Disorders/pathology , Myotonic Disorders/physiopathology , Myotonic Dystrophy/pathology , Myotonic Dystrophy/physiopathology , Oligonucleotides, Antisense/genetics , RNA-Binding Proteins/genetics , Severity of Illness Index , Young Adult
14.
Nucleic Acids Res ; 40(Database issue): D918-23, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22086951

ABSTRACT

The University of California Santa Cruz Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic data sets. In the past year, the local database has been updated with four new species assemblies, and we anticipate another four will be released by the end of 2011. Further, a large number of annotation tracks have been either added, updated by contributors, or remapped to the latest human reference genome. Among these are new phenotype and disease annotations, UCSC genes, and a major dbSNP update, which required new visualization methods. Growing beyond the local database, this year we have introduced 'track data hubs', which allow the Genome Browser to provide access to remotely located sets of annotations. This feature is designed to significantly extend the number and variety of annotation tracks that are publicly available for visualization and analysis from within our site. We have also introduced several usability features including track search and a context-sensitive menu of options available with a right-click anywhere on the Browser's image.


Subject(s)
Databases, Nucleic Acid , Genome , Animals , Disease/genetics , Genome, Human , Genomics , Humans , Internet , Molecular Sequence Annotation , Phenotype
15.
Nucleic Acids Res ; 40(Database issue): D912-7, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22075998

ABSTRACT

The Encyclopedia of DNA Elements (ENCODE) Consortium is entering its 5th year of production-level effort generating high-quality whole-genome functional annotations of the human genome. The past year has brought the ENCODE compendium of functional elements to critical mass, with a diverse set of 27 biochemical assays now covering 200 distinct human cell types. Within the mouse genome, which has been under study by ENCODE groups for the past 2 years, 37 cell types have been assayed. Over 2000 individual experiments have been completed and submitted to the Data Coordination Center for public use. UCSC makes this data available on the quality-reviewed public Genome Browser (http://genome.ucsc.edu) and on an early-access Preview Browser (http://genome-preview.ucsc.edu). Visual browsing, data mining and download of raw and processed data files are all supported. An ENCODE portal (http://encodeproject.org) provides specialized tools and information about the ENCODE data sets.


Subject(s)
Databases, Nucleic Acid , Genome, Human , Genome , Mice/genetics , Animals , Humans , Internet , Molecular Sequence Annotation , Software
16.
Nat Genet ; 37(8): 844-52, 2005 Aug.
Article in English | MEDLINE | ID: mdl-16041372

ABSTRACT

Alternative RNA splicing greatly increases proteome diversity and may thereby contribute to tissue-specific functions. We carried out genome-wide quantitative analysis of alternative splicing using a custom Affymetrix microarray to assess the role of the neuronal splicing factor Nova in the brain. We used a stringent algorithm to identify 591 exons that were differentially spliced in the brain relative to immune tissues, and 6.6% of these showed major splicing defects in the neocortex of Nova2-/- mice. We tested 49 exons with the largest predicted Nova-dependent splicing changes and validated all 49 by RT-PCR. We analyzed the encoded proteins and found that all those with defined brain functions acted in the synapse (34 of 40, including neurotransmitter receptors, cation channels, adhesion and scaffold proteins) or in axon guidance (8 of 40). Moreover, of the 35 proteins with known interaction partners, 74% (26) interact with each other. Validating a large set of Nova RNA targets has led us to identify a multi-tiered network in which Nova regulates the exon content of RNAs encoding proteins that interact in the synapse.


Subject(s)
Alternative Splicing/physiology , Antigens, Neoplasm/physiology , Nerve Tissue Proteins/physiology , RNA-Binding Proteins/physiology , Synapses/metabolism , Animals , Mice , Mice, Knockout , Neocortex/metabolism , Neuro-Oncological Ventral Antigen , Oligonucleotide Array Sequence Analysis
17.
medRxiv ; 2024 May 29.
Article in English | MEDLINE | ID: mdl-38854136

ABSTRACT

The ClinGen Hereditary Breast, Ovarian and Pancreatic Cancer (HBOP) Variant Curation Expert Panel (VCEP) is composed of internationally recognized experts in clinical genetics, molecular biology and variant interpretation. This VCEP made specifications for ACMG/AMP guidelines for the ataxia telangiectasia mutated (ATM) gene according to the Food and Drug Administration (FDA)-approved ClinGen protocol. These gene-specific rules for ATM were modified from the American College of Medical Genetics and Association for Molecular Pathology (ACMG/AMP) guidelines and were tested against 33 ATM variants of various types and classifications in a pilot curation phase. The pilot revealed a majority agreement between the HBOP VCEP classifications and the ClinVar-deposited classifications. Six pilot variants had conflicting interpretations in ClinVar and reevaluation with the VCEP's ATM-specific rules resulted in four that were classified as benign, one as likely pathogenic and one as a variant of uncertain significance (VUS) by the VCEP, improving the certainty of interpretations in the public domain. Overall, 28 the 33 pilot variants were not VUS leading to an 85% classification rate. The ClinGen-approved, modified rules demonstrated value for improved interpretation of variants in ATM.

18.
Nucleic Acids Res ; 39(Database issue): D871-5, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21037257

ABSTRACT

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.


Subject(s)
Databases, Genetic , Genome, Human , Gene Expression Regulation , Genomics , Humans , Internet , Software , User-Computer Interface
19.
Nucleic Acids Res ; 39(Database issue): D876-82, 2011 Jan.
Article in English | MEDLINE | ID: mdl-20959295

ABSTRACT

The University of California, Santa Cruz Genome Browser (http://genome.ucsc.edu) offers online access to a database of genomic sequence and annotation data for a wide variety of organisms. The Browser also has many tools for visualizing, comparing and analyzing both publicly available and user-generated genomic data sets, aligning sequences and uploading user data. Among the features released this year are a gene search tool and annotation track drag-reorder functionality as well as support for BAM and BigWig/BigBed file formats. New display enhancements include overlay of multiple wiggle tracks through use of transparent coloring, options for displaying transformed wiggle data, a 'mean+whiskers' windowing function for display of wiggle data at high zoom levels, and more color schemes for microarray data. New data highlights include seven new genome assemblies, a Neandertal genome data portal, phenotype and disease association data, a human RNA editing track, and a zebrafish Conservation track. We also describe updates to existing tracks.


Subject(s)
Databases, Genetic , Genomics , Animals , Disease/genetics , Genes , Genome, Human , Hominidae/genetics , Humans , Internet , Molecular Sequence Annotation , Phenotype , RNA Editing , Software
20.
Proc Natl Acad Sci U S A ; 107(23): 10514-9, 2010 Jun 08.
Article in English | MEDLINE | ID: mdl-20498046

ABSTRACT

Two major goals of regenerative medicine are to reproducibly transform adult somatic cells into a pluripotent state and to control their differentiation into specific cell fates. Progress toward these goals would be greatly helped by obtaining a complete picture of the RNA isoforms produced by these cells due to alternative splicing (AS) and alternative promoter selection (APS). To investigate the roles of AS and APS, reciprocal exon-exon junctions were interrogated on a genome-wide scale in differentiating mouse embryonic stem (ES) cells with a prototype Affymetrix microarray. Using a recently released open-source software package named AltAnalyze, we identified 144 genes for 170 putative isoform variants, the majority (67%) of which were predicted to alter protein sequence and domain composition. Verified alternative exons were largely associated with pathways of Wnt signaling and cell-cycle control, and most were conserved between mouse and human. To examine the functional impact of AS, we characterized isoforms for two genes. As predicted by AltAnalyze, we found that alternative isoforms of the gene Serca2 were targeted by distinct microRNAs (miRNA-200b, miRNA-214), suggesting a critical role for AS in cardiac development. Analysis of the Wnt transcription factor Tcf3, using selective knockdown of an ES cell-enriched and characterized isoform, revealed several distinct targets for transcriptional repression (Stmn2, Ccnd2, Atf3, Klf4, Nodal, and Jun) as well as distinct differentiation outcomes in ES cells. The findings herein illustrate a critical role for AS in the specification of ES cells with differentiation, and highlight the utility of global functional analyses of AS.


Subject(s)
Alternative Splicing , Cell Differentiation , Embryonic Stem Cells/cytology , Embryonic Stem Cells/metabolism , Pluripotent Stem Cells/cytology , Pluripotent Stem Cells/metabolism , Animals , Exons , Female , Gene Expression Profiling , Gene Expression Regulation , Humans , Kruppel-Like Factor 4 , Mice , MicroRNAs/genetics , Promoter Regions, Genetic , Selection, Genetic , Signal Transduction , Transcription, Genetic , Wnt Proteins/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL