Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
PLoS Genet ; 5(6): e1000522, 2009 Jun.
Article in English | MEDLINE | ID: mdl-19543368

ABSTRACT

To date, the contribution of disrupted potentially cis-regulatory conserved non-coding sequences (CNCs) to human disease is most likely underestimated, as no systematic screens for putative deleterious variations in CNCs have been conducted. As a model for monogenic disease we studied the involvement of genetic changes of CNCs in the cis-regulatory domain of FOXL2 in blepharophimosis syndrome (BPES). Fifty-seven molecularly unsolved BPES patients underwent high-resolution copy number screening and targeted sequencing of CNCs. Apart from three larger distant deletions, a de novo deletion as small as 7.4 kb was found at 283 kb 5' to FOXL2. The deletion appeared to be triggered by an H-DNA-induced double-stranded break (DSB). In addition, it disrupts a novel long non-coding RNA (ncRNA) PISRT1 and 8 CNCs. The regulatory potential of the deleted CNCs was substantiated by in vitro luciferase assays. Interestingly, Chromosome Conformation Capture (3C) of a 625 kb region surrounding FOXL2 in expressing cellular systems revealed physical interactions of three upstream fragments and the FOXL2 core promoter. Importantly, one of these contains the 7.4 kb deleted fragment. Overall, this study revealed the smallest distant deletion causing monogenic disease and impacts upon the concept of mutation screening in human disease and developmental disorders in particular.


Subject(s)
5' Untranslated Regions , Blepharophimosis/genetics , Forkhead Transcription Factors/genetics , Promoter Regions, Genetic , Regulatory Sequences, Nucleic Acid , Sequence Deletion , Cell Line , Conserved Sequence , DNA Mutational Analysis , Forkhead Box Protein L2 , Humans , Protein Binding
2.
Science ; 324(5926): 522-8, 2009 Apr 24.
Article in English | MEDLINE | ID: mdl-19390049

ABSTRACT

To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.


Subject(s)
Biological Evolution , Genome , Alternative Splicing , Animals , Animals, Domestic , Cattle , Evolution, Molecular , Female , Genetic Variation , Humans , Male , MicroRNAs/genetics , Molecular Sequence Data , Proteins/genetics , Sequence Analysis, DNA , Species Specificity , Synteny
3.
Nat Methods ; 5(7): 629-35, 2008 Jul.
Article in English | MEDLINE | ID: mdl-18500348

ABSTRACT

Rapid amplification of cDNA ends (RACE) is a widely used approach for transcript identification. Random clone selection from the RACE mixture, however, is an ineffective sampling strategy if the dynamic range of transcript abundances is large. To improve sampling efficiency of human transcripts, we hybridized the products of the RACE reaction onto tiling arrays and used the detected exons to delineate a series of reverse-transcriptase (RT)-PCRs, through which the original RACE transcript population was segregated into simpler transcript populations. We independently cloned the products and sequenced randomly selected clones. This approach, RACEarray, is superior to direct cloning and sequencing of RACE products because it specifically targets new transcripts and often results in overall normalization of transcript abundance. We show theoretically and experimentally that this strategy leads indeed to efficient sampling of new transcripts, and we investigated multiplexing the strategy by pooling RACE reactions from multiple interrogated loci before hybridization.


Subject(s)
DNA, Complementary/genetics , Gene Expression Profiling/methods , Gene Library , Nucleic Acid Amplification Techniques/methods , RNA/genetics , Alternative Splicing , Chromosomes, Human, Pair 21/genetics , Chromosomes, Human, Pair 22/genetics , Cloning, Molecular , Exons , Genome, Human , Humans , Molecular Sequence Data , Oligonucleotide Array Sequence Analysis/methods , Protein Isoforms/genetics , Reverse Transcriptase Polymerase Chain Reaction , Transcription, Genetic
4.
Genome Res ; 17(6): 746-59, 2007 Jun.
Article in English | MEDLINE | ID: mdl-17567994

ABSTRACT

This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.


Subject(s)
Chromosome Mapping , Exons , Genome, Human , Promoter Regions, Genetic , Quantitative Trait Loci , Transcription, Genetic/physiology , DNA, Complementary/genetics , Human Genome Project , Humans , Open Reading Frames
5.
Genome Res ; 17(6): 852-64, 2007 Jun.
Article in English | MEDLINE | ID: mdl-17568003

ABSTRACT

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).


Subject(s)
3' Untranslated Regions/genetics , GC Rich Sequence , Genome, Human , Quantitative Trait Loci , RNA, Untranslated/genetics , Transcription, Genetic , Base Sequence , Humans , Molecular Sequence Data , RNA, Messenger/genetics , Reverse Transcriptase Polymerase Chain Reaction
6.
Am J Hum Genet ; 79(2): 332-41, 2006 Aug.
Article in English | MEDLINE | ID: mdl-16826523

ABSTRACT

Genomic imbalance is a common cause of phenotypic abnormalities. We measured the relative expression level of genes that map within the microdeletion that causes Williams-Beuren syndrome and within its flanking regions. We found, unexpectedly, that not only hemizygous genes but also normal-copy neighboring genes show decreased relative levels of expression. Our results suggest that not only the aneuploid genes but also the flanking genes that map several megabases away from a genomic rearrangement should be considered possible contributors to the phenotypic variation in genomic disorders.


Subject(s)
Chromosomes, Human, Pair 7/genetics , Gene Deletion , Gene Expression , Williams Syndrome/genetics , Cell Line, Transformed , Cells, Cultured , Humans
7.
BMC Bioinformatics ; 6: 131, 2005 May 30.
Article in English | MEDLINE | ID: mdl-15924626

ABSTRACT

BACKGROUND: Despite the continuous production of genome sequence for a number of organisms, reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularly true for genomes for which there is not a large collection of known gene sequences, such as the recently published chicken genome. We used the chicken sequence to test comparative and homology-based gene-finding methods followed by experimental validation as an effective genome annotation method. RESULTS: We performed experimental evaluation by RT-PCR of three different computational gene finders, Ensembl, SGP2 and TWINSCAN, applied to the chicken genome. A Venn diagram was computed and each component of it was evaluated. The results showed that de novo comparative methods can identify up to about 700 chicken genes with no previous evidence of expression, and can correctly extend about 40% of homology-based predictions at the 5' end. CONCLUSIONS: De novo comparative gene prediction followed by experimental verification is effective at enhancing the annotation of the newly sequenced genomes provided by standard homology-based methods.


Subject(s)
Computational Biology/methods , Genome , Animals , Chickens , Chromosome Mapping , DNA Primers , DNA, Complementary/metabolism , Data Interpretation, Statistical , Databases, Genetic , Exons , Expressed Sequence Tags , Gene Expression Profiling , Introns , Models, Statistical , Reverse Transcriptase Polymerase Chain Reaction , Sequence Alignment , Sequence Analysis, DNA , Software , Tissue Distribution
8.
Nucleic Acids Res ; 33(6): 1935-9, 2005.
Article in English | MEDLINE | ID: mdl-15809229

ABSTRACT

The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT-PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.


Subject(s)
Chickens/genetics , Computational Biology , Exons , Genome, Human , Genomics , Animals , Computational Biology/methods , Gene Expression , Genomics/methods , Humans , Reverse Transcriptase Polymerase Chain Reaction
SELECTION OF CITATIONS
SEARCH DETAIL
...