Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 36(20): 5000-5006, 2020 12 22.
Article in English | MEDLINE | ID: mdl-32910174

ABSTRACT

MOTIVATION: Long-read sequencing technologies are invaluable for determining complex RNA transcript architectures but are error-prone. Numerous 'hybrid correction' algorithms have been developed for genomic data that correct long reads by exploiting the accuracy and depth of short reads sequenced from the same sample. These algorithms are not suited for correcting more complex transcriptome sequencing data. RESULTS: We have created a novel reference-free algorithm called Transcript-level Aware Long-Read Correction (TALC) which models changes in RNA expression and isoform representation in a weighted De Bruijn graph to correct long reads from transcriptome studies. We show that transcript-level aware correction by TALC improves the accuracy of the whole spectrum of downstream RNA-seq applications and is thus necessary for transcriptome analyses that use long read technology. AVAILABILITY AND IMPLEMENTATION: TALC is implemented in C++ and available at https://github.com/lbroseus/TALC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Software , Genomics , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA
2.
Commun Biol ; 2: 222, 2019.
Article in English | MEDLINE | ID: mdl-31240260

ABSTRACT

Comparative analysis of high throughput sequencing data between multiple conditions often involves mapping of sequencing reads to a reference and downstream bioinformatics analyses. Both of these steps may introduce heavy bias and potential data loss. This is especially true in studies where patient transcriptomes or genomes may vary from their references, such as in cancer. Here we describe a novel approach and associated software that makes use of advances in genetic algorithms and feature selection to comprehensively explore massive volumes of sequencing data to classify and discover new sequences of interest without a mapping step and without intensive use of specialized bioinformatics pipelines. We demonstrate that our approach called GECKO for GEnetic Classification using k-mer Optimization is effective at classifying and extracting meaningful sequences from multiple types of sequencing approaches including mRNA, microRNA, and DNA methylome data.


Subject(s)
Algorithms , High-Throughput Nucleotide Sequencing/methods , Blood Cells , Breast Neoplasms/classification , Breast Neoplasms/genetics , Computational Biology/methods , DNA Methylation , Humans , MicroRNAs , Mutation , RNA, Messenger , Software
3.
Genome Biol ; 18(1): 51, 2017 03 15.
Article in English | MEDLINE | ID: mdl-28298237

ABSTRACT

Intron retention (IR) occurs when an intron is transcribed into pre-mRNA and remains in the final mRNA. We have developed a program and database called IRFinder to accurately detect IR from mRNA sequencing data. Analysis of 2573 samples showed that IR occurs in all tissues analyzed, affects over 80% of all coding genes and is associated with cell differentiation and the cell cycle. Frequently retained introns are enriched for specific RNA binding protein sites and are often retained in clusters in the same gene. IR is associated with lower protein levels and intron-retaining transcripts that escape nonsense-mediated decay are not actively translated.


Subject(s)
Computational Biology/methods , Gene Expression Regulation , Introns , RNA Splicing , Software , Alternative Splicing , Animals , Binding Sites , Cell Cycle/genetics , Cell Differentiation/genetics , Exons , Humans , Nucleotide Motifs , RNA-Binding Proteins/metabolism
4.
Nat Genet ; 48(11): 1436-1442, 2016 11.
Article in English | MEDLINE | ID: mdl-27643538

ABSTRACT

Polycomb group proteins form two main complexes, PRC2 and PRC1, which generally coregulate their target genes. Here we show that PRC1 components act as neoplastic tumor suppressors independently of PRC2 function. By mapping the distribution of PRC1 components and trimethylation of histone H3 at Lys27 (H3K27me3) across the genome, we identify a large set of genes that acquire PRC1 in the absence of H3K27me3 in Drosophila larval tissues. These genes massively outnumber canonical targets and are mainly involved in the regulation of cell proliferation, signaling and polarity. Alterations in PRC1 components specifically deregulate this set of genes, whereas canonical targets are derepressed in both PRC1 and PRC2 mutants. In human embryonic stem cells, PRC1 components colocalize with H3K27me3 as in Drosophila embryos, whereas in differentiated cell types they are selectively recruited to a large set of proliferation and signaling-associated genes that lack H3K27me3, suggesting that the redeployment of PRC1 components during development is evolutionarily conserved.


Subject(s)
Drosophila Proteins/metabolism , Drosophila/growth & development , Polycomb Repressive Complex 1/metabolism , Polycomb-Group Proteins/metabolism , Tumor Suppressor Proteins/metabolism , Animals , Drosophila/genetics , Drosophila/metabolism , Drosophila Proteins/genetics , Genes, Insect , Histone Demethylases/metabolism , Histone-Lysine N-Methyltransferase/genetics , Histone-Lysine N-Methyltransferase/metabolism , Histones/metabolism , Humans , Larva/genetics , Larva/growth & development , Larva/metabolism , Microtubule-Associated Proteins , Polycomb-Group Proteins/genetics , Tumor Suppressor Proteins/genetics
5.
Cell Rep ; 9(1): 219-233, 2014 Oct 09.
Article in English | MEDLINE | ID: mdl-25284790

ABSTRACT

Metazoan genomes are partitioned into modular chromosomal domains containing active or repressive chromatin. In flies, Polycomb group (PcG) response elements (PREs) recruit PHO and other DNA-binding factors and act as nucleation sites for the formation of Polycomb repressive domains. The sequence specificity of PREs is not well understood. Here, we use comparative epigenomics and transgenic assays to show that Drosophila domain organization and PRE specification are evolutionarily conserved despite significant cis-element divergence within Polycomb domains, whereas cis-element evolution is strongly correlated with transcription factor binding divergence outside of Polycomb domains. Cooperative interactions of PcG complexes and their recruiting factor PHO stabilize PHO recruitment to low-specificity sequences. Consistently, PHO recruitment to sites within Polycomb domains is stabilized by PRC1. These data suggest that cooperative rather than hierarchical interactions among low-affinity sequences, DNA-binding factors, and the Polycomb machinery are giving rise to specific and strongly conserved 3D structures in Drosophila.


Subject(s)
Drosophila Proteins/metabolism , Drosophila/metabolism , Polycomb-Group Proteins/metabolism , Animals , Protein Binding , Transcription Factors/metabolism
6.
Mol Cell ; 54(3): 485-99, 2014 May 08.
Article in English | MEDLINE | ID: mdl-24703951

ABSTRACT

Polycomb group (PcG) proteins dynamically define cellular identities through epigenetic repression of key developmental genes. PcG target gene repression can be stabilized through the interaction in the nucleus at PcG foci. Here, we report the results of a high-resolution microscopy genome-wide RNAi screen that identifies 129 genes that regulate the nuclear organization of Pc foci. Candidate genes include PcG components and chromatin factors, as well as many protein-modifying enzymes, including components of the SUMOylation pathway. In the absence of SUMO, Pc foci coagulate into larger aggregates. Conversely, loss of function of the SUMO peptidase Velo disperses Pc foci. Moreover, SUMO and Velo colocalize with PcG proteins at PREs, and Pc SUMOylation affects its chromatin targeting, suggesting that the dynamic regulation of Pc SUMOylation regulates PcG-mediated silencing by modulating the kinetics of Pc binding to chromatin as well as its ability to form Polycomb foci.


Subject(s)
Drosophila Proteins/metabolism , Drosophila melanogaster/metabolism , Polycomb-Group Proteins/metabolism , Animals , Cell Line , Cell Nucleus/metabolism , Chromatin/metabolism , Cluster Analysis , Drosophila melanogaster/cytology , Drosophila melanogaster/genetics , Gene Knockdown Techniques , Gene Ontology , Imaginal Discs/cytology , Imaginal Discs/metabolism , Phenotype , Protein Binding , Protein Transport , RNA Interference , Sumoylation
7.
Nat Struct Mol Biol ; 17(11): 1391-7, 2010 Nov.
Article in English | MEDLINE | ID: mdl-20972444

ABSTRACT

Maintenance of genome integrity relies on surveillance mechanisms that detect and signal arrested replication forks. Although evidence from budding yeast indicates that the DNA replication checkpoint (DRC) is primarily activated by single-stranded DNA (ssDNA), studies in higher eukaryotes have implicated primer ends in this process. To identify factors that signal primed ssDNA in Saccharomyces cerevisiae, we have screened a collection of checkpoint mutants for their ability to activate the DRC, using the repression of late origins as readout for checkpoint activity. This quantitative analysis reveals that neither RFC(Rad24) and the 9-1-1 clamp nor the alternative clamp loader RFC(Elg1) is required to signal paused forks. In contrast, we found that RFC(Ctf18) is essential for the Mrc1-dependent activation of Rad53 and for the maintenance of paused forks. These data identify RFC(Ctf18) as a key DRC mediator, potentially bridging Mrc1 and primed ssDNA to signal paused forks.


Subject(s)
DNA Replication/physiology , Saccharomyces cerevisiae Proteins/physiology , Saccharomyces cerevisiae/genetics , Carrier Proteins/metabolism , Carrier Proteins/physiology , Cell Cycle Proteins/metabolism , Cell Cycle Proteins/physiology , DNA Damage , DNA Replication/drug effects , DNA, Single-Stranded/physiology , Hydroxyurea/pharmacology , Intracellular Signaling Peptides and Proteins/metabolism , Intracellular Signaling Peptides and Proteins/physiology , Saccharomyces cerevisiae/drug effects , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolism , Stress, Physiological
8.
Nat Cell Biol ; 11(11): 1315-24, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19838172

ABSTRACT

Topoisomerase I (Top1) is a key enzyme in functioning at the interface between DNA replication, transcription and mRNA maturation. Here, we show that Top1 suppresses genomic instability in mammalian cells by preventing a conflict between transcription and DNA replication. Using DNA combing and ChIP (chromatin immunoprecipitation)-on-chip, we found that Top1-deficient cells accumulate stalled replication forks and chromosome breaks in S phase, and that breaks occur preferentially at gene-rich regions of the genome. Notably, these phenotypes were suppressed by preventing the formation of RNA-DNA hybrids (R-loops) during transcription. Moreover, these defects could be mimicked by depletion of the splicing factor ASF/SF2 (alternative splicing factor/splicing factor 2), which interacts functionally with Top1. Taken together, these data indicate that Top1 prevents replication fork collapse by suppressing the formation of R-loops in an ASF/SF2-dependent manner. We propose that interference between replication and transcription represents a major source of spontaneous replication stress, which could drive genomic instability during the early stages of tumorigenesis.


Subject(s)
DNA Replication/physiology , DNA Topoisomerases, Type I/physiology , Genomic Instability/physiology , Transcription, Genetic , Animals , Chromatin Immunoprecipitation , S Phase
SELECTION OF CITATIONS
SEARCH DETAIL
...