Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 60
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 151(3): 533-46, 2012 Oct 26.
Artículo en Inglés | MEDLINE | ID: mdl-23063653

RESUMEN

Drosophila Dicer-1 produces microRNAs (miRNAs) from pre-miRNA, whereas Dicer-2 generates small interfering RNAs (siRNAs) from long dsRNA. Alternative splicing of the loquacious (loqs) mRNA generates three distinct Dicer partner proteins. To understand the function of each, we constructed flies expressing Loqs-PA, Loqs-PB, or Loqs-PD. Loqs-PD promotes both endo- and exo-siRNA production by Dicer-2. Loqs-PA or Loqs-PB is required for viability, but the proteins are not fully redundant: a specific subset of miRNAs requires Loqs-PB. Surprisingly, Loqs-PB tunes where Dicer-1 cleaves pre-miR-307a, generating a longer miRNA isoform with a distinct seed sequence and target specificity. The longer form of miR-307a represses glycerol kinase and taranis mRNA expression. The mammalian Dicer-partner TRBP, a Loqs-PB homolog, similarly tunes where Dicer cleaves pre-miR-132. Thus, Dicer-binding partner proteins change the choice of cleavage site by Dicer, producing miRNAs with target specificities different from those made by Dicer alone or Dicer bound to alternative protein partners.


Asunto(s)
ARN Helicasas DEAD-box/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , ARN Helicasas/metabolismo , Proteínas de Unión al ARN/metabolismo , Ribonucleasa III/metabolismo , Animales , Secuencia de Bases , Drosophila melanogaster/genética , Femenino , Humanos , Masculino , Ratones , MicroARNs/metabolismo , Datos de Secuencia Molecular
2.
Cell ; 147(7): 1498-510, 2011 Dec 23.
Artículo en Inglés | MEDLINE | ID: mdl-22196727

RESUMEN

Numerous chromatin regulators are required for embryonic stem (ES) cell self-renewal and pluripotency, but few have been studied in detail. Here, we examine the roles of several chromatin regulators whose loss affects the pluripotent state of ES cells. We find that Mbd3 and Brg1 antagonistically regulate a common set of genes by regulating promoter nucleosome occupancy. Furthermore, both Mbd3 and Brg1 play key roles in the biology of 5-hydroxymethylcytosine (5hmC): Mbd3 colocalizes with Tet1 and 5hmC in vivo, Mbd3 knockdown preferentially affects expression of 5hmC-marked genes, Mbd3 localization is Tet1-dependent, and Mbd3 preferentially binds to 5hmC relative to 5-methylcytosine in vitro. Finally, both Mbd3 and Brg1 are themselves required for normal levels of 5hmC in vivo. Together, our results identify an effector for 5hmC, and reveal that control of gene expression by antagonistic chromatin regulators is a surprisingly common regulatory strategy in ES cells.


Asunto(s)
Citosina/análogos & derivados , Proteínas de Unión al ADN/metabolismo , Células Madre Embrionarias/metabolismo , Complejo Desacetilasa y Remodelación del Nucleosoma Mi-2/metabolismo , Factores de Transcripción/metabolismo , 5-Metilcitosina/análogos & derivados , Animales , Ensamble y Desensamble de Cromatina , Citosina/metabolismo , ADN Helicasas/metabolismo , Proteínas de Unión al ADN/genética , Técnicas de Silenciamiento del Gen , Humanos , Ratones , Proteínas Nucleares/metabolismo , Proteínas Proto-Oncogénicas/genética , Proteínas Proto-Oncogénicas/metabolismo , ARN Polimerasa II/metabolismo
3.
Bioinformatics ; 2024 Jun 19.
Artículo en Inglés | MEDLINE | ID: mdl-38897667

RESUMEN

MOTIVATION: The Full-text index in Minute space (FM-index) is a memory-efficient data structure widely used in bioinformatics for solving the fundamental pattern-matching task of searching for short patterns within a long reference. With the demand for short query patterns, the k-ordered concept has been proposed for FM-indexes. However, few construction algorithms in the state of the art fully exploit this idea to achieve significant speedups in the pan-genome era. RESULTS: We introduce the k-ordered Induced Suffix Sorting (kISS) for efficient construction and utilization of k-ordered FM-indexes. We present an algorithmic workflow for building k-ordered suffix arrays, incorporating two novel strategies to improve time and memory efficiency. We also demonstrate the compatibility of integrating k-ordered FM-indexes with locate operations in FMtree. Experiments show that kISS can improve the construction time, and the generated k-ordered suffix array can also be applied to FMtree without any additional in computation or memory usage. AVAILABILITY: https://github.com/jhhung/kISS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

4.
Anal Chem ; 96(12): 4835-4844, 2024 Mar 26.
Artículo en Inglés | MEDLINE | ID: mdl-38488022

RESUMEN

The rapid proliferation of new psychoactive substances (NPS) poses significant challenges to conventional mass-spectrometry-based identification methods due to the absence of reference spectra for these emerging substances. This paper introduces PS2MS, an AI-powered predictive system designed specifically to address the limitations of identifying the emergence of unidentified novel illicit drugs. PS2MS builds a synthetic NPS database by enumerating feasible derivatives of known substances and uses deep learning to generate mass spectra and chemical fingerprints. When the mass spectrum of an analyte does not match any known reference, PS2MS simultaneously examines the chemical fingerprint and mass spectrum against the putative NPS database using integrated metrics to deduce possible identities. Experimental results affirm the effectiveness of PS2MS in identifying cathinone derivatives within real evidence specimens, signifying its potential for practical use in identifying emerging drugs of abuse for researchers and forensic experts.


Asunto(s)
Aprendizaje Profundo , Drogas Ilícitas , Cromatografía Liquida/métodos , Psicotrópicos/análisis , Espectrometría de Masas/métodos , Drogas Ilícitas/análisis , Detección de Abuso de Sustancias/métodos
5.
Mol Cell ; 59(2): 203-16, 2015 Jul 16.
Artículo en Inglés | MEDLINE | ID: mdl-26145176

RESUMEN

Uridylation of RNA species represents an emerging theme in post-transcriptional gene regulation. In the microRNA pathway, such modifications regulate small RNA biogenesis and stability in plants, worms, and mammals. Here, we report Tailor, an uridylyltransferase that is required for the majority of 3' end modifications of microRNAs in Drosophila and predominantly targets precursor hairpins. Uridylation modulates the characteristic two-nucleotide 3' overhang of microRNA hairpins, which regulates processing by Dicer-1 and destabilizes RNA hairpins. Tailor preferentially uridylates mirtron hairpins, thereby impeding the production of non-canonical microRNAs. Mirtron selectivity is explained by primary sequence specificity of Tailor, selecting substrates ending with a 3' guanosine. In contrast to mirtrons, conserved Drosophila precursor microRNAs are significantly depleted in 3' guanosine, thereby escaping regulatory uridylation. Our data support the hypothesis that evolutionary adaptation to Tailor-directed uridylation shapes the nucleotide composition of precursor microRNA 3' ends. Hence, hairpin uridylation may serve as a barrier for the de novo creation of microRNAs in Drosophila.


Asunto(s)
Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , MicroARNs/química , MicroARNs/metabolismo , ARN Nucleotidiltransferasas/metabolismo , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Línea Celular , Proteínas de Drosophila/antagonistas & inhibidores , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Drosophila melanogaster/fisiología , Femenino , Fertilidad/genética , Fertilidad/fisiología , Técnicas de Silenciamiento del Gen , Genes de Insecto , Masculino , MicroARNs/genética , Datos de Secuencia Molecular , Mutación , Conformación de Ácido Nucleico , ARN Nucleotidiltransferasas/antagonistas & inhibidores , ARN Nucleotidiltransferasas/genética , Procesamiento Postranscripcional del ARN , Estabilidad del ARN , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN Interferente Pequeño/genética , Especificidad por Sustrato
6.
Bioinformatics ; 37(13): 1846-1852, 2021 Jul 27.
Artículo en Inglés | MEDLINE | ID: mdl-33459339

RESUMEN

MOTIVATION: Cross-sample comparisons or large-scale meta-analyses based on the next generation sequencing (NGS) involve replicable and universal data preprocessing, including removing adapter fragments in contaminated reads (i.e. adapter trimming). While modern adapter trimmers require users to provide candidate adapter sequences for each sample, which are sometimes unavailable or falsely documented in the repositories (such as GEO or SRA), large-scale meta-analyses are therefore jeopardized by suboptimal adapter trimming. RESULTS: Here we introduce a set of fast and accurate adapter detection and trimming algorithms that entail no a priori adapter sequences. These algorithms were implemented in modern C++ with SIMD and multithreading to accelerate its speed. Our experiments and benchmarks show that the implementation (i.e. EARRINGS), without being given any hint of adapter sequences, can reach comparable accuracy and higher throughput than that of existing adapter trimmers. EARRINGS is particularly useful in meta-analyses of a large batch of datasets and can be incorporated in any sequence analysis pipelines in all scales. AVAILABILITY AND IMPLEMENTATION: EARRINGS is open-source software and is available at https://github.com/jhhung/EARRINGS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

7.
Rapid Commun Mass Spectrom ; 36(18): e9351, 2022 Sep 30.
Artículo en Inglés | MEDLINE | ID: mdl-35802517

RESUMEN

RATIONALE: Thermogravimetry (TG) combined with electrospray and atmospheric chemical ionization (ESI+APCI) mass spectrometry (MS) was developed to rapidly characterize thermal decomposition products of synthetic polymers and plastic products. The ESI-based TG-MS method is useful for characterizing thermally labile, nonvolatile, and polar compounds over an extensive mass range; and the APCI-based TG-MS counterpart is useful for characterizing volatile and nonpolar compounds. Both polar and nonpolar compounds can be simultaneously detected by ESI+APCI-based TG-MS. METHODS: Analytes with different volatility were produced from TG operated at different temperatures, which were delivered through a heated stainless-steel tube to the ESI+APCI source where they reacted with the primary charged species generated from electrospray and atmospheric pressure chemical ionization (ESI+APCI) of solvent and nitrogen. The analyte ions were then detected by an ion trap mass spectrometer. RESULTS: A semi-volatile PEG 600 standard was used as the sample and protonated and sodiated molecular ions together with adduct ions including [(PEG)n + 15]+ , [(PEG)n + 18]+ , and [(PEG)n + 29]+ were detected by TG-ESI+APCI-MS. The technique was further utilized to characterize thermal decomposition products of nonvolatile polypropylene glycol (PPG) and polystyrene (PS) standards, as well as a PS-made water cup and coffee cup lid. The characteristic fragments of PPG and PS with mass differences of 58 and 104 between respective ion peaks were detected at the maximum decomposition temperature (Tmax ). CONCLUSIONS: The information obtained from the TG-ESI+APCI-MS analysis is useful in rapidly distinguishing different types of polymers and their products. In addition, the signals of the additives in the polymer products, including antioxidants and plasticizers, were also detected before the TG temperature reached Tmax .


Asunto(s)
Presión Atmosférica , Espectrometría de Masa por Ionización de Electrospray , Polímeros , Solventes , Espectrometría de Masa por Ionización de Electrospray/métodos , Termogravimetría
9.
Nucleic Acids Res ; 47(19): 10115-10133, 2019 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-31555818

RESUMEN

Pluripotency and cell fates can be modulated through the regulation of super-enhancers; however, the underlying mechanisms are unclear. Here, we showed a novel mechanism in which Ash2l directly binds to super-enhancers of several stemness genes to regulate pluripotency and self-renewal in pluripotent stem cells. Ash2l recruits Oct4/Sox2/Nanog (OSN) to form Ash2l/OSN complex at the super-enhancers of Jarid2, Nanog, Sox2 and Oct4, and further drives enhancer activation, upregulation of stemness genes, and maintains the pluripotent circuitry. Ash2l knockdown abrogates the OSN recruitment to all super-enhancers and further hinders the enhancer activation. In addition, CRISPRi/dCas9-mediated blocking of Ash2l-binding motifs at these super-enhancers also prevents OSN recruitment and enhancer activation, validating that Ash2l directly binds to super-enhancers and initiates the pluripotency network. Transfection of Ash2l with W118A mutation to disrupt Ash2l-Oct4 interaction fails to rescue Ash2l-driven enhancer activation and pluripotent gene upregulation in Ash2l-depleted pluripotent stem cells. Together, our data demonstrated Ash2l formed an enhancer-bound Ash2l/OSN complex that can drive enhancer activation, govern pluripotency network and stemness circuitry.


Asunto(s)
Proteínas de Unión al ADN/genética , Elementos de Facilitación Genéticos , Células Madre Embrionarias de Ratones/metabolismo , Factor 3 de Transcripción de Unión a Octámeros/genética , Factores de Transcripción/genética , Animales , Sistemas CRISPR-Cas/genética , Diferenciación Celular/genética , Linaje de la Célula/genética , Autorrenovación de las Células/genética , Reprogramación Celular/genética , Elementos de Facilitación Genéticos/genética , Regulación del Desarrollo de la Expresión Génica/genética , Humanos , Ratones , Mutación/genética , Proteína Homeótica Nanog/genética , Células Madre Pluripotentes/metabolismo , Factores de Transcripción SOXB1/genética , Transfección
10.
Int J Mol Sci ; 21(5)2020 Mar 10.
Artículo en Inglés | MEDLINE | ID: mdl-32164275

RESUMEN

Traumatic brain injury is known to reprogram the epigenome. Chromatin immunoprecipitation-sequencing of histone H3 lysine 27 acetylation (H3K27ac) and tri-methylation of histone H3 at lysine 4 (H3K4me3) marks was performed to address the transcriptional regulation of candidate regeneration-associated genes. In this study, we identify a novel enhancer region for induced WNT3A transcription during regeneration of injured cortical neurons. We further demonstrated an increased mono-methylation of histone H3 at lysine 4 (H3K4me1) modification at this enhancer concomitant with a topological interaction between sub-regions of this enhancer and with promoter of WNT3A gene. Together, this study reports a novel mechanism for WNT3A gene transcription and reveals a potential therapeutic intervention for neuronal regeneration.


Asunto(s)
Lesiones Traumáticas del Encéfalo/genética , Histonas/metabolismo , Neuronas/fisiología , Proteína Wnt3A/genética , Acetilación , Animales , Lesiones Traumáticas del Encéfalo/metabolismo , Inmunoprecipitación de Cromatina , Modelos Animales de Enfermedad , Elementos de Facilitación Genéticos , Epigénesis Genética , Metilación , Neuronas/metabolismo , Regiones Promotoras Genéticas , Ratas , Ratas Sprague-Dawley , Regeneración
11.
Bioinformatics ; 32(22): 3498-3500, 2016 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-27412087

RESUMEN

MOTIVATION: The Full-text index in Minute space (FM-index) derived from the Burrows-Wheeler transform (BWT) is broadly used for fast string matching in large genomes or a huge set of sequencing reads. Several graphic processing unit (GPU) accelerated aligners based on the FM-index have been proposed recently; however, the construction of the index is still handled by central processing unit (CPU), only parallelized in data level (e.g. by performing blockwise suffix sorting in GPU), or not scalable for large genomes. RESULTS: To fulfill the need for a more practical, hardware-parallelizable indexing and matching approach, we herein propose sBWT based on a BWT variant (i.e. Schindler transform) that can be built with highly simplified hardware-acceleration-friendly algorithms and still suffices accurate and fast string matching in repetitive references. In our tests, the implementation achieves significant speedups in indexing and searching compared with other BWT-based tools and can be applied to a variety of domains. AVAILABILITY AND IMPLEMENTATION: sBWT is implemented in C ++ with CPU-only and GPU-accelerated versions. sBWT is open-source software and is available at http://jhhung.github.io/sBWT/Supplementary information: Supplementary data are available at Bioinformatics online. CONTACT: chyee@ntu.edu.tw or jhhung@nctu.edu.tw (also juihunghung@gmail.com).


Asunto(s)
Algoritmos , Programas Informáticos , Indización y Redacción de Resúmenes , Genoma , Humanos
12.
Nucleic Acids Res ; 43(17): e109, 2015 Sep 30.
Artículo en Inglés | MEDLINE | ID: mdl-26007652

RESUMEN

Small silencing RNAs, including microRNAs, endogenous small interfering RNAs (endo-siRNAs) and Piwi-interacting RNAs (piRNAs), have been shown to play important roles in fine-tuning gene expression, defending virus and controlling transposons. Loss of small silencing RNAs or components in their pathways often leads to severe developmental defects, including lethality and sterility. Recently, non-templated addition of nucleotides to the 3' end, namely tailing, was found to associate with the processing and stability of small silencing RNAs. Next Generation Sequencing has made it possible to detect such modifications at nucleotide resolution in an unprecedented throughput. Unfortunately, detecting such events from millions of short reads confounded by sequencing errors and RNA editing is still a tricky problem. Here, we developed a computational framework, Tailor, driven by an efficient and accurate aligner specifically designed for capturing the tailing events directly from the alignments without extensive post-processing. The performance of Tailor was fully tested and compared favorably with other general-purpose aligners using both simulated and real datasets for tailing analysis. Moreover, to show the broad utility of Tailor, we used Tailor to reanalyze published datasets and revealed novel findings worth further experimental validation. The source code and the executable binaries are freely available at https://github.com/jhhung/Tailor.


Asunto(s)
Algoritmos , MicroARNs/química , ARN Interferente Pequeño/química , Alineación de Secuencia/métodos , Análisis de Secuencia de ARN/métodos , Animales , Arabidopsis/genética , Drosophila melanogaster/genética , Células HeLa , Humanos , Programas Informáticos , Pez Cebra/genética
13.
PLoS Genet ; 10(8): e1004515, 2014 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-25102063

RESUMEN

Dynamic exchange of a subset of nucleosomes in vivo plays important roles in epigenetic inheritance of chromatin states, chromatin insulator function, chromosome folding, and the maintenance of the pluripotent state of embryonic stem cells. Here, we extend a pulse-chase strategy for carrying out genome-wide measurements of histone dynamics to several histone variants in murine embryonic stem cells and somatic tissues, recapitulating expected characteristics of the well characterized H3.3 histone variant. We extended this system to the less-studied MacroH2A2 variant, commonly described as a "repressive" histone variant whose accumulation in chromatin is thought to fix the epigenetic state of differentiated cells. Unexpectedly, we found that while large intergenic blocks of MacroH2A2 were stably associated with the genome, promoter-associated peaks of MacroH2A2 exhibited relatively rapid exchange dynamics in ES cells, particularly at highly-transcribed genes. Upon differentiation to embryonic fibroblasts, MacroH2A2 was gained primarily in additional long, stably associated blocks across gene-poor regions, while overall turnover at promoters was greatly dampened. Our results reveal unanticipated dynamic behavior of the MacroH2A2 variant in pluripotent cells, and provide a resource for future studies of tissue-specific histone dynamics in vivo.


Asunto(s)
Cromatina/genética , Células Madre Embrionarias/metabolismo , Epigenómica , Histonas/genética , Animales , Islas de CpG/genética , Células Madre Embrionarias/citología , Genoma , Histonas/metabolismo , Ratones , Nucleosomas/genética , Nucleosomas/metabolismo , Regiones Promotoras Genéticas
14.
Opt Lett ; 41(17): 4028-31, 2016 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-27607964

RESUMEN

We have generated optical pulses of 1.2 MW peak power and 0.6 ps duration using a 1060 nm band gain-switched laser diode pulse oscillator. Optical pulses are amplified by three-stage ytterbium-doped fiber amplifiers, and remarkable reductions of amplified spontaneous emission noise and temporal duration have been accomplished based on self-phase modulation in the middle-stage amplifier. After the main amplifier, optical pulses were temporally compressed by a grating pair, and this enabled generation of subpicosecond optical pulses with over 1 MW peak power.

15.
BMC Bioinformatics ; 16 Suppl 1: S2, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25707528

RESUMEN

BACKGROUND: In modern paired-end sequencing protocols short DNA fragments lead to adapter-appended reads. Current paired-end adapter removal approaches trim adapter by scanning the fragment of adapter on the 3' end of the reads, which are not competent in some applications. RESULTS: Here, we propose a fast and highly accurate adapter-trimming algorithm, PEAT, designed specifically for paired-end sequencing. PEAT requires no a priori adaptor sequence, which is convenient for large-scale meta-analyses. We assessed the performance of PEAT with many adapter trimmers in both simulated and real life paired-end sequencing libraries. The importance of adapter trimming was exemplified by the influence of the downstream analyses on RNA-seq, ChIP-seq and MNase-seq. Several useful guidelines of applying adapter trimmers with aligners were suggested. CONCLUSIONS: PEAT can be easily included in the routine paired-end sequencing pipeline. The executable binaries and the standalone C++ source code package of PEAT are freely available online.


Asunto(s)
Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Inmunoprecipitación de Cromatina , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Programas Informáticos , Factores de Tiempo
16.
Nat Methods ; 9(4): 403-9, 2012 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-22388288

RESUMEN

Understanding the function of individual microRNA (miRNA) species in mice would require the production of hundreds of loss-of-function strains. To accelerate analysis of miRNA biology in mammals, we combined recombinant adeno-associated virus (rAAV) vectors with miRNA 'tough decoys' (TuDs) to inhibit specific miRNAs. Intravenous injection of rAAV9 expressing anti-miR-122 or anti-let-7 TuDs depleted the corresponding miRNA and increased its mRNA targets. rAAV producing anti-miR-122 TuD but not anti-let-7 TuD reduced serum cholesterol by >30% for 25 weeks in wild-type mice. High-throughput sequencing of liver miRNAs from the treated mice confirmed that the targeted miRNAs were depleted and revealed that TuDs induced miRNA tailing and trimming in vivo. rAAV-mediated miRNA inhibition thus provides a simple way to study miRNA function in adult mammals and a potential therapy for dyslipidemia and other diseases caused by miRNA deregulation.


Asunto(s)
Dependovirus/genética , Vectores Genéticos/genética , MicroARNs/antagonistas & inhibidores , MicroARNs/genética , Animales , Secuencia de Bases , Sitios de Unión , Línea Celular , Colesterol/metabolismo , Hígado/metabolismo , Ratones , Ratones Endogámicos C57BL , MicroARNs/metabolismo , Datos de Secuencia Molecular , ARN sin Sentido/genética , ARN sin Sentido/metabolismo , Proteínas Recombinantes/genética
17.
Nucleic Acids Res ; 41(Database issue): D171-6, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23203885

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) consortium aims to identify all functional elements in the human genome including transcripts, transcriptional regulatory regions, along with their chromatin states and DNA methylation patterns. The ENCODE project generates data utilizing a variety of techniques that can enrich for regulatory regions, such as chromatin immunoprecipitation (ChIP), micrococcal nuclease (MNase) digestion and DNase I digestion, followed by deeply sequencing the resulting DNA. As part of the ENCODE project, we have developed a Web-accessible repository accessible at http://factorbook.org. In Wiki format, factorbook is a transcription factor (TF)-centric repository of all ENCODE ChIP-seq datasets on TF-binding regions, as well as the rich analysis results of these data. In the first release, factorbook contains 457 ChIP-seq datasets on 119 TFs in a number of human cell lines, the average profiles of histone modifications and nucleosome positioning around the TF-binding regions, sequence motifs enriched in the regions and the distance and orientation preferences between motif sites.


Asunto(s)
Bases de Datos Genéticas , Elementos Reguladores de la Transcripción , Factores de Transcripción/metabolismo , Sitios de Unión , Línea Celular , Inmunoprecipitación de Cromatina , Secuenciación de Nucleótidos de Alto Rendimiento , Histonas , Humanos , Internet , Nucleosomas/metabolismo , Motivos de Nucleótidos , Análisis de Secuencia de ADN
18.
Genome Res ; 21(2): 203-15, 2011 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-21177969

RESUMEN

Since the initial annotation of miRNAs from cloned short RNAs by the Ambros, Tuschl, and Bartel groups in 2001, more than a hundred studies have sought to identify additional miRNAs in various species. We report here a meta-analysis of short RNA data from Drosophila melanogaster, aggregating published libraries with 76 data sets that we generated for the modENCODE project. In total, we began with more than 1 billion raw reads from 187 libraries comprising diverse developmental stages, specific tissue- and cell-types, mutant conditions, and/or Argonaute immunoprecipitations. We elucidated several features of known miRNA loci, including multiple phased byproducts of cropping and dicing, abundant alternative 5' termini of certain miRNAs, frequent 3' untemplated additions, and potential editing events. We also identified 49 novel genomic locations of miRNA production, and 61 additional candidate loci with limited evidence for miRNA biogenesis. Although these loci broaden the Drosophila miRNA catalog, this work supports the notion that a restricted set of cellular transcripts is competent to be specifically processed by the Drosha/Dicer-1 pathway. Unexpectedly, we detected miRNA production from coding and untranslated regions of mRNAs and found the phenomenon of miRNA production from the antisense strand of known loci to be common. Altogether, this study lays a comprehensive foundation for the study of miRNA diversity and evolution in a complex animal model.


Asunto(s)
Drosophila melanogaster/genética , Regulación de la Expresión Génica , MicroARNs/genética , MicroARNs/metabolismo , Anotación de Secuencia Molecular , Animales , Secuencia de Bases , Línea Celular , Biología Computacional , Drosophila melanogaster/metabolismo , Femenino , Masculino , MicroARNs/química , Edición de ARN/genética , ARN sin Sentido/química , ARN sin Sentido/genética , ARN Mensajero/química , ARN Mensajero/genética , Ribonucleasa III/genética , Ribonucleasa III/metabolismo , Alineación de Secuencia
19.
Brief Bioinform ; 13(3): 281-91, 2012 May.
Artículo en Inglés | MEDLINE | ID: mdl-21900207

RESUMEN

A central goal of biology is understanding and describing the molecular basis of plasticity: the sets of genes that are combinatorially selected by exogenous and endogenous environmental changes, and the relations among the genes. The most viable current approach to this problem consists of determining whether sets of genes are connected by some common theme, e.g. genes from the same pathway are overrepresented among those whose differential expression in response to a perturbation is most pronounced. There are many approaches to this problem, and the results they produce show a fair amount of dispersion, but they all fall within a common framework consisting of a few basic components. We critically review these components, suggest best practices for carrying out each step, and propose a voting method for meeting the challenge of assessing different methods on a large number of experimental data sets in the absence of a gold standard.


Asunto(s)
Biología Computacional/métodos , Algoritmos , Bases de Datos Genéticas , Expresión Génica , Guías como Asunto , Humanos
20.
BMC Genomics ; 14 Suppl 1: S2, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23368412

RESUMEN

BACKGROUND: MicroRNAs (miRNAs) play a critical role in down-regulating gene expression. By coupling with Argonaute family proteins, miRNAs bind to target sites on mRNAs and employ translational repression. A large amount of miRNA-target interactions (MTIs) have been identified by the crosslinking and immunoprecipitation (CLIP) and the photoactivatable-ribonucleoside-enhanced CLIP (PAR-CLIP) along with the next-generation sequencing (NGS). PAR-CLIP shows high efficiency of RNA co-immunoprecipitation, but it also lead to T to C conversion in miRNA-RNA-protein crosslinking regions. This artificial error obviously reduces the mappability of reads. However, a specific tool to analyze CLIP and PAR-CLIP data that takes T to C conversion into account is still in need. RESULTS: We herein propose the first CLIP and PAR-CLIP sequencing analysis platform specifically for miRNA target analysis, namely miRTarCLIP. From scratch, it automatically removes adaptor sequences from raw reads, filters low quality reads, reverts C to T, aligns reads to 3'UTRs, scans for read clusters, identifies high confidence miRNA target sites, and provides annotations from external databases. With multi-threading techniques and our novel C to T reversion procedure, miRTarCLIP greatly reduces the running time comparing to conventional approaches. In addition, miRTarCLIP serves with a web-based interface to provide better user experiences in browsing and searching targets of interested miRNAs. To demonstrate the superior functionality of miRTarCLIP, we applied miRTarCLIP to two public available CLIP and PAR-CLIP sequencing datasets. miRTarCLIP not only shows comparable results to that of other existing tools in a much faster speed, but also reveals interesting features among these putative target sites. Specifically, we used miRTarCLIP to disclose that T to C conversion within position 1-7 and that within position 8-14 of miRNA target sites are significantly different (p value = 0.02), and even more significant when focusing on sites targeted by top 102 highly expressed miRNAs only (p value = 0.01). These results comply with previous findings and further suggest that combining miRNA expression and PAR-CLIP data can improve accuracy of the miRNA target prediction. CONCLUSION: To sum up, we devised a systematic approach for mining miRNA-target sites from CLIP-seq and PAR-CLIP sequencing data, and integrated the workflow with a graphical web-based browser, which provides a user friendly interface and detailed annotations of MTIs. We also showed through real-life examples that miRTarCLIP is a powerful tool for understanding miRNAs. Our integrated tool can be accessed online freely at http://miRTarCLIP.mbc.nctu.edu.tw.


Asunto(s)
Algoritmos , MicroARNs/metabolismo , Programas Informáticos , Regiones no Traducidas 3' , Ensayos Analíticos de Alto Rendimiento , Humanos , Inmunoprecipitación , Internet , ARN/genética , ARN/metabolismo , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA