Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
2.
Nat Biotechnol ; 2023 Sep 07.
Artículo en Inglés | MEDLINE | ID: mdl-37679545

RESUMEN

Certified RNA reference materials are indispensable for assessing the reliability of RNA sequencing to detect intrinsically small biological differences in clinical settings, such as molecular subtyping of diseases. As part of the Quartet Project for quality control and data integration of multi-omics profiling, we established four RNA reference materials derived from immortalized B-lymphoblastoid cell lines from four members of a monozygotic twin family. Additionally, we constructed ratio-based transcriptome-wide reference datasets between two samples, providing cross-platform and cross-laboratory 'ground truth'. Investigation of the intrinsically subtle biological differences among the Quartet samples enables sensitive assessment of cross-batch integration of transcriptomic measurements at the ratio level. The Quartet RNA reference materials, combined with the ratio-based reference datasets, can serve as unique resources for assessing and improving the quality of transcriptomic data in clinical and biological settings.

3.
bioRxiv ; 2020 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-32511352

RESUMEN

The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has caused thousands of deaths worldwide, including >18,000 in New York City (NYC) alone. The sudden emergence of this pandemic has highlighted a pressing clinical need for rapid, scalable diagnostics that can detect infection, interrogate strain evolution, and identify novel patient biomarkers. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs, plus a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, bacterial, and viral profiling. We applied both technologies across 857 SARS-CoV-2 clinical specimens and 86 NYC subway samples, providing a broad molecular portrait of the COVID-19 NYC outbreak. Our results define new features of SARS-CoV-2 evolution, nominate a novel, NYC-enriched viral subclade, reveal specific host responses in interferon, ACE, hematological, and olfaction pathways, and examine risks associated with use of ACE inhibitors and angiotensin receptor blockers. Together, these findings have immediate applications to SARS-CoV-2 diagnostics, public health, and new therapeutic targets.

4.
J Pain ; 21(9-10): 988-1004, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-31931229

RESUMEN

Understanding molecular alterations associated with peripheral inflammation is a critical factor in selectively controlling acute and persistent pain. The present report employs in situ hybridization of the 2 opioid precursor mRNAs coupled with quantitative measurements of 2 peptides derived from the prodynorphin and proenkephalin precursor proteins: dynorphin A 1-8 and [Met5]-enkephalin-Arg6-Gly7-Leu8. In dorsal spinal cord ipsilateral to the inflammation, dynorphin A 1-8 was elevated after inflammation, and persisted as long as the inflammation was sustained. Qualitative identification by high performance liquid chromatography and gel permeation chromatography revealed the major immunoreactive species in control and inflamed extracts to be dynorphin A 1-8. In situ hybridization in spinal cord after administration of the inflammatory agent, carrageenan, showed increased expression of prodynorphin (Pdyn) mRNA somatotopically in medial superficial dorsal horn neurons. The fold increase in preproenkephalin mRNA (Penk) was comparatively lower, although the basal expression is substantially higher than Pdyn. While Pdyn is not expressed in the dorsal root ganglion (DRG) in basal conditions, it can be induced by nerve injury, but not by inflammation alone. A bioinformatic meta-analysis of multiple nerve injury datasets confirmed Pdyn upregulation in DRG across different nerve injury models. These data support the idea that activation of endogenous opioids, notably dynorphin, is a dynamic indicator of persistent pain states in spinal cord and of nerve injury in DRG. PERSPECTIVE: This is a systematic, quantitative assessment of dynorphin and enkephalin peptides and mRNA in dorsal spinal cord and DRG neurons in response to peripheral inflammation and axotomy. These studies form the foundational framework for understanding how endogenous spinal opioid peptides are involved in nociceptive circuit modulation.


Asunto(s)
Dinorfinas/metabolismo , Encefalinas/metabolismo , Ganglios Espinales/metabolismo , Hiperalgesia/metabolismo , Mediadores de Inflamación/metabolismo , Médula Espinal/metabolismo , Animales , Dinorfinas/análisis , Encefalinas/análisis , Ganglios Espinales/química , Mediadores de Inflamación/análisis , Masculino , Péptidos Opioides/análisis , Péptidos Opioides/metabolismo , Fragmentos de Péptidos/análisis , Fragmentos de Péptidos/metabolismo , ARN Mensajero/análisis , ARN Mensajero/metabolismo , Ratas , Ratas Sprague-Dawley , Médula Espinal/química
5.
Front Cell Dev Biol ; 7: 299, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31824949

RESUMEN

Secreted proteins (SPs) play important roles in diverse important biological processes; however, a comprehensive and high-quality list of human SPs is still lacking. Here we identified 6,943 high-confidence human SPs (3,522 of them are novel) based on 330,427 human proteins derived from databases of UniProt, Ensembl, AceView, and RefSeq. Notably, 6,267 of 6,943 (90.3%) SPs have the supporting evidences from a large amount of mass spectrometry (MS) and RNA-seq data. We found that the SPs were broadly expressed in diverse tissues as well as human body fluid, and a significant portion of them exhibited tissue-specific expression. Moreover, 14 cancer-specific SPs that their expression levels were significantly associated with the patients' survival of eight different tumors were identified, which could be potential prognostic biomarkers. Strikingly, 89.21% of 6,943 SPs (2,927 novel SPs) contain known protein domains. Those novel SPs we mainly enriched with the known domains regarding immunity, such as Immunoglobulin V-set and C1-set domain. Specifically, we constructed a user-friendly and freely accessible database, SPRomeDB (www.unimd.org/SPRomeDB), to catalog those SPs. Our comprehensive SP identification and characterization gain insights into human secretome and provide valuable resource for future researches.

6.
BMC Bioinformatics ; 20(1): 405, 2019 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-31345161

RESUMEN

BACKGROUND: Next-generation sequencing technologies can produce tens of millions of reads, often paired-end, from transcripts or genomes. But few programs can align RNA on the genome and accurately discover introns, especially with long reads. We introduce Magic-BLAST, a new aligner based on ideas from the Magic pipeline. RESULTS: Magic-BLAST uses innovative techniques that include the optimization of a spliced alignment score and selective masking during seed selection. We evaluate the performance of Magic-BLAST to accurately map short or long sequences and its ability to discover introns on real RNA-seq data sets from PacBio, Roche and Illumina runs, and on six benchmarks, and compare it to other popular aligners. Additionally, we look at alignments of human idealized RefSeq mRNA sequences perfectly matching the genome. CONCLUSIONS: We show that Magic-BLAST is the best at intron discovery over a wide range of conditions and the best at mapping reads longer than 250 bases, from any platform. It is versatile and robust to high levels of mismatches or extreme base composition, and reasonably fast. It can align reads to a BLAST database or a FASTA file. It can accept a FASTQ file as input or automatically retrieve an accession from the SRA repository at the NCBI.


Asunto(s)
ARN/genética , Alineación de Secuencia , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Algoritmos , Secuencia de Bases , Bases de Datos de Ácidos Nucleicos , Humanos , Intrones/genética , Curva ROC , Factores de Tiempo
7.
Cephalalgia ; 38(5): 912-932, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-28699403

RESUMEN

Background The trigeminal ganglion contains neurons that relay sensations of pain, touch, pressure, and many other somatosensory modalities to the central nervous system. The ganglion is also a reservoir for latent herpes virus 1 infection. To gain a better understanding of molecular factors contributing to migraine and headache, transcriptome analyses were performed on postmortem human trigeminal ganglia. Methods RNA-Seq measurements of gene expression were conducted on small sub-regions of 16 human trigeminal ganglia. The samples were also characterized for transcripts derived from viral and microbial genomes. Herpes simplex virus 1 (HSV-1) antibodies in blood were measured using the luciferase immunoprecipitation assay. Results Observed molecular heterogeneity could be explained by sampling of anatomically distinct sub-regions of the excised ganglia consistent with neurally-enriched and non-neural, i.e. Schwann cell, enriched subregions. The levels of HSV-1 transcripts detected in trigeminal ganglia correlated with blood levels of HSV-1 antibodies. Multiple migraine susceptibility genes were strongly expressed in neurally-enriched trigeminal samples, while others were expressed in blood vessels. Conclusions These data provide a comprehensive human trigeminal transcriptome and a framework for evaluation of inhomogeneous post-mortem tissues through extensive quality control and refined downstream analyses for RNA-Seq methodologies. Expression profiling of migraine susceptibility genes identified by genetic association appears to emphasize the blood vessel component of the trigeminovascular system. Other genes displayed enriched expression in the trigeminal compared to dorsal root ganglion, and in-depth transcriptomic analysis of the KCNK18 gene underlying familial migraine shows selective neural expression within two specific populations of ganglionic neurons. These data suggest that expression profiling of migraine-associated genes can extend and amplify the underlying neurobiological insights obtained from genetic association studies.


Asunto(s)
Herpesvirus Humano 1/genética , Canales de Potasio/genética , ARN/genética , Análisis de Secuencia de ARN/métodos , Ganglio del Trigémino/patología , Adolescente , Adulto , Autopsia , Femenino , Humanos , Masculino , Persona de Mediana Edad , Ganglio del Trigémino/fisiología , Ganglio del Trigémino/virología , Adulto Joven
8.
BMC Syst Biol ; 9: 75, 2015 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-26542228

RESUMEN

BACKGROUND: Cellular function and diversity are orchestrated by complex interactions of fundamental biomolecules including DNA, RNA and proteins. Technological advances in genomics, epigenomics, transcriptomics and proteomics have enabled massively parallel and unbiased measurements. Such high-throughput technologies have been extensively used to carry out broad, unbiased studies, particularly in the context of human diseases. Nevertheless, a unified analysis of the genome, epigenome, transcriptome and proteome of a single human cell type to obtain a coherent view of the complex interplay between various biomolecules has not yet been undertaken. Here, we report the first multi-omic analysis of human primary naïve CD4+ T cells isolated from a single individual. RESULTS: Integrating multi-omics datasets allowed us to investigate genome-wide methylation and its effect on mRNA/protein expression patterns, extent of RNA editing under normal physiological conditions and allele specific expression in naïve CD4+ T cells. In addition, we carried out a multi-omic comparative analysis of naïve with primary resting memory CD4+ T cells to identify molecular changes underlying T cell differentiation. This analysis provided mechanistic insights into how several molecules involved in T cell receptor signaling are regulated at the DNA, RNA and protein levels. Phosphoproteomics revealed downstream signaling events that regulate these two cellular states. Availability of multi-omics data from an identical genetic background also allowed us to employ novel proteogenomics approaches to identify individual-specific variants and putative novel protein coding regions in the human genome. CONCLUSIONS: We utilized multiple high-throughput technologies to derive a comprehensive profile of two primary human cell types, naïve CD4+ T cells and memory CD4+ T cells, from a single donor. Through vertical as well as horizontal integration of whole genome sequencing, methylation arrays, RNA-Seq, miRNA-Seq, proteomics, and phosphoproteomics, we derived an integrated and comparative map of these two closely related immune cells and identified potential molecular effectors of immune cell differentiation following antigen encounter.


Asunto(s)
Linfocitos T CD4-Positivos/metabolismo , Inmunidad Innata/fisiología , Modelos Biológicos , Metilación de ADN , Epigenómica , Perfilación de la Expresión Génica , Variación Genética , Genoma Humano , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Inmunidad Innata/genética , Fosforilación , Proteómica , Edición de ARN/efectos de los fármacos , ARN Mensajero/metabolismo , Transducción de Señal/genética , Transcriptoma
9.
Nature ; 526(7575): 700-4, 2015 Oct 29.
Artículo en Inglés | MEDLINE | ID: mdl-26466568

RESUMEN

Neuroblastoma is a malignant paediatric tumour of the sympathetic nervous system. Roughly half of these tumours regress spontaneously or are cured by limited therapy. By contrast, high-risk neuroblastomas have an unfavourable clinical course despite intensive multimodal treatment, and their molecular basis has remained largely elusive. Here we have performed whole-genome sequencing of 56 neuroblastomas (high-risk, n = 39; low-risk, n = 17) and discovered recurrent genomic rearrangements affecting a chromosomal region at 5p15.33 proximal of the telomerase reverse transcriptase gene (TERT). These rearrangements occurred only in high-risk neuroblastomas (12/39, 31%) in a mutually exclusive fashion with MYCN amplifications and ATRX mutations, which are known genetic events in this tumour type. In an extended case series (n = 217), TERT rearrangements defined a subgroup of high-risk tumours with particularly poor outcome. Despite a large structural diversity of these rearrangements, they all induced massive transcriptional upregulation of TERT. In the remaining high-risk tumours, TERT expression was also elevated in MYCN-amplified tumours, whereas alternative lengthening of telomeres was present in neuroblastomas without TERT or MYCN alterations, suggesting that telomere lengthening represents a central mechanism defining this subtype. The 5p15.33 rearrangements juxtapose the TERT coding sequence to strong enhancer elements, resulting in massive chromatin remodelling and DNA methylation of the affected region. Supporting a functional role of TERT, neuroblastoma cell lines bearing rearrangements or amplified MYCN exhibited both upregulated TERT expression and enzymatic telomerase activity. In summary, our findings show that remodelling of the genomic context abrogates transcriptional silencing of TERT in high-risk neuroblastoma and places telomerase activation in the centre of transformation in a large fraction of these tumours.


Asunto(s)
Regulación Neoplásica de la Expresión Génica/genética , Genoma Humano/genética , Neuroblastoma/genética , Neuroblastoma/patología , Recombinación Genética/genética , Telomerasa/genética , Telomerasa/metabolismo , Línea Celular Tumoral , Transformación Celular Neoplásica/genética , Cromatina/genética , Cromatina/metabolismo , Cromosomas Humanos Par 5/genética , ADN Helicasas/genética , Metilación de ADN , Elementos de Facilitación Genéticos/genética , Activación Enzimática/genética , Amplificación de Genes/genética , Silenciador del Gen , Humanos , Lactante , Proteína Proto-Oncogénica N-Myc , Neuroblastoma/clasificación , Neuroblastoma/enzimología , Proteínas Nucleares/genética , Proteínas Oncogénicas/genética , Pronóstico , ARN Mensajero/análisis , ARN Mensajero/genética , Riesgo , Translocación Genética/genética , Regulación hacia Arriba/genética , Proteína Nuclear Ligada al Cromosoma X
10.
Genome Biol ; 16: 133, 2015 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-26109056

RESUMEN

BACKGROUND: Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. RESULTS: We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. CONCLUSIONS: We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.


Asunto(s)
Perfilación de la Expresión Génica , Neuroblastoma/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Análisis de Secuencia de ARN , Adolescente , Adulto , Niño , Preescolar , Determinación de Punto Final , Femenino , Humanos , Lactante , Recién Nacido , Masculino , Modelos Genéticos , Neuroblastoma/clasificación , Neuroblastoma/diagnóstico , Células Tumorales Cultivadas , Adulto Joven
11.
Nucleic Acids Res ; 43(Database issue): D737-42, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25392405

RESUMEN

The non-human primate reference transcriptome resource (NHPRTR, available online at http://nhprtr.org/) aims to generate comprehensive RNA-seq data from a wide variety of non-human primates (NHPs), from lemurs to hominids. In the 2012 Phase I of the NHPRTR project, 19 billion fragments or 3.8 terabases of transcriptome sequences were collected from pools of ∼ 20 tissues in 15 species and subspecies. Here we describe a major expansion of NHPRTR by adding 10.1 billion fragments of tissue-specific RNA-seq data. For this effort, we selected 11 of the original 15 NHP species and subspecies and constructed total RNA libraries for the same ∼ 15 tissues in each. The sequence quality is such that 88% of the reads align to human reference sequences, allowing us to compute the full list of expression abundance across all tissues for each species, using the reads mapped to human genes. This update also includes improved transcript annotations derived from RNA-seq data for rhesus and cynomolgus macaques, two of the most commonly used NHP models and additional RNA-seq data compiled from related projects. Together, these comprehensive reference transcriptomes from multiple primates serve as a valuable community resource for genome annotation, gene dynamics and comparative functional analysis.


Asunto(s)
Bases de Datos Genéticas , Perfilación de la Expresión Génica , Primates/genética , Análisis de Secuencia de ARN , Animales , Internet , Macaca , Anotación de Secuencia Molecular , Especificidad de Órganos , Estándares de Referencia , Alineación de Secuencia/normas
12.
Nat Biotechnol ; 32(9): 888-95, 2014 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25150837

RESUMEN

High-throughput RNA sequencing (RNA-seq) enables comprehensive scans of entire transcriptomes, but best practices for analyzing RNA-seq data have not been fully defined, particularly for data collected with multiple sequencing platforms or at multiple sites. Here we used standardized RNA samples with built-in controls to examine sources of error in large-scale RNA-seq studies and their impact on the detection of differentially expressed genes (DEGs). Analysis of variations in guanine-cytosine content, gene coverage, sequencing error rate and insert size allowed identification of decreased reproducibility across sites. Moreover, commonly used methods for normalization (cqn, EDASeq, RUV2, sva, PEER) varied in their ability to remove these systematic biases, depending on sample complexity and initial data quality. Normalization methods that combine data from genes across sites are strongly recommended to identify and remove site-specific effects and can substantially improve RNA-seq studies.


Asunto(s)
Análisis de Secuencia de ARN/métodos , Control de Calidad , Reproducibilidad de los Resultados
13.
Nat Biotechnol ; 32(9): 926-32, 2014 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25150839

RESUMEN

The concordance of RNA-sequencing (RNA-seq) with microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed using a range of chemical treatment conditions. Here we use a comprehensive study design to generate Illumina RNA-seq and Affymetrix microarray data from the same liver samples of rats exposed in triplicate to varying degrees of perturbation by 27 chemicals representing multiple modes of action (MOAs). The cross-platform concordance in terms of differentially expressed genes (DEGs) or enriched pathways is linearly correlated with treatment effect size (R(2)0.8). Furthermore, the concordance is also affected by transcript abundance and biological complexity of the MOA. RNA-seq outperforms microarray (93% versus 75%) in DEG verification as assessed by quantitative PCR, with the gain mainly due to its improved accuracy for low-abundance transcripts. Nonetheless, classifiers to predict MOAs perform similarly when developed using data from either platform. Therefore, the endpoint studied and its biological complexity, transcript abundance and the genomic application are important factors in transcriptomic research and for clinical and regulatory decision making.


Asunto(s)
Análisis de Secuencia por Matrices de Oligonucleótidos , ARN Mensajero/genética , Análisis de Secuencia de ARN , Animales , Ratas
14.
Mol Pain ; 10: 44, 2014 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-25123163

RESUMEN

BACKGROUND: Three neuropeptides, gastrin releasing peptide (GRP), natriuritic precursor peptide B (NPPB), and neuromedin B (NMB) have been proposed to play roles in itch sensation. However, the tissues in which these peptides are expressed and their positions in the itch circuit has recently become the subject of debate. Here we used next-gen RNA-Seq to examine the expression of transcripts coding for GRP, NPPB, NMB, and other peptides in DRG, trigeminal ganglion, and the spinal cord as well as expression levels for their cognate receptors in these tissues. RESULTS: RNA-Seq demonstrates that GRP is not transcribed in mouse, rat, or human sensory ganglia. NPPB, which activates natriuretic peptide receptor 1 (NPR1), is well expressed in mouse DRG and less so in rat and human, whereas NPPA, which also acts on the NPR1 receptor, is expressed in all three species. Analysis of transcripts expressed in the spinal cord of mouse, rat, and human reveals no expression of Nppb, but unambiguously detects expression of Grp and the GRP-receptor (Grpr). The transcripts coding for NMB and tachykinin peptides are among the most highly expressed in DRG. Bioinformatics comparisons using the sequence of the peptides used to produce GRP-antibodies with proteome databases revealed that the C-terminal primary sequence of NMB and Substance P can potentially account for results from previous studies which showed GRP-immunostaining in the DRG. CONCLUSIONS: RNA-Seq corroborates a primary itch afferent role for NPPB in mouse and potentially NPPB and NPPA in rats and humans, but does not support GRP as a primary itch neurotransmitter in mouse, rat, or humans. As such, our results are at odds with the initial proposal of Sun and Chen (2007) that GRP is expressed in DRG. By contrast, our data strongly support an itch pathway where the itch-inducing actions of GRP are exerted through its release from spinal cord neurons.


Asunto(s)
Ganglios Espinales/metabolismo , Péptido Liberador de Gastrina/metabolismo , Péptido Natriurético Encefálico/metabolismo , Médula Espinal/citología , Ganglio del Trigémino/metabolismo , Animales , Secuencia de Bases , Biología Computacional , Péptido Liberador de Gastrina/genética , Humanos , Ratones , Péptido Natriurético Encefálico/genética , Ratas , Receptores de Neuropéptido/genética , Receptores de Neuropéptido/metabolismo , Especificidad de la Especie
15.
Nat Commun ; 5: 3230, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24510058

RESUMEN

The rat has been used extensively as a model for evaluating chemical toxicities and for understanding drug mechanisms. However, its transcriptome across multiple organs, or developmental stages, has not yet been reported. Here we show, as part of the SEQC consortium efforts, a comprehensive rat transcriptomic BodyMap created by performing RNA-Seq on 320 samples from 11 organs of both sexes of juvenile, adolescent, adult and aged Fischer 344 rats. We catalogue the expression profiles of 40,064 genes, 65,167 transcripts, 31,909 alternatively spliced transcript variants and 2,367 non-coding genes/non-coding RNAs (ncRNAs) annotated in AceView. We find that organ-enriched, differentially expressed genes reflect the known organ-specific biological activities. A large number of transcripts show organ-specific, age-dependent or sex-specific differential expression patterns. We create a web-based, open-access rat BodyMap database of expression profiles with crosslinks to other widely used databases, anticipating that it will serve as a primary resource for biomedical research using the rat model.


Asunto(s)
Ratas Endogámicas F344/metabolismo , Transcriptoma , Empalme Alternativo , Animales , Femenino , Perfilación de la Expresión Génica , Masculino , Isoformas de Proteínas/metabolismo , Ratas Endogámicas F344/crecimiento & desarrollo , Análisis de Secuencia de ARN , Caracteres Sexuales
16.
Sci Data ; 1: 140020, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25977777

RESUMEN

Whole-transcriptome sequencing ('RNA-Seq') has been drastically changing the scale and scope of genomic research. In order to fully understand the power and limitations of this technology, the US Food and Drug Administration (FDA) launched the third phase of the MicroArray Quality Control (MAQC-III) project, also known as the SEquencing Quality Control (SEQC) project. Using two well-established human reference RNA samples from the first phase of the MAQC project, three sequencing platforms were tested across more than ten sites with built-in truths including spike-in of external RNA controls (ERCC), titration data and qPCR verification. The SEQC project generated over 30 billion sequence reads representing the largest RNA-Seq data ever generated by a single project on individual RNA samples. This extraordinarily ultradeep transcriptomic data set and the known truths built into the study design provide many opportunities for further research and development to advance the improvement and application of RNA-Seq.


Asunto(s)
Perfilación de la Expresión Génica , ARN/genética , Análisis de Secuencia de ARN , Transcriptoma , Perfilación de la Expresión Génica/métodos , Perfilación de la Expresión Génica/normas , Humanos , Control de Calidad , Estándares de Referencia
17.
Sci Data ; 1: 140021, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25977778

RESUMEN

RNA-Seq provides the capability to characterize the entire transcriptome in multiple levels including gene expression, allele specific expression, alternative splicing, fusion gene detection, and etc. The US FDA-led SEQC (i.e., MAQC-III) project conducted a comprehensive study focused on the transcriptome profiling of rat liver samples treated with 27 chemicals to evaluate the utility of RNA-Seq in safety assessment and toxicity mechanism elucidation. The chemicals represented multiple chemogenomic modes of action (MOA) and exhibited varying degrees of transcriptional response. The paired-end 100 bp sequencing data were generated using Illumina HiScanSQ and/or HiSeq 2000. In addition to the core study, six animals (i.e., three aflatoxin B1 treated rats and three vehicle control rats) were sequenced three times, with two separate library preparations on two sequencing machines. This large toxicogenomics dataset can serve as a resource to characterize various aspects of transcriptomic changes (e.g., alternative splicing) that are byproduct of chemical perturbation.


Asunto(s)
Perfilación de la Expresión Génica , Hígado , ARN/genética , Transcriptoma , Empalme Alternativo , Animales , Biblioteca de Genes , Ratas
18.
Nucleic Acids Res ; 41(Database issue): D906-14, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23203872

RESUMEN

RNA-based next-generation sequencing (RNA-Seq) provides a tremendous amount of new information regarding gene and transcript structure, expression and regulation. This is particularly true for non-coding RNAs where whole transcriptome analyses have revealed that the much of the genome is transcribed and that many non-coding transcripts have widespread functionality. However, uniform resources for raw, cleaned and processed RNA-Seq data are sparse for most organisms and this is especially true for non-human primates (NHPs). Here, we describe a large-scale RNA-Seq data and analysis infrastructure, the NHP reference transcriptome resource (http://nhprtr.org); it presently hosts data from12 species of primates, to be expanded to 15 species/subspecies spanning great apes, old world monkeys, new world monkeys and prosimians. Data are collected for each species using pools of RNA from comparable tissues. We provide data access in advance of its deposition at NCBI, as well as browsable tracks of alignments against the human genome using the UCSC genome browser. This resource will continue to host additional RNA-Seq data, alignments and assemblies as they are generated over the coming years and provide a key resource for the annotation of NHP genomes as well as informing primate studies on evolution, reproduction, infection, immunity and pharmacology.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genómica , Primates/genética , Transcriptoma , Animales , Genoma Humano , Humanos , Internet , Primates/metabolismo , Alineación de Secuencia , Análisis de Secuencia de ARN
19.
Science ; 329(5990): 432-5, 2010 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-20522740

RESUMEN

Three-prime untranslated regions (3'UTRs) of metazoan messenger RNAs (mRNAs) contain numerous regulatory elements, yet remain largely uncharacterized. Using polyA capture, 3' rapid amplification of complementary DNA (cDNA) ends, full-length cDNAs, and RNA-seq, we defined approximately 26,000 distinct 3'UTRs in Caenorhabditis elegans for approximately 85% of the 18,328 experimentally supported protein-coding genes and revised approximately 40% of gene models. Alternative 3'UTR isoforms are frequent, often differentially expressed during development. Average 3'UTR length decreases with animal age. Surprisingly, no polyadenylation signal (PAS) was detected for 13% of polyadenylation sites, predominantly among shorter alternative isoforms. Trans-spliced (versus non-trans-spliced) mRNAs possess longer 3'UTRs and frequently contain no PAS or variant PAS. We identified conserved 3'UTR motifs, isoform-specific predicted microRNA target sites, and polyadenylation of most histone genes. Our data reveal a rich complexity of 3'UTRs, both genome-wide and throughout development.


Asunto(s)
Regiones no Traducidas 3' , Caenorhabditis elegans/genética , Genes de Helminto , ARN de Helminto/genética , Animales , Sitios de Unión , Caenorhabditis elegans/embriología , Caenorhabditis elegans/crecimiento & desarrollo , Biología Computacional , Secuencia Conservada , Trastornos del Desarrollo Sexual , Regulación del Desarrollo de la Expresión Génica , Biblioteca de Genes , Proteínas del Helminto/genética , Histonas/genética , Masculino , MicroARNs/metabolismo , Operón , Poli A/metabolismo , Poliadenilación , ARN Mensajero/genética , Trans-Empalme
20.
Proc Natl Acad Sci U S A ; 106(44): 18674-9, 2009 Nov 03.
Artículo en Inglés | MEDLINE | ID: mdl-19846761

RESUMEN

Endogenous small interfering RNAs (endo-siRNAs) regulate diverse gene expression programs in eukaryotes by either binding and cleaving mRNA targets or mediating heterochromatin formation; however, the mechanisms of endo-siRNA biogenesis, sorting, and target regulation remain poorly understood. Here we report the identification and function of a specific class of germline-generated endo-siRNAs in Caenorhabditis elegans that are 26 nt in length and contain a guanine at the first nucleotide position (i.e., 26G RNAs). 26G RNAs regulate gene expression during spermatogenesis and zygotic development, and their biogenesis requires the ERI-1 exonuclease and the RRF-3 RNA-dependent RNA polymerase (RdRP). Remarkably, we identified two nonoverlapping subclasses of 26G RNAs that sort into specific RNA-induced silencing complexes (RISCs) and differentially regulate distinct mRNA targets. Class I 26G RNAs target genes are expressed during spermatogenesis, whereas class II 26G RNAs are maternally inherited and silence gene expression during zygotic development. These findings implicate a class of endo-siRNAs in the global regulation of transcriptional programs required for fertility and development.


Asunto(s)
Caenorhabditis elegans/embriología , Caenorhabditis elegans/genética , Regulación del Desarrollo de la Expresión Génica , Guanina/metabolismo , ARN Interferente Pequeño/metabolismo , Espermatogénesis/genética , Cigoto/metabolismo , Animales , Proteínas de Caenorhabditis elegans/metabolismo , Exorribonucleasas/metabolismo , Silenciador del Gen , Células Germinativas/metabolismo , Masculino , ARN de Helminto/clasificación , ARN de Helminto/metabolismo , ARN Interferente Pequeño/biosíntesis , ARN Interferente Pequeño/clasificación , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...