Global transcript structure resolution of high gene density genomes through multi-platform data integration.
Nucleic Acids Res
; 44(18): e145, 2016 Oct 14.
Article
en En
| MEDLINE
| ID: mdl-27407110
ABSTRACT
Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome.
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
ARN Mensajero
/
Estadística como Asunto
/
Genoma Viral
/
Herpesvirus Humano 4
Tipo de estudio:
Prognostic_studies
Límite:
Humans
Idioma:
En
Revista:
Nucleic Acids Res
Año:
2016
Tipo del documento:
Article
País de afiliación:
Estados Unidos