RESUMEN
Personalized medicine is expected to benefit from combining genomic information with regular monitoring of physiological states by multiple high-throughput methods. Here, we present an integrative personal omics profile (iPOP), an analysis that combines genomic, transcriptomic, proteomic, metabolomic, and autoantibody profiles from a single individual over a 14 month period. Our iPOP analysis revealed various medical risks, including type 2 diabetes. It also uncovered extensive, dynamic changes in diverse molecular components and biological pathways across healthy and diseased conditions. Extremely high-coverage genomic and transcriptomic data, which provide the basis of our iPOP, revealed extensive heteroallelic changes during healthy and diseased states and an unexpected RNA editing mechanism. This study demonstrates that longitudinal iPOP can be used to interpret healthy and diseased states by connecting genomic information with additional dynamic omics activity.
Asunto(s)
Genoma Humano , Genómica , Medicina de Precisión , Diabetes Mellitus Tipo 2/genética , Femenino , Perfilación de la Expresión Génica , Humanos , Masculino , Metabolómica , Persona de Mediana Edad , Mutación , Proteómica , Virus Sincitiales Respiratorios/aislamiento & purificación , Rhinovirus/aislamiento & purificaciónRESUMEN
One of the major forms of alternative splicing, which generates multiple mRNA isoforms differing in the precise combinations of their exon sequences, is exon skipping. While in constitutive splicing all exons are included, in the skipped pattern(s) one or more exons are skipped. The regulation of this process is still not well understood; so far, cis- regulatory elements (such as exonic splicing enhancers) were identified in individual cases. We therefore set to investigate the possibility that exon skipping is controlled by sequences in the adjacent introns. We employed a computer analysis on 54 sequences documented as undergoing exon skipping, and identified two motifs both in the upstream and downstream introns of the skipped exons. One motif is highly enriched in pyrimidines (mostly C residues), and the other motif is highly enriched in purines (mostly G residues). The two motifs differ from the known cis-elements present at the 5' and 3' splice site. Interestingly, the two motifs are complementary, and their relative positional order is conserved in the flanking introns. These suggest that base pairing interactions can underlie a mechanism that involves secondary structure to regulate exon skipping. Remarkably, the two motifs are conserved in mouse orthologous genes that undergo exon skipping.
Asunto(s)
Empalme Alternativo/genética , Secuencia Conservada/genética , Exones/genética , Animales , Secuencia de Bases , Sitios de Unión/genética , Bases de Datos de Ácidos Nucleicos , Secuencia Rica en GC/genética , HumanosRESUMEN
Splice site selection is a key element of pre-mRNA splicing and involves specific recognition of consensus sequences at the 5(') and 3(') splice sites. Evidently, the compliance of a given sequence with the consensus 5(') splice site sequence is not sufficient to define it as a functional 5(') splice site, because not all sequences that conform with the consensus are used for splicing. We have previously hypothesized that the necessity to avoid the inclusion of premature termination codons within mature mRNAs may serve as a criterion that differentiates normal 5(') splice sites from unused (latent) ones. We further provided experimental support to this idea, by analyzing the splicing of pre-mRNAs in which in-frame stop codons upstream of a latent 5(') splice site were mutated, and showing that splicing using the latent site is indeed activated by such mutations. Here we evaluate this hypothesis by a computerized survey for latent 5(') splice sites in 446 protein-coding human genes. This data set contains 2311 introns, in which we found 10490 latent 5(') splice sites. The utilization of 10045 (95.8%) of these sites for splicing would have led to the inclusion of an in-frame stop codon within the resultant mRNA. The validity of this finding is confirmed here by statistical analyses. This finding, together with our previous experimental results, invokes a nuclear scanning mechanism, as part of the splicing machine, which identifies in-frame stop codons within the pre-mRNA and prevents splicing that could lead to the formation of a prematurely terminated protein.
Asunto(s)
Sistemas de Lectura Abierta , Algoritmos , Codón , Bases de Datos como Asunto , Exones , Técnicas Genéticas , Humanos , Intrones , Modelos Genéticos , Mutación , Empalme del ARN , ARN Mensajero/metabolismo , Programas InformáticosRESUMEN
RNA sequences that conform to the consensus sequence of 5' splice sites but are not used for splicing occur frequently in protein coding genes. Mutational analyses have shown that suppression of splicing at such latent sites may be dictated by the necessity to maintain an open reading frame in the mRNA. Here we show that stop codon frequency in introns having latent 5' splice sites is significantly greater than that of introns lacking such sites and significantly greater than the expected occurrence by chance alone. Both observations suggest the occurrence of a general mechanism that recognizes the mRNA reading frame in the context of pre-mRNA.
Asunto(s)
Intrones/genética , Sistemas de Lectura Abierta , Biosíntesis de Proteínas , Procesamiento Postranscripcional del ARN , Empalme del ARN , ARN Mensajero/genética , Humanos , Precursores del ARN/genética , ARN Mensajero/metabolismoRESUMEN
Pre-mRNA splicing involves recognition of a consensus sequence at the 5' splice site (SS). However, only some of the many potential sites that conform to the consensus are true ones, whereas the majority remain silent and are not normally used for splicing. We noticed that in most cases the utilization of such a latent intronic 5' SS for splicing would introduce an in-frame stop codon into the resultant mRNA. This finding suggested a link between SS selection and maintenance of an ORF within the mRNA. Here we tested this idea by analyzing the splicing of pre-mRNAs in which in-frame stop codons upstream of a latent 5' SS were mutated. We found that splicing with the latent site is indeed activated by such mutations. Our findings predict the existence of a checking mechanism, as a component of the nuclear pre-mRNA splicing machine, to ensure the maintenance of an ORF. This notion is highly important for accurate gene expression, as perturbations that would lead to splicing at these latent sites are expected to introduce in-frame stop codons into the majority of mRNAs.
Asunto(s)
Codón , Empalme Alternativo , Animales , Sitios de Unión , Línea Celular , Codón sin Sentido , Codón de Terminación , Cricetinae , Fibroblastos/metabolismo , Humanos , Iduronidasa/química , Mesocricetus , Modelos Genéticos , Mutación , Sistemas de Lectura Abierta , Plásmidos/metabolismo , Mutación Puntual , Unión Proteica , ARN Mensajero/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , TransfecciónRESUMEN
Cytochrome b5/cytochrome b5 reductase (cb5/cb5r) is a cytosolic fusion protein between the hemoprotein cytochrome b5 and the flavoprotein cytochrome b5 reductase. We describe the identification and characterization of a novel splice variant of cb5/cb5r in the mouse and rat and show that expression of the variant is conserved in both species but is not expressed in human tissue. Characterization of the exon structure of cb5/cb5r indicated that the variant was due to the deletion of the whole of exon 12, thus the variant was named cb5/cb5rdelta12. Exon 12 codes for the flavin-adenine dinucleotide binding domain of cb5/cb5r. Expression analysis revealed the transcript of cb5/cb5rdelta12 in mouse and rat testis, brain, and skeletal muscle and also in the male germ line. We postulate that cb5/cb5rdelta12 may function in a dominant negative fashion, limiting the amount of damage caused by the production of reactive oxygen species by cb5/cb5r.