RESUMO
Exonic sequences contain both protein-coding and RNA splicing information but the interplay of the protein and splicing code is complex and poorly understood. Here, we have studied traditional and auxiliary splicing codes of human exons that encode residues coordinating two essential divalent metals at the opposite ends of the Irving-Williams series, a universal order of relative stabilities of metal-organic complexes. We show that exons encoding Zn2+-coordinating amino acids are supported much less by the auxiliary splicing motifs than exons coordinating Ca2+. The handicap of the former is compensated by stronger splice sites and uridine-richer polypyrimidine tracts, except for position -3 relative to 3' splice junctions. However, both Ca2+ and Zn2+ exons exhibit close-to-constitutive splicing in multiple tissues, consistent with their critical importance for metalloprotein function and a relatively small fraction of expendable, alternatively spliced exons. These results indicate that constraints imposed by metal coordination spheres on RNA splicing have been efficiently overcome by the plasticity of exon-intron architecture to ensure adequate metalloprotein expression.
Assuntos
Cálcio , Metaloproteínas , Splicing de RNA , Zinco , Humanos , Processamento Alternativo , Éxons , Íntrons , Metaloproteínas/genética , Sítios de Splice de RNARESUMO
Transcriptomic diversity in primates was considerably expanded by exonizations of intronic Alu elements. To better understand their cellular mechanisms we have used structure-based mutagenesis coupled with functional and proteomic assays to study the impact of successive primate mutations and their combinations on inclusion of a sense-oriented AluJ exon in the human F8 gene. We show that the splicing outcome was better predicted by consecutive RNA conformation changes than by computationally derived splicing regulatory motifs. We also demonstrate an involvement of SRP9/14 (signal recognition particle) heterodimer in splicing regulation of Alu-derived exons. Nucleotide substitutions that accumulated during primate evolution relaxed the conserved left-arm AluJ structure including helix H1 and reduced the capacity of SRP9/14 to stabilize the closed Alu conformation. RNA secondary structure-constrained mutations that promoted open Y-shaped conformations of the Alu made the Alu exon inclusion reliant on DHX9. Finally, we identified additional SRP9/14 sensitive Alu exons and predicted their functional roles in the cell. Together, these results provide unique insights into architectural elements required for sense Alu exonization, identify conserved pre-mRNA structures involved in exon selection and point to a possible chaperone activity of SRP9/14 outside the mammalian signal recognition particle.
Assuntos
RNA , Partícula de Reconhecimento de Sinal , Animais , Humanos , RNA/química , Partícula de Reconhecimento de Sinal/genética , Partícula de Reconhecimento de Sinal/metabolismo , Proteômica , Splicing de RNA , Primatas/genética , Elementos Alu , Conformação de Ácido Nucleico , Mamíferos/genéticaRESUMO
Auxilliary splicing sequences in exons, known as enhancers (ESEs) and silencers (ESSs), have been subject to strong selection pressures at the RNA and protein level. The protein component of this splicing code is substantial, recently estimated at â¼50% of the total information within ESEs, but remains poorly understood. The ESE/ESS profiles were previously associated with the Irving-Williams (I-W) stability series for divalent metals, suggesting that the ESE/ESS evolution was shaped by metal binding sites. Here, we have examined splicing activities of exonic sequences that encode protein binding sites for Ca2+, a weak binder in the I-W affinity order. We found that predicted exon inclusion levels for the EF-hand motifs and for Ca2+-binding residues in nonEF-hand proteins were higher than for average exons. For canonical EF-hands, the increase was centred on the EF-hand chelation loop and, in particular, on Ca2+-coordinating residues, with a 1>12>3â¼5>9 hierarchy in the 12-codon loop consensus and usage bias at codons 1 and 12. The same hierarchy but a lower increase was observed for noncanonical EF-hands, except for S100 proteins. EF-hand loops preferentially accumulated exon splits in two clusters, one located in their N-terminal halves and the other around codon 12. Using splicing assays and published crosslinking and immunoprecipitation data, we identify candidate trans-acting factors that preferentially bind conserved GA-rich motifs encoding negatively charged amino acids in the loops. Together, these data provide evidence for the high capacity of codons for Ca2+-coordinating residues to be retained in mature transcripts, facilitating their exon-level expansion during eukaryotic evolution.
Assuntos
Cálcio , Splicing de RNA , Processamento Alternativo , Sítios de Ligação/genética , Códon , Éxons , Ligação ProteicaRESUMO
Ca2+-insensitive and -sensitive E1 subunits of the 2-oxoglutarate dehydrogenase complex (OGDHC) regulate tissue-specific NADH and ATP supply by mutually exclusive OGDH exons 4a and 4b. Here we show that their splicing is enforced by distant lariat branch points (dBPs) located near the 5' splice site of the intervening intron. dBPs restrict the intron length and prevent transposon insertions, which can introduce or eliminate dBP competitors. The size restriction was imposed by a single dominant dBP in anamniotes that expanded into a conserved constellation of four dBP adenines in amniotes. The amniote clusters exhibit taxon-specific usage of individual dBPs, reflecting accessibility of their extended motifs within a stable RNA hairpin rather than U2 snRNA:dBP base-pairing. The dBP expansion took place in early terrestrial species and was followed by a uridine enrichment of large downstream polypyrimidine tracts in mammals. The dBP-protected megatracts permit reciprocal regulation of exon 4a and 4b by uridine-binding proteins, including TIA-1/TIAR and PUF60, which promote U1 and U2 snRNP recruitment to the 5' splice site and BP, respectively, but do not significantly alter the relative dBP usage. We further show that codons for residues critically contributing to protein binding sites for Ca2+ and other divalent metals confer the exon inclusion order that mirrors the Irving-Williams affinity series, linking the evolution of auxiliary splicing motifs in exons to metallome constraints. Finally, we hypothesize that the dBP-driven selection for Ca2+-dependent ATP provision by E1 facilitated evolution of endothermy by optimizing the aerobic scope in target tissues.
Assuntos
Processamento Alternativo , Regulação da Temperatura Corporal/genética , Íntrons , Complexo Cetoglutarato Desidrogenase/genética , Animais , Cálcio/metabolismo , Evolução Molecular , Éxons , Células HEK293 , Humanos , Sequências Repetitivas Dispersas , Complexo Cetoglutarato Desidrogenase/metabolismo , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Precursores de RNA/química , Precursores de RNA/metabolismo , Sítios de Splice de RNA , Fatores de Processamento de RNA/metabolismo , RNA Mensageiro/química , RNA Mensageiro/metabolismo , Spliceossomos/metabolismo , Vertebrados/genéticaRESUMO
Transposed elements (TEs) have dramatically shaped evolution of the exon-intron structure and significantly contributed to morbidity, but how recent TE invasions into older TEs cooperate in generating new coding sequences is poorly understood. Employing an updated repository of new exon-intron boundaries induced by pathogenic mutations, termed DBASS, here we identify novel TE clusters that facilitated exon selection. To explore the extent to which such TE exons maintain RNA secondary structure of their progenitors, we carried out structural studies with a composite exon that was derived from a long terminal repeat (LTR78) and AluJ and was activated by a C > T mutation optimizing the 5' splice site. Using a combination of SHAPE, DMS and enzymatic probing, we show that the disease-causing mutation disrupted a conserved AluJ stem that evolved from helix 3.3 (or 5b) of 7SL RNA, liberating a primordial GC 5' splice site from the paired conformation for interactions with the spliceosome. The mutation also reduced flexibility of conserved residues in adjacent exon-derived loops of the central Alu hairpin, revealing a cross-talk between traditional and auxilliary splicing motifs that evolved from opposite termini of 7SL RNA and were approximated by Watson-Crick base-pairing already in organisms without spliceosomal introns. We also identify existing Alu exons activated by the same RNA rearrangement. Collectively, these results provide valuable TE exon models for studying formation and kinetics of pre-mRNA building blocks required for splice-site selection and will be useful for fine-tuning auxilliary splicing motifs and exon and intron size constraints that govern aberrant splice-site activation.
Assuntos
Elementos de DNA Transponíveis , Sítios de Splice de RNA , Splicing de RNA , Alelos , Sequência de Bases , Éxons , Regulação da Expressão Gênica , Humanos , Íntrons , Mutação , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Transcrição GênicaRESUMO
PUF60 is a splicing factor that binds uridine (U)-rich tracts and facilitates association of the U2 small nuclear ribonucleoprotein with primary transcripts. PUF60 deficiency (PD) causes a developmental delay coupled with intellectual disability and spinal, cardiac, ocular and renal defects, but PD pathogenesis is not understood. Using RNA-Seq, we identify human PUF60-regulated exons and show that PUF60 preferentially acts as their activator. PUF60-activated internal exons are enriched for Us upstream of their 3' splice sites (3'ss), are preceded by longer AG dinucleotide exclusion zones and more distant branch sites, with a higher probability of unpaired interactions across a typical branch site location as compared to control exons. In contrast, PUF60-repressed exons show U-depletion with lower estimates of RNA single-strandedness. We also describe PUF60-regulated, alternatively spliced isoforms encoding other U-bound splicing factors, including PUF60 partners, suggesting that they are co-regulated in the cell, and identify PUF60-regulated exons derived from transposed elements. PD-associated amino-acid substitutions, even within a single RNA recognition motif (RRM), altered selection of competing 3'ss and branch points of a PUF60-dependent exon and the 3'ss choice was also influenced by alternative splicing of PUF60. Finally, we propose that differential distribution of RNA processing steps detected in cells lacking PUF60 and the PUF60-paralog RBM39 is due to the RBM39 RS domain interactions. Together, these results provide new insights into regulation of exon usage by the 3'ss organization and reveal that germline mutation heterogeneity in RRMs can enhance phenotypic variability at the level of splice-site and branch-site selection.
Assuntos
Éxons , Mutação de Sentido Incorreto , Sítios de Splice de RNA , Fatores de Processamento de RNA/metabolismo , Proteínas Repressoras/metabolismo , Motivos de Aminoácidos , Células HEK293 , Células HeLa , Ribonucleoproteínas Nucleares Heterogêneas/metabolismo , Humanos , Proteínas Nucleares/metabolismo , Fatores de Processamento de RNA/química , Fatores de Processamento de RNA/deficiência , Proteínas de Ligação a RNA/metabolismo , Proteínas Repressoras/química , Proteínas Repressoras/deficiência , Ribonucleoproteína Nuclear Pequena U1/metabolismo , Análise de Sequência de RNA , Elementos Nucleotídeos Curtos e Dispersos , Fator de Processamento U2AFRESUMO
The auxiliary factor of U2 small nuclear ribonucleoprotein (U2AF) facilitates branch point (BP) recognition and formation of lariat introns. The gene for the 35-kD subunit of U2AF gives rise to two protein isoforms (termed U2AF35a and U2AF35b) that are encoded by alternatively spliced exons 3 and Ab, respectively. The splicing recognition sequences of exon 3 are less favorable than exon Ab, yet U2AF35a expression is higher than U2AF35b across tissues. We show that U2AF35b repression is facilitated by weak, closely spaced BPs next to a long polypyrimidine tract of exon Ab. Each BP lacked canonical uridines at position -2 relative to the BP adenines, with efficient U2 base-pairing interactions predicted only for shifted registers reminiscent of programmed ribosomal frameshifting. The BP cluster was compensated by interactions involving unpaired cytosines in an upstream, EvoFold-predicted stem loop (termed ESL) that binds FUBP1/2. Exon Ab inclusion correlated with predicted free energies of mutant ESLs, suggesting that the ESL operates as a conserved rheostat between long inverted repeats upstream of each exon. The isoform-specific U2AF35 expression was U2AF65-dependent, required interactions between the U2AF-homology motif (UHM) and the α6 helix of U2AF35, and was fine-tuned by exon Ab/3 variants. Finally, we identify tandem homologous exons regulated by U2AF and show that their preferential responses to U2AF65-related proteins and SRSF3 are associated with unpaired pre-mRNA segments upstream of U2AF-repressed 3'ss. These results provide new insights into tissue-specific subfunctionalization of duplicated exons in vertebrate evolution and expand the repertoire of exon repression mechanisms that control alternative splicing.
Assuntos
Processamento Alternativo , Éxons , Subunidades Proteicas/genética , Precursores de RNA/genética , Fatores de Processamento de Serina-Arginina/genética , Fator de Processamento U2AF/genética , Sequência de Aminoácidos , Sequência de Bases , Evolução Biológica , Células HEK293 , Células HeLa , Humanos , Íntrons , Subunidades Proteicas/metabolismo , Precursores de RNA/metabolismo , Sítios de Splice de RNA , Fatores de Processamento de Serina-Arginina/metabolismo , Fator de Processamento U2AF/metabolismoRESUMO
The selection of 3Î splice sites (3Îss) is an essential early step in mammalian RNA splicing reactions, but the processes involved are unknown. We have used single molecule methods to test whether the major components implicated in selection, the proteins U2AF35 and U2AF65 and the U2 snRNP, are able to recognize alternative candidate sites or are restricted to one pre-specified site. In the presence of adenosine triphosphate (ATP), all three components bind in a 1:1 stoichiometry with a 3Îss. Pre-mRNA molecules with two alternative 3Îss can be bound concurrently by two molecules of U2AF or two U2 snRNPs, so none of the components are restricted. However, concurrent occupancy inhibits splicing. Stoichiometric binding requires conditions consistent with coalescence of the 5Î and 3Î sites in a complex (I, initial), but if this cannot form the components show unrestricted and stochastic association. In the absence of ATP, when complex E forms, U2 snRNP association is unrestricted. However, if protein dephosphorylation is prevented, an I-like complex forms with stoichiometric association of U2 snRNPs and the U2 snRNA is base-paired to the pre-mRNA. Complex I differs from complex A in that the formation of complex A is associated with the loss of U2AF65 and 35.
Assuntos
Splicing de RNA , Spliceossomos/metabolismo , Fator de Processamento U2AF/metabolismo , Trifosfato de Adenosina/metabolismo , Linhagem Celular , Humanos , Íntrons , Modelos Biológicos , Complexos Multiproteicos/metabolismo , Monoéster Fosfórico Hidrolases/metabolismo , Ligação Proteica , Precursores de RNA/química , Precursores de RNA/genética , Precursores de RNA/metabolismo , Sítios de Splice de RNA , Ribonucleoproteína Nuclear Pequena U1/metabolismo , Proteína 2 de Sobrevivência do Neurônio Motor/metabolismo , Tropomiosina/metabolismoRESUMO
Gitelman syndrome (GS) is an autosomal recessive renal tubulopathy characterized by hypokalemic metabolic alkalosis with hypocalciuria and hypomagnesemia. GS clinical symptoms range from mild weakness to muscular cramps, paralysis or even sudden death as a result of cardiac arrhythmia. GS is caused by loss-of-function mutations in the solute carrier family 12 member 3 (SLC12A3) gene, but molecular mechanisms underlying such a wide range of symptoms are poorly understood. Here we report cryptic exon activation in SLC12A3 intron 12 in a clinically asymptomatic GS, resulting from an intronic mutation c.1669+297 T>G that created a new acceptor splice site. The cryptic exon was sandwiched between the L3 transposon upstream and a mammalian interspersed repeat downstream, possibly contributing to inclusion of the cryptic exon in mature transcripts. The mutation was identified by targeted next-generation sequencing of candidate genes in GS patients with missing pathogenic SLC12A3 alleles. Taken together, this work illustrates the power of next-generation sequencing to identify causal mutations in intronic regions in asymptomatic individuals at risk of developing potentially fatal disease complications, improving clinical management of these cases.
Assuntos
Síndrome de Gitelman/diagnóstico , Síndrome de Gitelman/genética , Túbulos Renais Distais/patologia , Sequência de Bases , Pré-Escolar , Éxons/genética , Feminino , Síndrome de Gitelman/patologia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Íntrons/genética , Mutação/genética , Análise de Sequência de DNA , Membro 3 da Família 12 de Carreador de Soluto/genética , Membro 3 da Família 12 de Carreador de Soluto/metabolismoRESUMO
Alport syndrome-diffuse leiomyomatosis (AS-DL, OMIM: 308940) is a rare variant of the X-linked Alport syndrome that shows overgrowth of visceral smooth muscles in the gastrointestinal, respiratory and female reproductive tracts in addition to renal symptoms. AS-DL results from deletions that encompass the 5' ends of the COL4A5 and COL4A6 genes, but deletion breakpoints between COL4A5 and COL4A6 have been determined in only four cases. Here, we characterize deletion breakpoints in five AS-DL patients and show a contiguous COL4A6/COL4A5 deletion in each case. We also demonstrate that eight out of nine deletion alleles involved sequences homologous between COL4A5 and COL4A6. Most breakpoints took place in recognizable transposed elements, including long and short interspersed repeats, DNA transposons and long-terminal repeat retrotransposons. Because deletions involved the bidirectional promoter region in each case, we suggest that the occurrence of leiomyomatosis in AS-DL requires inactivation of both genes. Altogether, our study highlights the importance of homologous recombination involving multiple transposed elements for the development of this continuous gene syndrome and other atypical loss-of-function phenotypes.
Assuntos
Colágeno Tipo IV/genética , Deleção de Genes , Leiomiomatose/complicações , Leiomiomatose/genética , Nefrite Hereditária/complicações , Nefrite Hereditária/genética , Sequência de Bases , HumanosRESUMO
The auxiliary factor of U2 small nuclear RNA (U2AF) is a heterodimer consisting of 65- and 35-kD proteins that bind the polypyrimidine tract (PPT) and AG dinucleotides at the 3' splice site (3'ss). The gene encoding U2AF35 (U2AF1) is alternatively spliced, giving rise to two isoforms U2AF35a and U2AF35b. Here, we knocked down U2AF35 and each isoform and characterized transcriptomes of HEK293 cells with varying U2AF35/U2AF65 and U2AF35a/b ratios. Depletion of both isoforms preferentially modified alternative RNA processing events without widespread failure to recognize 3'ss or constitutive exons. Over a third of differentially used exons were terminal, resulting largely from the use of known alternative polyadenylation (APA) sites. Intronic APA sites activated in depleted cultures were mostly proximal whereas tandem 3'UTR APA was biased toward distal sites. Exons upregulated in depleted cells were preceded by longer AG exclusion zones and PPTs than downregulated or control exons and were largely activated by PUF60 and repressed by CAPERα. The U2AF(35) repression and activation was associated with a significant interchange in the average probabilities to form single-stranded RNA in the optimal PPT and branch site locations and sequences further upstream. Although most differentially used exons were responsive to both U2AF subunits and their inclusion correlated with U2AF levels, a small number of transcripts exhibited distinct responses to U2AF35a and U2AF35b, supporting the existence of isoform-specific interactions. These results provide new insights into function of U2AF and U2AF35 in alternative RNA processing.
Assuntos
Éxons , Proteínas Nucleares/genética , Splicing de RNA , Ribonucleoproteínas/genética , Análise de Sequência de RNA , Células HEK293 , Humanos , Fator de Processamento U2AFRESUMO
BACKGROUND: X-linked Alport syndrome (XLAS) is a progressive hereditary nephropathy caused by mutations in the type IV collagen alpha chain 5 gene (COL4A5). Although many COL4A5 mutations have previously been identified, pathogenic synonymous mutations have not yet been described. METHODS: A family with XLAS underwent mutational analyses of COL4A5 by PCR and direct sequencing, as well as transcript analysis of potential splice site mutations. In silico analysis was also conducted to predict the disruption of splicing factor binding sites. Immunohistochemistry (IHC) of kidney biopsies was used to detect α2 and α5 chain expression. RESULTS: We identified a hemizygous point mutation, c.876A>T, in exon 15 of COL4A5 in the proband and his brother, which is predicted to result in a synonymous amino acid change, p.(Gly292Gly). Transcript analysis showed that this mutation potentially altered splicing because it disrupted the splicing factor binding site. The kidney biopsy of the proband showed lamellation of the glomerular basement membrane (GBM), while IHC revealed negative α5(IV) staining in the GBM and Bowman's capsule, which is typical of XLAS. CONCLUSIONS: This is the first report of a synonymous COL4A5 substitution being responsible for XLAS. Our findings suggest that transcript analysis should be conducted for the future correct assessment of silent mutations.
Assuntos
Colágeno Tipo IV/genética , Nefrite Hereditária/genética , Mutação Puntual , Sítios de Splice de RNA , Biópsia , Cápsula Glomerular/química , Colágeno Tipo IV/metabolismo , Análise Mutacional de DNA , Progressão da Doença , Éxons , Feminino , Estudos de Associação Genética , Predisposição Genética para Doença , Membrana Basal Glomerular/química , Humanos , Imuno-Histoquímica , Falência Renal Crônica/diagnóstico , Falência Renal Crônica/genética , Falência Renal Crônica/metabolismo , Masculino , Pessoa de Meia-Idade , Nefrite Hereditária/diagnóstico , Nefrite Hereditária/metabolismo , Linhagem , Fenótipo , Adulto JovemRESUMO
Splice-switching oligonucleotides (SSOs) have been widely used to inhibit exon usage but antisense strategies that promote removal of entire introns to increase splicing-mediated gene expression have not been developed. Here we show reduction of INS intron 1 retention by SSOs that bind transcripts derived from a human haplotype expressing low levels of proinsulin. This haplotype is tagged by a polypyrimidine tract variant rs689 that decreases the efficiency of intron 1 splicing and increases the relative abundance of mRNAs with extended 5' untranslated region (5' UTR), which curtails translation. Co-expression of haplotype-specific reporter constructs with SSOs bound to splicing regulatory motifs and decoy splice sites in primary transcripts revealed a motif that significantly reduced intron 1-containing mRNAs. Using an antisense microwalk at a single nucleotide resolution, the optimal target was mapped to a splicing silencer containing two pseudoacceptor sites sandwiched between predicted RNA guanine (G) quadruplex structures. Circular dichroism spectroscopy and nuclear magnetic resonance of synthetic G-rich oligoribonucleotide tracts derived from this region showed formation of a stable parallel 2-quartet G-quadruplex on the 3' side of the antisense retention target and an equilibrium between quadruplexes and stable hairpin-loop structures bound by optimal SSOs. This region interacts with heterogeneous nuclear ribonucleoproteins F and H that may interfere with conformational transitions involving the antisense target. The SSO-assisted promotion of weak intron removal from the 5' UTR through competing noncanonical and canonical RNA structures may facilitate development of novel strategies to enhance gene expression.
Assuntos
Quadruplex G , Íntrons , Oligonucleotídeos Antissenso/química , Proinsulina/genética , Splicing de RNA , Regiões 5' não Traduzidas , Animais , Células COS , Linhagem Celular , Chlorocebus aethiops , RNA Helicases DEAD-box/antagonistas & inibidores , Humanos , RNA/química , Precursores de RNA/metabolismo , Sítios de Splice de RNA , RNA Mensageiro/metabolismo , Proteínas de Ligação a RNA/metabolismoRESUMO
Splice-site selection is controlled by secondary structure through sequestration or approximation of splicing signals in primary transcripts but the exact role of even the simplest and most prevalent structural motifs in exon recognition remains poorly understood. Here we took advantage of a single-hairpin exon that was activated in a mammalian-wide interspersed repeat (MIR) by a mutation stabilizing a terminal triloop, with splice sites positioned close to each other in a lower stem of the hairpin. We first show that the MIR exon inclusion in mRNA correlated inversely with hairpin stabilities. Employing a systematic manipulation of unpaired regions without altering splice-site configuration, we demonstrate a high correlation between exon inclusion of terminal tri- and tetraloop mutants and matching tri-/tetramers in splicing silencers/enhancers. Loop-specific exon inclusion levels and enhancer/silencer associations were preserved across primate cell lines, in 4 hybrid transcripts and also in the context of a distinct stem, but only if its loop-closing base pairs were shared with the MIR hairpin. Unlike terminal loops, splicing activities of internal loop mutants were predicted by their intramolecular Watson-Crick interactions with the antiparallel strand of the MIR hairpin rather than by frequencies of corresponding trinucleotides in splicing silencers/enhancers. We also show that splicing outcome of oligonucleotides targeting the MIR exon depend on the identity of the triloop adjacent to their antisense target. Finally, we identify proteins regulating MIR exon recognition and reveal a distinct requirement of adjacent exons for C-terminal extensions of Tra2α and Tra2ß RNA recognition motifs.
Assuntos
Éxons , Sequências Repetidas Invertidas , Mamíferos/genética , Conformação de Ácido Nucleico , RNA Mensageiro/metabolismo , Animais , Linhagem Celular Tumoral , Humanos , Splicing de RNARESUMO
BACKGROUND: Autosomal recessive Alport syndrome (ARAS) is a rare hereditary disease caused by homozygous or compound heterozygous mutations in either the COL4A3 or COL4A4 genes. Failure to diagnose ARAS cases is common, even if detailed clinical and pathological examinations are carried out. As the mutation detection rate for ARAS is unsatisfactory, we sought to develop more reliable diagnostic methods and provide a better description of the clinicopathological characteristics of this disorder. METHODS: A retrospective analysis of 30 genetically diagnosed patients with ARAS in 24 pedigrees was conducted. The mutation detection strategy comprised three steps: (1) genomic DNA analysis using polymerase chain reaction (PCR) and direct sequencing; (2) mRNA analysis using reverse transcription (RT)-PCR to detect RNA processing abnormalities; (3) semi-quantitative PCR using capillary electrophoresis to detect large heterozygous deletions. RESULTS: Using the three-step analysis, we identified homozygous or compound heterozygous mutations in all patients. Interestingly, 20% of our ARAS patients showed normal expression of α5 in kidney tissue. The median age of developing end-stage renal disease was 21 years. CONCLUSIONS: The strategy described in this study improves the diagnosis for ARAS families. Although immunohistochemical analysis of α5 can provide diagnostic information, normal distribution does not exclude the diagnosis of ARAS.
Assuntos
Nefrite Hereditária/diagnóstico , Nefrite Hereditária/genética , Adolescente , Adulto , Criança , Pré-Escolar , Análise Mutacional de DNA , Feminino , Estudos de Associação Genética , Humanos , Falência Renal Crônica/genética , Masculino , Reação em Cadeia da Polimerase , Estudos Retrospectivos , Adulto JovemRESUMO
Eukaryotic DNA codes not only for proteins but contains a wealth of information required for accurate splicing of messenger RNA precursors and inclusion of constitutively or alternatively spliced exons in mature transcripts. This "auxiliary" splicing code has been characterized as exonic splicing enhancers and silencers (ESE and ESS). The exact interplay between protein and splicing codes is, however, poorly understood. Here, we show that exons encoding copper-coordinating amino acids in human cuproproteins lack ESEs and/or have an excess of ESSs, yet RNA sequencing and expressed sequence tags data show that they are more efficiently included in mature transcripts by the splicing machinery than average exons. Their largely constitutive inclusion in messenger RNA is facilitated by stronger splice sites, including polypyrimidine tracts, consistent with an important role of the surrounding intron architecture in ensuring high expression of metal-binding residues during evolution. ESE/ESS profiles of codons and entire exons that code for copper-coordinating residues were very similar to those encoding residues that coordinate zinc but markedly different from those that coordinate calcium. Together, these results reveal how the traditional and auxiliary splicing motifs responded to constraints of metal coordination in proteins.
Assuntos
Cobre , Éxons , Splicing de RNA , Humanos , Éxons/genética , Cobre/metabolismo , Processamento Alternativo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Elementos Facilitadores Genéticos/genética , Proteínas de Transporte/genética , Proteínas de Transporte/metabolismoRESUMO
DBASS3 and DBASS5 provide comprehensive repositories of new exon boundaries that were induced by pathogenic mutations in human disease genes. Aberrant 5'- and 3'-splice sites were activated either by mutations in the consensus sequences of natural exon-intron junctions (cryptic sites) or elsewhere ('de novo' sites). DBASS3 and DBASS5 currently contain approximately 900 records of cryptic and de novo 3'- and 5'-splice sites that were produced by over a thousand different mutations in approximately 360 genes. DBASS3 and DBASS5 data can be searched by disease phenotype, gene, mutation, location of aberrant splice sites in introns and exons and their distance from authentic counterparts, by bibliographic references and by the splice-site strength estimated with several prediction algorithms. The user can also retrieve reference sequences of both aberrant and authentic splice sites with the underlying mutation. These data will facilitate identification of introns or exons frequently involved in aberrant splicing, mutation analysis of human disease genes and study of germline or somatic mutations that impair RNA processing. Finally, this resource will be useful for fine-tuning splice-site prediction algorithms, better definition of auxiliary splicing signals and design of new reporter assays. DBASS3 and DBASS5 are freely available at http://www.dbass.org.uk/.
Assuntos
Bases de Dados de Ácidos Nucleicos , Mutação , Sítios de Splice de RNA , Doença/genética , Humanos , Terminologia como Assunto , Interface Usuário-ComputadorRESUMO
GC 5' splice sites (5'ss) are present in â¼1% of human introns, but factors promoting their efficient selection are poorly understood. Here, we describe a case of X-linked agammaglobulinemia resulting from a GC 5'ss activated by a mutation in BTK intron 3. This GC 5'ss was intrinsically weak, yet it was selected in >90% primary transcripts in the presence of a strong and intact natural GT counterpart. We show that efficient selection of this GC 5'ss required a high density of GAA/CAA-containing splicing enhancers in the exonized segment and was promoted by SR proteins 9G8, Tra2ß and SC35. The GC 5'ss was efficiently inhibited by splice-switching oligonucleotides targeting either the GC 5'ss itself or the enhancer. Comprehensive analysis of natural GC-AG introns and previously reported pathogenic GC 5'ss showed that their efficient activation was facilitated by higher densities of splicing enhancers and lower densities of silencers than their GT 5'ss equivalents. Removal of the GC-AG introns was promoted to a minor extent by the splice-site strength of adjacent exons and inhibited by flanking Alu repeats, with the first downstream Alus located on average at a longer distance from the GC 5'ss than other transposable elements. These results provide new insights into the splicing code that governs selection of noncanonical splice sites.
Assuntos
Agamaglobulinemia/genética , Doenças Genéticas Ligadas ao Cromossomo X/genética , Sítios de Splice de RNA , Tirosina Quinase da Agamaglobulinemia , Linhagem Celular , Humanos , Sequências Repetitivas Dispersas , Íntrons , Oligonucleotídeos Antissenso , Mutação Puntual , Proteínas Tirosina Quinases/genética , Splicing de RNA , Sequências Reguladoras de Ácido RibonucleicoRESUMO
We describe a new program called cryptic splice finder (CSF) that can reliably identify cryptic splice sites (css), so providing a useful tool to help investigate splicing mutations in genetic disease. We report that many css are not entirely dormant and are often already active at low levels in normal genes prior to their enhancement in genetic disease. We also report a fascinating correlation between the positions of css and introns, whereby css within the exons of one species frequently match the exact position of introns in equivalent genes from another species. These results strongly indicate that many introns were inserted into css during evolution and they also imply that the splicing information that lies outside some introns can be independently recognized by the splicing machinery and was in place prior to intron insertion. This indicates that non-intronic splicing information had a key role in shaping the split structure of eukaryote genes.
Assuntos
Sítios de Splice de RNA , Software , Sequência de Bases , Sequência Consenso , Evolução Molecular , Etiquetas de Sequências Expressas/química , Genes , Doenças Genéticas Inatas/genética , Genômica/métodos , Humanos , Íntrons , Alinhamento de Sequência , Análise de Sequência de ProteínaRESUMO
Olduvai protein domains (also known as DUF1220 or NBPF) have undergone the greatest human-specific increase in the copy number of any coding region in the genome. Their repeat number was strongly associated with the evolutionary expansion of brain volumes, neuron counts and cognitive abilities, as well as with disorders of the autistic spectrum. Nevertheless, the domain function and cellular mechanisms underlying the positive selection of Olduvai DNA sequences in higher primates remain obscure. Here, I show that the inclusion of Olduvai exon doublets in mature transcripts is facilitated by a potent splicing enhancer that was created through duplication within the first exon. The enhancer is the strongest among the NBPF transcripts and further promotes the already high splicing activity of the unexpanded first exons of the two-exon domains, safeguarding the expanded Olduvai exon doublets in the mature transcriptome. The duplication also creates a predicted RNA guanine quadruplex that may regulate the access to spliceosomal components of the super-enhancer and influence the splicing of adjacent exons. Thus, positive Olduvai selection during primate evolution is likely to result from a combination of multiple targets in gene expression pathways, including RNA splicing.