Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
J Biol Chem ; 286(12): 10017-26, 2011 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-21285356

RESUMO

Although alternative DNA secondary structures (non-B DNA) can induce genomic rearrangements, their associated mutational spectra remain largely unknown. The helicase activity of WRN, which is absent in the human progeroid Werner syndrome, is thought to counteract this genomic instability. We determined non-B DNA-induced mutation frequencies and spectra in human U2OS osteosarcoma cells and assessed the role of WRN in isogenic knockdown (WRN-KD) cells using a supF gene mutation reporter system flanked by triplex- or Z-DNA-forming sequences. Although both non-B DNA and WRN-KD served to increase the mutation frequency, the increase afforded by WRN-KD was independent of DNA structure despite the fact that purified WRN helicase was found to resolve these structures in vitro. In U2OS cells, ∼70% of mutations comprised single-base substitutions, mostly at G·C base-pairs, with the remaining ∼30% being microdeletions. The number of mutations at G·C base-pairs in the context of NGNN/NNCN sequences correlated well with predicted free energies of base stacking and ionization potentials, suggesting a possible origin via oxidation reactions involving electron loss and subsequent electron transfer (hole migration) between neighboring bases. A set of ∼40,000 somatic mutations at G·C base pairs identified in a lung cancer genome exhibited similar correlations, implying that hole migration may also be involved. We conclude that alternative DNA conformations, WRN deficiency and lung tumorigenesis may all serve to increase the mutation rate by promoting, through diverse pathways, oxidation reactions that perturb the electron orbitals of neighboring bases. It follows that such "hole migration" is likely to play a much more widespread role in mutagenesis than previously anticipated.


Assuntos
DNA Forma Z/metabolismo , Exodesoxirribonucleases , Instabilidade Genômica , Neoplasias Pulmonares/metabolismo , RecQ Helicases , Deleção de Sequência , Linhagem Celular Tumoral , DNA Forma Z/genética , Técnicas de Silenciamento de Genes , Humanos , Neoplasias Pulmonares/genética , Helicase da Síndrome de Werner
2.
Hum Genomics ; 4(5): 289-301, 2010 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-20650818

RESUMO

The +1169A allele of the A/T single nucleotide polymorphism (SNP; rs2665802), located within intron 4 of the human growth hormone 1 ( GH1 ) gene, has been associated with reduced levels of circulating GH and insulin-like growth factor 1, a reduced risk of colorectal cancer and a predisposition to osteoporosis. Whether this intronic SNP is itself the functional polymorphism responsible for exerting a direct effect on GH1 gene expression, however, or whether it is instead in linkage disequilibrium with the functional SNP, has been an open question. The evolutionary conservation of the +1169T allele (and the surrounding intronic sequence) in the bovine genome, as well as in primate genomes, is, however, suggestive of its functionality. Although a potential alternative splice site spans the location of the +1169 SNP, polymerase chain reaction-based assays failed to yield any evidence for alternative splicing associated with either allele. To determine whether the +1169 SNP, in different allelic combinations with SNPs at -278 (G/T), -57 (T/G) and +2103 (C/T), exerts a direct effect on gene expression and/or GH secretion, we performed a series of transfections of various GH1 haplotype-expressing constructs into rat GC (somatotroph) cells. The results obtained provided evidence to support the contention that the +1169A allele contributes directly to the observed reduction in both GH1 gene expression and GH secretion. Part of the apparent influence of the +1169A-bearing allele on GH1 gene expression and GH secretion may still, however, be attributable to alleles of additional SNPs in cis to +1169A and located within either the promoter or the 3'-flanking region.


Assuntos
Hormônio do Crescimento Humano/genética , Íntrons/genética , Polimorfismo de Nucleotídeo Único/genética , Região 3'-Flanqueadora/genética , Alelos , Processamento Alternativo/genética , Animais , Sequência de Bases , Linhagem Celular , Sequência Conservada/genética , Evolução Molecular , Regulação da Expressão Gênica , Haplótipos/genética , Humanos , Desequilíbrio de Ligação/genética , Dados de Sequência Molecular , Regiões Promotoras Genéticas/genética , Sítios de Splice de RNA/genética , Ratos , Homologia de Sequência do Ácido Nucleico , Somatotrofos/metabolismo , População Branca/genética
3.
Hum Genomics ; 4(6): 406-10, 2010 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-20846930

RESUMO

The cytosine-guanine (CpG) dinucleotide has long been known to be a hotspot for pathological mutation in the human genome. This hypermutability is related to its role as the major site of cytosine methylation with the attendant risk of spontaneous deamination of 5-methylcytosine (5mC) to yield thymine. Cytosine methylation, however, also occurs in the context of CpNpG sites in the human genome, an unsurprising finding since the intrinsic symmetry of CpNpG renders it capable of supporting a semi-conservative model of replication of the methylation pattern. Recently, it has become clear that significant DNA methylation occurs in a CpHpG context (where H = A, C or T) in a variety of human somatic tissues. If we assume that CpHpG methylation also occurs in the germline, and that 5mC deamination can occur within a CpHpG context, then we might surmise that methylated CpHpG sites could also constitute mutation hotspots causing human genetic disease. To test this postulate, 54,625 missense and nonsense mutations from 2,113 genes causing inherited disease were retrieved from the Human Gene Mutation Database (http://www.hgmd.org). Some 18.2 per cent of these pathological lesions were found to be C → T and G → A transitions located in CpG dinucleotides (compatible with a model of methylation-mediated deamination of 5mC), an approximately ten-fold higher proportion than would have been expected by chance alone. The corresponding proportion for the CpHpG trinucleotide was 9.9 per cent, an approximately two-fold higher proportion than would have been expected by chance. We therefore estimate that ∼5 per cent of missense/nonsense mutations causing human inherited disease may be attributable to methylation-mediated deamination of 5mC within a CpHpG context.


Assuntos
5-Metilcitosina/metabolismo , Metilação de DNA/genética , Fosfatos de Dinucleosídeos/genética , Doenças Genéticas Inatas/genética , Mutação/genética , Repetições de Trinucleotídeos/genética , Bases de Dados de Ácidos Nucleicos , Desaminação , Humanos
4.
Hum Mutat ; 31(10): 1163-73, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20725927

RESUMO

Nonallelic homologous recombination (NAHR) is responsible for the recurrent rearrangements that give rise to genomic disorders. Although meiotic NAHR has been investigated in multiple contexts, much less is known about mitotic NAHR despite its importance for tumorigenesis. Because type-2 NF1 microdeletions frequently result from mitotic NAHR, they represent a good model in which to investigate the features of mitotic NAHR. We have used microsatellite analysis and SNP arrays to distinguish between the various alternative recombinational possibilities, thereby ascertaining that 17 of 18 type-2 NF1 deletions, with breakpoints in the SUZ12 gene and its highly homologous pseudogene, originated via intrachromosomal recombination. This high proportion of intrachromosomal NAHR causing somatic type-2 NF1 deletions contrasts with the interchromosomal origin of germline type-1 NF1 microdeletions, whose breakpoints are located within the NF1-REPs (low-copy repeats located adjacent to the SUZ12 sequences). Further, meiotic NAHR causing type-1 NF1 deletions occurs within recombination hotspots characterized by high GC-content and DNA duplex stability, whereas the type-2 breakpoints associated with the mitotic NAHR events investigated here do not cluster within hotspots and are located within regions of significantly lower GC-content and DNA stability. Our findings therefore point to fundamental mechanistic differences between the determinants of mitotic and meiotic NAHR.


Assuntos
Cromossomos Humanos Par 17/genética , Mitose/genética , Neurofibromina 1/genética , Recombinação Genética , Deleção de Sequência , Proteínas de Transporte/genética , Biologia Computacional , Genes da Neurofibromatose 1 , Humanos , Repetições de Microssatélites/genética , Proteínas de Neoplasias , Neurofibromatose 1/genética , Proteínas Nucleares/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Complexo Repressor Polycomb 2 , Polimorfismo de Nucleotídeo Único , Fatores de Transcrição
5.
Hum Mutat ; 31(6): 742-51, 2010 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-20506354

RESUMO

Large microdeletions encompassing the neurofibromatosis type-1 (NF1) gene and its flanking regions at 17q11.2 belong to the group of genomic disorders caused by aberrant recombination between segmental duplications. The most common NF1 microdeletions (type-1) span 1.4-Mb and have breakpoints located within NF1-REPs A and C, low-copy repeats (LCRs) containing LRRC37-core duplicons. We have identified a novel type of recurrent NF1 deletion mediated by nonallelic homologous recombination (NAHR) between the highly homologous NF1-REPs B and C. The breakpoints of these approximately 1.0-Mb ("type-3") NF1 deletions were characterized at the DNA sequence level in three unrelated patients. Recombination regions, spanning 275, 180, and 109-bp, respectively, were identified within the LRRC37B-P paralogues of NF1-REPs B and C, and were found to contain sequences capable of non-B DNA formation. Both LCRs contain LRRC37-core duplicons, abundant and highly dynamic sequences in the human genome. NAHR between LRRC37-containing LCRs at 17q21.31 is known to have mediated the 970-kb polymorphic inversions of the MAPT-locus that occurred independently in different primate species, but also underlies the syndromes associated with recurrent 17q21.31 microdeletions and reciprocal microduplications. The novel NF1 microdeletions reported here provide further evidence for the unusually high recombinogenic potential of LRRC37-containing LCRs in the human genome.


Assuntos
Deleção Cromossômica , Cromossomos Humanos Par 17/genética , Neurofibromatose 1/genética , Neurofibromina 1/genética , Adulto , Animais , Criança , Mapeamento Cromossômico , Feminino , Deleção de Genes , Humanos , Células Híbridas , Hibridização in Situ Fluorescente , Masculino , Camundongos , Recombinação Genética , Duplicações Segmentares Genômicas/genética
6.
Am J Hum Genet ; 81(6): 1201-20, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-17999360

RESUMO

Approximately 5% of patients with neurofibromatosis type 1 (NF1) exhibit gross deletions that encompass the NF1 gene and its flanking regions. The breakpoints of the common 1.4-Mb (type 1) deletions are located within low-copy repeats (NF1-REPs) and cluster within a 3.4-kb hotspot of nonallelic homologous recombination (NAHR). Here, we present the first comprehensive breakpoint analysis of type 2 deletions, which are a second type of recurring NF1 gene deletion. Type 2 deletions span 1.2 Mb and are characterized by breakpoints located within the SUZ12 gene and its pseudogene, which closely flank the NF1-REPs. Breakpoint analysis of 13 independent type 2 deletions did not reveal any obvious hotspots of NAHR. However, an overrepresentation of polypyrimidine/polypurine tracts and triplex-forming sequences was noted in the breakpoint regions that could have facilitated NAHR. Intriguingly, all 13 type 2 deletions identified so far are characterized by somatic mosaicism, which indicates a positional preference for mitotic NAHR within the NF1 gene region. Indeed, whereas interchromosomal meiotic NAHR occurs between the NF1-REPs giving rise to type 1 deletions, NAHR during mitosis appears to occur intrachromosomally between the SUZ12 gene and its pseudogene, thereby generating type 2 deletions. Such a clear distinction between the preferred sites of mitotic versus meiotic NAHR is unprecedented in any other genomic disorder induced by the local genomic architecture. Additionally, 12 of the 13 mosaic type 2 deletions were found in females. The marked female preponderance among mosaic type 2 deletions contrasts with the equal sex distribution noted for type 1 and/or atypical NF1 deletions. Although an influence of chromatin structure was strongly suspected, no sex-specific differences in the methylation pattern exhibited by the SUZ12 gene were apparent that could explain the higher rate of mitotic recombination in females.


Assuntos
Mitose/genética , Neurofibromatose 1/genética , Neurofibromina 1/deficiência , Neurofibromina 1/genética , Deleção de Sequência , Adolescente , Adulto , Proteínas de Transporte/genética , Cromossomos Humanos Par 17 , Feminino , Humanos , Masculino , Mosaicismo , Proteínas de Neoplasias , Proteínas Nucleares/genética , Complexo Repressor Polycomb 2 , Recombinação Genética , Caracteres Sexuais , Fatores de Transcrição
7.
Chromosome Res ; 17(4): 469-83, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19475482

RESUMO

Submicroscopic inversions have contributed significantly to the genomic divergence between humans and chimpanzees over evolutionary time. Those microinversions which are flanked by segmental duplications (SDs) are presumed to have originated via non-allelic homologous recombination between SDs arranged in inverted orientation. However, the nature of the mechanisms underlying those inversions which are not flanked by SDs remains unclear. We have investigated 35 such inversions, ranging in size from 51-nt to 22056-nt, with the goal of characterizing the DNA sequences in the breakpoint-flanking regions. Using the macaque genome as an outgroup, we determined the lineage specificity of these inversions and noted that the majority (N = 31; 89%) were associated with deletions (of length between 1-nt and 6754-nt) immediately adjacent to one or both inversion breakpoints. Overrepresentations of both direct and inverted repeats, >or= 6-nt in length and capable of non-B DNA structure formation, were noted in the vicinity of breakpoint junctions suggesting that these repeats could have contributed to double strand breakage. Inverted repeats capable of cruciform structure formation were also found to be a common feature of the inversion breakpoint-flanking regions, consistent with these inversions having originated through the resolution of Holliday junction-like cruciforms. Sequences capable of non-B DNA structure formation have previously been implicated in promoting gross deletions and translocations causing human genetic disease. We conclude that non-B DNA forming sequences may also have promoted the occurrence of mutations in an evolutionary context, giving rise to at least some of the inversion/deletions which now serve to distinguish the human and chimpanzee genomes.


Assuntos
Inversão Cromossômica , DNA Cruciforme/genética , Genoma Humano , Sequências Repetidas Invertidas/genética , Pan troglodytes/genética , Animais , Sequência de Bases , Quebra Cromossômica , Biologia Computacional/métodos , Evolução Molecular , Humanos , Modelos Genéticos , Dados de Sequência Molecular , Recombinação Genética , Reprodutibilidade dos Testes , Análise de Sequência de DNA
8.
Hum Mutat ; 30(11): E956-73, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19634183

RESUMO

Mutational analysis of the GNPTAB gene was performed in 46 apparently unrelated patients with mucolipidosis IIalpha/beta or IIIalpha/beta, characterized by the mistargeting of multiple lysosomal enzymes as a consequence of a UDP-GlcNAc-1-phosphotransferase defect. The GNPTAB mutational spectrum comprised 25 distinct mutant alleles, 22 of which were novel, including 3 nonsense mutations (p.Q314X, p.R375X, p.Q507X), 5 missense mutations (p.I403T, p.C442Y, p.C461G, p.Q926P, p.L1001P), 6 microduplications (c.749dupA, c.857dupA, c.1191_1194dupGCTG, c.1206dupT, c.1331dupG, c.2220_2221dupGA) and 8 microdeletions (c.755_759delCCTCT, c.1399delG, c.1959_1962delTAGT, c.1965delC, c.2550_2554delGAAAA, c.3443_3446delTTTG, c.3487_3490delACAG, c.3523_3529delATGTTCC). All micro-duplications/deletions were predicted to result in the premature termination of translation. A novel exonic SNP (c.303G>A; E101E) was identified which is predicted to create an SFRS1 (SF2/ASF) binding site that may be of potential functional/clinical relevance. This study of mutations in the GNPTAB gene, the largest yet reported, extends our knowledge of the mutational heterogeneity evident in MLIIalpha/beta/MLIIIalpha/beta.


Assuntos
Mucolipidoses/genética , Mutação , Transferases (Outros Grupos de Fosfato Substituídos)/genética , Adolescente , Adulto , Animais , Células COS , Criança , Pré-Escolar , Chlorocebus aethiops , Códon sem Sentido , Análise Mutacional de DNA , Estudos de Associação Genética , Genótipo , Humanos , Lactente , Mutação de Sentido Incorreto , Deleção de Sequência
9.
Hum Mutat ; 30(6): 978-84, 2009 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-19370764

RESUMO

Mucolipidosis type III (MLIII) is an autosomal recessive disorder affecting lysosomal hydrolase trafficking. In a study of 10 patients from seven families with a clinical phenotype and enzymatic diagnosis of MLIII, six novel GNPTG gene mutations were identified. These included missense (p.T286M) and nonsense (p.W111X) mutations and a transition in the obligate AG-dinucleotide of the intron 8 acceptor splice site (c.610-2A>G). Three microdeletions were also identified, two of which (c.611delG and c.640_667del28) were located within the coding region whereas one (c.609+28_610-16del) was located entirely within intron 8. RT-PCR analysis of the c.610-2A>G transition demonstrated that the change altered splicing, leading to the production of two distinct aberrantly spliced forms, viz. the skipping of exon 9 (p.G204_K247del) or the retention of introns 8 and 9 (p.G204VfsX28). RT-PCR analysis, performed on a patient homozygous for the intronic deletion (c.609+28_610-16del), failed to detect any GNPTG RNA transcripts. To determine whether c.609+28_610-16del allele-derived transcripts were subject to nonsense-mediated mRNA decay (NMD), patient fibroblasts were incubated with the protein synthesis inhibitor anisomycin. An RT-PCR fragment retaining 43 bp of intron 8 was consistently detected suggesting that the 33-bp genomic deletion had elicited NMD. Quantitative real-time PCR and GNPTG western blot analysis confirmed that the homozygous microdeletion p.G204VfsX17 had elicited NMD resulting in failure to synthesize GNPTG protein. Analysis of the sequences surrounding the microdeletion breakpoints revealed either intrinsic repetitivity of the deleted region or short direct repeats adjacent to the breakpoint junctions. This is consistent with these repeats having mediated the microdeletions via replication slippage and supports the view that the mutational spectrum of the GNPTG gene is strongly influenced by the properties of the local DNA sequence environment.


Assuntos
Mucolipidoses/enzimologia , Mucolipidoses/genética , Mutação/genética , Subunidades Proteicas/genética , Transferases (Outros Grupos de Fosfato Substituídos)/genética , Adolescente , Adulto , Processamento Alternativo/genética , Sequência de Bases , Criança , Códon sem Sentido/genética , Feminino , Fibroblastos/enzimologia , Fibroblastos/patologia , Genótipo , Humanos , Masculino , Dados de Sequência Molecular , Mutação de Sentido Incorreto/genética , Sítios de Splice de RNA/genética , Estabilidade de RNA/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Deleção de Sequência
10.
Hum Mutat ; 29(8): 1037-47, 2008 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-18454449

RESUMO

Nonsense mutations account for approximately 11% of all described gene lesions causing human inherited disease and approximately 20% of disease-associated single-basepair substitutions affecting gene coding regions. Pathological nonsense mutations resulting in TGA (38.5%), TAG (40.4%), and TAA (21.1%) occur in different proportions to naturally occurring stop codons. Of the 23 different nucleotide substitutions giving rise to nonsense mutations, the most frequent are CGA --> TGA (21%; resulting from methylation-mediated deamination) and CAG --> TAG (19%). The differing nonsense mutation frequencies are largely explicable in terms of variable nucleotide substitution rates such that it is unnecessary to invoke differential translational termination efficiency or differential codon usage. Some genes are characterized by numerous nonsense mutations but relatively few if any missense mutations (e.g., CHM) whereas other genes exhibit many missense mutations but few if any nonsense mutations (e.g., PSEN1). Genes in the latter category have a tendency to encode proteins characterized by multimer formation. Consistent with the operation of a clinical selection bias, genes exhibiting an excess of nonsense mutations are also likely to display an excess of frameshift mutations. Tumor suppressor (TS) genes exhibit a disproportionate number of nonsense mutations while most mutations in oncogenes are missense. A total of 12% of somatic nonsense mutations in TS genes were found to occur recurrently in the hypermutable CpG dinucleotide. In a comparison of somatic and germline mutational spectra for 17 TS genes, approximately 43% of somatic nonsense mutations had counterparts in the germline (rising to 98% for CpG mutations). Finally, the proportion of disease-causing nonsense mutations predicted to elicit nonsense-mediated mRNA decay (NMD) is significantly higher (P=1.56 x 10(-9)) than among nonobserved (potential) nonsense mutations, implying that nonsense mutations that elicit NMD are more likely to come to clinical attention.


Assuntos
Códon sem Sentido , Bases de Dados Genéticas , Doenças Genéticas Inatas , Códon , Humanos , Estabilidade de RNA
11.
BMC Evol Biol ; 7: 51, 2007 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-17407546

RESUMO

BACKGROUND: Evolutionary processes in gene regulatory regions are major determinants of organismal evolution, but exceptionally challenging to study. We explored the possibilities of evolutionary analysis of phylogenetic footprints in 5'-noncoding sequences (NCS) from 27 ribulose-1,5-bisphosphate carboxylase small subunit (rbcS) genes, from three dicot families (Brassicaceae, Fabaceae and Solanaceae). RESULTS: Sequences of up to 400 bp encompassing proximal promoter and 5'-untranslated regions were analyzed. We conducted phylogenetic footprinting by several alternative methods: generalized Lempel-Ziv complexity (CLZ), multiple alignments with DIALIGN and ALIGN-M, and the MOTIF SAMPLER Gibbs sampling algorithm. These tools collectively defined 36 conserved blocks of mean length 12.8 bp. On average, 12.5 blocks were found in each 5'-NCS. The blocks occurred in arrays whose relative order was absolutely conserved, confirming the existence of 'conserved modular arrays' in promoters. Identities of half of the blocks confirmed past rbcS research, including versions of the I-box, G-box, and GT-1 sites such as Box II. Over 90% of blocks overlapped DNase-protected regions in tomato 5'-NCS. Regions characterized by low CLZ in sliding-window analyses were also frequently associated with DNase-protection. Blocks could be assigned to evolutionary hierarchies based on taxonomic distribution and estimated age. Lineage divergence dates implied that 13 blocks found in all three plant families were of Cretaceous antiquity, while other family-specific blocks were much younger. Blocks were also dated by formation of multigene families, using genome and coding sequence information. Dendrograms of evolutionary relations of the 5'-NCS were produced by several methods, including: cluster analysis using pairwise CLZ values; evolutionary trees of DIALIGN sequence alignments; and cladistic analysis of conserved blocks. CONCLUSION: Dicot 5'-NCS contain conserved modular arrays of recurrent sequence blocks, which are coincident with functional elements. These blocks are amenable to evolutionary interpretation as hierarchies in which ancient, taxonomically widespread blocks can be distinguished from more recent, taxon-specific ones.


Assuntos
Brassicaceae/genética , DNA de Plantas/genética , Fabaceae/genética , Genes de Plantas , Ribulose-Bifosfato Carboxilase/genética , Solanaceae/genética , Regiões 5' não Traduzidas/genética , Brassicaceae/enzimologia , Pegada de DNA , Evolução Molecular , Fabaceae/enzimologia , Filogenia , Regiões Promotoras Genéticas , Solanaceae/enzimologia
12.
Hum Mutat ; 26(3): 205-13, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16086312

RESUMO

In the Human Gene Mutation Database (www.hgmd.org), microdeletions and microinsertions causing inherited disease (both defined as involving < or = 20 bp of DNA) account for 8,399 (17%) and 3,345 (7%) logged mutations, in 940 and 668 genes, respectively. A positive correlation was noted between the microdeletion and microinsertion frequencies for 564 genes for which both microdeletions and microinsertions are reported in HGMD, consistent with the view that the propensity of a given gene/sequence to undergo microdeletion is related to its propensity to undergo microinsertion. While microdeletions and microinsertions of 1 bp constitute respectively 48% and 66% of the corresponding totals, the relative frequency of the remaining lesions correlates negatively with the length of the DNA sequence deleted or inserted. Many of the microdeletions and microinsertions of more than 1 bp are potentially explicable in terms of slippage mutagenesis, involving the addition or removal of one copy of a mono-, di-, or trinucleotide tandem repeat. The frequency of in-frame 3-bp and 6-bp microinsertions and microdeletions was, however, found to be significantly lower than that of mutations of other lengths, suggesting that some of these in-frame lesions may not have come to clinical attention. Various sequence motifs were found to be over-represented in the vicinity of both microinsertions and microdeletions, including the heptanucleotide CCCCCTG that shares homology with the complement of the 8-bp human minisatellite conserved sequence/chi-like element (GCWGGWGG). The previously reported indel hotspot GTAAGT and its complement ACTTAC were also found to be overrepresented in the vicinity of both microinsertions and microdeletions, thereby providing a first example of a mutational hotspot that is common to different types of gene lesion. Other motifs overrepresented in the vicinity of microdeletions and microinsertions included DNA polymerase pause sites and topoisomerase cleavage sites. Several novel microdeletion/microinsertion hotspots were noted and some of these exhibited sufficient similarity to one another to justify terming them "super-hotspot" motifs. Analysis of sequence complexity also demonstrated that a combination of slipped mispairing mediated by direct repeats, and secondary structure formation promoted by symmetric elements, can account for the majority of microdeletions and microinsertions. Thus, microinsertions and microdeletions exhibit strong similarities in terms of the characteristics of their flanking DNA sequences, implying that they are generated by very similar underlying mechanisms.


Assuntos
Doenças Genéticas Inatas/genética , Mutagênese , Análise de Sequência de DNA/métodos , Biologia Computacional/métodos , DNA Polimerase Dirigida por DNA/genética , Bases de Dados Genéticas , Deleção de Genes , Variação Genética , Humanos , Mutação , Sequências Repetitivas de Ácido Nucleico
13.
Hum Mutat ; 21(1): 28-44, 2003 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-12497629

RESUMO

A relatively rare type of mutation causing human genetic disease is the indel, a complex lesion that appears to represent a combination of micro-deletion and micro-insertion. In the absence of meta-analytical studies of indels, the mutational mechanisms underlying indel formation remain unclear. Data from the Human Gene Mutation Database (HGMD) were therefore used to compare and contrast 211 different indels underlying genetic disease in an attempt to deduce the processes responsible for their genesis. Each indel was treated as if it were the result of a two-step insertion/deletion process and was assessed in the context of 10 base-pairs DNA sequence flanking the lesion on either side. Several indel hotspots were noted and a GTAAGT motif was found to be significantly over-represented in the vicinity of the indels studied. Previously postulated mechanisms underlying micro-deletions and micro-insertions were initially explored in terms of local DNA sequence regularity as measured by its complexity. The change in complexity consequent to a mutation was found to be indicative of the type of repeat sequence involved in mediating the event, thereby providing clues as to the underlying mutational mechanism. Complexity analysis was then employed to examine the possible intermediates through which each indel could have occurred and to propose likely mechanisms and pathways for indel generation on an individual basis. Manual analysis served to confirm that the majority of indels (>90%) are explicable in terms of a two-step process involving established mutational mechanisms. Indels equivalent to double base-pair substitutions (22% of the total) were found to be mechanistically indistinguishable from the remainder and may therefore be regarded as a special type of indel. The observed correspondence between changes in local DNA sequence complexity and the involvement of specific mutational mechanisms in the insertion/deletion process, and the ability of generated models to account for both the number and identity of the bases deleted and/or inserted, makes this approach invaluable not only for the analysis of indel formation, but also for the study of other types of complex lesion.


Assuntos
Doenças Genéticas Inatas/genética , Predisposição Genética para Doença , Mutagênese , Mutação , Sequência de Bases , Análise Mutacional de DNA , Humanos , Modelos Genéticos , Mutagênese Insercional , Conformação de Ácido Nucleico , Sequências Repetitivas de Ácido Nucleico , Deleção de Sequência
14.
Evolution ; 56(2): 224-32, 2002 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-11926491

RESUMO

Complexity analysis is capable of highlighting those gross evolutionary changes in gene promoter regions (loosely termed "promoter shuffling") that are undetectable by conventional DNA sequence alignment. Complexity analysis was therefore used here to identify the modular components (blocks) of the orthologous beta-globin gene promoter sequences of 22 vertebrate species, from zebrafish to humans. Considerable variation between the beta-globin gene promoters was apparent in terms of block presence/absence, copy number, and relative location. Some sequence blocks appear to be ubiquitous, whereas others are restricted to a specific taxon. Block similarities were also evident between the promoters of the paralogous human beta-like globin genes. It may be inferred that a wide variety of different mutational mechanisms have operated upon the beta-globin gene promoter over evolutionary time. Because these include gross changes such as deletion, duplication, amplification, elongation, contraction, and fusion, as well as the steady accumulation of single base-pair substitutions, it is clear that some redefinition of the term "promoter shuffling" is required. This notwithstanding, and as previously described for the vertebrate growth hormone gene promoter, the modular structure of the beta-globin promoter region and those of its paralogous counterparts have continually been rearranged into new combinations through the alteration, or shuffling, of preexisting blocks. Some of these changes may have had no influence on promoter function, but others could have altered either the level of gene expression or the responsiveness of the promoter to external stimuli. The comparative study of vertebrate beta-globin gene promoter regions described here confirms the generality of the phenomenon of sequence block shuffling and thus supports the view that it could have played an important role in the evolution of differential gene expression.


Assuntos
Globinas/genética , Regiões Promotoras Genéticas , Vertebrados/classificação , Vertebrados/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Humanos , Mamíferos/genética , Dados de Sequência Molecular , Fragmentos de Peptídeos/química , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Especificidade da Espécie , TATA Box
15.
Curr Protoc Bioinformatics ; Chapter 1: Unit 1.13, 2006 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18428754

RESUMO

The Human Gene Mutation Database (HGMD) constitutes a comprehensive core collection of data on germ-line mutations in nuclear genes underlying or associated with human inherited disease (http://www.hgmd.org). Data cataloged include single base-pair substitutions in coding, regulatory, and splicing-relevant regions, microdeletions and microinsertions, indels, and triplet repeat expansions, as well as gross gene deletions, insertions, duplications, and complex rearrangements. Each mutation is entered into HGMD only once, in order to avoid confusion between recurrent and identical-by-descent lesions. By June 2005, the database contained in excess of 53,000 different lesions detected in 2029 different nuclear genes, with new entries currently accumulating at a rate in excess of 5000 per annum. HGMD includes cDNA reference sequences, now provided for more than 90% of the listed genes, splice junction data, disease-associated and functional polymorphisms, and links to data present in publicly available online locus-specific mutation databases.


Assuntos
Análise Mutacional de DNA/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma Humano/genética , Mutação/genética , Polimorfismo de Nucleotídeo Único/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Gráficos por Computador , Humanos , Armazenamento e Recuperação da Informação , Dados de Sequência Molecular , Interface Usuário-Computador
16.
Appl Environ Microbiol ; 72(9): 5734-41, 2006 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16957188

RESUMO

A new computer program, called Mallard, is presented for screening entire 16S rRNA gene libraries of up to 1,000 sequences for chimeras and other artifacts. Written in the Java computer language and capable of running on all major operating systems, the program provides a novel graphical approach for visualizing phylogenetic relationships among 16S rRNA gene sequences. To illustrate its use, we analyzed most of the large libraries of cloned bacterial 16S rRNA gene sequences submitted to the public repository during 2005. Defining a large library as one containing 100 or more sequences of 1,200 bases or greater, we screened 25 of the 28 libraries and found that all but three contained substantial anomalies. Overall, 543 anomalous sequences were found. The average anomaly content per clone library was 9.0%, 4% higher than that previously estimated for the public repository overall. In addition, 90.8% of anomalies had characteristic chimeric patterns, a rise of 25.4% over that found previously. One library alone was found to contain 54 chimeras, representing 45.8% of its content. These figures far exceed previous estimates of artifacts within public repositories and further highlight the urgent need for all researchers to adequately screen their libraries prior to submission. Mallard is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.


Assuntos
Biblioteca Gênica , RNA Ribossômico 16S/genética , Software , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Sequência de Bases , Quimera/genética , Clonagem Molecular , DNA Arqueal/genética , DNA Bacteriano/genética , DNA Ribossômico/genética , Biologia Marinha , Dados de Sequência Molecular , RNA Arqueal/genética , RNA Bacteriano/genética
17.
In Silico Biol ; 6(1-2): 23-34, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16789908

RESUMO

DNA sequence features were sought that could be used for the in silico ascertainment of the likely functional consequences of single nucleotide changes in human gene promoter regions. To identify relevant features of the local DNA sequence context, we transformed into consensus tables the nucleotide composition of sequences flanking 101 promoter SNPs of type C<-->T or A<-->G, defined empirically as being either 'functional' or 'non-functional' on the basis of a standardised reporter gene assay. The similarity of a given sequence to these consensus tables was then measured by means of the Shapiro-Senapathy score. A decision rule with the potential to discriminate between empirically ascertained functional and non-functional SNPs was proposed that potentiated discrimination between functional and non-functional SNPs with a sensitivity of 80% and a specificity of 20%. Two further datasets (viz. disease-associated SNPs of types A<-->G and C<-->T (N = 75) and pathological promoter mutations (transitions, N = 114)) were retrieved from the Human Gene Mutation Database (HGMD; http://www.hgmd.org/) and analyzed using consensus tables derived from the functional and non-functional promoter SNPs; approximately 70% were correctly recognized as being of probable functional significance. Complexity analysis was also used to quantify the regularity of the local DNA sequence environment. Functional SNPs/mutations of type C<-->T were found to occur in DNA regions characterized by lower average sequence complexity as measured with respect to symmetric elements; complexity values increased gradually from functional SNPs and pathological mutations to functional disease-associated SNPs and non-functional SNPs. This may reflect the internal axial symmetry that frequently characterizes transcription factor binding sites.


Assuntos
Algoritmos , Predisposição Genética para Doença , Mutação , Polimorfismo de Nucleotídeo Único , Regiões Promotoras Genéticas , Análise de Sequência de DNA/métodos , Sequência de Bases , Sítios de Ligação , Bases de Dados de Ácidos Nucleicos , Genoma Humano , Humanos , Fatores de Transcrição/genética
18.
Am J Med Genet A ; 140(1): 62-9, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16333843

RESUMO

The SFTPB gene indel g.1549C > GAA (121ins2) accounts for about 2/3 of the mutant alleles underlying complete surfactant protein B deficiency. It is unclear, however, whether its prevalence is due to recurrent mutation or a founder effect. The underlying mutational mechanism was therefore sought through the analysis of local DNA sequence complexity. A relatively complex two-step process was proposed: the first step involving slipped mispairing mediated by a direct repeat and generating an AGAA micro-insertion, the second step involving hairpin loop resolution resulting in a CA micro-deletion. The possibility of a founder effect was then assessed by typing 8 intragenic SNPs in 17 independent 121ins2 chromosomes from 10 probands, with parental non-121ins2 chromosomes serving as controls. The 121ins2 chromosomes were assigned to three discrete haplotypes, whilst control chromosomes were distributed between 10 of the 11 observed parental haplotypes. The 121ins2 mutation was in strong and significant linkage disequilibrium (LD) with the tightly linked marker g.1580T/C (|D'| = 1; P approximately 0.024), although only moderate LD was found with the rest of the locus (|D'| approximately 0.54; P approximately 0.136). Data on haplotype structure and the locus LD pattern, obtained from 81 independent Western-European chromosomes, were consistent with the three mutation-bearing haplotypes having originated from a common ancestor by recombination. Interestingly, all families harboring the 121ins2 indel had ancestors from a region of Northwestern Europe populated by Frankish/Saxon migration. Taken together, these data are consistent with the view that an indel mutation occurred on a relatively common SFTPB haplotype and now accounts for the majority of (and possibly all) extant 121ins2 chromosomes.


Assuntos
Mutagênese Insercional , Proteína B Associada a Surfactante Pulmonar/deficiência , Proteína B Associada a Surfactante Pulmonar/genética , Sequência de Bases , Europa (Continente) , Feminino , Efeito Fundador , Frequência do Gene , Genótipo , Haplótipos/genética , Humanos , Íntrons/genética , Desequilíbrio de Ligação , Masculino , Dados de Sequência Molecular , Mutação , Polimorfismo de Nucleotídeo Único
19.
Appl Environ Microbiol ; 71(12): 7724-36, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16332745

RESUMO

A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.


Assuntos
Bacteroides/genética , Bases de Dados de Ácidos Nucleicos/normas , RNA Ribossômico 16S/química , Bacteroides/classificação , Sequência de Bases , RNA Ribossômico 16S/genética , Reprodutibilidade dos Testes , Deleção de Sequência , Software
20.
Hum Genet ; 112(3): 303-9, 2003 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-12545275

RESUMO

Transfer of nucleic acid from cytoplasmic organelles to the nuclear genome is a well-established mechanism of evolutionary change in eukaryotes. Such transfers have occurred throughout evolution, but so far, none has been shown unequivocally to occur de novo to cause a heritable human disease. We have characterized a patient with a de novo nucleic acid transfer from the mitochondrial to the nuclear genome, a transfer that is responsible for a sporadic case of Pallister-Hall syndrome, a condition usually inherited in an autosomal dominant fashion. This mutation, a 72-bp insertion into exon 14 of the GLI3 gene, creates a premature stop codon and predicts a truncated protein product. Both the mechanism and the cause of the mitochondrial-nuclear transfer are unknown. Although the conception of this patient was temporally and geographically associated with high-level radioactive contamination following the Chernobyl accident, this case cannot, on its own, be used to establish a causal relationship between radiation exposure and this rare type of mutation. Thus, for the time being, it must be considered as an intriguing coincidence. Nevertheless, these data serve to demonstrate that de novo mitochondrial-nuclear transfer of nucleic acid is a novel mechanism of human inherited disease.


Assuntos
Núcleo Celular/genética , DNA Mitocondrial/genética , Proteínas de Ligação a DNA/genética , DNA/genética , Doenças Genéticas Inatas/genética , Proteínas do Tecido Nervoso/genética , Proteínas Repressoras , Fatores de Transcrição/genética , Proteínas de Xenopus , Adolescente , Alelos , Sequência de Aminoácidos , Sequência de Bases , DNA Complementar/análise , Hamartoma/genética , Humanos , Doenças Hipotalâmicas/genética , Fatores de Transcrição Kruppel-Like , Masculino , Dados de Sequência Molecular , Linhagem , Polidactilia/genética , Reação em Cadeia da Polimerase , Análise de Sequência de DNA , Síndrome , Proteína Gli3 com Dedos de Zinco
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa