Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Publication year range
1.
Hum Mutat ; 23(2): 205, 2004 Feb.
Article in English | MEDLINE | ID: mdl-14722925

ABSTRACT

Mutations in the serine protease inhibitor Kazal type 1 gene (SPINK1) encoding pancreatic secretory trypsin inhibitor (PSTI) have recently been found to be associated with chronic pancreatitis. Nevertheless, knowledge of severe mutations is particularly scarce, both in terms of number and in the extent of clinical information. The aim of this study was to expand the known spectrum of such mutations. 46 unrelated families, each including at least two pancreatitis patients and carrying neither cationic trypsinogen (PRSS1) mutations nor the frequent SPINK1 N34S mutation, participated in this study. The four exons and their flanking sequences of the SPINK1 gene were screened by denaturing high performance liquid chromatography analysis (DHPLC); and mutations were identified by direct sequencing. A heterozygous microdeletion mutation (c.27delC), which occurs within a symmetric element, was identified in two families. In one family, c.27delC showed segregation with the disease across two generations, with a penetrance of up to 75%. But in the other family, however, the same mutation manifested as a low-penetrance susceptibility factor. In addition, a novel heterozygous splicing mutation, c.87+1G>A (G>A substitution at nucleotide +1 of intron 2) was found in one family with familial pancreatitis. Our results also helped to resolve the sharply differing views about PSTI's role in pancreatitis.


Subject(s)
Mutation/genetics , Pancreatitis/genetics , Trypsin Inhibitor, Kazal Pancreatic/genetics , Adult , Child , Cohort Studies , Exons/genetics , Female , Gene Deletion , Heterozygote , Humans , Male , Middle Aged , Pedigree , RNA Splice Sites/genetics
2.
Mol Biol (Mosk) ; 35(6): 1015-22, 2001.
Article in Russian | MEDLINE | ID: mdl-11771125

ABSTRACT

It is now common practice to retrieve, by key words, highly specialized selections of sequences from general-purpose databases such as EMBL, GenBank, etc. The sequences included in a selection are often interconnected, which means that there are duplications, embeddings, intersections, homology, common structural elements. Knowledge of these interconnections is necessary for further processing of the sequences. We propose a rapid (single scan) method for identification of such interconnections by means of complexity analysis that generalizes the Lempel-Ziv approach. Analysis of a selection of 5'-flanking regions of vertebrate growth hormone genes from EMBL is presented as an example.


Subject(s)
Evolution, Molecular , Databases, Nucleic Acid , Repetitive Sequences, Nucleic Acid
3.
Hum Genet ; 107(4): 362-5, 2000 Oct.
Article in English | MEDLINE | ID: mdl-11129336

ABSTRACT

No general rules have been proposed to account for the functional consequences of gene regulatory mutations. In a first attempt to establish the nature of such rules, an analysis was performed of the DNA sequence context of 153 different single base-pair substitutions in the regulatory regions of 65 different human genes underlying inherited disease. Use of a recently proposed measure of DNA sequence complexity (taking into account the level of structural repetitiveness of a DNA sequence, rather than simply the oligonucleotide composition) has served to demonstrate that the concomitant change in local DNA sequence complexity surrounding a substituted nucleotide is related to the likelihood of a regulatory mutation coming to clinical attention. Mutations that led to an increase in complexity exhibited higher odds ratios in favour of pathological consequences than mutations that led to a decrease or left complexity unchanged. This relationship, however, was discernible only for pyrimidine-to-purine transversions. Odds ratios for other types of substitution were not found to be significantly associated with local changes in sequence complexity, even though a trend similar to that observed for Y-->R transversions was also apparent for transitions. These findings suggest that the maintenance of a defined level of DNA sequence complexity, or at least the avoidance of an increase in sequence complexity, is a critical prerequisite for the function of gene regulatory regions.


Subject(s)
DNA/genetics , Genes, Regulator , Mutation , Base Sequence , DNA Mutational Analysis , Humans , Odds Ratio , Phenotype , Sequence Analysis, DNA
4.
Gene ; 254(1-2): 9-18, 2000 Aug 22.
Article in English | MEDLINE | ID: mdl-10974531

ABSTRACT

Comparative studies of vertebrate gene promoter regions seldom detect gross rearrangements ('promoter shuffling') since such analyses usually employ relatively similar DNA sequences. Conversely, attempts to compare evolutionarily more divergent promoter sequences have been largely unsuccessful owing to the inability of conventional alignment procedures to deal with gross rearrangements. These limitations have been circumvented in the present study by using the novel technique of complexity analysis to identify modular components ('blocks') in the growth hormone (GH) gene promoter sequences of some 22 vertebrate species, from salmon to human. Significant rearrangement of blocks was found to have occurred, indicating that they have evolved as independent units. Some blocks appear to be ubiquitous, whereas others are restricted to a specific taxon. Considerable variation between orthologous GH gene promoters was apparent in terms of block length, copy number and relative location. It may be inferred that a wide variety of different mutational mechanisms have operated upon the GH gene promoter over evolutionary time. These include gross changes such as deletion, duplication, amplification, elongation, contraction, transposition, inversion and fusion, as well as the slow, steady accumulation of single base-pair substitutions. Thus the patchwork structure of the modular GH promoter region, and those of its paralogous GH2 and prolactin (PRL) counterparts, have continually been shuffled into new combinations through the rearrangement of pre-existing blocks. Although some of these changes may have had no influence on promoter function, others could have served to alter either the level of gene expression or the responsiveness of the promoter to external stimuli.


Subject(s)
Evolution, Molecular , Growth Hormone/genetics , Promoter Regions, Genetic/genetics , Animals , Base Sequence , Binding Sites , Conserved Sequence , DNA/genetics , DNA/metabolism , Databases, Factual , Gene Rearrangement , Humans , Molecular Sequence Data , Sequence Alignment , Species Specificity , Transcription Factors/metabolism , Transcription, Genetic , Vertebrates
5.
Hum Genet ; 107(5): 458-65, 2000 Nov.
Article in English | MEDLINE | ID: mdl-11140943

ABSTRACT

Regulatory DNA elements responsible for human protein C (PROC) gene expression have previously been identified in the upstream promoter region and first (untranslated) exon of the gene. Here we show that an additional sequence element located more than 500 bp downstream of the core promoter within intron 1 further enhances PROC promoter-driven reporter gene expression in human hepatoma cells. In common with core promoter constructs used in previous studies, the activity of this 3'-extended regulatory region is diminished by a naturally occurring promoter mutation. However, in contrast to constructs lacking intronic sequence, the promoter/intron regulatory region is repressed rather than activated by the transcription factor HNF-1. Using both conventional alignment procedures and complexity analysis to study the human and canine PROC sequences, we identified two conserved intronic regions, which were tested for their involvement in gene regulation. High-level gene expression from the intron-coupled promoter was dependent upon the integrity of a 142 bp sequence element, a duplicate copy of which is located in an upstream region of the PROC gene that possesses enhancer activity. These findings emphasise the potential importance of intragenic sequences for gene regulation and serve to illustrate that the results of PROC promoter/reporter gene experiments are critically dependent upon the sequence context. The identification of such intragenic elements is relevant to the analysis of human genetic disease since it will facilitate the detection and functional evaluation of regulatory mutations and polymorphisms.


Subject(s)
DNA-Binding Proteins , Gene Expression Regulation , Introns , Nuclear Proteins , Promoter Regions, Genetic , Protein C/genetics , Regulatory Sequences, Nucleic Acid , Base Sequence , Carcinoma, Hepatocellular , Conserved Sequence , DNA/chemistry , DNA/genetics , Genes, Reporter , Hepatocyte Nuclear Factor 1 , Hepatocyte Nuclear Factor 1-alpha , Hepatocyte Nuclear Factor 1-beta , Humans , Liver Neoplasms , Luciferases/genetics , Mutagenesis, Site-Directed , Transcription Factors/metabolism , Transfection , Tumor Cells, Cultured
6.
Gene ; 237(1): 143-51, 1999 Sep 03.
Article in English | MEDLINE | ID: mdl-10524245

ABSTRACT

The evolutionary relationship between the proximal growth hormone (GH) gene promoter sequences of 12 mammalian species was explored by comparison of their trinucleotide composition and by multiple sequence alignment. Both approaches yielded results that were consistent with the known fossil record-based phylogeny of the analysed sequences, suggesting that the two methods of tree reconstruction might be equally efficient and reliable. The pattern of evolution inferred for the mammalian GH gene promoters was found to vary both temporally and spatially. Thus, two distinct regions devoid of any evolutionary changes exist in primates, but only one of these 'gaps' is also observed in rodents, and neither is seen in ruminants. Furthermore, different evolutionary rates must have prevailed during different periods of evolutionary time and in different lineages, with a dramatic increase in evolutionary rate apparent in primates. Since a similar pattern of discontinuity has been previously noted for the evolution of the GH-coding regions, it may reflect the action of positive selection operating upon the GH gene as a single cohesive unit. Strong evidence for the action of gene conversion between primate GH gene promoters is provided by the fact that the human GH1 and GH2 sequences, which are thought to have diverged before the divergence of Old World monkeys from great apes, are more similar to one another than either is to the rhesus monkey GH2 promoter. Finally, it was noted that a number of nucleotide positions in the GH1 gene promoter that are polymorphic in humans appear to be highly conserved in mammals. This apparent conundrum, which could represent a caveat for the interpretation of phylogenetic footprinting studies, is potentially explicable in terms either of reduced genetic diversity in highly inbred animal species or insufficient population data from non-human species.


Subject(s)
Evolution, Molecular , Growth Hormone/genetics , Mammals/genetics , Promoter Regions, Genetic , Animals , Base Sequence , Human Growth Hormone/genetics , Humans , Molecular Sequence Data , Polymorphism, Genetic , Rabbits , Rats , Sequence Alignment , Sequence Homology, Nucleic Acid
7.
Bioinformatics ; 15(12): 994-9, 1999 Dec.
Article in English | MEDLINE | ID: mdl-10745989

ABSTRACT

MOTIVATION: It is well known that the regulatory regions of genomes are highly repetitive. They are rich in direct, symmetric and complemented repeats, and there is no doubt about the functional significance of these repeats. Among known measures of complexity, the Ziv-Lempel complexity measure reflects most adequately repeats occurring in the text. But this measure does not take into account isomorphic repeats. By isomorphic repeats we mean fragments that are identical (or symmetric) modulo some permutation of the alphabet letters. RESULTS: In this paper, two complexity measures of symbolic sequences are proposed that generalize the Ziv-Lempel complexity measure by taking into account any isomorphic repeats in the text (rather than just direct repeats as in Ziv-Lempel). The first of them, the complexity vector, is designed for small alphabets such as the alphabet of nucleotides. The second is based on a search for the longest isomorphic fragment in the history of sequence synthesis and can be used for alphabets of arbitrary cardinality. These measures have been used for recognition of structural regularities in DNA sequences. Some interesting structures related to the regulatory region of the human growth hormone are reported.


Subject(s)
Algorithms , Models, Genetic , Sequence Analysis, DNA/methods , Base Sequence , Growth Hormone/genetics , Humans , Information Storage and Retrieval/methods , Repetitive Sequences, Nucleic Acid , Sequence Homology, Amino Acid
8.
Bioinformatics ; 14(2): 139-43, 1998.
Article in English | MEDLINE | ID: mdl-9545445

ABSTRACT

MOTIVATION: Most of the existing methods for genetic sequence classification are based on a computer search for homologies in nucleotide or amino acid sequences. The standard sequence alignment programs scale very poorly as the number of sequences increases or the degree of sequence identity is <30%. Some new computationally inexpensive methods based on nucleotide or amino acid compositional analysis have been proposed, but prediction results are still unsatisfactory and depend on the features chosen to represent the sequences. RESULTS: In this paper, a feature selection method based on the Gamma (or near-neighbour) test is proposed. If there is a continuous or smooth map from feature space to the classification target values, the Gamma test gives an estimate for the mean-squared error of the classification, despite the fact that one has no a priori knowledge of the smooth mapping. We can search a large space of possible feature combinations for a combination which gives a smallest estimated mean-squared error using a genetic algorithm. The method was used for feature selection and classification of the large subunits of rRNA according to RDP (Ribosomal Database Project) phylogenetic classes. The sequences were represented by dinucleotide frequency distribution. The nearest-neighbour criterion has been used to estimate the predictive accuracy of the classification based on the selected features. For examples discussed, we found that the classification according to the first nearest neighbour is correct for 80% of the test samples. If we consider the set of the 10 nearest neighbours, then 94% of the test samples are classified correctly. AVAILABILITY: The principal novel component of this method is the Gamma test and this can be downloaded compiled for Unix Sun 4, Windows 95 and MS-DOS from http://www.cs.cf.ac.uk/ec/ CONTACT: s.margetts@cs.cf.ac.uk


Subject(s)
Sequence Alignment/methods , Algorithms , Animals , Computational Biology , Databases, Factual , Models, Genetic , Phylogeny , RNA, Ribosomal/genetics , Sequence Alignment/statistics & numerical data , Sequence Analysis, RNA/methods , Sequence Analysis, RNA/statistics & numerical data
SELECTION OF CITATIONS
SEARCH DETAIL
...