Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
PLoS Genet ; 10(11): e1004790, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-25393762

RESUMO

Recombination is an essential process in eukaryotes, which increases diversity by disrupting genetic linkage between loci and ensures the proper segregation of chromosomes during meiosis. In the human genome, recombination events are clustered in hotspots, whose location is determined by the PRDM9 protein. There is evidence that the location of hotspots evolves rapidly, as a consequence of changes in PRDM9 DNA-binding domain. However, the reasons for these changes and the rate at which they occur are not known. In this study, we investigated the evolution of human hotspot loci and of PRDM9 target motifs, both in modern and archaic human lineages (Denisovan) to quantify the dynamic of hotspot turnover during the recent period of human evolution. We show that present-day human hotspots are young: they have been active only during the last 10% of the time since the divergence from chimpanzee, starting to be operating shortly before the split between Denisovans and modern humans. Surprisingly, however, our analyses indicate that Denisovan recombination hotspots did not overlap with modern human ones, despite sharing similar PRDM9 target motifs. We further show that high-affinity PRDM9 target motifs are subject to a strong self-destructive drive, known as biased gene conversion (BGC), which should lead to the loss of the majority of them in the next 3 MYR. This depletion of PRDM9 genomic targets is expected to decrease fitness, and thereby to favor new PRDM9 alleles binding different motifs. Our refined estimates of the age and life expectancy of human hotspots provide empirical evidence in support of the Red Queen hypothesis of recombination hotspots evolution.


Assuntos
Troca Genética , Evolução Molecular , Histona-Lisina N-Metiltransferase/genética , Recombinação Genética , Animais , Cromossomos/genética , Proteínas de Ligação a DNA , Conversão Gênica , Genoma Humano , Humanos , Meiose/genética , Pan troglodytes
2.
Mol Biol Evol ; 30(6): 1409-19, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23505044

RESUMO

GC-biased gene conversion (gBGC) is a process associated with recombination that favors the transmission of GC alleles over AT alleles during meiosis. gBGC plays a major role in genome evolution in many eukaryotes. However, the molecular mechanisms of gBGC are still unknown. Different steps of the recombination process could potentially cause gBGC: the formation of double-strand breaks (DSBs), the invasion of the homologous or sister chromatid, and the repair of mismatches in heteroduplexes. To investigate these models, we analyzed a genome-wide data set of crossovers (COs) and noncrossovers (NCOs) in Saccharomyces cerevisiae. We demonstrate that the overtransmission of GC alleles is specific to COs and that it occurs among conversion tracts in which all alleles are converted from the same donor haplotype. Thus, gBGC results from a process that leads to long-patch repair. We show that gBGC is associated with longer tracts and that it is driven by the nature (GC or AT) of the alleles located at the extremities of the tract. These observations invalidate the hypotheses that gBGC is due to the base excision repair machinery or to a bias in DSB formation and suggest that in S. cerevisiae, gBGC is caused by the mismatch repair (MMR) system. We propose that the presence of nicks on both DNA strands during CO resolution could be the cause of the bias in MMR activity. Our observations are consistent with the hypothesis that gBGC is a nonadaptive consequence of a selective pressure to limit the mutation rate in mitotic cells.


Assuntos
Composição de Bases , Evolução Molecular , Conversão Gênica , Genes Fúngicos , Saccharomyces cerevisiae/genética , Haplótipos , Meiose/genética , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Recombinação Genética , Saccharomyces cerevisiae/citologia
3.
Hum Mutat ; 32(2): 198-206, 2011 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-21120948

RESUMO

Although mutations that are detrimental to the fitness of organisms are expected to be rapidly purged from populations by natural selection, some disease-causing mutations are present at high frequencies in human populations. Several nonexclusive hypotheses have been proposed to account for this apparent paradox (high new mutation rate, genetic drift, overdominance, or recent changes in selective pressure). However, the factors ultimately responsible for the presence at high frequency of disease-causing mutations are still contentious. Here we establish the existence of an additional process that contributes to the spreading of deleterious mutations: GC-biased gene conversion (gBGC), a process associated with recombination that tends to favor the transmission of GC-alleles over AT-alleles. We show that the spectrum of amino acid-altering polymorphisms in human populations exhibits the footprints of gBGC. This pattern cannot be explained in terms of selection and is evident with all nonsynonymous mutations, including those predicted to be detrimental to protein structure and function, and those implicated in human genetic disease. We present simulations to illustrate the conditions under which gBGC can extend the persistence time of deleterious mutations in a finite population. These results indicate that gBGC meiotic drive contributes to the spreading of deleterious mutations in human populations.


Assuntos
Composição de Bases , Predisposição Genética para Doença , Meiose , Recombinação Genética , Doença/genética , Frequência do Gene , Humanos , Mutação , Polimorfismo de Nucleotídeo Único
4.
Biochimie ; 90(4): 563-9, 2008 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-17988782

RESUMO

Single nucleotide polymorphisms (SNPs), which are the most abundant form of genetic variations in numerous organisms, have emerged as important tools for the study of complex genetic traits and deciphering of genome evolution. High-throughput genome sequencing projects worldwide provide an unprecedented opportunity for whole-genome SNP analysis in a variety of species. To facilitate SNP discovery in vertebrates, we have developed a web-based, user-friendly, and fully automated application, DigiPINS, for genome-wide identification of exonic SNPs from EST data. Currently, the database can be used to the mining of exonic SNPs in six complete genomes (Homo sapiens, Mus musculus, Rattus norvegicus, Canis familiaris, Gallus gallus and Danio rerio). In addition to providing information on sequence conservation, DigiPINS allows compilation of comprehensive sets of polymorphisms within cancer candidate genes or identification of novel cancer markers, making it potentially useful for cancer association studies. The DigiPINS server is available via the internet at http://pbil.univ-lyon1.fr/gem/DigiPINS/query_DigiPINS.php.


Assuntos
Bases de Dados Genéticas , Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Software , Sequência de Aminoácidos , Animais , Sequência de Bases , Biomarcadores , Etiquetas de Sequências Expressas , Genoma , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Dados de Sequência Molecular , Análise de Sequência de DNA , Interface Usuário-Computador
5.
PLoS Biol ; 3(1): e4, 2005 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-15630476

RESUMO

To help understand the evolution of suppressed recombination between sex chromosomes, and its consequences for evolution of the sequences of Y-linked genes, we have studied four X-Y gene pairs, including one gene not previously characterized, in plants in a group of closely related dioecious species of Silene which have an X-Y sex-determining system (S. latifolia, S. dioica, and S. diclinis). We used the X-linked copies to build a genetic map of the X chromosomes, with a marker in the pseudoautosomal region (PAR) to orient the map. The map covers a large part of the X chromosomes--at least 50 centimorgans. Except for a recent rearrangement in S. dioica, the gene order is the same in the X chromosomes of all three species. Silent site divergence between the DNA sequences of the X and Y copies of the different genes increases with the genes' distances from the PAR, suggesting progressive restriction of recombination between the X and Y chromosomes. This was confirmed by phylogenetic analyses of the four genes, which also revealed that the least-diverged X-Y pair could have ceased recombining independently in the dioecious species after their split. Analysis of amino acid replacements vs. synonymous changes showed that, with one possible exception, the Y-linked copies appear to be functional in all three species, but there are nevertheless some signs of degenerative processes affecting the genes that have been Y-linked for the longest times. Although the X-Y system evolved quite recently in Silene (less than 10 million years ago) compared to mammals (about 320 million years ago), our results suggest that similar processes have been at work in the evolution of sex chromosomes in plants and mammals, and shed some light on the molecular mechanisms suppressing recombination between X and Y chromosomes.


Assuntos
Cromossomos de Plantas , Evolução Molecular , Silene/genética , Animais , Mapeamento Cromossômico , DNA Complementar/metabolismo , Ligação Genética , Variação Genética , Dados de Sequência Molecular , Filogenia , Polimorfismo Genético , Análise de Sequência de DNA , Cromossomos Sexuais , Cromossomo X/ultraestrutura , Cromossomo Y/ultraestrutura
6.
BMC Bioinformatics ; 8: 154, 2007 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-17504516

RESUMO

BACKGROUND: SAGE has been used widely to study the expression of known transcripts, but much less to annotate new transcribed regions. LongSAGE produces tags that are sufficiently long to be reliably mapped to a whole-genome sequence. Here we used this property to study the position of human LongSAGE tags obtained from all public libraries. We focused mainly on tags that do not map to known transcripts. RESULTS: Using a published error rate in SAGE libraries, we first removed the tags likely to result from sequencing errors. We then observed that an unexpectedly large number of the remaining tags still did not match the genome sequence. Some of these correspond to parts of human mRNAs, such as polyA tails, junctions between two exons and polymorphic regions of transcripts. Another non-negligible proportion can be attributed to contamination by murine transcripts and to residual sequencing errors. After filtering out our data with these screens to ensure that our dataset is highly reliable, we studied the tags that map once to the genome. 31% of these tags correspond to unannotated transcripts. The others map to known transcribed regions, but many of them (nearly half) are located either in antisense or in new variants of these known transcripts. CONCLUSION: We performed a comprehensive study of all publicly available human LongSAGE tags, and carefully verified the reliability of these data. We found the potential origin of many tags that did not match the human genome sequence. The properties of the remaining tags imply that the level of sequencing error may have been under-estimated. The frequency of tags matching once the genome sequence but not in an annotated exon suggests that the human transcriptome is much more complex than shown by the current human genome annotations, with many new splicing variants and antisense transcripts. SAGE data is appropriate to map new transcripts to the genome, as demonstrated by the high rate of cross-validation of the corresponding tags using other methods.


Assuntos
Mapeamento Cromossômico/métodos , Biblioteca Gênica , Genoma Humano/genética , Sitios de Sequências Rotuladas , Humanos
7.
BMC Genomics ; 8: 2, 2007 Jan 03.
Artigo em Inglês | MEDLINE | ID: mdl-17201911

RESUMO

BACKGROUND: A promising application of the huge amounts of genetic data currently available lies in developing a better understanding of complex diseases, such as cancer. Analysis of publicly available databases can help identify potential candidates for genes or mutations specifically related to the cancer phenotype. In spite of their huge potential to affect gene function, no systematic attention has been paid so far to the changes that occur in untranslated regions of mRNA. RESULTS: In this study, we used Expressed Sequence Tag (EST) databases as a source for cancer-related sequence polymorphism discovery at the whole-genome level. Using a novel computational procedure, we focused on the identification of untranslated region (UTR)-localized non-coding Single Nucleotide Polymorphisms (UTR-SNPs) significantly associated with the tumoral state. To explore possible relationships between genetic mutation and phenotypic variation, bioinformatic tools were used to predict the potential impact of cancer-associated UTR-SNPs on mRNA secondary structure and UTR regulatory elements. We provide a comprehensive and unbiased description of cancer-associated UTR-SNPs that may be useful to define genotypic markers or to propose polymorphisms that can act to alter gene expression levels. Our results suggest that a fraction of cancer-associated UTR-SNPs may have functional consequences on mRNA stability and/or expression. CONCLUSION: We have undertaken a comprehensive effort to identify cancer-associated polymorphisms in untranslated regions of mRNA and to characterize putative functional UTR-SNPs. Alteration of translational control can change the expression of genes in tumor cells, causing an increase or decrease in the concentration of specific proteins. Through the description of testable candidates and the experimental validation of a number of UTR-SNPs discovered on the secreted protein acidic and rich in cysteine (SPARC) gene, this report illustrates the utility of a cross-talk between in silico transcriptomics and cancer genetics.


Assuntos
Genoma Humano/genética , Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Regiões não Traduzidas/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Etiquetas de Sequências Expressas , Humanos , Leucemia Mieloide Aguda/genética , Conformação de Ácido Nucleico , Osteonectina/genética , RNA Mensageiro/química , RNA Mensageiro/genética , Regiões não Traduzidas/química
8.
Nucleic Acids Res ; 33(Database issue): D59-66, 2005 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-15608268

RESUMO

Processed pseudogenes result from reverse transcribed mRNAs. In general, because processed pseudogenes lack promoters, they are no longer functional from the moment they are inserted into the genome. Subsequently, they freely accumulate substitutions, insertions and deletions. Moreover, the ancestral structure of processed pseudogenes could be easily inferred using the sequence of their functional homologous genes. Owing to these characteristics, processed pseudogenes represent good neutral markers for studying genome evolution. Recently, there is an increasing interest for these markers, particularly to help gene prediction in the field of genome annotation, functional genomics and genome evolution analysis (patterns of substitution). For these reasons, we have developed a method to annotate processed pseudogenes in complete genomes. To make them useful to different fields of research, we stored them in a nucleic acid database after having annotated them. In this work, we screened both mouse and human complete genomes from ENSEMBL to find processed pseudogenes generated from functional genes with introns. We used a conservative method to detect processed pseudogenes in order to minimize the rate of false positive sequences. Within processed pseudogenes, some are still having a conserved open reading frame and some have overlapping gene locations. We designated as retroelements all reverse transcribed sequences and more strictly, we designated as processed pseudogenes, all retroelements not falling in the two former categories (having a conserved open reading or overlapping gene locations). We annotated 5823 retroelements (5206 processed pseudogenes) in the human genome and 3934 (3428 processed pseudogenes) in the mouse genome. Compared to previous estimations, the total number of processed pseudogenes was underestimated but the aim of this procedure was to generate a high-quality dataset. To facilitate the use of processed pseudogenes in studying genome structure and evolution, DNA sequences from processed pseudogenes, and their functional reverse transcribed homologs, are now stored in a nucleic acid database, HOPPSIGEN. HOPPSIGEN can be browsed on the PBIL (Pole Bioinformatique Lyonnais) World Wide Web server (http://pbil.univ-lyon1.fr/) or fully downloaded for local installation.


Assuntos
Bases de Dados de Ácidos Nucleicos , Pseudogenes , Animais , Humanos , Internet , Camundongos , Retroelementos , Transcrição Reversa
9.
Elife ; 62017 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-28826480

RESUMO

Synonymous codon usage (SCU) varies widely among human genes. In particular, genes involved in different functional categories display a distinct codon usage, which was interpreted as evidence that SCU is adaptively constrained to optimize translation efficiency in distinct cellular states. We demonstrate here that SCU is not driven by constraints on tRNA abundance, but by large-scale variation in GC-content, caused by meiotic recombination, via the non-adaptive process of GC-biased gene conversion (gBGC). Expression in meiotic cells is associated with a strong decrease in recombination within genes. Differences in SCU among functional categories reflect differences in levels of meiotic transcription, which is linked to variation in recombination and therefore in gBGC. Overall, the gBGC model explains 70% of the variance in SCU among genes. We argue that the strong heterogeneity of SCU induced by gBGC in mammalian genomes precludes any optimization of the tRNA pool to the demand in codon usage.


Assuntos
Códon , Conversão Gênica , Código Genético , Meiose , Modelos Genéticos , Composição de Bases , Variação Genética , Genoma Humano , Humanos , RNA de Transferência/genética , RNA de Transferência/metabolismo
10.
Oncogene ; 24(40): 6133-42, 2005 Sep 08.
Artigo em Inglês | MEDLINE | ID: mdl-15897869

RESUMO

Last decade has led to the accumulation of large amounts of data on cancer genetics, opening an unprecedented access to the mapping of cancer genes in the human genome. Single-nucleotide polymorphisms (SNPs), the most common form of DNA variation in humans, emerge as an invaluable tool for cancer association studies. These genotypic markers can be used to assay how alleles of candidate genes correlate with the malignant phenotype, and may provide new clues into the genetic modifications that characterize cancer onset. In this cancer-oriented study, we detail an SNP mining strategy based on the analysis of expressed sequence tags among publicly available databases. Our whole-genome approach provides a comprehensive and unbiased description of nonsynonymous SNPs (nsSNPs) in tumoral versus normal tissues. To gain further insights into the possible relationships between genetic variation and altered phenotype, locations of a subset of nsSNPs were mapped onto protein domains known to be critical for protein function. Computational methods were also used to predict the potential impact of these cancer-associated nsSNPs on protein structure and function. We illustrate our approach through the detailed biochemical and structural characterization of a previously unknown cancer-associated mutation (G79C) affecting the 8 kDa dynein light chain (DNCL1).


Assuntos
Dineínas/genética , Etiquetas de Sequências Expressas , Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Biologia Computacional , Dineínas do Citoplasma , Análise Mutacional de DNA , Marcadores Genéticos , Genoma Humano , Humanos , Fenótipo
11.
BMC Genomics ; 7: 94, 2006 Apr 26.
Artigo em Inglês | MEDLINE | ID: mdl-16640784

RESUMO

BACKGROUND: Owing to the explosion of information generated by human genomics, analysis of publicly available databases can help identify potential candidate genes relevant to the cancerous phenotype. The aim of this study was to scan for such genes by whole-genome in silico subtraction using Expressed Sequence Tag (EST) data. METHODS: Genes differentially expressed in normal versus tumor tissues were identified using a computer-based differential display strategy. Bcl-xL, an anti-apoptotic member of the Bcl-2 family, was selected for confirmation by western blot analysis. RESULTS: Our genome-wide expression analysis identified a set of genes whose differential expression may be attributed to the genetic alterations associated with tumor formation and malignant growth. We propose complete lists of genes that may serve as targets for projects seeking novel candidates for cancer diagnosis and therapy. Our validation result showed increased protein levels of Bcl-xL in two different liver cancer specimens compared to normal liver. Notably, our EST-based data mining procedure indicated that most of the changes in gene expression observed in cancer cells corresponded to gene inactivation patterns. Chromosomes and chromosomal regions most frequently associated with aberrant expression changes in cancer libraries were also determined. CONCLUSION: Through the description of several candidates (including genes encoding extracellular matrix and ribosomal components, cytoskeletal proteins, apoptotic regulators, and novel tissue-specific biomarkers), our study illustrates the utility of in silico transcriptomics to identify tumor cell signatures, tumor-related genes and chromosomal regions frequently associated with aberrant expression in cancer.


Assuntos
Biologia Computacional/métodos , Etiquetas de Sequências Expressas , Regulação Neoplásica da Expressão Gênica , Neoplasias/metabolismo , Algoritmos , Western Blotting , Mapeamento Cromossômico , Cromossomos/ultraestrutura , Interpretação Estatística de Dados , Bases de Dados Genéticas , Regulação para Baixo , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Genoma Humano , Humanos , Queratinas/metabolismo , Neoplasias Hepáticas/metabolismo , Modelos Genéticos , Neoplasias/embriologia , Neoplasias/genética , Regulação para Cima , Proteína bcl-X/biossíntese , Proteína bcl-X/genética
12.
Genome Biol Evol ; 8(8): 2427-41, 2016 08 25.
Artigo em Inglês | MEDLINE | ID: mdl-27401173

RESUMO

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences.


Assuntos
Sequência de Aminoácidos/genética , Códon/genética , Evolução Molecular , Seleção Genética , Substituição de Aminoácidos , Humanos , Modelos Genéticos , Modelos Estatísticos , Mutação , Nucleotídeos/genética
13.
Genetics ; 162(4): 1837-47, 2002 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-12524353

RESUMO

To understand the origin and evolution of isochores-the peculiar spatial distribution of GC content within mammalian genomes-we analyzed the synonymous substitution pattern in coding sequences from closely related species in different mammalian orders. In primate and cetartiodactyls, GC-rich genes are undergoing a large excess of GC --> AT substitutions over AT --> GC substitutions: GC-rich isochores are slowly disappearing from the genome of these two mammalian orders. In rodents, our analyses suggest both a decrease in GC content of GC-rich isochores and an increase in GC-poor isochores, but more data will be necessary to assess the significance of this pattern. These observations question the conclusions of previous works that assumed that base composition was at equilibrium. Analysis of allele frequency in human polymorphism data, however, confirmed that in the GC-rich parts of the genome, GC alleles have a higher probability of fixation than AT alleles. This fixation bias appears not strong enough to overcome the large excess of GC --> AT mutations. Thus, whatever the evolutionary force (neutral or selective) at the origin of GC-rich isochores, this force is no longer effective in mammals. We propose a model based on the biased gene conversion hypothesis that accounts for the origin of GC-rich isochores in the ancestral amniote genome and for their decline in present-day mammals.


Assuntos
Isocoros/química , Isocoros/genética , Mamíferos/genética , Alelos , Animais , Artiodáctilos/genética , Composição de Bases , Evolução Molecular , Conversão Gênica , Genoma , Genoma Humano , Humanos , Modelos Genéticos , Primatas/genética , Roedores/genética , Especificidade da Espécie
14.
BMC Bioinformatics ; 5: 143, 2004 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-15469608

RESUMO

BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assignment of each 14 bp tag to the transcript from which it was derived. This process, called tag-to-gene mapping, represents a step that has to be improved in the analysis of SAGE libraries. Indeed, the existing web sites providing correspondence between tags and transcripts do not concern all species for which numerous EST and cDNA have already been sequenced. RESULTS: This is the reason why we designed and implemented a freely available tool called Identitag for tag identification that can be used in any species for which transcript sequences are available. Identitag is based on a relational database structure in order to allow rapid and easy storage and updating of data and, most importantly, in order to be able to precisely define identification parameters. This structure can be seen like three interconnected modules : the first one stores virtual tags extracted from a given list of transcript sequences, the second stores experimental tags observed in SAGE experiments, and the third allows the annotation of the transcript sequences used for virtual tag extraction. It therefore connects an observed tag to a virtual tag and to the sequence it comes from, and then to its functional annotation when available. Databases made from different species can be connected according to orthology relationship thus allowing the comparison of SAGE libraries between species. We successfully used Identitag to identify tags from our chicken SAGE libraries and for chicken to human SAGE tags interspecies comparison. Identitag sources are freely available on http://pbil.univ-lyon1.fr/software/identitag/ web site. CONCLUSIONS: Identitag is a flexible and powerful tool for tag identification in any single species and for interspecies comparison of SAGE libraries. It opens the way to comparative transcriptomic analysis, an emerging branch of biology.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Software , Animais , Galinhas/genética , DNA Complementar/genética , Etiquetas de Sequências Expressas , Humanos , Poli A/genética , Homologia de Sequência do Ácido Nucleico , Design de Software , Especificidade da Espécie
15.
Genome Biol Evol ; 4(3): 412-22, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22417915

RESUMO

Meiotic recombination is an important evolutionary force shaping the nucleotide landscape of genomes. For most vertebrates, the frequency of recombination varies slightly or considerably between the sexes (heterochiasmy). In humans, male, rather than female, recombination rate has been found to be more highly correlated with the guanine and cytosine (GC) content across the genome. In the present study, we review the results in human and extend the examination of the evolutionary impact of heterochiasmy beyond primates to include four additional eutherian mammals (mouse, dog, pig, and sheep), a metatherian mammal (opossum), and a bird (chicken). Specifically, we compared sex-specific recombination rates (RRs) with nucleotide substitution patterns evaluated in transposable elements. Our results, based on a comparative approach, reveal a great diversity in the relationship between heterochiasmy and nucleotide composition. We find that the stronger male impact on this relationship is a conserved feature of human, mouse, dog, and sheep. In contrast, variation in genomic GC content in pig and opossum is more strongly correlated with female, rather than male, RR. Moreover, we show that the sex-differential impact of recombination is mainly driven by the chromosomal localization of recombination events. Independent of sex, the higher the RR in a genomic region and the longer this recombination activity is conserved in time, the stronger the bias in nucleotide substitution pattern, through such mechanisms as biased gene conversion. Over time, this bias will increase the local GC content of the region.


Assuntos
Meiose/genética , Recombinação Genética/genética , Animais , Composição de Bases/genética , Cães , Evolução Molecular , Feminino , Conversão Gênica/genética , Humanos , Masculino , Camundongos , Ovinos , Suínos
16.
J Mol Evol ; 62(6): 745-52, 2006 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-16752212

RESUMO

Processed pseudogenes are generated by reverse transcription of a functional gene. They are generally nonfunctional after their insertion and, as a consequence, are no longer subjected to the selective constraints associated with functional genes. Because of this property they can be used as neutral markers in molecular evolution. In this work, we investigated the relationship between the evolution of GC content in recently inserted processed pseudogenes and the local recombination pattern in two mammalian genomes (human and mouse). We confirmed, using original markers, that recombination drives GC content in the human genome and we demonstrated that this is also true for the mouse genome despite lower recombination rates. Finally, we discussed the consequences on isochores evolution and the contrast between the human and the mouse pattern.


Assuntos
Composição de Bases , Evolução Molecular , Genoma , Pseudogenes , Recombinação Genética , Animais , Biologia Computacional , Bases de Dados Genéticas , Genoma Humano , Humanos , Camundongos , Fatores de Tempo
17.
Am J Physiol Renal Physiol ; 291(4): F750-60, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-16597615

RESUMO

Hypoxia has been suspected to trigger transdifferentiation of renal tubular cells into myofibroblasts in an epithelial-to-mesenchymal transition (EMT) process. To determine the functional networks potentially altered by hypoxia, rat renal tubule suspensions were incubated under three conditions of oxygenation ranging from normoxia (lactate uptake) to severe hypoxia (lactate production). Transcriptome changes after 4 h were analyzed on a high scale by restriction fragment differential display. Among 1,533 transcripts found, 42% were maximally expressed under severe hypoxia and 8% under mild hypoxia (Po(2) = 48 mmHg), suggesting two different levels of oxygen sensing. Normoxia was required for full expression of the proximal tubule-specific transcripts 25-hydroxyvitamin D 1-hydroxylase (Cyp27b1) and l-pyruvate kinase (Pklr), transcripts involved in tissue cohesion such as fibronectin (Fn1) and N-cadherin (Cdh2), and non-muscle-type myosin transcripts. Mild hypoxia increased myogenin transcript level. Conversely, severe hypoxia increased transcripts involved in extracellular matrix remodeling, those of muscle-type myosins, and others involved in creatine phosphate synthesis and lactate transport (Slc16a7). Accordingly, microscopy showed loss of tubule aggregation under hypoxia, without tubular disruption. Hypoxia also increased the levels of kidney-specific transcripts normally restricted to the less oxygenated medullary zone and others specific for the distal part of the nephron. We conclude that extensive oxygen supply to the kidney tubule favors expression of its differentiated functions specifically in the proximal tubule, whose embryonic origin is mesenchymal. The phenotype changes could potentially permit transient adaptation to hypoxia but also favor pathological processes such as tissue invasion.


Assuntos
Córtex Renal/fisiologia , Túbulos Renais Proximais/fisiologia , Consumo de Oxigênio , Animais , Hipóxia Celular , DNA Complementar/genética , Amplificação de Genes , Masculino , Reação em Cadeia da Polimerase/métodos , RNA/genética , RNA/isolamento & purificação , Ratos , Ratos Sprague-Dawley , Mucosa Respiratória/fisiologia , Transcrição Gênica
18.
Hum Mol Genet ; 14(3): 421-7, 2005 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-15590696

RESUMO

Mammalian chromosomes are characterized by large-scale variations of DNA base composition (the so-called isochores). In contradiction with previous studies, Lercher et al. (Hum. Mol. Genet., 12, 2411, 2003) recently reported a strong correlation between gene expression breadth and GC-content, suggesting that there might be a selective pressure favoring the concentration of housekeeping genes in GC-rich isochores. We reassessed this issue by examining in human and mouse the correlation between gene expression and GC-content, using different measures of gene expression (EST, SAGE and microarray) and different measures of GC-content. We show that correlations between GC-content and expression are very weak, and may vary according to the method used to measure expression. Such weak correlations have a very low predictive value. The strong correlations reported by Lercher et al. (2003) are because of the fact that they measured variables over neighboring genes windows. We show here that using gene windows artificially enhances the correlation. The assertion that the expression of a given gene depends on the GC-content of the region where it is located is therefore not supported by the data.


Assuntos
Sequência Rica em GC , Genoma , Animais , Composição de Bases , Etiquetas de Sequências Expressas , Expressão Gênica , Humanos , Isocoros/genética , Camundongos , Modelos Genéticos , Análise de Sequência com Séries de Oligonucleotídeos
19.
Bioinformatics ; 18(4): 631-3, 2002 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-12016061

RESUMO

RESULTS: CpGProD is an application for identifying mammalian promoter regions associated with CpG islands in large genomic sequences. Although it is strictly dedicated to this particular promoter class corresponding to approximately 50% of the genes, CpGProD exhibits a higher sensitivity and specificity than other tools used for promoter prediction. Notably, CpGProD uses different parameters according to species (human, mouse) studied. Moreover, CpGProD predicts the promoter orientation on the DNA strand. AVAILABILITY: http://pbil.univ-lyon1.fr/software/cpgprod.html SUPPLEMENTARY INFORMATION: http://pbil.univ-lyon1.fr/software/cpgprod.html


Assuntos
Ilhas de CpG/genética , Bases de Dados de Proteínas , Genoma Humano , Análise de Sequência de Proteína/métodos , Software , Sítio de Iniciação de Transcrição , Animais , Éxons , Humanos , Armazenamento e Recuperação da Informação/métodos , Íntrons , Camundongos , Dados de Sequência Molecular , Reconhecimento Automatizado de Padrão , Sensibilidade e Especificidade
20.
Genet Res ; 81(2): 79-87, 2003 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-12872909

RESUMO

Recombination is thought to have various evolutionary effects on genome evolution. In this study, we investigated the relationship between the base composition and recombination rate in the Drosophila melanogaster genome. Because of a current debate about the accuracy of the estimates of recombination rate in Drosophila, we used eight different measures of recombination rate from recent work. We confirmed that the G + C content of large introns and flanking regions is positively correlated with recombination rate, suggesting that recombination has a neutral effect on base composition in Drosophila. We also confirmed that this neutral effect of recombination is the main determinant of the correlation between synonymous codon usage bias and recombination rate in Drosophila.


Assuntos
Drosophila melanogaster/genética , Recombinação Genética , Animais , Composição de Bases
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA