RESUMEN
Numerous genes in diverse organisms have been shown to be under positive selection, especially genes involved in reproduction, adaptation to contrasting environments, hybrid inviability, and host-pathogen interactions. Looking for genes under positive selection in pathogens has been a priority in efforts to investigate coevolution dynamics and to develop vaccines or drugs. To elucidate the functions involved in host specialization, here we aimed at identifying candidate sequences that could have evolved under positive selection among closely related pathogens specialized on different hosts. For this goal, we sequenced c. 17,000-32,000 ESTs from each of four Microbotryum species, which are fungal pathogens responsible for anther smut disease on host plants in the Caryophyllaceae. Forty-two of the 372 predicted orthologous genes showed significant signal of positive selection, which represents a good number of candidate genes for further investigation. Sequencing 16 of these genes in 9 additional Microbotryum species confirmed that they have indeed been rapidly evolving in the pathogen species specialized on different hosts. The genes showing significant signals of positive selection were putatively involved in nutrient uptake from the host, secondary metabolite synthesis and secretion, respiration under stressful conditions and stress response, hyphal growth and differentiation, and regulation of expression by other genes. Many of these genes had transmembrane domains and may therefore also be involved in pathogen recognition by the host. Our approach thus revealed fruitful and should be feasible for many non-model organisms for which candidate genes for diversifying selection are needed.
Asunto(s)
Basidiomycota/genética , Interacciones Huésped-Patógeno/genética , Selección Genética , Caryophyllaceae/microbiología , Análisis por Conglomerados , ADN de Hongos/genética , Etiquetas de Secuencia Expresada , Biblioteca de Genes , Genes Fúngicos , Enfermedades de las Plantas/microbiología , Alineación de Secuencia , Análisis de Secuencia de ADN , Especificidad de la EspecieRESUMEN
Phylogenies involving nonmodel species are based on a few genes, mostly chosen following historical or practical criteria. Because gene trees are sometimes incongruent with species trees, the resulting phylogenies may not accurately reflect the evolutionary relationships among species. The increase in availability of genome sequences now provides large numbers of genes that could be used for building phylogenies. However, for practical reasons only a few genes can be sequenced for a wide range of species. Here we asked whether we can identify a few genes, among the single-copy genes common to most fungal genomes, that are sufficient for recovering accurate and well-supported phylogenies. Fungi represent a model group for phylogenomics because many complete fungal genomes are available. An automated procedure was developed to extract single-copy orthologous genes from complete fungal genomes using a Markov Clustering Algorithm (Tribe-MCL). Using 21 complete, publicly available fungal genomes with reliable protein predictions, 246 single-copy orthologous gene clusters were identified. We inferred the maximum likelihood trees using the individual orthologous sequences and constructed a reference tree from concatenated protein alignments. The topologies of the individual gene trees were compared to that of the reference tree using three different methods. The performance of individual genes in recovering the reference tree was highly variable. Gene size and the number of variable sites were highly correlated and significantly affected the performance of the genes, but the average substitution rate did not. Two genes recovered exactly the same topology as the reference tree, and when concatenated provided high bootstrap values. The genes typically used for fungal phylogenies did not perform well, which suggests that current fungal phylogenies based on these genes may not accurately reflect the evolutionary relationships among species. Analyses on subsets of species showed that the phylogenetic performance did not seem to depend strongly on the sample. We expect that the best-performing genes identified here will be very useful for phylogenetic studies of fungi, at least at a large taxonomic scale. Furthermore, we compare the method developed here for finding genes for building robust phylogenies with previous ones and we advocate that our method could be applied to other groups of organisms when more complete genomes are available.
Asunto(s)
Clasificación/métodos , Filogenia , Hongos/clasificación , Hongos/genética , Genes Fúngicos/genética , Funciones de Verosimilitud , Familia de MultigenesRESUMEN
A statistical method is presented for detecting quantitative trait loci (QTLs), based on the linear model. Unlike methods able to detect a few well separated QTLs and to estimate their effects and positions, this method considers the genome as a whole and enables the detection of chromosomal segments involved in the differences between two homozygous lines, and their backcross, doubled haploid, or F2 progenies, for a quantitative trait. Genetic markers must be codominant, but missing markers are accepted, provided they are missing independently from the experiment. Asymptotic properties, which are of practical use, are developed. This method does not rely on strong genetic hypotheses, and thus does not permit any precise genetic analysis of the trait under study, but it does assess which regions of the genome are involved, whatever the complexity of the genetic determinism (number, effects and interactions among QTLs). Simultaneous use of several methods, including this one, should lead to better efficiency in QTL detection.
Asunto(s)
Cromosomas , Marcadores Genéticos , Modelos Genéticos , Animales , Humanos , Modelos EstadísticosRESUMEN
A coding sequence is defined as a DNA sequence coding the primary structure of a protein (a polypeptide). Such a sequence must satisfy a specific constraint, which consists in coding a functional protein. As the genetic code is degenerated, there exists, for a given polypeptide, a set of synonymous sequences which would code the same polypeptide. Translation conditional models are being defined on such sets. The aim of this paper is to give a common formalism. Besides the codon bias model, a few other conditional models will be defined. Statistical estimators and comparison methods will be briefly presented. These models can be used for gene classification, or to find out, in a real sequence, remarkable features. An example will be presented on Escherichia coli genes.
Asunto(s)
Modelos Genéticos , Biosíntesis de Proteínas , Proteínas/genética , Proteínas Bacterianas/genética , Secuencia de Bases , Biometría , Codón/genética , ADN Bacteriano/genética , Escherichia coli/genética , Genes Bacterianos , Cadenas de MarkovRESUMEN
Ruthenium complexes have been shown to perform efficient transfer hydrogenation reactions between alcohols and alkenes; in combination with an in situ Wittig reaction, indirect formation of C-C bonds has been achieved from alcohols.
RESUMEN
DNA methylation is commonly thought of as a "molecular lock" that leads to permanent gene silencing. To investigate this notion, we tested 24 different histone deacetylase inhibitors (HDACi) on colon cancer cells that harbor a GFP locus stably integrated and silenced by a hypermethylated cytomegalovirus (CMV) promoter. We found that HDACi efficiently reactivated expression of GFP and many other endogenous genes silenced by DNA hypermethylation. After treatment, all promoters were marked with active chromatin, yet DNA hypermethylation did not change. Thus, DNA methylation could not prevent gene reactivation by drug-induced resetting of the chromatin state. In evaluating the relative contribution of DNA methylation and histone modifications to stable gene silencing, we followed expression levels of GFP and other genes silenced by DNA hypermethylation over time after treatment with HDACi or DNA-demethylating drugs. Reactivation of methylated loci by HDACi was detectable for only 2 weeks, whereas DNA-demethylating drugs induced permanent epigenetic reprogramming. Therefore, DNA methylation cannot be considered as a lock for gene expression but rather as a memory signal for long-term maintenance of gene silencing. These findings define chromatin as an important druggable target for cancer epigenetic therapy and suggest that removal of DNA methylation signals is required to achieve long-term gene reactivation.
Asunto(s)
Neoplasias del Colon/genética , Metilación de ADN , Regulación Neoplásica de la Expresión Génica , Silenciador del Gen , Línea Celular Tumoral , Ensamble y Desensamble de Cromatina , Metilación de ADN/efectos de los fármacos , Depsipéptidos/farmacología , Proteínas Fluorescentes Verdes/genética , Inhibidores de Histona Desacetilasas/farmacología , Humanos , Regiones Promotoras Genéticas , Activación Transcripcional/efectos de los fármacosRESUMEN
Thermolysis of [Ru(AsPh3)3(CO)H2] with the N-aryl heterocyclic carbenes (NHCs) IMes (1,3-bis(2,4,6-trimethylphenyl)imidazol-2-ylidene), IPr (1,3-bis(2,6-diisopropylphenyl)imidazol-2-ylidene) or the adduct SIPr.(C6F5)H (SIPr=1,3-bis(2,6-diisopropylphenyl)-4,5-dihydroimidazol-2-ylidene), followed by addition of CH2Cl2, affords the coordinatively unsaturated ruthenium hydride chloride complexes [Ru(NHC)2(CO)HCl] (NHC=IMes , IPr , SIPr ). These react with CO at room temperature to yield the corresponding 18-electron dicarbonyl complexes . Reduction of and [Ru(IMes)(PPh3)(CO)HCl] () with NaBH4 yields the isolable borohydride complexes [Ru(NHC)(L)(CO)H(eta2-BH4)] (, L=NHC, PPh3). Both the bis-IMes complex and the IMes-PPh3 species react with CO at low temperature to give the eta1-borohydride species [Ru(IMes)(L)(CO)2H(eta1-BH4)] (L=IMes , PPh3), which can be spectroscopically characterised. Upon warming to room temperature, further reaction with CO takes place to afford initially [Ru(IMes)(L)(CO)2H2] (L=IMes, L=PPh3) and, ultimately, [Ru(IMes)(L)(CO)3] (L=IMes , L=PPh3). Both and lose BH3 on addition of PMe2Ph to give [Ru(IMes)(L)(L')(CO)H2](L=L'=PMe2Ph; L=PPh3, L'=PMe2Ph). Compounds and have been tested as catalysts for the hydrogenation of aromatic ketones in the presence of (i)PrOH and H2. For the reduction of acetophenone, catalytic activity varies with the NHC present, decreasing in the order IPr>IMes>>SIMes.
RESUMEN
Natural populations are currently the basic material for studying forest tree breeding, but little is known on the genotypic spatial structures in these stands. The use of gene markers, such as isozymes, leads to the determination of part of the allelic constitution of individuals. A method is presented here to estimate the degree of genetic relationship between any pair of genotypes. A French Picea abies population is analyzed by these means and a slight but significant correlation between estimated genetic relationship and topographic distance is found.
RESUMEN
The genome length, in units of Morgans or centimorgans, is a fundamental feature of a species. It can be calculated from a complete linkage map. However, the genome size can be estimated with partial linkage data. Using linkage data obtained by the analysis of a two-dimensional electrophoresis of the proteins contained in an haploid organ, the megagametophyte, we suggest an estimation and a confidence interval of the genome length of a gymnosperm, the maritime pine (Pinus pinaster Ait.). The results indicate an important gap between the physical and the genetic maps.
RESUMEN
The first step in the construction of a linkage map involves the estimation and test for linkage between all possible pairs of markers. The lod score method is used in many linkage studies for the latter purpose. In contrast with classical statistical tests, this method does not rely on the choice of a first-type error level. We thus provide a comparison between the lod score and a χ (2) test on linkage data from a gymnosperm, the maritime pine. The lod score appears to be a very conservative test with the usual thresholds. Its severity depends on the type of data used.
RESUMEN
The oligomers constituted by association of different subunits can exist under multiple forms. In the case of the genetically variable proteins, such a multiplicity leads to numerous questions (i) on the enumerations: what is the number of active forms when a given subunit can make the oligomer inactive, or when the subunits are encoded by s alleles; (ii) on the subunit effects on biochemical properties: how to estimate these effects, are they equal, are there interactions between subunits, etc. Theoretical methods for the study of such oligomeric structures are developed, which mainly rely on linear model techniques. Peculiar properties examined are Vmax and Km, but also the quantities of the various oligomers, which depend on their association law. This approach is extended to the oligomers composed of different sets of subunits, as are for example some enzymes. These aspects are discussed from numerous bibliographic examples, with special reference to molecular interactions (protein complementation or molecular heterosis). Otherwise the genetic application of this theoretical approach is presented: it is possible to consider a genotype as an oligomer of alleles, and thus to study their effects and their interactions, in the one-locus case as well as in the several-loci case. The relevance of this generalization is discussed in connection with two other concepts, the "sequence space" used in molecular evolution and the regression of the genotypic values on the number of alleles used in quantitative genetics.
Asunto(s)
Proteínas , Enzimas , Genotipo , Cinética , Sustancias Macromoleculares , Matemática , Estadística como AsuntoRESUMEN
Proteins from haploid megagametophytes from 18 trees were studied by two-dimensional polyacrylamide gel electrophoresis (2-D PAGE). A total of 222 seeds, an average of 12 per tree, were analysed individually. 150 protein spots appeared to segregate on the polyacrylamide gels in at least tree. Genetic interpretations were made to define the number of loci responsible for the presence versus absence, staining differences or position variation of the segregating spots. The complete covariation observed between some spots could be the result of either the separation of a single gene product into two or more constituents, very close linkage, or the action of a pleiotropic gene. Human genetics techniques were used to map the 84 putative loci detected. Sixty-five loci were organised in 17 linkage groups, whereas 19 remained unlinked.
RESUMEN
Thermolysis of Ru(PPh3)3(CO)H2 with the N-heterocyclic carbene bis(1,3-(2,4,6-trimethylphenyl)imidazol-2-ylidene) (IMes) results in C-C activation of an Ar-CH3 bond in one of the mesityl rings of the carbene ligand. Upon addition of IMes to Ru(PPh3)3(CO)H2 at room temperature in the presence of an alkene, C-H bond activation is observed instead. The thermodynamics of these C-C and C-H cleavage reactions have been probed using density functional theory.
RESUMEN
Addition of 1 atm of H(2)S to [Ru(IMes)(2)(CO)(EtOH)H(2)] (IMes = 1,3-bis(2,4,6-trimethylphenyl)imidazol-2-ylidene) affords the air-stable hydrogen sulfide dihydride complex, [Ru(IMes)(2)(CO)(H(2)S)H(2)] 2. Treatment of 2 with excess H(2)S leads to formation of the 16-electron bis-hydrosulfido complex, [Ru(IMes)(2)(CO)(SH)(2)] 3. Preliminary studies show that 3 reacts with both H(2) and CO in solution as well as in the solid state. Both 2 and 3 have been structurally characterized