RESUMEN
Experimental validation of enzyme function is crucial for genome interpretation, but it remains challenging because it cannot be scaled up to accommodate the constant accumulation of genome sequences. We tackled this issue for the MetA and MetX enzyme families, phylogenetically unrelated families of acyl-L-homoserine transferases involved in L-methionine biosynthesis. Members of these families are prone to incorrect annotation because MetX and MetA enzymes are assumed to always use acetyl-CoA and succinyl-CoA, respectively. We determined the enzymatic activities of 100 enzymes from diverse species, and interpreted the results by structural classification of active sites based on protein structure modeling. We predict that >60% of the 10,000 sequences from these families currently present in databases are incorrectly annotated, and suggest that acetyl-CoA was originally the sole substrate of these isofunctional enzymes, which evolved to use exclusively succinyl-CoA in the most recent bacteria. We also uncovered a divergent subgroup of MetX enzymes in fungi that participate only in L-cysteine biosynthesis as O-succinyl-L-serine transferases.
Asunto(s)
Acetiltransferasas/metabolismo , Evolución Molecular , Metionina/biosíntesis , Acinetobacter/enzimología , Escherichia coli/enzimologíaRESUMEN
Our knowledge of species and functional composition of the human gut microbiome is rapidly increasing, but it is still based on very few cohorts and little is known about variation across the world. By combining 22 newly sequenced faecal metagenomes of individuals from four countries with previously published data sets, here we identify three robust clusters (referred to as enterotypes hereafter) that are not nation or continent specific. We also confirmed the enterotypes in two published, larger cohorts, indicating that intestinal microbiota variation is generally stratified, not continuous. This indicates further the existence of a limited number of well-balanced host-microbial symbiotic states that might respond differently to diet and drug intake. The enterotypes are mostly driven by species composition, but abundant molecular functions are not necessarily provided by abundant species, highlighting the importance of a functional analysis to understand microbial communities. Although individual host properties such as body mass index, age, or gender cannot explain the observed enterotypes, data-driven marker genes or functional modules can be identified for each of these host properties. For example, twelve genes significantly correlate with age and three functional modules with the body mass index, hinting at a diagnostic potential of microbial markers.
Asunto(s)
Bacterias/clasificación , Intestinos/microbiología , Metagenoma , Bacterias/genética , Técnicas de Tipificación Bacteriana , Biodiversidad , Biomarcadores/análisis , Europa (Continente) , Heces/microbiología , Femenino , Humanos , Masculino , Metagenómica , FilogeniaRESUMEN
Millions of protein database entries are not assigned reliable functions, preventing the full understanding of chemical diversity in living organisms. Here, we describe an integrated strategy for the discovery of various enzymatic activities catalyzed within protein families of unknown or little known function. This approach relies on the definition of a generic reaction conserved within the family, high-throughput enzymatic screening on representatives, structural and modeling investigations and analysis of genomic and metabolic context. As a proof of principle, we investigated the DUF849 Pfam family and unearthed 14 potential new enzymatic activities, leading to the designation of these proteins as ß-keto acid cleavage enzymes. We propose an in vivo role for four enzymatic activities and suggest key residues for guiding further functional annotation. Our results show that the functional diversity within a family may be largely underestimated. The extension of this strategy to other families will improve our knowledge of the enzymatic landscape.
Asunto(s)
Enzimas/metabolismo , Enzimas/química , Conformación ProteicaRESUMEN
The Périgord black truffle (Tuber melanosporum Vittad.) and the Piedmont white truffle dominate today's truffle market. The hypogeous fruiting body of T. melanosporum is a gastronomic delicacy produced by an ectomycorrhizal symbiont endemic to calcareous soils in southern Europe. The worldwide demand for this truffle has fuelled intense efforts at cultivation. Identification of processes that condition and trigger fruit body and symbiosis formation, ultimately leading to efficient crop production, will be facilitated by a thorough analysis of truffle genomic traits. In the ectomycorrhizal Laccaria bicolor, the expansion of gene families may have acted as a 'symbiosis toolbox'. This feature may however reflect evolution of this particular taxon and not a general trait shared by all ectomycorrhizal species. To get a better understanding of the biology and evolution of the ectomycorrhizal symbiosis, we report here the sequence of the haploid genome of T. melanosporum, which at approximately 125 megabases is the largest and most complex fungal genome sequenced so far. This expansion results from a proliferation of transposable elements accounting for approximately 58% of the genome. In contrast, this genome only contains approximately 7,500 protein-coding genes with very rare multigene families. It lacks large sets of carbohydrate cleaving enzymes, but a few of them involved in degradation of plant cell walls are induced in symbiotic tissues. The latter feature and the upregulation of genes encoding for lipases and multicopper oxidases suggest that T. melanosporum degrades its host cell walls during colonization. Symbiosis induces an increased expression of carbohydrate and amino acid transporters in both L. bicolor and T. melanosporum, but the comparison of genomic traits in the two ectomycorrhizal fungi showed that genetic predispositions for symbiosis-'the symbiosis toolbox'-evolved along different ways in ascomycetes and basidiomycetes.
Asunto(s)
Ascomicetos/genética , Evolución Molecular , Genoma Fúngico/genética , Simbiosis/genética , Carbohidratos , Elementos Transponibles de ADN/genética , Cuerpos Fructíferos de los Hongos/metabolismo , Genes Fúngicos/genética , Genómica , Haploidia , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Azufre/metabolismoRESUMEN
Brown algae (Phaeophyceae) are complex photosynthetic organisms with a very different evolutionary history to green plants, to which they are only distantly related. These seaweeds are the dominant species in rocky coastal ecosystems and they exhibit many interesting adaptations to these, often harsh, environments. Brown algae are also one of only a small number of eukaryotic lineages that have evolved complex multicellularity (Fig. 1). We report the 214 million base pair (Mbp) genome sequence of the filamentous seaweed Ectocarpus siliculosus (Dillwyn) Lyngbye, a model organism for brown algae, closely related to the kelps (Fig. 1). Genome features such as the presence of an extended set of light-harvesting and pigment biosynthesis genes and new metabolic processes such as halide metabolism help explain the ability of this organism to cope with the highly variable tidal environment. The evolution of multicellularity in this lineage is correlated with the presence of a rich array of signal transduction genes. Of particular interest is the presence of a family of receptor kinases, as the independent evolution of related molecules has been linked with the emergence of multicellularity in both the animal and green plant lineages. The Ectocarpus genome sequence represents an important step towards developing this organism as a model species, providing the possibility to combine genomic and genetic approaches to explore these and other aspects of brown algal biology further.
Asunto(s)
Proteínas Algáceas/genética , Evolución Biológica , Genoma/genética , Phaeophyceae/citología , Phaeophyceae/genética , Animales , Eucariontes , Evolución Molecular , Datos de Secuencia Molecular , Phaeophyceae/metabolismo , Filogenia , Pigmentos Biológicos/biosíntesis , Transducción de Señal/genéticaRESUMEN
Red seaweeds are key components of coastal ecosystems and are economically important as food and as a source of gelling agents, but their genes and genomes have received little attention. Here we report the sequencing of the 105-Mbp genome of the florideophyte Chondrus crispus (Irish moss) and the annotation of the 9,606 genes. The genome features an unusual structure characterized by gene-dense regions surrounded by repeat-rich regions dominated by transposable elements. Despite its fairly large size, this genome shows features typical of compact genomes, e.g., on average only 0.3 introns per gene, short introns, low median distance between genes, small gene families, and no indication of large-scale genome duplication. The genome also gives insights into the metabolism of marine red algae and adaptations to the marine environment, including genes related to halogen metabolism, oxylipins, and multicellularity (microRNA processing and transcription factors). Particularly interesting are features related to carbohydrate metabolism, which include a minimalistic gene set for starch biosynthesis, the presence of cellulose synthases acquired before the primary endosymbiosis showing the polyphyly of cellulose synthesis in Archaeplastida, and cellulases absent in terrestrial plants as well as the occurrence of a mannosylglycerate synthase potentially originating from a marine bacterium. To explain the observations on genome structure and gene content, we propose an evolutionary scenario involving an ancestral red alga that was driven by early ecological forces to lose genes, introns, and intergenetic DNA; this loss was followed by an expansion of genome size as a consequence of activity of transposable elements.
Asunto(s)
Chondrus/genética , Evolución Molecular , Genes de Plantas , Secuencia de Bases , MicroARNs/genética , Datos de Secuencia Molecular , Proteínas de Plantas/genética , ARN de Planta/genéticaRESUMEN
Root-knot nematodes are globally the most aggressive and damaging plant-parasitic nematodes. Chemical nematicides have so far constituted the most efficient control measures against these agricultural pests. Because of their toxicity for the environment and danger for human health, these nematicides have now been banned from use. Consequently, new and more specific control means, safe for the environment and human health, are urgently needed to avoid worldwide proliferation of these devastating plant-parasites. Mining the genomes of root-knot nematodes through an evolutionary and comparative genomics approach, we identified and analyzed 15,952 nematode genes conserved in genomes of plant-damaging species but absent from non target genomes of chordates, plants, annelids, insect pollinators and mollusks. Functional annotation of the corresponding proteins revealed a relative abundance of putative transcription factors in this parasite-specific set compared to whole proteomes of root-knot nematodes. This may point to important and specific regulators of genes involved in parasitism. Because these nematodes are known to secrete effector proteins in planta, essential for parasitism, we searched and identified 993 such effector-like proteins absent from non-target species. Aiming at identifying novel targets for the development of future control methods, we biologically tested the effect of inactivation of the corresponding genes through RNA interference. A total of 15 novel effector-like proteins and one putative transcription factor compatible with the design of siRNAs were present as non-redundant genes and had transcriptional support in the model root-knot nematode Meloidogyne incognita. Infestation assays with siRNA-treated M. incognita on tomato plants showed significant and reproducible reduction of the infestation for 12 of the 16 tested genes compared to control nematodes. These 12 novel genes, showing efficient reduction of parasitism when silenced, constitute promising targets for the development of more specific and safer control means.
Asunto(s)
Genes de Helminto/fisiología , Enfermedades de las Plantas/parasitología , Tylenchoidea/genética , Animales , Estudio de Asociación del Genoma Completo , Humanos , Interferencia de ARN , Tylenchoidea/metabolismoRESUMEN
The analysis of the first plant genomes provided unexpected evidence for genome duplication events in species that had previously been considered as true diploids on the basis of their genetics. These polyploidization events may have had important consequences in plant evolution, in particular for species radiation and adaptation and for the modulation of functional capacities. Here we report a high-quality draft of the genome sequence of grapevine (Vitis vinifera) obtained from a highly homozygous genotype. The draft sequence of the grapevine genome is the fourth one produced so far for flowering plants, the second for a woody species and the first for a fruit crop (cultivated for both fruit and beverage). Grapevine was selected because of its important place in the cultural heritage of humanity beginning during the Neolithic period. Several large expansions of gene families with roles in aromatic features are observed. The grapevine genome has not undergone recent genome duplication, thus enabling the discovery of ancestral traits and features of the genetic organization of flowering plants. This analysis reveals the contribution of three ancestral genomes to the grapevine haploid content. This ancestral arrangement is common to many dicotyledonous plants but is absent from the genome of rice, which is a monocotyledon. Furthermore, we explain the chronology of previously described whole-genome duplication events in the evolution of flowering plants.
Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Poliploidía , Vitis/clasificación , Vitis/genética , Arabidopsis/genética , ADN Intergénico/genética , Exones/genética , Genes de Plantas/genética , Intrones/genética , Cariotipificación , MicroARNs/genética , Datos de Secuencia Molecular , Oryza/genética , Populus/genética , ARN de Planta/genética , ARN de Transferencia/genética , Análisis de Secuencia de ADNRESUMEN
The exponential increase in genome sequencing output has led to the accumulation of thousands of predicted genes lacking a proper functional annotation. Among this mass of hypothetical proteins, enzymes catalyzing new reactions or using novel ways to catalyze already known reactions might still wait to be identified. Here, we provide a structural and biochemical characterization of the 3-keto-5-aminohexanoate cleavage enzyme (Kce), an enzymatic activity long known as being involved in the anaerobic fermentation of lysine but whose catalytic mechanism has remained elusive so far. Although the enzyme shows the ubiquitous triose phosphate isomerase (TIM) barrel fold and a Zn(2+) cation reminiscent of metal-dependent class II aldolases, our results based on a combination of x-ray snapshots and molecular modeling point to an unprecedented mechanism that proceeds through deprotonation of the 3-keto-5-aminohexanoate substrate, nucleophilic addition onto an incoming acetyl-CoA, intramolecular transfer of the CoA moiety, and final retro-Claisen reaction leading to acetoacetate and 3-aminobutyryl-CoA. This model also accounts for earlier observations showing the origin of carbon atoms in the products, as well as the absence of detection of any covalent acyl-enzyme intermediate. Kce is the first representative of a large family of prokaryotic hypothetical proteins, currently annotated as the "domain of unknown function" DUF849.
Asunto(s)
Oxo-Ácido-Liasas/metabolismo , Catálisis , Cristalografía por Rayos X , Modelos Moleculares , Oxo-Ácido-Liasas/química , Conformación Proteica , Pliegue de Proteína , Especificidad por SustratoRESUMEN
Our knowledge of yeast genomes remains largely dominated by the extensive studies on Saccharomyces cerevisiae and the consequences of its ancestral duplication, leaving the evolution of the entire class of hemiascomycetes only partly explored. We concentrate here on five species of Saccharomycetaceae, a large subdivision of hemiascomycetes, that we call "protoploid" because they diverged from the S. cerevisiae lineage prior to its genome duplication. We determined the complete genome sequences of three of these species: Kluyveromyces (Lachancea) thermotolerans and Saccharomyces (Lachancea) kluyveri (two members of the newly described Lachancea clade), and Zygosaccharomyces rouxii. We included in our comparisons the previously available sequences of Kluyveromyces lactis and Ashbya (Eremothecium) gossypii. Despite their broad evolutionary range and significant individual variations in each lineage, the five protoploid Saccharomycetaceae share a core repertoire of approximately 3300 protein families and a high degree of conserved synteny. Synteny blocks were used to define gene orthology and to infer ancestors. Far from representing minimal genomes without redundancy, the five protoploid yeasts contain numerous copies of paralogous genes, either dispersed or in tandem arrays, that, altogether, constitute a third of each genome. Ancient, conserved paralogs as well as novel, lineage-specific paralogs were identified.
Asunto(s)
Genoma Fúngico , Genómica/métodos , Saccharomycetales/genética , Elementos Transponibles de ADN/genética , Elementos Transponibles de ADN/fisiología , Eremothecium/genética , Duplicación de Gen , Genes Fúngicos/genética , Inteínas/genética , Kluyveromyces/genética , Datos de Secuencia Molecular , Sistemas de Lectura Abierta/genética , Filogenia , ARN no Traducido/genética , Saccharomyces/genética , Empalmosomas/metabolismo , Zygosaccharomyces/genéticaRESUMEN
We performed high-throughput sequencing of DNA from fossilized faeces to evaluate this material as a source of information on the genome and diet of Pleistocene carnivores. We analysed coprolites derived from the extinct cave hyena (Crocuta crocuta spelaea), and sequenced 90 million DNA fragments from two specimens. The DNA reads enabled a reconstruction of the cave hyena mitochondrial genome with up to a 158-fold coverage. This genome, and those sequenced from extant spotted (Crocuta crocuta) and striped (Hyaena hyaena) hyena specimens, allows for the establishment of a robust phylogeny that supports a close relationship between the cave and the spotted hyena. We also demonstrate that high-throughput sequencing yields data for cave hyena multi-copy and single-copy nuclear genes, and that about 50 per cent of the coprolite DNA can be ascribed to this species. Analysing the data for additional species to indicate the cave hyena diet, we retrieved abundant sequences for the red deer (Cervus elaphus), and characterized its mitochondrial genome with up to a 3.8-fold coverage. In conclusion, we have demonstrated the presence of abundant ancient DNA in the coprolites surveyed. Shotgun sequencing of this material yielded a wealth of DNA sequences for a Pleistocene carnivore and allowed unbiased identification of diet.
Asunto(s)
Dieta/veterinaria , Heces/química , Genoma , Hyaenidae/genética , Hyaenidae/fisiología , Animales , ADN/genética , ADN/aislamiento & purificación , Fósiles , Datos de Secuencia Molecular , FilogeniaRESUMEN
MOTIVATION: Current computational approaches to function prediction are mostly based on protein sequence classification and transfer of annotation from known proteins to their closest homologous sequences relying on the orthology concept of function conservation. This approach suffers a major weakness: annotation reliability depends on global sequence similarity to known proteins and is poorly efficient for enzyme superfamilies that catalyze different reactions. Structural biology offers a different strategy to overcome the problem of annotation by adding information about protein 3D structures. This information can be used to identify amino acids located in active sites, focusing on detection of functional polymorphisms residues in an enzyme superfamily. Structural genomics programs are providing more and more novel protein structures at a high-throughput rate. However, there is still a huge gap between the number of sequences and available structures. Computational methods, such as homology modeling provides reliable approaches to bridge this gap and could be a new precise tool to annotate protein functions. RESULTS: Here, we present Active Sites Modeling and Clustering (ASMC) method, a novel unsupervised method to classify sequences using structural information of protein pockets. ASMC combines homology modeling of family members, structural alignment of modeled active sites and a subsequent hierarchical conceptual classification. Comparison of profiles obtained from computed clusters allows the identification of residues correlated to subfamily function divergence, called specificity determining positions. ASMC method has been validated on a benchmark of 42 Pfam families for which previous resolved holo-structures were available. ASMC was also applied to several families containing known protein structures and comprehensive functional annotations. We will discuss how ASMC improves annotation and understanding of protein families functions by giving some specific illustrative examples on nucleotidyl cyclases, protein kinases and serine proteases. AVAILABILITY: http://www.genoscope.fr/ASMC/.
Asunto(s)
Proteínas/clasificación , Análisis de Secuencia de Proteína/métodos , Dominio Catalítico , Análisis por Conglomerados , Biología Computacional/métodos , Enzimas/clasificación , Modelos Biológicos , Anotación de Secuencia Molecular , Liasas de Fósforo-Oxígeno/química , Proteínas Quinasas/química , Proteínas/química , Proteínas/metabolismo , Alineación de Secuencia , Serina Proteasas/químicaRESUMEN
Escherichia coli and Shigella O antigens can be inferred using the rfb-restriction fragment length polymorphism (RFLP) molecular test. We present herein a dynamic programming algorithm-based software to compare the rfb-RFLP patterns of clinical isolates with those in a database containing the 171 previously published patterns corresponding to all known E. coli/Shigella O antigens.
Asunto(s)
Dermatoglifia del ADN/métodos , ADN Bacteriano/genética , Escherichia coli/clasificación , Antígenos O/análisis , Polimorfismo de Longitud del Fragmento de Restricción , Shigella/clasificación , Disentería Bacilar/microbiología , Escherichia coli/genética , Infecciones por Escherichia coli/microbiología , Humanos , Serotipificación/métodos , Shigella/genética , Programas InformáticosRESUMEN
BACKGROUND: The Plasmodium falciparum genome (3D7 strain) published in 2002, revealed ~5,400 genes, mostly based on in silico predictions. Experimental data is therefore required for structural and functional assessments of P. falciparum genes and expression, and polymorphic data are further necessary to exploit genomic information to further qualify therapeutic target candidates. Here, we undertook a large scale analysis of a P. falciparum FcB1-schizont-EST library previously constructed by suppression subtractive hybridization (SSH) to study genes expressed during merozoite morphogenesis, with the aim of: 1) obtaining an exhaustive collection of schizont specific ESTs, 2) experimentally validating or correcting P. falciparum gene models and 3) pinpointing genes displaying protein polymorphism between the FcB1 and 3D7 strains. RESULTS: A total of 22,125 clones randomly picked from the SSH library were sequenced, yielding 21,805 usable ESTs that were then clustered on the P. falciparum genome. This allowed identification of 243 protein coding genes, including 121 previously annotated as hypothetical. Statistical analysis of GO terms, when available, indicated significant enrichment in genes involved in "entry into host-cells" and "actin cytoskeleton". Although most ESTs do not span full-length gene reading frames, detailed sequence comparison of FcB1-ESTs versus 3D7 genomic sequences allowed the confirmation of exon/intron boundaries in 29 genes, the detection of new boundaries in 14 genes and identification of protein polymorphism for 21 genes. In addition, a large number of non-protein coding ESTs were identified, mainly matching with the two A-type rRNA units (on chromosomes 5 and 7) and to a lower extent, two atypical rRNA loci (on chromosomes 1 and 8), TARE subtelomeric regions (several chromosomes) and the recently described telomerase RNA gene (chromosome 9). CONCLUSION: This FcB1-schizont-EST analysis confirmed the actual expression of 243 protein coding genes, allowing the correction of structural annotations for a quarter of these sequences. In addition, this analysis demonstrated the actual transcription of several remarkable non-protein coding loci: 2 atypical rRNA, TARE region and telomerase RNA gene. Together with other collections of P. falciparum ESTs, usually generated from mixed parasite stages, this collection of FcB1-schizont-ESTs provides valuable data to gain further insight into the P. falciparum gene structure, polymorphism and expression.
Asunto(s)
Etiquetas de Secuencia Expresada , Genoma de Protozoos , Plasmodium falciparum/genética , Animales , Exones , Biblioteca de Genes , Genes Protozoarios , Intrones , Modelos Genéticos , Datos de Secuencia Molecular , Polimorfismo Genético , Proteínas Protozoarias/genética , ARN Protozoario/genética , ARN Ribosómico/genética , Esquizontes/metabolismo , Alineación de Secuencia , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Massively parallel DNA sequencing instruments are enabling the decoding of whole genomes at significantly lower cost and higher throughput than classical Sanger technology. Each of these technologies have been estimated to yield assemblies with more problematic features than the standard method. These problems are of a different nature depending on the techniques used. So, an appropriate mix of technologies may help resolve most difficulties, and eventually provide assemblies of high quality without requiring any Sanger-based input. RESULTS: We compared assemblies obtained using Sanger data with those from different inputs from New Sequencing Technologies. The assemblies were systematically compared with a reference finished sequence. We found that the 454 GSFLX can efficiently produce high continuity when used at high coverage. The potential to enhance continuity by scaffolding was tested using 454 sequences from circularized genomic fragments. Finally, we explore the use of Solexa-Illumina short reads to polish the genome draft by implementing a technique to correct 454 consensus errors. CONCLUSION: High quality drafts can be produced for small genomes without any Sanger data input. We found that 454 GSFLX and Solexa/Illumina show great complementarity in producing large contigs and supercontigs with a low error rate.
Asunto(s)
Genoma Bacteriano , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Biología Computacional/métodos , Mapeo Contig , Biblioteca de Genes , Análisis de Secuencia de ADN/instrumentaciónRESUMEN
Single nucleotide polymorphism (SNP) markers have been shown to be useful in genetic investigations of medically important parasites and their hosts. In this paper, we describe the prediction and validation of SNPs in ESTs of Schistosoma mansoni. We used 107,417 public sequences of S. mansoni and identified 15,614 high-quality candidate SNPs in 12,184 contigs. The presence of predicted SNPs was observed in well characterized antigens and vaccine candidates such as those coding for myosin; Sm14 and Sm23; cathepsin B and triosephosphate isomerase (TPI). Additionally, SNPs were experimentally validated for the cathepsin B. A comparative model of the S. mansoni cathepsin B was built for predicting the possible consequences of amino acid substitutions on the protein structure. An analysis of the substitutions indicated that the amino acids were mostly located on the surface of the molecule, and we found no evidence for a significant conformational change of the enzyme. However, at least one of the substitutions could result in a structural modification of an epitope.
Asunto(s)
Schistosoma mansoni/genética , Animales , Antígenos Helmínticos/genética , Catepsina B/química , Catepsina B/genética , Proteínas de Transporte de Ácidos Grasos/genética , Genes de Helminto/genética , Proteínas del Helminto/genética , Modelos Moleculares , Miosinas/genética , Polimorfismo de Nucleótido Simple , Triosa-Fosfato Isomerasa/genéticaRESUMEN
Balanced chromosomal rearrangements (BCR) are associated with abnormal phenotypes in approximately 6% of balanced translocations and 9.4% of balanced inversions. Abnormal phenotypes can be caused by disruption of genes at the breakpoints, deletions, or positional effects. Conventional cytogenetic techniques have a limited resolution and do not enable a thorough genetic investigation. Molecular techniques applied to BCR carriers can contribute to the characterization of this type of chromosomal rearrangement and to the phenotype-genotype correlation. Fifteen individuals among 35 with abnormal phenotypes and BCR were selected for further investigation by molecular techniques. Chromosomal rearrangements involved 11 reciprocal translocations, 3 inversions, and 1 balanced insertion. Array genomic hybridization (AGH) was performed and genomic imbalances were detected in 20% of the cases, 1 at a rearrangement breakpoint and 2 further breakpoints in other chromosomes. Alterations were further confirmed by FISH and associated with the phenotype of the carriers. In the analyzed cases not showing genomic imbalances by AGH, next-generation sequencing (NGS), using whole genome libraries, prepared following the Illumina TruSeq DNA PCR-Free protocol (Illumina®) and then sequenced on an Illumina HiSEQ 2000 as 150-bp paired-end reads, was done. The NGS results suggested breakpoints in 7 cases that were similar or near those estimated by karyotyping. The genes overlapping 6 breakpoint regions were analyzed. Follow-up of BCR carriers would improve the knowledge about these chromosomal rearrangements and their consequences.
RESUMEN
The cytidine analogues azacytidine and 5-aza-2'-deoxycytidine (decitabine) are commonly used to treat myelodysplastic syndromes, with or without a myeloproliferative component. It remains unclear whether the response to these hypomethylating agents results from a cytotoxic or an epigenetic effect. In this study, we address this question in chronic myelomonocytic leukaemia. We describe a comprehensive analysis of the mutational landscape of these tumours, combining whole-exome and whole-genome sequencing. We identify an average of 14±5 somatic mutations in coding sequences of sorted monocyte DNA and the signatures of three mutational processes. Serial sequencing demonstrates that the response to hypomethylating agents is associated with changes in DNA methylation and gene expression, without any decrease in the mutation allele burden, nor prevention of new genetic alteration occurence. Our findings indicate that cytosine analogues restore a balanced haematopoiesis without decreasing the size of the mutated clone, arguing for a predominantly epigenetic effect.