RESUMEN
BACKGROUND: Several studies suggested that cavefish populations of Astyanax mexicanus settled during the Late Pleistocene. This implies that the cavefish's most conspicuous phenotypic changes, blindness and depigmentation, and more cryptic characters important for cave life, evolved rapidly. RESULTS: Using the published genomes of 47 Astyanax cavefish from la Cueva de El Pachón, El Sótano de la Tinaja, La Cueva Chica and El Sótano de Molino, we searched for putative loss-of-function mutations in previously defined sets of genes, i.e., vision, circadian clock and pigmentation genes. Putative non-functional alleles for four vision genes were identified. Then, we searched genome-wide for putative non-functional alleles in these four cave populations. Among 512 genes with segregating putative non-functional alleles in cavefish that are absent in surface fish, we found an enrichment in visual perception genes. Among cavefish populations, different levels of shared putative non-functional alleles were found. Using a subset of 12 genes for which putative loss-of-function mutations were found, we extend the analysis of shared pseudogenes to 11 cave populations. Using a subset of six genes for which putative loss-of-function mutations were found in the El Sótano del Toro population, where extensive hybridization with surface fish occurs, we found a correlation between the level of eye regression and the amount of putative non-functional alleles. CONCLUSIONS: We confirm that very few putative non-functional alleles are present in a large set of vision genes, in accordance with the recent origin of Astyanax mexicanus cavefish. Furthermore, the genome-wide analysis indicates an enrichment of putative loss-of-function alleles in genes with vision-related GO-terms, suggesting that visual perception may be the function chiefly impacted by gene losses related to the shift from a surface to a cave environment. The geographic distribution of putative loss-of-function alleles newly suggests that cave populations from Sierra de Guatemala and Sierra de El Abra share a common origin, albeit followed by independent evolution for a long period. It also supports that populations from the Micos area have an independent origin. In El Sótano del Toro, the troglomorphic phenotype is maintained despite massive introgression of the surface genome.
Asunto(s)
Characidae , Animales , Alelos , Characidae/genética , Mutación , Ceguera/genética , Visión OcularRESUMEN
Background: The axolotl, Ambystoma mexicanum is a unique biological model for complete tissue regeneration. Is a neotenic endangered species and is highly susceptible to environmental stress, including infectious disease. In contrast to other amphibians, the axolotl is particularly vulnerable to certain viral infections. Like other salamanders, the axolotl genome is one of the largest (32 Gb) and the impact of genome size on Ig loci architecture is unknown. To better understand the immune response in axolotl, we aimed to characterize the immunoglobulin loci of A. mexicanum and compare it with other model vertebrates. Methods: The most recently published genome sequence of A. mexicanum (V6) was used for alignment-based annotation and manual curation using previously described axolotl Ig sequences or reference sequences from other vertebrates. Gene models were further curated using A. mexicanum spleen RNA-seq data. Human, Xenopus tropicalis, Danio rerio (zebrafish), and eight tetrapod reference genomes were used for comparison. Results: Canonical A. mexicanum heavy chain (IGH), lambda (IGL), sigma (IGS), and the putative surrogate light chain (SLC) loci were identified. No kappa locus was found. More than half of the IGHV genes and the IGHF gene are pseudogenes and there is no clan I IGHV genes. Although the IGH locus size is proportional to genome size, we found local size restriction in the IGHM gene and the V gene intergenic distances. In addition, there were V genes with abnormally large V-intron sizes, which correlated with loss of gene functionality. Conclusion: The A. mexicanum immunoglobulin loci share the same general genome architecture as most studied tetrapods. Consistent with its large genome, Ig loci are larger; however, local size restrictions indicate evolutionary constraints likely to be imposed by high transcriptional demand of certain Ig genes, as well as the V(D)J recombination over very long genomic distance ranges. The A. mexicanum has undergone an extensive process of Ig gene loss which partially explains a reduced potential repertoire diversity that may contribute to its impaired antibody response.
Asunto(s)
Ambystoma mexicanum , Inmunoglobulinas , Animales , Ambystoma mexicanum/genética , Genoma , Genómica , Inmunoglobulinas/genéticaRESUMEN
Whole-genome sequence analyses have significantly contributed to the understanding of virulence and evolution of the Mycobacterium tuberculosis complex (MTBC), the causative pathogens of tuberculosis. Most MTBC evolutionary studies are focused on single nucleotide polymorphisms and deletions, but rare studies have evaluated gene content, whereas none has comprehensively evaluated pseudogenes. Accordingly, we describe an extensive study focused on quantifying and predicting possible functions of MTBC and Mycobacterium canettii pseudogenes. Using NCBI's PGAP-detected pseudogenes, we analysed 25â837 pseudogenes from 158 MTBC and M. canetii strains and combined transcriptomics and proteomics of M. tuberculosis H37Rv to gain insights about pseudogenes' expression. Our results indicate significant variability concerning rate and conservancy of in silico predicted pseudogenes among different ecotypes and lineages of tuberculous mycobacteria and pseudogenization of important virulence factors and genes of the metabolism and antimicrobial resistance/tolerance. We show that in silico predicted pseudogenes contribute considerably to MTBC genetic diversity at the population level. Moreover, the transcription machinery of M. tuberculosis can fully transcribe most pseudogenes, indicating intact promoters and recent pseudogene evolutionary emergence. Proteomics of M. tuberculosis and close evaluation of mutational lesions driving pseudogenization suggest that few in silico predicted pseudogenes are likely capable of neofunctionalization, nonsense mutation reversal, or phase variation, contradicting the classical definition of pseudogenes. Such findings indicate that genome annotation should be accompanied by proteomics and protein function assays to improve its accuracy. While indels and insertion sequences are the main drivers of the observed mutational lesions in these species, population bottlenecks and genetic drift are likely the evolutionary processes acting on pseudogenes' emergence over time. Our findings unveil a new perspective on MTBC's evolution and genetic diversity.
Asunto(s)
Mycobacterium tuberculosis , Seudogenes , Antiinfecciosos , Codón sin Sentido , Elementos Transponibles de ADN , Mycobacterium tuberculosis/efectos de los fármacos , Mycobacterium tuberculosis/genética , Seudogenes/genética , Factores de Virulencia/genética , Farmacorresistencia Bacteriana/genéticaRESUMEN
This contribution endeavored to investigate the genetic structure and gene flow of the flood mosquito, Aedes vexans (Meigen, 1830). Using partial sequences of the mitochondrial COI gene, available from BOLD Systems and GenBank, the Haplotypic (Hd) and nucleotide (π) gene diversity, genetic structuring and gene flow of A. vexans at the global, continental, and country levels were calculated. In total, 1,184 sequences were obtained, distributed among America (88.60%; represented by EUA and Canada), Europe (7.35%), Asia (3.89%), and Africa (0.17%). From these, 395 haplotypes (H) without presence of pseudogenes (NUMTs) were detected. The cluster analyses grouped the haplotypes into six clades. Clade I includes haplotypes from countries in America and Europe, while clades II and III include haplotypes exclusively from Asia and Europe; clade IV grouped only one haplotype from Africa and clade V grouped haplotypes from America and Africa. The global Hd and π were 0.92 and 0.01, respectively. In addition, there is evidence of genetic structuring among continents (7.07%), countries (1.62%), and within countries (91.30%; FST = 0.08, p < 0.05) and no isolation by distance was detected (r = 0.003, p > 0.05). The genetic diversity of A. vexans was found to be greater in North America than in other continents. Although this provisional conclusion might be influenced by a sample bias, since 88.60% of the sequences are from America, is also plausible to consider that America corresponds to the ancestral distribution area of the flood mosquito. This hypothesis needs further testing, using a more comprehensive sample from other continents. Additionally, the six clusters found and their geographical distribution do not support previous proposals of splitting the genus into three subspecies confined to certain geographical areas.
Asunto(s)
Animales , Variación Genética , ADN Mitocondrial/análisis , Culicidae/clasificación , Culicidae/genética , FilogeografíaRESUMEN
Whole-genome sequence analyses have significantly contributed to the understanding of virulence and evolution of the Mycobacterium tuberculosis complex (MTBC), the causative pathogens of tuberculosis. Most MTBC evolutionary studies are focused on single nucleotide polymorphisms and deletions, but rare studies have evaluated gene content, whereas none has comprehensively evaluated pseudogenes. Accordingly, we describe an extensive study focused on quantifying and predicting possible functions of MTBC and Mycobacterium canettii pseudogenes. Using NCBI’s PGAP-detected pseudogenes, we analysed 25 837 pseudogenes from 158 MTBC and M. canetii strains and combined transcriptomics and proteomics of M. tuberculosis H37Rv to gain insights about pseudogenes' expression. Our results indicate significant variability concerning rate and conservancy of in silico predicted pseudogenes among different ecotypes and lineages of tuberculous mycobacteria and pseudogenization of important virulence factors and genes of the metabolism and antimicrobial resistance/tolerance. We show that in silico predicted pseudogenes contribute considerably to MTBC genetic diversity at the population level. Moreover, the transcription machinery of M. tuberculosis can fully transcribe most pseudogenes, indicating intact promoters and recent pseudogene evolutionary emergence. Proteomics of M. tuberculosis and close evaluation of mutational lesions driving pseudogenization suggest that few in silico predicted pseudogenes are likely capable of neofunctionalization, nonsense mutation reversal, or phase variation, contradicting the classical definition of pseudogenes. Such findings indicate that genome annotation should be accompanied by proteomics and protein function assays to improve its accuracy. While indels and insertion sequences are the main drivers of the observed mutational lesions in these species, population bottlenecks and genetic drift are likely the evolutionary processes acting on pseudogenes' emergence over time. Our findings unveil a new perspective on MTBC’s evolution and genetic diversity.
RESUMEN
ABSTRACT In Colombia Aedes aegypti is present in 80% of the country up to 2,300 m; however, little is known of its genetic relations within a country context and, hence, within a broader context, for example, America. The aforementioned, herein, analyzed the gene flow within a context of the Americas, its directionality and genetic diversity of the mitochondrial lineages in the A. aegypti populations for Colombia. This called for the use of the sequences for A. aegypti available of the mitochondrial ND4 gene in the GenBank for Colombia and the American continent. No presence was found of nuclear mitochondrial pseudogenes (NUMTs) for Colombia. It is estimated that in Colombia the gene flow of the A. aegypti populations is occurring from the southeast and northeast toward the center of the country. In comparison with the mitochondrial sequences for America, the vector's haplotypes in Colombia suggest connections between the populations of mosquitoes from the south with those from the north of the continent. The gene flow model at continental scale suggests bidirectional connections between the populations from the north of the continent with those from the south, while at South American scale it proposes the gene flow in all the directions with respect to the Colombian. The Colombian A. aegypti vector monitoring and control strategies must pay special attention to the vector's points of entry into Colombia related with Peru, Venezuela, Brazil, Mexico, and North America to avoid the entry of populations with characteristics like resistance to insecticides or vector competition.
RESUMEN
Os avanços metodológicos e instrumentais decorrentes do Projeto Genoma Humano formaram o arcabouço necessário para o surgimento das tecnologias de sequenciamento de DNA de Nova Geração, as quais se caracterizam por um custo reduzido, uma baixa demanda operacional e a produção de um grande volume de dados por experimento. Concomitantemente a isso, o aumento no poder de processamento computacional permitiu o desenvolvimento de análises genéticas em larga escala, de modo que, atualmente, é possível estudar características genômicas individualizadas e, até então, pouco ou nunca exploradas. Dentre essas características, aquelas relacionadas às variações estruturais em genomas têm recebido bastante atenção. Os pseudogenes processados, ou retrocópias, são variações estruturais causadas pela duplicação de genes codificadores mediante à transposição de seu RNA mensageiro maduro pela maquinaria enzimática de LINE- 1. As retrocópias podem estar fixadas, ou seja, presentes em todos os genomas de uma dada espécie, os quais são representados pela montagem modelo do genoma de referência, ou podem não estar fixadas, sendo polimórficas, germinativas ou somáticas. No entanto, o conhecimento acerca das retrocópias não fixadas ainda é limitado devido à falta de ferramentas de bioinformática dedicadas a sua identificação e anotação em dados de sequenciamento de DNA. Posto isso, este trabalho apresenta o sideRETRO um programa computacional especializado na detecção de pseudogenes processados ausentes do genoma de referência, mas presentes em dados de sequenciamento de genoma completo e exoma de outros indivíduos. Além de apontar para a presença de retrocópias não fixadas, o sideRETRO é capaz de anotar várias outras características relacionadas a esses evento, tais como: a coordenada genômica de inserção do pseudogene processado, a qual constitui o cromossomo, o ponto de inserção e a fita de DNA (líder or retardada); o contexto genômico do evento (exônico, intrônico ou intergênico); a genotipagem (presente ou ausente) e a haplotipagem (em homozigose ou heterozigose). Para atestar a eficiência da ferramenta, o sideRETRO foi executado para dados simulados e para dados reais validados experimentalmente por um grupo independente. Portanto, em resumo, nesta tese são descritos o desenvolvimento e o uso do sideRETRO uma ferramenta computacional robusta e eficiente, designada para identificar e anotar pseudogenes processados não fixados. Por fim, vale destacar que o sideRETRO preenche uma lacuna metodológica e possibilita novas hipóteses e investigações sistemáticas no campo de chamada de variantes estruturais
The methodological and instrumental advances resulting from the Human Genome Project have created the necessary framework to the emergence of Next Generation DNA sequencing technologies, which are characterized by a reduced cost, low operational demand and the generation of a large volume of data per experiment. Concomitantly with this, the increase in computational processing power has driven the development of large-scale genetic analyses, which allowed us to study individualized genomic traits little or never explored before. Among these characteristics, those related to structural variations in genomes have received much attention. Processed pseudogenes, or retrocopies, are structural variations caused by the duplication of coding genes through the transposition of their mature messenger RNA by the LINE-1 enzymatic machinery. Retrocopies can be fixed (i.e., present in all genomes of a given species and included into the assembly of the reference genome) or unfixed, being polymorphic, germinal or somatic. However, knowledge about unfixed retrocopies is still limited due to the lack of bioinformatics tools dedicated to their identification and annotation in DNA sequencing data. Therefore, this work presents sideRETRO a computer program specialized in the detection of processed pseudogenes absent from the reference genome, but present in whole genome and exome sequencing data from other individuals. In addition to pointing out the presence of unfixed retrocopies, sideRETRO is able to annotate several other characteristics related to these events, such as: the genomic coordinate of the processed pseudogene insetion, which constitutes the chromosome, the insertion point and the DNA strand (leader or retard); the genomic context of the event (exonic, intronic or intergenic); genotyping (present or absent) and haplotyping (homozygous or heterozygous). To certify the sideRETRO efficiency, it was run on simulated data and on real data experimentally validated by an independent group. Therefore, in summary, this thesis describes the development and use of sideRETRO a robust and efficient computational tool, designed to identify and annotate unfixed processed pseudogenes. Finally, it is worth noting that sideRETRO fills a methodological gap and allows new hypotheses and systematic investigations in the field of structural variant calling
Asunto(s)
Polimorfismo Genético/genética , Biología Computacional/clasificación , Biología Computacional/instrumentación , Costos y Análisis de Costo , Genómica/instrumentación , Análisis de Secuencia de ADN/instrumentación , Codificación ClínicaRESUMEN
DNA barcoding has become a standard method for species identification in taxonomically complex groups. An important step of the barcoding process is the construction of a library of voucher-based material that was properly identified by independent methods, free of inaccurate identification, and paralogs. We provide here a cytochrome oxidase I (mt-Co1) DNA barcode database for species of the genus Oligoryzomys, based on type material and karyotyped specimens, and anchored on the mitochondrial genome of one species of Oligoryzomys, O. stramineus. To evaluate the taxonomic determination of new COI sequences, we assessed species intra/interspecific genetic distances (barcode gap), performed the General Mixed Yule Coalescent method (GMYC) for lineages' delimitation, and identified diagnostic nucleotides for each species of Oligoryzomys. Phylogenetic analyses of Oligoryzomys were performed on 2 datasets including 14 of the 23 recognized species of this genus: a mt-Co1 only matrix, and a concatenated matrix including mt-Co1, cytochrome b (mt-Cytb), and intron 7 of the nuclear fibrinogen beta chain gene (i7Fgb). We recovered nuclear-mitochondrial translocated (Numts) pseudogenes on our samples and identified several published sequences that are cases of Numts. We analyzed the rate of non-synonymous and synonymous substitution, which were higher in Numts in comparison to mtDNA sequences. GMYC delimitations and DNA barcode gap results highlight the need for further work that integrate molecular, karyotypic, and morphological analyses, as well as additional sampling, to tackle persistent problems in the taxonomy of Oligoryzomys.
Asunto(s)
Arvicolinae/genética , Núcleo Celular/genética , Código de Barras del ADN Taxonómico , Bases de Datos Genéticas , Genoma Mitocondrial/genética , Dinámicas Mitocondriales/genética , Animales , Especificidad de la EspecieRESUMEN
BACKGROUND: Universal stress proteins (USPs) are present in all domains of life. Their expression is upregulated in response to a large variety of stress conditions. The functional diversity found in this protein family, paired with the sequence degeneration of the characteristic ATP-binding motif, suggests a complex evolutionary pattern for the paralogous USP-encoding genes. In this work, we investigated the origin, genomic organization, expression patterns and evolutionary history of the USP gene family in species of the phylum Platyhelminthes. RESULTS: Our data showed a cluster organization, a lineage-specific distribution, and the presence of several pseudogenes among the USP gene copies identified. The absence of a well conserved -CCAATCA- motif in the promoter region was positively correlated with low or null levels of gene expression, and with amino acid changes within the ligand binding motifs. Despite evidence of the pseudogenization of various USP genes, we detected an important functional divergence at several residues, mostly located near sites that are critical for ligand interaction. CONCLUSIONS: Our results provide a broad framework for the evolution of the USP gene family, based on the emergence of new paralogs that face very contrasting fates, including pseudogenization, subfunctionalization or neofunctionalization. This framework aims to explain the sequence and functional diversity of this gene family, providing a foundation for future studies in other taxa in which USPs occur.
Asunto(s)
Evolución Molecular , Proteínas de Choque Térmico/genética , Platelmintos/genética , Homología de Secuencia de Aminoácido , Secuencia de Aminoácidos , Animales , Duplicación de Gen , Regulación de la Expresión Génica , Variación Genética , Proteínas de Choque Térmico/química , Modelos Moleculares , Familia de Multigenes , Motivos de Nucleótidos/genética , Filogenia , Seudogenes , Selección GenéticaRESUMEN
Lynch syndrome (LS) is an autosomal dominant disorder, with high penetrance that affects approximately 3% of the cases of colorectal cancer. Affected individuals inherit germline mutations in genes responsible for DNA mismatch repair, mainly at MSH2, MLH1, MSH6 and PMS2. The molecular screening of these individuals is frequently costly and time consuming due to the large size of these genes. In addition, PMS2 mutation detection is often a challenge because there are 16 different pseudogenes identified until now. In the present work we evaluate a molecular screening strategy based in next generation sequencing (NGS) in order to optimize the mutation detection in LS patients. We established 16 multiplex PCRs for MSH2, MSH6 and MLH1 and 5 Long-Range PCRs for PMS2, coupled with NGS. The strategy was validated by screening 66 patients who filled Bethesda and Amsterdam criteria for LS from health institutions of Brazil. The mean depth of coverage for MSH2, MSH6, MLH1 and PMS2 genes was 7.988, 36.313, 11.899 and 4.772 times, respectively. Ninety-four variants were found in exons and flanking intron/exon regions for the four MMR genes. Twenty-five were pathogenic or VUS and found in 32 patients (7 in MSH2, 5 in MSH6, 12 in MLH1 e 1 in PMS2). All variants were confirmed by Sanger sequencing. The strategy was efficient to reduce time consuming and costs to identify genetic changes at these MMR genes, reducing in three times the number of PCR reactions performed per patient and was efficient in identifying variants at PMS2 gene.
Asunto(s)
Neoplasias Colorrectales Hereditarias sin Poliposis/genética , Reparación de la Incompatibilidad de ADN/genética , Pruebas Genéticas/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Mutación de Línea Germinal/genética , Humanos , Masculino , Persona de Mediana Edad , Reacción en Cadena de la Polimerasa/métodos , Adulto JovenRESUMEN
AIMS: Malignant tumours from the upper aerodigestive tract are grouped collectively in the class of head and neck squamous cell carcinoma (HNSCC). The head and neck tumours were responsible for more than 500 000 cancer cases in 2012, accounting for the sixth highest incidence rate and mortality worldwide among all tumour types. Laryngeal squamous cell carcinoma (LSCC) possesses the second highest incidence rate among all HNSCC. Despite significant advances in surgery and radiotherapy during the last few decades, no treatment has been shown to achieve a satisfactory therapeutic outcome and the mortality rate of LSCC is still high, with a 5-year survival rate of 64%. Therefore, further investigations are required to identify the pathogenesis of LSCC. METHODS AND RESULTS: In order to search for new LSCC biomarkers, we have analysed the expression of the HMGA family members, HMGA1 and HMGA2, by quantitative reverse transcription-polymerase chain reaction (qRT-PCR) and immunohistochemistry. HMGA proteins are usually absent in the healthy adult tissues. In contrast, their constitutive expression is a feature of several neoplasias, being associated with a highly malignant phenotype and reduced survival. Here, we report HMGA2 overexpression in larynx carcinomas. Conversely, HMGA1 does not show any differences in its expression between normal and carcinoma tissues. Interestingly, HMGA2 overexpression appears associated with that of two HMGA1-pseudogenes, HMGA1P6 and HMGA1P7, acting as a sponge for HMGA1- and HMGA2-targeting microRNAs and involved in several human cancers. CONCLUSIONS: Therefore, HMGA2 overexpression appears to be a strong feature of larynx carcinoma, supporting its detection as a valid tool for the diagnosis of these malignancies.
Asunto(s)
Carcinoma/genética , Regulación Neoplásica de la Expresión Génica , Proteína HMGA1a/genética , Proteína HMGA2/genética , Neoplasias Laríngeas/genética , Anciano , Anciano de 80 o más Años , Biomarcadores de Tumor/genética , Biomarcadores de Tumor/metabolismo , Carcinoma/metabolismo , Carcinoma/patología , Femenino , Proteína HMGA1a/metabolismo , Proteína HMGA2/metabolismo , Humanos , Neoplasias Laríngeas/metabolismo , Neoplasias Laríngeas/patología , Laringe/metabolismo , Laringe/patología , Masculino , MicroARNs/genética , MicroARNs/metabolismo , Persona de Mediana EdadRESUMEN
Pseudocorynosoma tepehuanesi n. sp., is described from the intestine of the ruddy duck Oxyura jamaicensis Gmelin, 1789 from single locality from northern Mexico. The new species is mainly distinguished morphologically from the other five described species of Pseudocorynosoma from the Americas (P. constrictum, type species, P. peposacae, P. anatarium, P. enrietti and P. iheringi) associated with waterfowl species by possessing a proboscis with 15 longitudinal rows with 7-8 hooks each, a trunk expanded anteriorly and by having smaller lemniscus. Partial sequences of the mitochondrial gene cytochrome c oxidase subunit I (cox 1) and the large subunit (LSU) of ribosomal DNA including the domains D2+D3 were used independently to corroborate the morphological distinction between the new species and other two congeneric species (P. constrictum and P. anatarium) from North America. The genetic divergence estimated among the new species and the other two species ranged from 15 to 18% for cox 1 and from 3.2 to 4% for LSU. The cox 1 alignment shows 24 sequences from P. anatarium with abnormalities, which were defined as pseudogenes due the presence of insertions, deletions and premature stop codons. Maximum likelihood and Bayesian inference analyses with each data set showed that the acanthocephalans from ruddy duck represent an independent clade with strong bootstrap support and posterior probabilities. The phylogenetic tree inferred with cox 1 gene placed all the pseudogenes from P. anatarium in single clade suggesting that those genes arose after speciation process within genus Pseudocorynosoma. The morphological evidence, plus the monophyly in both phylogenetic analyses indicate that the acanthocephalans collected from intestine of the ruddy duck from northern Mexico represent a new species.
Asunto(s)
Acantocéfalos/anatomía & histología , Acantocéfalos/genética , Enfermedades de las Aves/parasitología , Ciclooxigenasa 1/genética , Patos/parasitología , Helmintiasis Animal/parasitología , Seudogenes , Acantocéfalos/clasificación , Acantocéfalos/enzimología , Animales , Teorema de Bayes , ADN de Helmintos/genética , ADN Ribosómico , Parasitosis Intestinales/parasitología , Parasitosis Intestinales/veterinaria , Intestinos/parasitología , México , Filogenia , Alineación de SecuenciaRESUMEN
Anaplasma marginale (A. marginale), es una bacteria del orden de las Rickettsias que ocasiona la anaplasmosis bovina en regiones tropicales y subtropicales del mundo. Esta enfermedad, trasmitida principalmente por tábanos y garrapatas, se desarrolla típicamente en una etapa inicial aguda con manifestaciones clínicas caracterizadas principalmente por anemia y fiebre. Después de un par de meses, los animales recuperan su condición física y se hacen asintomáticos, siendo incapaces de eliminar completamente la bacteria, convirtiéndose en animales persistentemente infectados. Esto se debe a la capacidad de A. marginale para evadir el sistema inmune. En este sentido, se ha demostrado la existencia de un mecanismo de variación antigénica en las proteínas MSP1, MSP2 y MSP3 de la bacteria. Al evaluar la familia multigénica que codifica para la MSP2, se determinó que está conformada por dos regiones conservadas que flanquean una región central hipervariable. De esta manera, al expresarse cada una de las 52 variables de la MSP2, se expresa un epítope diferente. Cuando se describió el genoma completo de este hemotrópico, se encontró también la presencia de 16 pseudogenes msp2, los cuales pueden ser recombinados dentro del sitio de expresión del operón de la MSP2, constituyendo un segundo mecanismo de variación. Además de ello, los fragmentos hipervaribles y los pseudogenes se pueden combinar entre sí, en un proceso denominado conversión génica, creando nuevos epítopes recombinantes, confiriendo una capacidad de variabilidad antigénica casi infinita al A. marginale (tercer mecanismo). Un cuarto mecanismo de variación antigénica, lo constituye la dimerización de la MSP2 sobre la superficie del A. marginale, debido a que la expresión simultánea de variantes conforman epítopes únicos. En conclusión, la recombinación génica de la MSP2 y su dimerización en la membrana, constituye un mecanismo muy eficiente de variación antigénica para eludir el sistema inmunológico del hospedador.
Anaplasma marginale (A. marginale) is a bacterium of the Rickettsiales order that causes bovine anaplasmosis in tropical and subtropical regions worldwide. This disease, mainly transmitted by ticks and horseflies, typically develops in an initial acute stage, with clinical signs characterized by anemia and fever. After two months, animals recover their original physical condition and become asymptomatic, being unable to completely eliminate the bacterium, turning into persistently infected animals. This is due to the ability of A.marginale to evade the immune system. In this regard, the existence of a mechanism for antigenic variation in proteins of the bacterium, such as MSP1, MSP2, and MSP3, has been demonstrated. When assessing the multigenic family which encodes for MSP2, it was determined that it consists of two conserved regions flanking a central hypervariable region. Thus, when expressing each of the 52 MSP2 variables, a different epitope is also expressed. When the entire genome of this parasite was decoded, the presence of 16 pseudogenes for MSP2 was also discovered. These pseudogenes can be recombined within the operon expression site of MSP2, providing a second mechanism of variation. Moreover, both the hypervariable fragments and pseudogenes can combine among them, in a process called gene conversion, creating new recombinant epitopes, conferring the A. marginale with an almost infinite capacity for antigenic variability (third mechanism). A fourth mechanism of antigentic variation consists of the dimerization of MSP2 on the surface of A. marginale, because the simultaneous expression of variants creates unique epitopes. In conclusion, gene recombination of MSP2 along with the dimerization of MSP2 on the membrane provides a very efficient mechanism for antigenic variation for evading the hosts immune system.
RESUMEN
The Enteritidis and Dublin serovars of Salmonella enterica are closely related, yet they differ significantly in pathogenicity and epidemiology. S. Enteritidis is a broad host range serovar that commonly causes gastroenteritis and infrequently causes invasive disease in humans. S. Dublin mainly colonizes cattle but upon infecting humans often results in invasive disease.To gain a broader view of the extent of these differences we conducted microarray-based comparative genomics between several field isolates from each serovar. Genome degradation has been correlated with host adaptation in Salmonella, thus we also compared at whole genome scale the available genomic sequences of them to evaluate pseudogene composition within each serovar.Microarray analysis revealed 3771 CDS shared by both serovars while 33 were only present in Enteritidis and 87 were exclusive to Dublin. Pseudogene evaluation showed 177 inactive CDS in S. Dublin which correspond to active genes in S. Enteritidis, nine of which are also inactive in the host adapted S. Gallinarum and S. Choleraesuis serovars. Sequencing of these 9 CDS in several S. Dublin clinical isolates revealed that they are pseudogenes in all of them, indicating that this feature is not peculiar to the sequenced strain. Among these CDS, shdA (Peyer´s patch colonization factor) and mglA (galactoside transport ATP binding protein), appear also to be inactive in the human adapted S. Typhi and S. Paratyphi A, suggesting that functionality of these genes may be relevant for the capacity of certain Salmonella serovars to infect a broad range of hosts.