Pesquisa | BVS - MINISTÉRIO DA SAÚDE

Joint, multifaceted genomic analysis enables diagnosis of diverse, ultra-rare monogenic presentations.

Nadimpalli Kobren, Shilpa; Moldovan, Mikhail A; Reimers, Rebecca; Traviglia, Daniel; Li, Xinyun; Barnum, Danielle; Veit, Alexander; Willett, Julian; Berselli, Michele; Ronchetti, William; Sherwood, Richard; Krier, Joel; Kohane, Isaac S; Sunyaev, Shamil R.

bioRxiv ; 2024 Feb 16.

Artigo em Inglês | MEDLINE | ID: mdl-38405764

RESUMO

Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients. The increasing sizes of ultra-rare, "N-of-1" disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We apply existing and introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We make our gene-level findings and variant-level information across the cohort available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.

Nontriplet feature of genetic code in Euplotes ciliates is a result of neutral evolution.

Gaydukova, Sofya A; Moldovan, Mikhail A; Vallesi, Adriana; Heaphy, Stephen M; Atkins, John F; Gelfand, Mikhail S; Baranov, Pavel V.

Proc Natl Acad Sci U S A ; 120(22): e2221683120, 2023 05 30.

Artigo em Inglês | MEDLINE | ID: mdl-37216548

RESUMO

The triplet nature of the genetic code is considered a universal feature of known organisms. However, frequent stop codons at internal mRNA positions in Euplotes ciliates ultimately specify ribosomal frameshifting by one or two nucleotides depending on the context, thus posing a nontriplet feature of the genetic code of these organisms. Here, we sequenced transcriptomes of eight Euplotes species and assessed evolutionary patterns arising at frameshift sites. We show that frameshift sites are currently accumulating more rapidly by genetic drift than they are removed by weak selection. The time needed to reach the mutational equilibrium is several times longer than the age of Euplotes and is expected to occur after a several-fold increase in the frequency of frameshift sites. This suggests that Euplotes are at an early stage of the spread of frameshifting in expression of their genome. In addition, we find the net fitness burden of frameshift sites to be noncritical for the survival of Euplotes. Our results suggest that fundamental genome-wide changes such as a violation of the triplet character of genetic code can be introduced and maintained solely by neutral evolution.

Assuntos

Cilióforos , Euplotes , Euplotes/genética , Euplotes/metabolismo , Código Genético , Sequência de Bases , Códon de Terminação/genética , Códon de Terminação/metabolismo , Cilióforos/genética , Deriva Genética

Spread of endemic SARS-CoV-2 lineages in Russia before April 2021.

Klink, Galya V; Safina, Ksenia R; Garushyants, Sofya K; Moldovan, Mikhail; Nabieva, Elena; Komissarov, Andrey B; Lioznov, Dmitry; Bazykin, Georgii A.

PLoS One ; 17(7): e0270717, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35857745

RESUMO

In 2021, the COVID-19 pandemic was characterized by global spread of several lineages with evidence for increased transmissibility. Throughout the pandemic, Russia has remained among the countries with the highest number of confirmed COVID-19 cases, making it a potential hotspot for emergence of novel variants. Here, we show that among the globally significant variants of concern that have spread globally by late 2020, alpha (B.1.1.7), beta (B.1.351) or gamma (P.1), none have been sampled in Russia before the end of 2020. Instead, between summer 2020 and spring 2021, the epidemic in Russia has been characterized by the spread of two lineages that were rare in most other countries: B.1.1.317 and a sublineage of B.1.1 including B.1.1.397 (hereafter, B.1.1.397+). Their frequency has increased concordantly in different parts of Russia. On top of these lineages, in late December 2020, alpha (B.1.1.7) emerged in Russia, reaching a frequency of 17.4% (95% C.I.: 12.0%-24.4%) in March 2021. Additionally, we identify three novel distinct lineages, AT.1, B.1.1.524 and B.1.1.525, that have started to spread, together reaching the frequency of 11.8% (95% C.I.: 7.5%-18.1%) in March 2021. These lineages carry combinations of several notable mutations, including the S:E484K mutation of concern, deletions at a recurrent deletion region of the spike glycoprotein (S:Δ140-142, S:Δ144 or S:Δ136-144), and nsp6:Δ106-108 (also known as ORF1a:Δ3675-3677). Community-based PCR testing indicates that these variants have continued to spread in April 2021, with the frequency of B.1.1.7 reaching 21.7% (95% C.I.: 12.3%-35.6%), and the joint frequency of B.1.1.524 and B.1.1.525, 15.2% (95% C.I.: 7.6%-28.2%). Although these variants have been displaced by the onset of delta variant in May-June 2021, lineages B.1.1.317, B.1.1.397+, AT.1, B.1.1.524 and B.1.1.525 and the combinations of mutations comprising them that are found in other lineages merit monitoring.

Assuntos

COVID-19 , SARS-CoV-2 , COVID-19/epidemiologia , Humanos , Mutação , Pandemias , Federação Russa/epidemiologia , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus

A hierarchy in clusters of cephalopod mRNA editing sites.

Moldovan, Mikhail A; Chervontseva, Zoe S; Nogina, Daria S; Gelfand, Mikhail S.

Sci Rep ; 12(1): 3447, 2022 03 02.

Artigo em Inglês | MEDLINE | ID: mdl-35236910

RESUMO

RNA editing in the form of substituting adenine with inosine (A-to-I editing) is the most frequent type of RNA editing in many metazoan species. In most species, A-to-I editing sites tend to form clusters and editing at clustered sites depends on editing of the adjacent sites. Although functionally important in some specific cases, A-to-I editing usually is rare. The exception occurs in soft-bodied coleoid cephalopods, where tens of thousands of potentially important A-to-I editing sites have been identified, making coleoids an ideal model for studying of properties and evolution of A-to-I editing sites. Here, we apply several diverse techniques to demonstrate a strong tendency of coleoid RNA editing sites to cluster along the transcript. We show that clustering of editing sites and correlated editing substantially contribute to the transcriptome diversity that arises due to extensive RNA editing. Moreover, we identify three distinct types of editing site clusters, varying in size, and describe RNA structural features and mechanisms likely underlying formation of these clusters. In particular, these observations may explain sequence conservation at large distances around editing sites and the observed dependency of editing on mutations in the vicinity of editing sites.

Assuntos

Cefalópodes , Animais , Cefalópodes/genética , Cefalópodes/metabolismo , Inosina/metabolismo , RNA/genética , Edição de RNA , RNA Mensageiro/genética

Schistosome W-Linked Genes Inform Temporal Dynamics of Sex Chromosome Evolution and Suggest Candidate for Sex Determination.

Elkrewi, Marwan; Moldovan, Mikhail A; Picard, Marion A L; Vicoso, Beatriz.

Mol Biol Evol ; 38(12): 5345-5358, 2021 12 09.

Artigo em Inglês | MEDLINE | ID: mdl-34146097

RESUMO

Schistosomes, the human parasites responsible for snail fever, are female-heterogametic. Different parts of their ZW sex chromosomes have stopped recombining in distinct lineages, creating "evolutionary strata" of various ages. Although the Z-chromosome is well characterized at the genomic and molecular level, the W-chromosome has remained largely unstudied from an evolutionary perspective, as only a few W-linked genes have been detected outside of the model species Schistosoma mansoni. Here, we characterize the gene content and evolution of the W-chromosomes of S. mansoni and of the divergent species S. japonicum. We use a combined RNA/DNA k-mer based pipeline to assemble around 100 candidate W-specific transcripts in each of the species. About half of them map to known protein coding genes, the majority homologous to S. mansoni Z-linked genes. We perform an extended analysis of the evolutionary strata present in the two species (including characterizing a previously undetected young stratum in S. japonicum) to infer patterns of sequence and expression evolution of W-linked genes at different time points after recombination was lost. W-linked genes show evidence of degeneration, including high rates of protein evolution and reduced expression. Most are found in young lineage-specific strata, with only a few high expression ancestral W-genes remaining, consistent with the progressive erosion of nonrecombining regions. Among these, the splicing factor u2af2 stands out as a promising candidate for primary sex determination, opening new avenues for understanding the molecular basis of the reproductive biology of this group. Keywords: sex chromosomes, evolutionary strata, W-linked gene, sex determining gene, schistosome parasites.

Assuntos

Evolução Molecular , Cromossomos Sexuais , Animais , Feminino , Genoma , Genômica , Humanos , Schistosoma/genética , Cromossomos Sexuais/genética

Phospho-islands and the evolution of phosphorylated amino acids in mammals.

Moldovan, Mikhail; Gelfand, Mikhail S.

PeerJ ; 8: e10436, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-33344082

RESUMO

BACKGROUND: Protein phosphorylation is the best studied post-translational modification strongly influencing protein function. Phosphorylated amino acids not only differ in physico-chemical properties from non-phosphorylated counterparts, but also exhibit different evolutionary patterns, tending to mutate to and originate from negatively charged amino acids (NCAs). The distribution of phosphosites along protein sequences is non-uniform, as phosphosites tend to cluster, forming so-called phospho-islands. METHODS: Here, we have developed a hidden Markov model-based procedure for the identification of phospho-islands and studied the properties of the obtained phosphorylation clusters. To check robustness of evolutionary analysis, we consider different models for the reconstructions of ancestral phosphorylation states. RESULTS: Clustered phosphosites differ from individual phosphosites in several functional and evolutionary aspects including underrepresentation of phosphotyrosines, higher conservation, more frequent mutations to NCAs. The spectrum of tissues, frequencies of specific phosphorylation contexts, and mutational patterns observed near clustered sites also are different.

Adaptive evolution at mRNA editing sites in soft-bodied cephalopods.

Moldovan, Mikhail; Chervontseva, Zoe; Bazykin, Georgii; Gelfand, Mikhail S.

PeerJ ; 8: e10456, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-33312772

RESUMO

BACKGROUND: The bulk of variability in mRNA sequence arises due to mutation-change in DNA sequence which is heritable if it occurs in the germline. However, variation in mRNA can also be achieved by post-transcriptional modification including mRNA editing, changes in mRNA nucleotide sequence that mimic the effect of mutations. Such modifications are not inherited directly; however, as the processes affecting them are encoded in the genome, they have a heritable component, and therefore can be shaped by selection. In soft-bodied cephalopods, adenine-to-inosine RNA editing is very frequent, and much of it occurs at nonsynonymous sites, affecting the sequence of the encoded protein. METHODS: We study selection regimes at coleoid A-to-I editing sites, estimate the prevalence of positive selection, and analyze interdependencies between the editing level and contextual characteristics of editing site. RESULTS: Here, we show that mRNA editing of individual nonsynonymous sites in cephalopods originates in evolution through substitutions at regions adjacent to these sites. As such substitutions mimic the effect of the substitution at the edited site itself, we hypothesize that they are favored by selection if the inosine is selectively advantageous to adenine at the edited position. Consistent with this hypothesis, we show that edited adenines are more frequently substituted with guanine, an informational analog of inosine, in the course of evolution than their unedited counterparts, and for heavily edited adenines, these transitions are favored by positive selection. Our study shows that coleoid editing sites may enhance adaptation, which, together with recent observations on Drosophila and human editing sites, points at a general role of RNA editing in the molecular evolution of metazoans.

Comparative genomic analysis of fungal TPP-riboswitches.

Moldovan, Mikhail A; Petrova, Svetlana A; Gelfand, Mikhail S.

Fungal Genet Biol ; 114: 34-41, 2018 05.

Artigo em Inglês | MEDLINE | ID: mdl-29548845

RESUMO

Riboswitches are conserved RNA structures located in non-coding regions of mRNA and able to bind small molecules (e.g. metabolites) changing conformation upon binding. This feature enables them to function as regulators of gene expression. The thiamin pyrophosphate (TPP) riboswitch is the only type of riboswitches found not only in bacteria, but also in eukaryotes - in plants, green algae, protists, and fungi. Two main mechanisms of fungal TPP riboswitch action, involving alternative splicing, have been established so far. Here, we report a large-scale bioinformatic study of riboswitch structural features, action mechanisms, and distribution along the fungal taxonomy groups. For each putatively regulated gene, we reconstruct the riboswitch structure, identify other components of the regulation machinery, and establish mechanisms of riboswitch-mediated regulation. In addition to three genes known to be regulated by TPP riboswitches, thiazole synthase THI4, hydroxymethilpyrimidine-syntase NMT1, and putative transporter NCU01977, we identify two new genes, a putative thiamin transporter THI9 and a transporter of unknown specificity. While the riboswitch sequence and structure remain highly conserved in all species and genes, the mode of riboswitch-mediated regulation varies between regulated genes. The riboswitch usage varies strongly between fungal taxa, with the largest number of riboswitch-regulated genes found in Pezizomycotina and no riboswitch-mediated regulation established in Saccaromycotina.

Assuntos

Fungos/genética , Genoma Fúngico/genética , Riboswitch/genética , Tiamina Pirofosfato/genética , Processamento Alternativo , Fungos/fisiologia , Regulação Fúngica da Expressão Gênica/genética , Estudos de Associação Genética , Genômica , Filogenia , RNA Fúngico/genética , RNA Fúngico/metabolismo , Alinhamento de Sequência , Tiamina/metabolismo , Tiamina Pirofosfato/metabolismo

Pangenomic Definition of Prokaryotic Species and the Phylogenetic Structure of Prochlorococcus spp.

Moldovan, Mikhail A; Gelfand, Mikhail S.

Front Microbiol ; 9: 428, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-29593678

RESUMO

The pangenome is the collection of all groups of orthologous genes (OGGs) from a set of genomes. We apply the pangenome analysis to propose a definition of prokaryotic species based on identification of lineage-specific gene sets. While being similar to the classical biological definition based on allele flow, it does not rely on DNA similarity levels and does not require analysis of homologous recombination. Hence this definition is relatively objective and independent of arbitrary thresholds. A systematic analysis of 110 accepted species with the largest numbers of sequenced strains yields results largely consistent with the existing nomenclature. However, it has revealed that abundant marine cyanobacteria Prochlorococcus marinus should be divided into two species. As a control we have confirmed the paraphyletic origin of Yersinia pseudotuberculosis (with embedded, monophyletic Y. pestis) and Burkholderia pseudomallei (with B. mallei). We also demonstrate that by our definition and in accordance with recent studies Escherichia coli and Shigella spp. are one species.

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA