Search | VHL Search Portal

1.

Comprehensive transcriptome analysis reveals altered mRNA splicing and post-transcriptional changes in the aged mouse brain.

Kumar, Nisha Hemandhar; Kluever, Verena; Barth, Emanuel; Krautwurst, Sebastian; Furlan, Mattia; Pelizzola, Mattia; Marz, Manja; Fornasiero, Eugenio F.

Nucleic Acids Res ; 52(6): 2865-2885, 2024 Apr 12.

Article in English | MEDLINE | ID: mdl-38471806

ABSTRACT

A comprehensive understanding of molecular changes during brain aging is essential to mitigate cognitive decline and delay neurodegenerative diseases. The interpretation of mRNA alterations during brain aging is influenced by the health and age of the animal cohorts studied. Here, we carefully consider these factors and provide an in-depth investigation of mRNA splicing and dynamics in the aging mouse brain, combining short- and long-read sequencing technologies with extensive bioinformatic analyses. Our findings encompass a spectrum of age-related changes, including differences in isoform usage, decreased mRNA dynamics and a module showing increased expression of neuronal genes. Notably, our results indicate a reduced abundance of mRNA isoforms leading to nonsense-mediated RNA decay and suggest a regulatory role for RNA-binding proteins, indicating that their regulation may be altered leading to the reshaping of the aged brain transcriptome. Collectively, our study highlights the importance of studying mRNA splicing events during brain aging.

Subject(s)

Alternative Splicing , Brain , RNA Splicing , Animals , Mice , Brain/metabolism , Gene Expression Profiling/methods , RNA Splicing/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Transcriptome/genetics

2.

Sequential disruption of SPLASH-identified vRNA-vRNA interactions challenges their role in influenza A virus genome packaging.

Jakob, Celia; Lovate, Gabriel L; Desirò, Daniel; Gießler, Lara; Smyth, Redmond P; Marquet, Roland; Lamkiewicz, Kevin; Marz, Manja; Schwemmle, Martin; Bolte, Hardin.

Nucleic Acids Res ; 51(12): 6479-6494, 2023 07 07.

Article in English | MEDLINE | ID: mdl-37224537

ABSTRACT

A fundamental step in the influenza A virus (IAV) replication cycle is the coordinated packaging of eight distinct genomic RNA segments (i.e. vRNAs) into a viral particle. Although this process is thought to be controlled by specific vRNA-vRNA interactions between the genome segments, few functional interactions have been validated. Recently, a large number of potentially functional vRNA-vRNA interactions have been detected in purified virions using the RNA interactome capture method SPLASH. However, their functional significance in coordinated genome packaging remains largely unclear. Here, we show by systematic mutational analysis that mutant A/SC35M (H7N7) viruses lacking several prominent SPLASH-identified vRNA-vRNA interactions involving the HA segment package the eight genome segments as efficiently as the wild-type virus. We therefore propose that the vRNA-vRNA interactions identified by SPLASH in IAV particles are not necessarily critical for the genome packaging process, leaving the underlying molecular mechanism elusive.

Subject(s)

Influenza A Virus, H7N7 Subtype , Viral Genome Packaging , Humans , Genome, Viral , Influenza A Virus, H7N7 Subtype/physiology , Influenza, Human/virology , RNA, Viral/metabolism , Virus Assembly

3.

Taxonomic classification of DNA sequences beyond sequence similarity using deep neural networks.

Mock, Florian; Kretschmer, Fleming; Kriese, Anton; Böcker, Sebastian; Marz, Manja.

Proc Natl Acad Sci U S A ; 119(35): e2122636119, 2022 08 30.

Article in English | MEDLINE | ID: mdl-36018838

ABSTRACT

Taxonomic classification, that is, the assignment to biological clades with shared ancestry, is a common task in genetics, mainly based on a genome similarity search of large genome databases. The classification quality depends heavily on the database, since representative relatives must be present. Many genomic sequences cannot be classified at all or only with a high misclassification rate. Here we present BERTax, a deep neural network program based on natural language processing to precisely classify the superkingdom and phylum of DNA sequences taxonomically without the need for a known representative relative from a database. We show BERTax to be at least on par with the state-of-the-art approaches when taxonomically similar species are part of the training data. For novel organisms, however, BERTax clearly outperforms any existing approach. Finally, we show that BERTax can also be combined with database approaches to further increase the prediction quality in almost all cases. Since BERTax is not based on similar entries in databases, it allows precise taxonomic classification of a broader range of genomic sequences, thus increasing the overall information gain.

Subject(s)

DNA Barcoding, Taxonomic , DNA , Deep Learning , Software , Algorithms , Base Sequence , DNA/classification , DNA/genetics , DNA Barcoding, Taxonomic/methods , Genome , Genomics

4.

Functional comparisons of the virus sensor RIG-I from humans, the microbat Myotis daubentonii, and the megabat Rousettus aegyptiacus, and their response to SARS-CoV-2 infection.

Schoen, Andreas; Hölzer, Martin; Müller, Marcel A; Wallerang, Kai B; Drosten, Christian; Marz, Manja; Lamp, Benjamin; Weber, Friedemann.

J Virol ; 97(10): e0020523, 2023 10 31.

Article in English | MEDLINE | ID: mdl-37728614

ABSTRACT

IMPORTANCE: A common hypothesis holds that bats (order Chiroptera) are outstanding reservoirs for zoonotic viruses because of a special antiviral interferon (IFN) system. However, functional studies about key components of the bat IFN system are rare. RIG-I is a cellular sensor for viral RNA signatures that activates the antiviral signaling chain to induce IFN. We cloned and functionally characterized RIG-I genes from two species of the suborders Yangochiroptera and Yinpterochiroptera. The bat RIG-Is were conserved in their sequence and domain organization, and similar to human RIG-I in (i) mediating virus- and IFN-activated gene expression, (ii) antiviral signaling, (iii) temperature dependence, and (iv) recognition of RNA ligands. Moreover, RIG-I of Rousettus aegyptiacus (suborder Yinpterochiroptera) and of humans were found to recognize SARS-CoV-2 infection. Thus, members of both bat suborders encode RIG-Is that are comparable to their human counterpart. The ability of bats to harbor zoonotic viruses therefore seems due to other features.

Subject(s)

Chiroptera , Receptors, Retinoic Acid , SARS-CoV-2 , Animals , Humans , Chiroptera/metabolism , COVID-19 , Receptors, Immunologic/chemistry , Receptors, Immunologic/genetics , Receptors, Immunologic/metabolism , SARS-CoV-2/physiology , Viruses , Receptors, Retinoic Acid/chemistry , Receptors, Retinoic Acid/genetics , Receptors, Retinoic Acid/metabolism

5.

Genome Structure, Life Cycle, and Taxonomy of Coronaviruses and the Evolution of SARS-CoV-2.

Lamkiewicz, Kevin; Esquivel Gomez, Luis Roger; Kühnert, Denise; Marz, Manja.

Curr Top Microbiol Immunol ; 439: 305-339, 2023.

Article in English | MEDLINE | ID: mdl-36592250

ABSTRACT

Coronaviruses have a broad host range and exhibit high zoonotic potential. In this chapter, we describe their genomic organization in terms of encoded proteins and provide an introduction to the peculiar discontinuous transcription mechanism. Further, we present evolutionary conserved genomic RNA secondary structure features, which are involved in the complex replication mechanism. With a focus on computational methods, we review the emergence of SARS-CoV-2 starting with the 2019 strains. In that context, we also discuss the debated hypothesis of whether SARS-CoV-2 was created in a laboratory. We focus on the molecular evolution and the epidemiological dynamics of this recently emerged pathogen and we explain how variants of concern are detected and characterised. COVID-19, the disease caused by SARS-CoV-2, can spread through different transmission routes and also depends on a number of risk factors. We describe how current computational models of viral epidemiology, or more specifically, phylodynamics, have facilitated and will continue to enable a better understanding of the epidemic dynamics of SARS-CoV-2.

Subject(s)

COVID-19 , SARS-CoV-2 , Animals , SARS-CoV-2/genetics , COVID-19/genetics , Genome, Viral , Genomics , Life Cycle Stages

6.

VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.

Rangel-Pineros, Guillermo; Almeida, Alexandre; Beracochea, Martin; Sakharova, Ekaterina; Marz, Manja; Reyes Muñoz, Alejandro; Hölzer, Martin; Finn, Robert D.

PLoS Comput Biol ; 19(8): e1011422, 2023 08.

Article in English | MEDLINE | ID: mdl-37639475

ABSTRACT

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them.

Subject(s)

Eukaryota , Microbiota , Humans , Eukaryotic Cells , Genome, Viral/genetics , Metagenome/genetics

7.

De novo genome assembly resolving repetitive structures enables genomic analysis of 35 European Mycoplasmopsis bovis strains.

Triebel, Sandra; Sachse, Konrad; Weber, Michael; Heller, Martin; Diezel, Celia; Hölzer, Martin; Schnee, Christiane; Marz, Manja.

BMC Genomics ; 24(1): 548, 2023 Sep 16.

Article in English | MEDLINE | ID: mdl-37715127

ABSTRACT

Mycoplasmopsis (M.) bovis, the agent of mastitis, pneumonia, and arthritis in cattle, harbors a small genome of approximately 1 Mbp. Combining data from Illumina and Nanopore technologies, we sequenced and assembled the genomes of 35 European strains and isolate DL422_88 from Cuba. While the high proportion of repetitive structures in M. bovis genomes represent a particular challenge, implementation of our own pipeline Mycovista (available on GitHub www.github.com/sandraTriebel/mycovista ) in a hybrid approach enabled contiguous assembly of the genomes and, consequently, improved annotation rates considerably. To put our European strain panel in a global context, we analyzed the new genome sequences together with 175 genome assemblies from public databases. Construction of a phylogenetic tree based on core genes of these 219 strains revealed a clustering pattern according to geographical origin, with European isolates positioned on clades 4 and 5. Genomic data allowing assignment of strains to tissue specificity or certain disease manifestations could not be identified. Seven strains isolated from cattle with systemic circular condition (SCC), still a largely unknown manifestation of M. bovis disease, were located on both clades 4 and 5. Pairwise association analysis revealed 108 genomic elements associated with a particular clade of the phylogenetic tree. Further analyzing these hits, 25 genes are functionally annotated and could be linked to a M. bovis protein, e.g. various proteases and nucleases, as well as ten variable surface lipoproteins (Vsps) and other surface proteins. These clade-specific genes could serve as useful markers in epidemiological and clinical surveys.

Subject(s)

Genomics , Mycoplasma bovis , Female , Animals , Cattle , Phylogeny , Cluster Analysis , Databases, Factual , Endonucleases , Mycoplasma bovis/genetics

8.

Genomic analysis of 61 Chlamydia psittaci strains reveals extensive divergence associated with host preference.

Sachse, Konrad; Hölzer, Martin; Vorimore, Fabien; Barf, Lisa-Marie; Sachse, Carsten; Laroucau, Karine; Marz, Manja; Lamkiewicz, Kevin.

BMC Genomics ; 24(1): 288, 2023 May 29.

Article in English | MEDLINE | ID: mdl-37248517

ABSTRACT

BACKGROUND: Chlamydia (C.) psittaci, the causative agent of avian chlamydiosis and human psittacosis, is a genetically heterogeneous species. Its broad host range includes parrots and many other birds, but occasionally also humans (via zoonotic transmission), ruminants, horses, swine and rodents. To assess whether there are genetic markers associated with host tropism we comparatively analyzed whole-genome sequences of 61 C. psittaci strains, 47 of which carrying a 7.6-kbp plasmid. RESULTS: Following clean-up, reassembly and polishing of poorly assembled genomes from public databases, phylogenetic analyses using C. psittaci whole-genome sequence alignment revealed four major clades within this species. Clade 1 represents the most recent lineage comprising 40/61 strains and contains 9/10 of the psittacine strains, including type strain 6BC, and 10/13 of human isolates. Strains from different non-psittacine hosts clustered in Clades 2- 4. We found that clade membership correlates with typing schemes based on SNP types, ompA genotypes, multilocus sequence types as well as plasticity zone (PZ) structure and host preference. Genome analysis also revealed that i) sequence variation in the major outer membrane porin MOMP can result in 3D structural changes of immunogenic domains, ii) past host change of Clade 3 and 4 strains could be associated with loss of MAC/perforin in the PZ, rather than the large cytotoxin, iii) the distinct phylogeny of atypical strains (Clades 3 and 4) is also reflected in their repertoire of inclusion proteins (Inc family) and polymorphic membrane proteins (Pmps). CONCLUSIONS: Our study identified a number of genomic features that can be correlated with the phylogeny and host preference of C. psittaci strains. Our data show that intra-species genomic divergence is associated with past host change and includes deletions in the plasticity zone, structural variations in immunogenic domains and distinct repertoires of virulence factors.

Subject(s)

Chlamydia , Chlamydophila psittaci , Psittacosis , Animals , Humans , Horses , Swine , Chlamydophila psittaci/genetics , Psittacosis/veterinary , Phylogeny , Chlamydia/genetics , Birds , Genomics

9.

Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing.

Erkes, Annett; Grove, René P; Zarkovic, Milena; Krautwurst, Sebastian; Koebnik, Ralf; Morgan, Richard D; Wilson, Geoffrey G; Hölzer, Martin; Marz, Manja; Boch, Jens; Grau, Jan.

BMC Genomics ; 24(1): 151, 2023 Mar 27.

Article in English | MEDLINE | ID: mdl-36973643

ABSTRACT

BACKGROUND: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. RESULTS: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. CONCLUSIONS: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.

Subject(s)

Nanopore Sequencing , Xanthomonas , Transcription Activator-Like Effectors/genetics , Xanthomonas/genetics , Genome

10.

Comparative Study of Ten Thogotovirus Isolates and Their Distinct In Vivo Characteristics.

Fuchs, Jonas; Lamkiewicz, Kevin; Kolesnikova, Larissa; Hölzer, Martin; Marz, Manja; Kochs, Georg.

J Virol ; 96(5): e0155621, 2022 03 09.

Article in English | MEDLINE | ID: mdl-35019718

ABSTRACT

Thogotoviruses are tick-borne arboviruses that comprise a unique genus within the Orthomyxoviridae family. Infections with thogotoviruses primarily cause disease in livestock with occasional reports of human infections suggesting a zoonotic potential. In the past, multiple genetically distinct thogotoviruses were isolated mostly from collected ticks. However, many aspects regarding their phylogenetic relationships, morphological characteristics, and virulence in mammals remain unclear. For the present comparative study, we used a collection of 10 different thogotovirus isolates from different geographic areas. Next-generation sequencing and subsequent phylogenetic analyses revealed a distinct separation of these viruses into two major clades, the Thogoto-like and Dhori-like viruses. Electron microscopy demonstrated a heterogeneous morphology with spherical and filamentous particles being present in virus preparations. To study their pathogenicity, we analyzed the viruses in a small animal model system. In intraperitoneally infected C57BL/6 mice, all isolates showed a tropism for liver, lung, and spleen. Importantly, we did not observe horizontal transmission to uninfected, highly susceptible contact mice. The isolates enormously differed in their capacity to induce disease, ranging from subclinical to fatal outcomes. In vivo multistep passaging experiments of two low-pathogenic isolates showed no increased virulence and sequence analyses of the passaged viruses indicated a high stability of the viral genomes after 10 mouse passages. In summary, our analysis demonstrates the broad genetic and phenotypic variability within the thogotovirus genus. Moreover, thogotoviruses are well adapted to mammals but their horizontal transmission seems to depend on ticks as their vectors. IMPORTANCE Since their discovery over 60 years ago, 15 genetically distinct members of the thogotovirus genus have been isolated. These arboviruses belong to the Orthomyxovirus family and share many features with influenza viruses. However, numerous of these isolates have not been characterized in depth. In the present study, we comparatively analyzed a collection of 10 different thogotovirus isolates to answer basic questions about their phylogenetic relationships, morphology, and pathogenicity in mice. Our results highlight shared and unique characteristics of this diverse genus. Taken together, these observations provide a framework for the phylogenic classification and phenotypic characterization of newly identified thogotovirus isolates that could potentially cause severe human infections as exemplified by the recently reported, fatal Bourbon virus cases in the United States.

Subject(s)

Orthomyxoviridae Infections , Thogotovirus , Animals , Disease Models, Animal , Genetic Variation , Genome, Viral/genetics , Genomic Instability , Mice , Mice, Inbred C57BL , Microscopy, Electron , Orthomyxoviridae Infections/transmission , Orthomyxoviridae Infections/virology , Phylogeny , Thogotovirus/classification , Thogotovirus/genetics , Thogotovirus/pathogenicity , Thogotovirus/ultrastructure , Ticks/virology

11.

A genome-wide CRISPR activation screen reveals Hexokinase 1 as a critical factor in promoting resistance to multi-kinase inhibitors in hepatocellular carcinoma cells.

Sofer, Summer; Lamkiewicz, Kevin; Armoza Eilat, Shir; Partouche, Shirly; Marz, Manja; Moskovits, Neta; Stemmer, Salomon M; Shlomai, Amir; Sklan, Ella H.

FASEB J ; 36(3): e22191, 2022 03.

Article in English | MEDLINE | ID: mdl-35147243

ABSTRACT

Hepatocellular carcinoma (HCC) is often diagnosed at an advanced stage and is, therefore, treated with systemic drugs, such as tyrosine-kinase inhibitors (TKIs). These drugs, however, offer only modest survival benefits due to the rapid development of drug resistance. To identify genes implicated in TKI resistance, a cluster of regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 activation screen was performed in hepatoma cells treated with regorafenib, a TKI used as second-line therapy for advanced HCC. The screen results show that Hexokinase 1 (HK1), catalyzing the first step in glucose metabolism, is a top candidate for conferring TKI resistance. Compatible with this, HK1 was upregulated in regorafenib-resistant cells. Using several experimental approaches, both in vitro and in vivo, we show that TKI resistance correlates with HK1 expression. Furthermore, an HK inhibitor resensitized resistant cells to TKI treatment. Together, our data indicate that HK1 may function as a critical factor modulating TKI resistance in hepatoma cells and, therefore, may serve as a biomarker for treatment success.

Subject(s)

Carcinoma, Hepatocellular/metabolism , Drug Resistance, Neoplasm , Hexokinase/metabolism , Liver Neoplasms/metabolism , Animals , Antineoplastic Agents/therapeutic use , Carcinoma, Hepatocellular/drug therapy , Cell Line, Tumor , Cells, Cultured , Hexokinase/genetics , Humans , Liver Neoplasms/drug therapy , Male , Mice , Mice, Inbred NOD , Mutation , Protein Kinase Inhibitors/therapeutic use , Up-Regulation

12.

Streptomyces marispadix sp. nov., isolated from marine beach sediment.

Santos, José Diogo Neves Dos; Vitorino, Inês Rosado; Kallscheuer, Nicolai; Srivastava, Akash; Krautwurst, Sebastian; Marz, Manja; Jogler, Christian; Lobo-da-Cunha, Alexandre; Catita, José; Gonçalves, Hugo; González, Ignacio; Reyes, Fernando; Lage, Olga Maria.

Int J Syst Evol Microbiol ; 73(7)2023 Jul.

Article in English | MEDLINE | ID: mdl-37489568

ABSTRACT

A novel actinomycetal strain, designated M600PL45_2T, was isolated from marine sediments obtained from Ingleses beach, Porto, on the Northern Coast of Portugal and was subjected to a polyphasic taxonomic characterisation study. The here described Gram-reaction-positive strain is characterised by the production of a brown pigment in both solid and liquid medium and forms typical helical hyphae that differentiate into smooth spores. The results of a phylogenetic analysis based on the 16S rRNA gene sequence indicated that M600PL45_2T has a high similarity to two members of the genus Streptomyces, Streptomyces bathyalis ASO4wetT (98.51â%) and Streptomyces daqingensis NEAU ZJC8T (98.44â%). The genome of M600PL45_2T has a size of 6â¯695â¯159 bp, a DNA G+C content of 70.71 mol% and 5538 coding sequences. M600PL45_2T grows at 15-37 °C and with a maximal growth rate between 25 °C and 30 °C. Growth at pH 6.0 to 9.0 with the optimal range between 6.0 and 7.5 was observed. M600PL45_2T showed a high salinity tolerance, growing with 0-10â% (w/v) NaCl, with best growth with 1-3% (w/v) NaCl. Major cellular fatty acids are iso-C15:0 (25.03â%), anteiso-C15:0 (17.70) and iso-C16:0 (26.90â%). The novel isolate was able to grow in media containing a variety of nitrogen and carbon sources. An antimicrobial activity screening indicated that an extract of M600PL45_2T has inhibitory activity against Staphylococcus aureus. On the basis of the polyphasic data, M600PL45_2T (= CECT 30365T = DSM 114036T) is introduced as the type strain of a novel species, that we named Streptomyces marispadix sp. nov.

Subject(s)

Fatty Acids , Sodium Chloride , Base Composition , Fatty Acids/chemistry , Phylogeny , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA , DNA, Bacterial/genetics , Bacterial Typing Techniques , Geologic Sediments

13.

Rfam 14: expanded coverage of metagenomic, viral and microRNA families.

Kalvari, Ioanna; Nawrocki, Eric P; Ontiveros-Palacios, Nancy; Argasinska, Joanna; Lamkiewicz, Kevin; Marz, Manja; Griffiths-Jones, Sam; Toffano-Nioche, Claire; Gautheret, Daniel; Weinberg, Zasha; Rivas, Elena; Eddy, Sean R; Finn, Robert D; Bateman, Alex; Petrov, Anton I.

Nucleic Acids Res ; 49(D1): D192-D200, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33211869

ABSTRACT

Rfam is a database of RNA families where each of the 3444 families is represented by a multiple sequence alignment of known RNA sequences and a covariance model that can be used to search for additional members of the family. Recent developments have involved expert collaborations to improve the quality and coverage of Rfam data, focusing on microRNAs, viral and bacterial RNAs. We have completed the first phase of synchronising microRNA families in Rfam and miRBase, creating 356 new Rfam families and updating 40. We established a procedure for comprehensive annotation of viral RNA families starting with Flavivirus and Coronaviridae RNAs. We have also increased the coverage of bacterial and metagenome-based RNA families from the ZWD database. These developments have enabled a significant growth of the database, with the addition of 759 new families in Rfam 14. To facilitate further community contribution to Rfam, expert users are now able to build and submit new families using the newly developed Rfam Cloud family curation system. New Rfam website features include a new sequence similarity search powered by RNAcentral, as well as search and visualisation of families with pseudoknots. Rfam is freely available at https://rfam.org.

Subject(s)

Databases, Nucleic Acid , Metagenome , MicroRNAs/genetics , RNA, Bacterial/genetics , RNA, Untranslated/genetics , RNA, Viral/genetics , Bacteria/genetics , Bacteria/metabolism , Base Pairing , Base Sequence , Humans , Internet , MicroRNAs/classification , MicroRNAs/metabolism , Molecular Sequence Annotation , Nucleic Acid Conformation , RNA, Bacterial/classification , RNA, Bacterial/metabolism , RNA, Untranslated/classification , RNA, Untranslated/metabolism , RNA, Viral/classification , RNA, Viral/metabolism , Sequence Alignment , Sequence Analysis, RNA , Software , Viruses/genetics , Viruses/metabolism

14.

Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis.

Viehweger, Adrian; Krautwurst, Sebastian; Lamkiewicz, Kevin; Madhugiri, Ramakanth; Ziebuhr, John; Hölzer, Martin; Marz, Manja.

Genome Res ; 29(9): 1545-1554, 2019 09.

Article in English | MEDLINE | ID: mdl-31439691

ABSTRACT

Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called "quasispecies." Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (â¼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.

Subject(s)

Coronavirus/genetics , Nanopore Sequencing/methods , Sequence Analysis, RNA/methods , Cell Line , Evolution, Molecular , Genetic Variation , Genome Size , Humans , Methylation , Quasispecies

15.

Comparative transcriptomics identifies candidate genes involved in the evolutionary transition from dehiscent to indehiscent fruits in Lepidium (Brassicaceae).

Gramzow, Lydia; Klupsch, Katharina; Fernández-Pozo, Noé; Hölzer, Martin; Marz, Manja; Rensing, Stefan A; Theißen, Günter.

BMC Plant Biol ; 22(1): 340, 2022 Jul 14.

Article in English | MEDLINE | ID: mdl-35836106

ABSTRACT

BACKGROUND: Fruits are the seed-bearing structures of flowering plants and are highly diverse in terms of morphology, texture and maturation. Dehiscent fruits split open upon maturation to discharge their seeds while indehiscent fruits are dispersed as a whole. Indehiscent fruits evolved from dehiscent fruits several times independently in the crucifer family (Brassicaceae). The fruits of Lepidium appelianum, for example, are indehiscent while the fruits of the closely related L. campestre are dehiscent. Here, we investigate the molecular and genetic mechanisms underlying the evolutionary transition from dehiscent to indehiscent fruits using these two Lepidium species as model system. RESULTS: We have sequenced the transcriptomes and small RNAs of floral buds, flowers and fruits of L. appelianum and L. campestre and analyzed differentially expressed genes (DEGs) and differently differentially expressed genes (DDEGs). DEGs are genes that show significantly different transcript levels in the same structures (buds, flowers and fruits) in different species, or in different structures in the same species. DDEGs are genes for which the change in expression level between two structures is significantly different in one species than in the other. Comparing the two species, the highest number of DEGs was found in flowers, followed by fruits and floral buds while the highest number of DDEGs was found in fruits versus flowers followed by flowers versus floral buds. Several gene ontology terms related to cell wall synthesis and degradation were overrepresented in different sets of DEGs highlighting the importance of these processes for fruit opening. Furthermore, the fruit valve identity genes FRUITFULL and YABBY3 were among the DEGs identified. Finally, the microRNA miR166 as well as the TCP transcription factors BRANCHED1 (BRC1) and TCP FAMILY TRANSCRIPTION FACTOR 4 (TCP4) were found to be DDEGs. CONCLUSIONS: Our study reveals differences in gene expression between dehiscent and indehiscent fruits and uncovers miR166, BRC1 and TCP4 as candidate genes for the evolutionary transition from dehiscent to indehiscent fruits in Lepidium.

Subject(s)

Brassicaceae , Lepidium , Brassicaceae/genetics , Brassicaceae/metabolism , Flowers/genetics , Fruit/genetics , Fruit/metabolism , Gene Expression Regulation, Plant , Lepidium/genetics , Transcriptome

16.

PoSeiDon: a Nextflow pipeline for the detection of evolutionary recombination events and positive selection.

Hölzer, Martin; Marz, Manja.

Bioinformatics ; 37(7): 1018-1020, 2021 05 17.

Article in English | MEDLINE | ID: mdl-32735310

ABSTRACT

SUMMARY: PoSeiDon is an easy-to-use pipeline that helps researchers to find recombination events and sites under positive selection in protein-coding sequences. By entering homologous sequences, PoSeiDon builds an alignment, estimates a best-fitting substitution model and performs a recombination analysis followed by the construction of all corresponding phylogenies. Finally, significantly positive selected sites are detected according to different models for the full alignment and possible recombination fragments. The results of PoSeiDon are summarized in a user-friendly HTML page providing all intermediate results and the graphical representation of recombination events and positively selected sites. AVAILABILITY AND IMPLEMENTATION: PoSeiDon is freely available at https://github.com/hoelzer/poseidon. The pipeline is implemented in Nextflow with Docker support and processes the output of various tools.

Subject(s)

Recombination, Genetic , Software , Phylogeny

17.

VIDHOP, viral host prediction with deep learning.

Mock, Florian; Viehweger, Adrian; Barth, Emanuel; Marz, Manja.

Bioinformatics ; 37(3): 318-325, 2021 04 20.

Article in English | MEDLINE | ID: mdl-32777818

ABSTRACT

MOTIVATION: Zoonosis, the natural transmission of infections from animals to humans, is a far-reaching global problem. The recent outbreaks of Zikavirus, Ebolavirus and Coronavirus are examples of viral zoonosis, which occur more frequently due to globalization. In case of a virus outbreak, it is helpful to know which host organism was the original carrier of the virus to prevent further spreading of viral infection. Recent approaches aim to predict a viral host based on the viral genome, often in combination with the potential host genome and arbitrarily selected features. These methods are limited in the number of different hosts they can predict or the accuracy of the prediction. RESULTS: Here, we present a fast and accurate deep learning approach for viral host prediction, which is based on the viral genome sequence only. We tested our deep neural network (DNN) on three different virus species (influenza A virus, rabies lyssavirus and rotavirus A). We achieved for each virus species an AUC between 0.93 and 0.98, allowing highly accurate predictions while using only fractions (100-400 bp) of the viral genome sequences. We show that deep neural networks are suitable to predict the host of a virus, even with a limited amount of sequences and highly unbalanced available data. The trained DNNs are the core of our virus-host prediction tool VIrus Deep learning HOst Prediction (VIDHOP). VIDHOP also allows the user to train and use models for other viruses. AVAILABILITY AND IMPLEMENTATION: VIDHOP is freely available under https://github.com/flomock/vidhop. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Deep Learning , Viruses , Genome, Viral , Humans , Neural Networks, Computer

18.

EpiDope: a deep neural network for linear B-cell epitope prediction.

Collatz, Maximilian; Mock, Florian; Barth, Emanuel; Hölzer, Martin; Sachse, Konrad; Marz, Manja.

Bioinformatics ; 37(4): 448-455, 2021 05 01.

Article in English | MEDLINE | ID: mdl-32915967

ABSTRACT

MOTIVATION: By binding to specific structures on antigenic proteins, the so-called epitopes, B-cell antibodies can neutralize pathogens. The identification of B-cell epitopes is of great value for the development of specific serodiagnostic assays and the optimization of medical therapy. However, identifying diagnostically or therapeutically relevant epitopes is a challenging task that usually involves extensive laboratory work. In this study, we show that the time, cost and labor-intensive process of epitope detection in the lab can be significantly reduced using in silico prediction. RESULTS: Here, we present EpiDope, a python tool which uses a deep neural network to detect linear B-cell epitope regions on individual protein sequences. With an area under the curve between 0.67 ± 0.07 in the receiver operating characteristic curve, EpiDope exceeds all other currently used linear B-cell epitope prediction tools. Our software is shown to reliably predict linear B-cell epitopes of a given protein sequence, thus contributing to a significant reduction of laboratory experiments and costs required for the conventional approach. AVAILABILITYAND IMPLEMENTATION: EpiDope is available on GitHub (http://github.com/mcollatz/EpiDope). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Epitopes, B-Lymphocyte , Software , Amino Acid Sequence , Computer Simulation , Epitope Mapping , Neural Networks, Computer

19.

HIV-1 Lethality and Loss of Env Protein Expression Induced by Single Synonymous Substitutions in the Virus Genome Intronic-Splicing Silencer.

Jordan-Paiz, Ana; Nevot, Maria; Lamkiewicz, Kevin; Lataretu, Marie; Franco, Sandra; Marz, Manja; Martinez, Miguel Angel.

J Virol ; 94(21)2020 10 14.

Article in English | MEDLINE | ID: mdl-32817222

ABSTRACT

Synonymous genome recoding has been widely used to study different aspects of virus biology. Codon usage affects the temporal regulation of viral gene expression. In this study, we performed synonymous codon mutagenesis to investigate whether codon usage affected HIV-1 Env protein expression and virus viability. We replaced the codons AGG, GAG, CCU, ACU, CUC, and GGG of the HIV-1 env gene with the synonymous codons CGU, GAA, CCG, ACG, UUA, and GGA, respectively. We found that recoding the Env protein gp120 coding region (excluding the Rev response element [RRE]) did not significantly affect virus replication capacity, even though we introduced 15 new CpG dinucleotides. In contrast, changing a single codon (AGG to CGU) located in the gp41 coding region (HXB2 env position 2125 to 2127), which was included in the intronic splicing silencer (ISS), completely abolished virus replication and Env expression. Computational analyses of this mutant revealed a severe disruption in the ISS RNA secondary structure. A variant that restored ISS secondary RNA structure also reestablished Env production and virus viability. Interestingly, this codon variant prevented both virus replication and Env translation in a eukaryotic expression system. These findings suggested that disrupting mRNA splicing was not the only means of inhibiting translation. Our findings indicated that synonymous gp120 recoding was not always deleterious to HIV-1 replication. Importantly¸ we found that disrupting an external ISS loop strongly affected HIV-1 replication and Env translation.IMPORTANCE Synonymous substitutions can influence virus phenotype, replication capacity, and virulence. In this study, we explored how synonymous codon mutations impacted HIV-1 Env protein expression and virus replication capacity. We changed a single codon, AGG to CGU, which was located in the gp41 coding region (env nucleotide residues 2125 to 2127) and was included in the HIV-1 intronic splicing silencer. This change completely abolished virus replication and Env expression. We also found that changing codon usage in the gp120 region by including an increased number of CpG dinucleotides did not significantly affect Env expression or virus viability. Our findings showed that synonymous recoding was useful for altering viral phenotype and exploring virus biology.

Subject(s)

Genome, Viral , HIV-1/genetics , Silent Mutation , Virus Replication/genetics , env Gene Products, Human Immunodeficiency Virus/chemistry , Base Pairing , Base Sequence , CD4-Positive T-Lymphocytes/metabolism , CD4-Positive T-Lymphocytes/virology , Cell Line , Codon , Exons , HEK293 Cells , HIV-1/metabolism , Humans , Introns , RNA Folding , RNA Splicing , Structure-Activity Relationship , Thermodynamics , env Gene Products, Human Immunodeficiency Virus/genetics , env Gene Products, Human Immunodeficiency Virus/metabolism

20.

A marine Chlamydomonas sp. emerging as an algal model.

Carrasco Flores, David; Fricke, Markus; Wesp, Valentin; Desirò, Daniel; Kniewasser, Anja; Hölzer, Martin; Marz, Manja; Mittag, Maria.

J Phycol ; 57(1): 54-69, 2021 02.

Article in English | MEDLINE | ID: mdl-33043442

ABSTRACT

The freshwater microalga Chlamydomonas reinhardtii, which lives in wet soil, has served for decades as a model for numerous biological processes, and many tools have been introduced for this organism. Here, we have established a stable nuclear transformation for its marine counterpart, Chlamydomonas sp. SAG25.89, by fusing specific cis-acting elements from its Actin gene with the gene providing hygromycin resistance and using an elaborated electroporation protocol. Like C. reinhardtii, Chlamydomonas sp. has a high GC content, allowing reporter genes and selection markers to be applicable in both organisms. Chlamydomonas sp. grows purely photoautotrophically and requires ammonia as a nitrogen source because its nuclear genome lacks some of the genes required for nitrogen metabolism. Interestingly, it can grow well under both low and very high salinities (up to 50 g · L-1 ) rendering it as a model for osmotolerance. We further show that Chlamydomonas sp. grows well from 15 to 28°C, but halts its growth at 32°C. The genome of Chlamydomonas sp. contains some gene homologs the expression of which is regulated according to the ambient temperatures and/or confer thermal acclimation in C. reinhardtii. Thus, knowledge of temperature acclimation can now be compared to the marine species. Furthermore, Chlamydomonas sp. can serve as a model for studying marine microbial interactions and for comparing mechanisms in freshwater and marine environments. Chlamydomonas sp. was previously shown to be immobilized rapidly by a cyclic lipopeptide secreted from the antagonistic bacterium Pseudomonas protegens PF-5, which deflagellates C. reinhardtii.

Subject(s)

Chlamydomonas reinhardtii , Chlamydomonas , Acclimatization , Chlamydomonas reinhardtii/genetics , Pseudomonas

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL