Your browser doesn't support javascript.
loading
smORFer: a modular algorithm to detect small ORFs in prokaryotes.
Bartholomäus, Alexander; Kolte, Baban; Mustafayeva, Ayten; Goebel, Ingrid; Fuchs, Stephan; Benndorf, Dirk; Engelmann, Susanne; Ignatova, Zoya.
Affiliation
  • Bartholomäus A; GFZ German Research Centre for Geosciences, Section Geomicrobiology, 14473 Potsdam, Germany.
  • Kolte B; Inst. Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, 20146 Hamburg, Germany.
  • Mustafayeva A; Inst. Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, 20146 Hamburg, Germany.
  • Goebel I; Helmholtz Center for Infection Research, Microbial Proteomics, 38124 Braunschweig, Germany.
  • Fuchs S; Inst. Microbiology, TU Braunschweig, Braunschweig, Germany.
  • Benndorf D; Inst. Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, 20146 Hamburg, Germany.
  • Engelmann S; Robert Koch Institute, Berlin, Germany.
  • Ignatova Z; Otto von Guericke University, Bioprocess Engineering, 39106 Magdeburg, Germany.
Nucleic Acids Res ; 49(15): e89, 2021 09 07.
Article in En | MEDLINE | ID: mdl-34125903
ABSTRACT
Emerging evidence places small proteins (≤50 amino acids) more centrally in physiological processes. Yet, their functional identification and the systematic genome annotation of their cognate small open-reading frames (smORFs) remains challenging both experimentally and computationally. Ribosome profiling or Ribo-Seq (that is a deep sequencing of ribosome-protected fragments) enables detecting of actively translated open-reading frames (ORFs) and empirical annotation of coding sequences (CDSs) using the in-register translation pattern that is characteristic for genuinely translating ribosomes. Multiple identifiers of ORFs that use the 3-nt periodicity in Ribo-Seq data sets have been successful in eukaryotic smORF annotation. They have difficulties evaluating prokaryotic genomes due to the unique architecture (e.g. polycistronic messages, overlapping ORFs, leaderless translation, non-canonical initiation etc.). Here, we present a new algorithm, smORFer, which performs with high accuracy in prokaryotic organisms in detecting putative smORFs. The unique feature of smORFer is that it uses an integrated approach and considers structural features of the genetic sequence along with in-frame translation and uses Fourier transform to convert these parameters into a measurable score to faithfully select smORFs. The algorithm is executed in a modular way, and dependent on the data available for a particular organism, different modules can be selected for smORF search.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Ribosomes / Protein Biosynthesis / Open Reading Frames / Genome Type of study: Prognostic_studies Language: En Journal: Nucleic Acids Res Year: 2021 Document type: Article Affiliation country: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Ribosomes / Protein Biosynthesis / Open Reading Frames / Genome Type of study: Prognostic_studies Language: En Journal: Nucleic Acids Res Year: 2021 Document type: Article Affiliation country: Germany