Your browser doesn't support javascript.
loading
Information theoretic perspective on genome clustering.
Veluchamy, Alaguraj; Mehta, Preeti; Srividhya, K V; Vikram, Hirendra; Govind, M K; Gupta, Ramneek; Aziz Bin Dukhyil, Abdul; Abdullah Alharbi, Raed; Abdullah Aloyuni, Saleh; Hassan, Mohamed M; Krishnaswamy, S.
Afiliação
  • Veluchamy A; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Mehta P; Department of Computational Biology, St. Jude Children's Research Hospital, Danny Thomas Place, Memphis 38105, Tennesse, United States of America.
  • Srividhya KV; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Vikram H; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Govind MK; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Gupta R; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Aziz Bin Dukhyil A; Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai 625021, India.
  • Abdullah Alharbi R; Department of Medical Laboratory Sciences, College of Applied Medical Sciences, Majmaah University, Al Majmaah 11952, Saudi Arabia.
  • Abdullah Aloyuni S; Department of Public Health, College of Applied Medical Sciences, Majmaah University, Al Majmaah 11952, Saudi Arabia.
  • Hassan MM; Department of Public Health, College of Applied Medical Sciences, Majmaah University, Al Majmaah 11952, Saudi Arabia.
  • Krishnaswamy S; Department of Biology, College of Science, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia.
Saudi J Biol Sci ; 28(3): 1867-1889, 2021 Mar.
Article em En | MEDLINE | ID: mdl-33732074
ABSTRACT
Shannon's information theoretic perspective of communication helps one to understand the storage and processing of information in one-dimensional sequences. An information theoretic analysis of 937 available completely sequenced prokaryotic genomes and 238 eukaryotic chromosomes is presented. Information content (Id) values were used to cluster these chromosomes. Chargaff's second parity rule i.e compositional self-complementarity, an empirical fact is observed in all the genomes, except for the proteobacteria Candidatus Hodgkinia cicadicola. High information content, arising out of biased base composition in all the 14 chromosomes of Plasmodium falciparum is found among two other genomes of prokaryotes viz. Buchnera aphidicola str. Cc (Cinara cedri) and Candidatus Carsonella ruddii PV. Despite size and compositional variations, both prokaryotic and eukaryotic genomes do not deviate significantly from an equiprobable and random situation. Eukaryotic chromosomes of an organism tend to have similar informational restraints as seen when a simple distance based method is used to cluster them. In eukaryotes, in certain cases, Id values are also similar for the two arms (p and q arm) of the chromosomes. The results of this current study confirm that the information content can provide insights into the clustering of genomes and the evolution of messaging strategies of the genomes. An efficient and robust Perl CGI standalone tool is created based on this information theory algorithm for the analysis of the whole genomes and is made available at https//github.com/AlagurajVeluchamy/InformationTheory.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article