Your browser doesn't support javascript.
loading
CAIM: Coverage-based Analysis for Identification of Microbiome.
Acheampong, Daniel A; Jenjaroenpun, Piroon; Wongsurawat, Thidathip; Krulilung, Alongkorn; Pomyen, Yotsawat; Kandel, Sangam; Kunadirek, Pattapon; Chuaypen, Natthaya; Kusonmano, Kanthida; Nookaew, Intawat.
Affiliation
  • Acheampong DA; Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
  • Jenjaroenpun P; Stowers Institute for Medical Research, Kansas City, MO, USA.
  • Wongsurawat T; Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
  • Krulilung A; Division of Medical Bioinformatics, Department of Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
  • Pomyen Y; Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
  • Kandel S; Division of Medical Bioinformatics, Department of Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
  • Kunadirek P; Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
  • Chuaypen N; Translational Research Unit, Chulabhorn Research Institute, Bangkok, 10210, Thailand.
  • Kusonmano K; Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
  • Nookaew I; Center of Excellence in Hepatitis and Liver Cancer, Department of Biochemistry, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
bioRxiv ; 2024 May 15.
Article in En | MEDLINE | ID: mdl-38746391
ABSTRACT
Accurate taxonomic profiling of microbial taxa in a metagenomic sample is vital to gain insights into microbial ecology. Recent advancements in sequencing technologies have contributed tremendously toward understanding these microbes at species resolution through a whole shotgun metagenomic (WMS) approach. In this study, we developed a new bioinformatics tool, CAIM, for accurate taxonomic classification and quantification within both long- and short-read metagenomic samples using an alignment-based method. CAIM depends on two different containment techniques to identify species in metagenomic samples using their genome coverage information to filter out false positives rather than the traditional approach of relative abundance. In addition, we propose a nucleotide-count based abundance estimation, which yield lesser root mean square error than the traditional read-count approach. We evaluated the performance of CAIM on 28 metagenomic mock communities and 2 synthetic datasets by comparing it with other top-performing tools. CAIM maintained a consitently good performance across datasets in identifying microbial taxa and in estimating relative abundances than other tools. CAIM was then applied to a real dataset sequenced on both Nanopore (with and without amplification) and Illumina sequencing platforms and found high similality of taxonomic profiles between the sequencing platforms. Lastly, CAIM was applied to fecal shotgun metagenomic datasets of 232 colorectal cancer patients and 229 controls obtained from 4 different countries and primary 44 liver cancer patients and 76 controls. The predictive performance of models using the genome-coverage cutoff was better than those using the relative-abundance cutoffs in discriminating colorectal cancer and primary liver cancer patients from healthy controls with a highly confident species markers.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: BioRxiv Year: 2024 Document type: Article Affiliation country: Estados Unidos

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: BioRxiv Year: 2024 Document type: Article Affiliation country: Estados Unidos