Your browser doesn't support javascript.
loading
Phylogenomics from Whole Genome Sequences Using aTRAM.
Allen, Julie M; Boyd, Bret; Nguyen, Nam-Phuong; Vachaspati, Pranjal; Warnow, Tandy; Huang, Daisie I; Grady, Patrick G S; Bell, Kayce C; Cronk, Quentin C B; Mugisha, Lawrence; Pittendrigh, Barry R; Leonardi, M Soledad; Reed, David L; Johnson, Kevin P.
Afiliação
  • Allen JM; Illinois Natural History Survey, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Boyd B; Illinois Natural History Survey, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Nguyen NP; Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA.
  • Vachaspati P; Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Warnow T; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Huang DI; Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Grady PGS; Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Bell KC; Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Cronk QCB; Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, Canada.
  • Mugisha L; Illinois Natural History Survey, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
  • Pittendrigh BR; Department of Biology and Museum of Southwestern Biology, University of New Mexico, Albuquerque, NM 87131, USA.
  • Leonardi MS; Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, Canada.
  • Reed DL; Conservation & Ecosystem Health Alliance (CEHA), Kampala, Uganda.
  • Johnson KP; College of Veterinary Medicine, Animal Resources & Biosecurity (COVAB), Makerere University, Uganda.
Syst Biol ; 66(5): 786-798, 2017 Sep 01.
Article em En | MEDLINE | ID: mdl-28123117
ABSTRACT
Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes ($<$1000 Mbp) it is feasible to sequence the entire genome at modest coverage ($10-30\times$). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice (Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close- to distantly related taxa at high to low levels of coverage.Both the concatenated analysis and the coalescent-based analysis produced the same tree topology, which was consistent with previously published results and resolved weakly supported nodes. These results demonstrate that this approach is successful at developing phylogenomic data sets from raw genome sequencing reads. Further, we found that with coverages above $5-10\times$, aTRAM was successful at assembling 80-90% of the contigs for both close and distantly related taxa. As sequencing costs continue to decline, we expect full genome sequencing will become more feasible for a wider array of organisms, and aTRAM will enable mining of these genomic data sets for an extensive variety of applications, including phylogenomics. [aTRAM; gene assembly; genome sequencing; phylogenomics.].
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Filogenia / Classificação / Genômica Idioma: En Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Filogenia / Classificação / Genômica Idioma: En Ano de publicação: 2017 Tipo de documento: Article