Your browser doesn't support javascript.
loading
Inference and visualization of DNA damage patterns using a grade of membership model.
Al-Asadi, Hussein; Dey, Kushal K; Novembre, John; Stephens, Matthew.
Afiliación
  • Al-Asadi H; Committee on Evolutionary Biology, University of Chicago, Chicago, IL, USA.
  • Dey KK; Department of Statistics, University of Chicago, Chicago, IL, USA.
  • Novembre J; Department of Statistics, University of Chicago, Chicago, IL, USA.
  • Stephens M; Committee on Evolutionary Biology, University of Chicago, Chicago, IL, USA.
Bioinformatics ; 35(8): 1292-1298, 2019 04 15.
Article en En | MEDLINE | ID: mdl-30192911
ABSTRACT
MOTIVATION Quality control plays a major role in the analysis of ancient DNA (aDNA). One key step in this quality control is assessment of DNA damage aDNA contains unique signatures of DNA damage that distinguish it from modern DNA, and so analyses of damage patterns can help confirm that DNA sequences obtained are from endogenous aDNA rather than from modern contamination. Predominant signatures of DNA damage include a high frequency of cytosine to thymine substitutions (C-to-T) at the ends of fragments, and elevated rates of purines (A & G) before the 5' strand-breaks. Existing QC procedures help assess damage by simply plotting for each sample, the C-to-T mismatch rate along the read and the composition of bases before the 5' strand-breaks. Here we present a more flexible and comprehensive model-based approach to infer and visualize damage patterns in aDNA, implemented in an R package aRchaic. This approach is based on a 'grade of membership' model (also known as 'admixture' or 'topic' model) in which each sample has an estimated grade of membership in each of K damage profiles that are estimated from the data.

RESULTS:

We illustrate aRchaic on data from several aDNA studies and modern individuals from 1000 Genomes Project Consortium (2012). Here, aRchaic clearly distinguishes modern from ancient samples irrespective of DNA extraction, lab and sequencing protocols. Additionally, through an in-silico contamination experiment, we show that the aRchaic grades of membership reflect relative levels of exogenous modern contamination. Together, the outputs of aRchaic provide a concise visual summary of DNA damage patterns, as well as other processes generating mismatches in the data. AVAILABILITY AND IMPLEMENTATION aRchaic is available for download from https//www.github.com/kkdey/aRchaic. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Daño del ADN / Genoma Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Daño del ADN / Genoma Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos