Your browser doesn't support javascript.
loading
InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms.
Patel, Anand; Edge, Peter; Selvaraj, Siddarth; Bansal, Vikas; Bafna, Vineet.
Afiliación
  • Patel A; Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA adp002@ucsd.edu.
  • Edge P; Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA.
  • Selvaraj S; Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA.
  • Bansal V; Department of Pediatrics, School of Medicine, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA.
  • Bafna V; Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA.
Nucleic Acids Res ; 44(12): e111, 2016 07 08.
Article en En | MEDLINE | ID: mdl-27105843
ABSTRACT
Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at http//l337x911.github.io/inphadel/.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Secuencia de ADN / Polimorfismo de Nucleótido Simple / Variación Estructural del Genoma / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Revista: Nucleic Acids Res Año: 2016 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Secuencia de ADN / Polimorfismo de Nucleótido Simple / Variación Estructural del Genoma / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Revista: Nucleic Acids Res Año: 2016 Tipo del documento: Article País de afiliación: Estados Unidos