Your browser doesn't support javascript.
loading
Performance and precision of double digestion RAD (ddRAD) genotyping in large multiplexed datasets of marine fish species.
Maroso, F; Hillen, J E J; Pardo, B G; Gkagkavouzis, K; Coscia, I; Hermida, M; Franch, R; Hellemans, B; Van Houdt, J; Simionati, B; Taggart, J B; Nielsen, E E; Maes, G; Ciavaglia, S A; Webster, L M I; Volckaert, F A M; Martinez, P; Bargelloni, L; Ogden, R.
Affiliation
  • Maroso F; Department of Compared Biomedicine and Food Science, University of Padova, 35020 Legnaro, Italy. Electronic address: francesco.maroso@studenti.unipd.it.
  • Hillen JEJ; Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Ch. de Bériotstraat 32 Box 2439, B-3000 Leuven, Belgium.
  • Pardo BG; Departmento de Zoología, Genética y Antropología Física, Universidade de Santiago de Compostela, 27002 Lugo, Spain.
  • Gkagkavouzis K; Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
  • Coscia I; Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Ch. de Bériotstraat 32 Box 2439, B-3000 Leuven, Belgium; School of Environmental and Life Science, Rm 332, Peel Building, University of Salford, Salford M5 4WT, UK.
  • Hermida M; Departmento de Zoología, Genética y Antropología Física, Universidade de Santiago de Compostela, 27002 Lugo, Spain.
  • Franch R; Department of Compared Biomedicine and Food Science, University of Padova, 35020 Legnaro, Italy.
  • Hellemans B; Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Ch. de Bériotstraat 32 Box 2439, B-3000 Leuven, Belgium.
  • Van Houdt J; Department of Human Genetics, University of Leuven, O&N I Herestraat 49 - Box 602, B-3000 Leuven, Belgium.
  • Simionati B; BMR Genomics, Via Redipuglia 21a, Padova, Italy.
  • Taggart JB; Division of Environmental and Evolutionary Biology, School of Biology and Biochemistry, The Queen's University of Belfast, Belfast BT7 INN, UK.
  • Nielsen EE; National Institute of Aquatic Resources, Technical University of Denmark, Vejlsøvej 39, 8600 Silkeborg, Denmark.
  • Maes G; Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Ch. de Bériotstraat 32 Box 2439, B-3000 Leuven, Belgium; Department of Human Genetics, University of Leuven, O&N I Herestraat 49 - Box 602, B-3000 Leuven, Belgium; Centre for Sustainable Tropical Fisheries and Aquacultur
  • Ciavaglia SA; Science and Advice for Scottish Agriculture, Roddinglaw Road, Edinburgh EH12 9FJ, UK.
  • Webster LMI; Science and Advice for Scottish Agriculture, Roddinglaw Road, Edinburgh EH12 9FJ, UK.
  • Volckaert FAM; Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Ch. de Bériotstraat 32 Box 2439, B-3000 Leuven, Belgium.
  • Martinez P; Departmento de Zoología, Genética y Antropología Física, Universidade de Santiago de Compostela, 27002 Lugo, Spain.
  • Bargelloni L; Department of Compared Biomedicine and Food Science, University of Padova, 35020 Legnaro, Italy.
  • Ogden R; Royal (Dick) School of Veterinary Studies and the Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, UK.
Mar Genomics ; 39: 64-72, 2018 Jun.
Article in En | MEDLINE | ID: mdl-29496460
ABSTRACT
The development of Genotyping-By-Sequencing (GBS) technologies enables cost-effective analysis of large numbers of Single Nucleotide Polymorphisms (SNPs), especially in "non-model" species. Nevertheless, as such technologies enter a mature phase, biases and errors inherent to GBS are becoming evident. Here, we evaluated the performance of double digest Restriction enzyme Associated DNA (ddRAD) sequencing in SNP genotyping studies including high number of samples. Datasets of sequence data were generated from three marine teleost species (>5500 samples, >2.5 × 1012 bases in total), using a standardized protocol. A common bioinformatics pipeline based on STACKS was established, with and without the use of a reference genome. We performed analyses throughout the production and analysis of ddRAD data in order to explore (i) the loss of information due to heterogeneous raw read number across samples; (ii) the discrepancy between expected and observed tag length and coverage; (iii) the performances of reference based vs. de novo approaches; (iv) the sources of potential genotyping errors of the library preparation/bioinformatics protocol, by comparing technical replicates. Our results showed use of a reference genome and a posteriori genotype correction improved genotyping precision. Individual read coverage was a key variable for reproducibility; variance in sequencing depth between loci in the same individual was also identified as an important factor and found to correlate to tag length. A comparison of downstream analysis carried out with ddRAD vs single SNP allele specific assay genotypes provided information about the levels of genotyping imprecision that can have a significant impact on allele frequency estimations and population assignment. The results and insights presented here will help to select and improve approaches to the analysis of large datasets based on RAD-like methodologies.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Genome / Sequence Analysis, DNA / Computational Biology / Genotyping Techniques / Fishes Type of study: Prognostic_studies Limits: Animals Language: En Journal: Mar Genomics Year: 2018 Document type: Article

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Genome / Sequence Analysis, DNA / Computational Biology / Genotyping Techniques / Fishes Type of study: Prognostic_studies Limits: Animals Language: En Journal: Mar Genomics Year: 2018 Document type: Article