Your browser doesn't support javascript.
loading
Tracing the breeding farm of domesticated pig using feature selection (Sus scrofa).
Kwon, Taehyung; Yoon, Joon; Heo, Jaeyoung; Lee, Wonseok; Kim, Heebal.
Affiliation
  • Kwon T; Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Korea.
  • Yoon J; Interdisciplinary Program in Bioinformatics Department of Natural Science, Seoul National University, Seoul 08826, Korea.
  • Heo J; International Agricultural Development and Cooperation Center, Chonbuk National University, Jeonju 54896, Korea.
  • Lee W; Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Korea.
  • Kim H; Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Korea.
Asian-Australas J Anim Sci ; 30(11): 1540-1549, 2017 Nov.
Article in En | MEDLINE | ID: mdl-29073733
ABSTRACT

OBJECTIVE:

Increasing food safety demands in the animal product market have created a need for a system to trace the food distribution process, from the manufacturer to the retailer, and genetic traceability is an effective method to trace the origin of animal products. In this study, we successfully achieved the farm tracing of 6,018 multi-breed pigs, using single nucleotide polymorphism (SNP) markers strictly selected through least absolute shrinkage and selection operator (LASSO) feature selection.

METHODS:

We performed farm tracing of domesticated pig (Sus scrofa) from SNP markers and selected the most relevant features for accurate prediction. Considering multi-breed composition of our data, we performed feature selection using LASSO penalization on 4,002 SNPs that are shared between breeds, which also includes 179 SNPs with small between-breed difference. The 100 highest-scored features were extracted from iterative simulations and then evaluated using machine-leaning based classifiers.

RESULTS:

We selected 1,341 SNPs from over 45,000 SNPs through iterative LASSO feature selection, to minimize between-breed differences. We subsequently selected 100 highest-scored SNPs from iterative scoring, and observed high statistical measures in classification of breeding farms by cross-validation only using these SNPs.

CONCLUSION:

The study represents a successful application of LASSO feature selection on multi-breed pig SNP data to trace the farm information, which provides a valuable method and possibility for further researches on genetic traceability.
Key words

Full text: 1 Database: MEDLINE Type of study: Prognostic_studies Language: En Year: 2017 Type: Article

Full text: 1 Database: MEDLINE Type of study: Prognostic_studies Language: En Year: 2017 Type: Article