RESUMO
Establishing maximum-residue limits (MRLs) for veterinary medicine helps to protect the human food supply. Guidelines for establishing MRLs are outlined by regulatory authorities that drug sponsors follow in each country. During the drug approval process, residue limits are targeted for specific animal species and matrices. Therefore, MRLs are commonly not established for other species. This study demonstrates unestablished MRLs can be reliably predicted for under-represented food commodity groups using machine learning (ML). Classification methods with imbalanced data were used to analyze MRL data from multiple countries by implementing resampling techniques in different ML classifiers. Afterward, we developed and evaluated a data-mining method for predicting unestablished MRLs. Seven different ML classifiers such as support vector classifier, multi-layer perceptron (MLP), random forest, decision tree, k-neighbors, Gaussian NB, and AdaBoost have been selected in this baseline study. Among these, the neural network MLP classifier reliably scored the highest average-weighted F1 score (accuracy >99% with markers and ≈88% without markets) in predicting unestablished MRLs. This provides the first study to apply ML algorithms in regulatory food animal medicine. By predicting and estimating MRLs, we can potentially decrease the use and cost of live animals and the overall research burden of determining new MRLs.