RESUMO
Yearly, cannabis belongs to the most seized drugs worldwide. During judicial investigations, illicit cannabis profiling can be performed to compare seized herbal material. However, comparison is challenging because of the natural heterogeneity of the psychoactive crop. Gas chromatography-mass spectrometry (GC-MS) profiles, consisting of eight cannabinoids, were used to study the intra-location (within) and inter-location (between) variabilities. Decision thresholds were derived from the 95% and 99% confidence limits, applying Pearson correlation coefficients for the intra-location samples. The false negatives and false positives (FPs) determined the discriminative power of different pretreatments applied to obtain the lowest FP error rate possible. Initially, a 97 samples data set was used and with log transformation as pretreatment, a decrease in FPs from 38% and 45% FPs to 17% and 22%, for both confidence limits, respectively, was seen relative to internal standard normalization that was used as reference. An additional intra-plantation variability study with 130 samples verified whether the initial model contained sufficient within-location information, but this was not the case. Hence, a combined data matrix was constructed with all seized samples. Log transformation provided the best FP results for both limits, that is, an improvement from 58% and 64% to 21% and 26%, respectively, was seen. The representativeness of these 'linked' thresholds was demonstrated using both cross-validation and an external set, for which similar FP results as for the calibration set were obtained. By applying data pretreatment, a significant improvement was observed to distinguish seized samples. However, the FP rate is still not at an acceptable level to defend in court.
RESUMO
Cannabis sativa L. is widely used as recreational illegal drugs. Illicit Cannabis profiling, comparing seized samples, is challenging due to natural Cannabis heterogeneity. The aim of this study was to use GC-FID and GC-MS herbal fingerprints for intra (within)- and inter (between)-location variability evaluation. This study focused on finding an acceptable threshold to link seized samples. Through Pearson correlation-coefficient calculations between intra-location samples, 'linked' thresholds were derived using 95% and 99% confidence limits. False negative (FN) and false positive (FP) error rate calculations, aiming at obtaining the lowest possible FP value, were performed for different data pre-treatments. Fingerprint-alignment parameters were optimized using Automated Correlation-Optimized Warping (ACOW) or Design of Experiments (DoE), which presented similar results. Hence, ACOW data, as reference, showed 54% and 65% FP values (95 and 99% confidence, respectively). An additional fourth root normalization pre-treatment provided the best results for both the GC-FID and GC-MS datasets. For GC-FID, which showed the best improved FP error rate, 54 and 65% FP for the reference data decreased to 24 and 32%, respectively, after fourth root transformation. Cross-validation showed FP values similar as the entire calibration set, indicating the representativeness of the thresholds. A noteworthy improvement in discrimination between seized Cannabis samples could be concluded.
Assuntos
Cannabis/química , Cromatografia Gasosa , Drogas Ilícitas/análise , Drogas Ilícitas/química , Área Sob a Curva , Cromatografia Gasosa/métodos , Análise de Dados , Cromatografia Gasosa-Espectrometria de Massas , Curva ROC , Reprodutibilidade dos TestesRESUMO
Selected-Ion Flow-Tube Mass Spectrometry (SIFT-MS) has been applied in a clinical context as diagnostic tool for breath samples using target biomarkers. Exhaled breath sampling is non-invasive and therefore much more patient friendly compared to bronchoscopy, which is the golden standard for evaluating airway inflammation. In the actual pilot study, 55 exhaled breath samples of children with asthma, cystic-fibrosis and healthy individuals were included. Rather than focusing on the analysis of target biomarkers or on the identification of biomarkers, different data analysis strategies, including a variety of pretreatment, classification and discrimination techniques, are evaluated regarding their capacity to distinguish the three classes based on subtle differences in their full scan SIFT-MS spectra. Proper data-analysis strategies are required because these full scan spectra contain much external, i.e. unwanted, variation. Each SIFT-MS analysis generates three spectra resulting from ion-molecule reactions of analyte molecules with H3O+, NO+ and O2+. Models were built with Linear Discriminant Analysis, Quadratic Discriminant Analysis, Soft Independent Modelling by Class Analogy, Partial Least Squares - Discriminant Analysis, K-Nearest Neighbours, and Classification and Regression Trees. Perfect models, concerning overall sensitivity and specificity (100% for both) were found using Direct Orthogonal Signal Correction (DOSC) pretreatment. Given the uncertainty related to the classification models associated with DOSC pretreatments (i.e. good classification found also for random classes), other models are built applying other preprocessing approaches. A Partial Least Squares - Discriminant Analysis model with a combined pre-processing method considering single value imputation results in 100% sensitivity and specificity for calibration, but was less good predictive. Pareto scaling prior to Quadratic Discriminant Analysis resulted in 41/55 correctly classified samples for calibration and 34/55 for cross-validation. In future, the uncertainty with DOSC and the applicability of the promising preprocessing methods and models must be further studied applying a larger representative data set with a more extensive number of samples for each class. Nevertheless, this pilot study showed already some potential for the untargeted SIFT-MS application as a rapid pattern-recognition technique, useful in the diagnosis of clinical breath samples.