RESUMO
This study introduces a model selection technique based on Bayesian information criteria for estimating the number of components in a mixture during Diffusion-Ordered Spectroscopy (DOSY) Nuclear Magnetic Resonance (NMR) data analysis. As the accuracy of this technique is dependent on the efficiency of parameter estimators, we further investigate the performance of the Weighted Least Squares (WLS) and Maximum a Posteriori (MAP) estimators. The WLS method, enhanced with meticulously tuned L2-regularization, effectively detects components when the difference in self-diffusion coefficients is more than two-fold, especially when the component with the smaller coefficient has a larger weight ratio. The MAP method, strengthened by a substantial database of prior information, exhibits outstanding precision, decreasing this threshold to 1.5 times. Both estimators provide weight ratio estimates with standard deviations of approximately around 1 percentage point, although the MAP method tends to overestimate the component with a larger self-diffusion coefficient. Deviations from the expected values can exceed 10 percentage points, often due to inaccuracies in component detection. The error estimates are determined using data resampling techniques derived from a large-scale 1000-point experiment and an additional five measurements from a single-component mixture. This approach allowed us to thoroughly examine data distribution characteristics, thereby laying a robust groundwork for future refinement efforts.