Generating realistic data through modeling and parametric probability for the numerical evaluation of data processing algorithms in two-dimensional chromatography.

Milani, Nino B L; García-Cicourel, Alan Rodrigo; Blomberg, Jan; Edam, Rob; Samanipour, Saer; Bos, Tijmen S; Pirok, Bob W J

Milani, Nino B L; García-Cicourel, Alan Rodrigo; Blomberg, Jan; Edam, Rob; Samanipour, Saer; Bos, Tijmen S; Pirok, Bob W J.

Afiliación

Milani NBL; Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands. Electronic address: n.b.l.milani@uva.nl.
García-Cicourel AR; Shell Global Solutions International B.V., Grasweg 31, 1031 HW, Amsterdam, the Netherlands.
Blomberg J; Shell Global Solutions International B.V., Grasweg 31, 1031 HW, Amsterdam, the Netherlands.
Edam R; Shell Global Solutions International B.V., Grasweg 31, 1031 HW, Amsterdam, the Netherlands.
Samanipour S; Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands.
Bos TS; Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands.
Pirok BWJ; Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands. Electronic address: B.W.J.Pirok@uva.nl.

Anal Chim Acta ; 1312: 342724, 2024 Jul 11.

Article en En | MEDLINE | ID: mdl-38834259

ABSTRACT

ABSTRACT

BACKGROUND:

Comprehensive two-dimensional chromatography generates complex data sets, and numerous baseline correction and noise removal algorithms have been proposed in the past decade to address this challenge. However, evaluating their performance objectively is currently not possible due to a lack of objective data.

RESULT:

To tackle this issue, we introduce a versatile platform that models and reconstructs single-trace two-dimensional chromatography data, preserving peak parameters. This approach balances real experimental data with synthetic data for precise comparisons. We achieve this by employing a Skewed Lorentz-Normal model to represent each peak and creating probability distributions for relevant parameter sampling. The model's performance has been showcased through its application to two-dimensional gas chromatography data where it has created a data set with 458 peaks with an RMSE of 0.0048 or lower and minimal residuals compared to the original data. Additionally, the same process has been shown in liquid chromatography data.

SIGNIFICANCE:

Data analysis is an integral component of any analytical method. The development of new data processing strategies is of paramount importance to tackle the complex signals generated by state-of-the-art separation technology. Through the use of probability distributions, quantitative assessment of algorithm performance of new algorithms is now possible. Therefore, creating new opportunities for faster, more accurate, and simpler data analysis development.

Palabras clave

Data synthesis; Peak detection; Peak modeling; Two-dimensional chromatography

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google