RESUMO
This paper aims to provide a comprehensive and innovative 12-lead electrocardiogram (ECG) dataset tailored to understand the unique needs of professional football players. Other ECG datasets are available but collected from common people, normally with diseases confirmed, while it is well known that ECG characteristics change in athletes and elite players as a result of their intense long-term physical training. This initiative is part of a broader research project employing machine learning (ML) to analyse ECG data in this athlete population and explore them according to the International criteria for ECG interpretation in athletes. The dataset is generated through the establishment of a prospective observational cohort consisting of 54 male football players from La Liga, representing a UEFA Pro-level team. Named the Pro-Football 12-lead Resting Electrocardiogram Database (PF12RED), it comprises 163 10-s ECG recordings, offering a detailed examination of the at-rest heart activity of professional football athletes. Data collection spans five phases over multiple seasons, including the 2018-2019 postseason, the 2019-20 preseason, the 2020-21 preseason, and the 2021-22 preseason. Athletes undergo medical evaluations that include a 10-s resting 12-lead ECG performed with General Electric's USB-CAM 14 module (https://co.services.gehealthcare.com/gehcstorefront/p/900995-002), with data saved using General Electric's CardioSoft V6.73 12SL V21 ECG Software. (https://www.gehealthcare.es/products/cardiosoft-v7) The data collection adheres to ethical principles, with clearance granted by the Autonomous Community of Andalusia Ethics Committee (Spain) under protocol number 1573-N-19 in December 2019. Participants provide informed consent, and data sharing is permitted following anonymization. The study aligns with the Declaration of Helsinki and adheres to the recommendations of the International Committee of Medical Journal Editors (ICMJE). The generated dataset serves as a valuable resource for research in sports cardiology and cardiac health. Its potential for reuse encompasses:1.International Comparison: Enabling cross-regional comparisons of cardiac characteristics among elite football players, enriching international studies.2.ML Model Development: Facilitating the development and refinement of machine learning models for arrhythmia detection, serving as a benchmark dataset.3.Validation of Diagnostic Methods: Allowing the validation of automatic diagnostic methods, contributing to enhanced accuracy in detecting cardiac conditions.4.Research in Sports Cardiology: Supporting future investigations into specific cardiac adaptations in elite athletes and their relation to cardiovascular health.5.Reference for Athlete Protection Policies: Influencing athlete protection policies by providing data on cardiac health and suggesting guidelines for medical assessments.6.Health Professionals Training: Serving as a training resource for health professionals interested in interpreting ECGs in sports contexts.7.Tool and Application Development: Facilitating the development of tools and applications related to the visualization, simulation and analysis of ECG signals in athletes.