Búsqueda | Portal de Búsqueda de la BVS

1.

A dataset for predicting cloud cover over Europe.

Svennevik, Hanna; Hicks, Steven A; Riegler, Michael A; Storelvmo, Trude; Hammer, Hugo L.

Sci Data ; 11(1): 245, 2024 Feb 27.

Artículo en Inglés | MEDLINE | ID: mdl-38413601

RESUMEN

Clouds are important factors when projecting future climate. Unfortunately, future cloud fractional cover (the portion of the sky covered by clouds) is associated with significant uncertainty, making climate projections difficult. In this paper, we present the European Cloud Cover dataset, which can be used to learn statistical relations between cloud cover and other environmental variables, to potentially improve future climate projections. The dataset was created using a novel technique called Area Weighting Regridding Scheme to map satellite observations to cloud fractional cover on the same grid as the other variables in the dataset. Baseline experiments using autoregressive models document that it is possible to use the dataset to predict cloud fractional cover.

2.

A large-scale multivariate soccer athlete health, performance, and position monitoring dataset.

Midoglu, Cise; Kjæreng Winther, Andreas; Boeker, Matthias; Dahl Pettersen, Susann; Pedersen, Sigurd; Ragab, Nourhan; Kupka, Tomas; Hicks, Steven A; Bredsgaard Randers, Morten; Jain, Ramesh; Dagenborg, Håvard J; Pettersen, Svein Arne; Johansen, Dag; Riegler, Michael A; Halvorsen, Pål.

Sci Data ; 11(1): 553, 2024 May 30.

Artículo en Inglés | MEDLINE | ID: mdl-38816403

RESUMEN

Data analysis for athletic performance optimization and injury prevention is of tremendous interest to sports teams and the scientific community. However, sports data are often sparse and hard to obtain due to legal restrictions, unwillingness to share, and lack of personnel resources to be assigned to the tedious process of data curation. These constraints make it difficult to develop automated systems for analysis, which require large datasets for learning. We therefore present SoccerMon, the largest soccer athlete dataset available today containing both subjective and objective metrics, collected from two different elite women's soccer teams over two years. Our dataset contains 33,849 subjective reports and 10,075 objective reports, the latter including over six billion GPS position measurements. SoccerMon can not only play a valuable role in developing better analysis and prediction systems for soccer, but also inspire similar data collection activities in other domains which can benefit from subjective athlete reports, GPS position information, and/or time-series data in general.

Asunto(s)

Rendimiento Atlético , Fútbol , Humanos , Femenino , Sistemas de Información Geográfica , Atletas

3.

Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges.

Jha, Debesh; Sharma, Vanshali; Banik, Debapriya; Bhattacharya, Debayan; Roy, Kaushiki; Hicks, Steven A; Tomar, Nikhil Kumar; Thambawita, Vajira; Krenzer, Adrian; Ji, Ge-Peng; Poudel, Sahadev; Batchkala, George; Alam, Saruar; Ahmed, Awadelrahman M A; Trinh, Quoc-Huy; Khan, Zeshan; Nguyen, Tien-Phat; Shrestha, Shruti; Nathan, Sabari; Gwak, Jeonghwan; Jha, Ritika K; Zhang, Zheyuan; Schlaefer, Alexander; Bhattacharjee, Debotosh; Bhuyan, M K; Das, Pradip K; Fan, Deng-Ping; Parasa, Sravanthi; Ali, Sharib; Riegler, Michael A; Halvorsen, Pål; de Lange, Thomas; Bagci, Ulas.

Med Image Anal ; 99: 103307, 2024 Sep 05.

Artículo en Inglés | MEDLINE | ID: mdl-39303447

RESUMEN

Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Therefore, there is a need for an automated system that can flag missed polyps during the examination and improve patient care. Deep learning has emerged as a promising solution to this challenge as it can assist endoscopists in detecting and classifying overlooked polyps and abnormalities in real time, improving the accuracy of diagnosis and enhancing treatment. In addition to the algorithm's accuracy, transparency and interpretability are crucial to explaining the whys and hows of the algorithm's prediction. Further, conclusions based on incorrect decisions may be fatal, especially in medicine. Despite these pitfalls, most algorithms are developed in private data, closed source, or proprietary software, and methods lack reproducibility. Therefore, to promote the development of efficient and transparent methods, we have organized the "Medico automatic polyp segmentation (Medico 2020)" and "MedAI: Transparency in Medical Image Segmentation (MedAI 2021)" competitions. The Medico 2020 challenge received submissions from 17 teams, while the MedAI 2021 challenge also gathered submissions from another 17 distinct teams in the following year. We present a comprehensive summary and analyze each contribution, highlight the strength of the best-performing methods, and discuss the possibility of clinical translations of such methods into the clinic. Our analysis revealed that the participants improved dice coefficient metrics from 0.8607 in 2020 to 0.8993 in 2021 despite adding diverse and challenging frames (containing irregular, smaller, sessile, or flat polyps), which are frequently missed during a routine clinical examination. For the instrument segmentation task, the best team obtained a mean Intersection over union metric of 0.9364. For the transparency task, a multi-disciplinary team, including expert gastroenterologists, accessed each submission and evaluated the team based on open-source practices, failure case analysis, ablation studies, usability and understandability of evaluations to gain a deeper understanding of the models' credibility for clinical deployment. The best team obtained a final transparency score of 21 out of 25. Through the comprehensive analysis of the challenge, we not only highlight the advancements in polyp and surgical instrument segmentation but also encourage subjective evaluation for building more transparent and understandable AI-based colonoscopy systems. Moreover, we discuss the need for multi-center and out-of-distribution testing to address the current limitations of the methods to reduce the cancer burden and improve patient care.

4.

Sperm motility assessed by deep convolutional neural networks into WHO categories.

Haugen, Trine B; Witczak, Oliwia; Hicks, Steven A; Björndahl, Lars; Andersen, Jorunn M; Riegler, Michael A.

Sci Rep ; 13(1): 14777, 2023 09 07.

Artículo en Inglés | MEDLINE | ID: mdl-37679484

RESUMEN

Semen analysis is central in infertility investigation. Manual assessment of sperm motility according to the WHO recommendations is the golden standard, and extensive training is a requirement for accurate and reproducible results. Deep convolutional neural networks (DCNN) are especially suitable for image classification. In this study, we evaluated the performance of the DCNN ResNet-50 in predicting the proportion of sperm in the WHO motility categories. Two models were evaluated using tenfold cross-validation with 65 video recordings of wet semen preparations from an external quality assessment programme for semen analysis. The corresponding manually assessed data was obtained from several of the reference laboratories, and the mean values were used for training of the DCNN models. One model was trained to predict the three categories progressive motility, non-progressive motility, and immotile spermatozoa. Another model was used in predicting four categories, where progressive motility was differentiated into rapid and slow. The resulting average mean absolute error (MAE) was 0.05 and 0.07, and the average ZeroR baseline was 0.09 and 0.10 for the three-category and the four-category model, respectively. Manual and DCNN-predicted motility was compared by Pearson's correlation coefficient and by difference plots. The strongest correlation between the mean manually assessed values and DCNN-predicted motility was observed for % progressively motile spermatozoa (Pearson's r = 0.88, p < 0.001) and % immotile spermatozoa (r = 0.89, p < 0.001). For rapid progressive motility, the correlation was moderate (Pearson's r = 0.673, p < 0.001). The median difference between manual and predicted progressive motility was 0 and 2 for immotile spermatozoa. The largest bias was observed at high and low percentages of progressive and immotile spermatozoa. The DCNN-predicted value was within the range of the interlaboratory variation of the results for most of the samples. In conclusion, DCNN models were able to predict the proportion of spermatozoa into the WHO motility categories with significantly lower error than the baseline. The best correlation between the manual and the DCNN-predicted motility values was found for the categories progressive and immotile. Of note, there was considerable variation between the mean motility values obtained for each category by the reference laboratories, especially for rapid progressive motility, which impacts the training of the DCNN models.

Asunto(s)

Semen , Motilidad Espermática , Masculino , Humanos , Análisis de Semen , Redes Neurales de la Computación , Organización Mundial de la Salud

5.

Usefulness of Heat Map Explanations for Deep-Learning-Based Electrocardiogram Analysis.

Storås, Andrea M; Andersen, Ole Emil; Lockhart, Sam; Thielemann, Roman; Gnesin, Filip; Thambawita, Vajira; Hicks, Steven A; Kanters, Jørgen K; Strümke, Inga; Halvorsen, Pål; Riegler, Michael A.

Diagnostics (Basel) ; 13(14)2023 Jul 11.

Artículo en Inglés | MEDLINE | ID: mdl-37510089

RESUMEN

Deep neural networks are complex machine learning models that have shown promising results in analyzing high-dimensional data such as those collected from medical examinations. Such models have the potential to provide fast and accurate medical diagnoses. However, the high complexity makes deep neural networks and their predictions difficult to understand. Providing model explanations can be a way of increasing the understanding of "black box" models and building trust. In this work, we applied transfer learning to develop a deep neural network to predict sex from electrocardiograms. Using the visual explanation method Grad-CAM, heat maps were generated from the model in order to understand how it makes predictions. To evaluate the usefulness of the heat maps and determine if the heat maps identified electrocardiogram features that could be recognized to discriminate sex, medical doctors provided feedback. Based on the feedback, we concluded that, in our setting, this mode of explainable artificial intelligence does not provide meaningful information to medical doctors and is not useful in the clinic. Our results indicate that improved explanation techniques that are tailored to medical data should be developed before deep neural networks can be applied in the clinic for diagnostic purposes.

6.

VISEM-Tracking, a human spermatozoa tracking dataset.

Thambawita, Vajira; Hicks, Steven A; Storås, Andrea M; Nguyen, Thu; Andersen, Jorunn M; Witczak, Oliwia; Haugen, Trine B; Hammer, Hugo L; Halvorsen, Pål; Riegler, Michael A.

Sci Data ; 10(1): 260, 2023 05 09.

Artículo en Inglés | MEDLINE | ID: mdl-37156762

RESUMEN

A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-aided sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to train supervised machine learning approaches in order to improve accuracy and reliability in the assessment of sperm motility and kinematics. In this regard, we provide a dataset called VISEM-Tracking with 20 video recordings of 30 seconds (comprising 29,196 frames) of wet semen preparations with manually annotated bounding-box coordinates and a set of sperm characteristics analyzed by experts in the domain. In addition to the annotated data, we provide unlabeled video clips for easy-to-use access and analysis of the data via methods such as self- or unsupervised learning. As part of this paper, we present baseline sperm detection performances using the YOLOv5 deep learning (DL) model trained on the VISEM-Tracking dataset. As a result, we show that the dataset can be used to train complex DL models to analyze spermatozoa.

Asunto(s)

Semen , Motilidad Espermática , Espermatozoides , Humanos , Masculino , Reproducibilidad de los Resultados , Grabación en Video

7.

On evaluation metrics for medical applications of artificial intelligence.

Hicks, Steven A; Strümke, Inga; Thambawita, Vajira; Hammou, Malek; Riegler, Michael A; Halvorsen, Pål; Parasa, Sravanthi.

Sci Rep ; 12(1): 5979, 2022 04 08.

Artículo en Inglés | MEDLINE | ID: mdl-35395867

RESUMEN

Clinicians and software developers need to understand how proposed machine learning (ML) models could improve patient care. No single metric captures all the desirable properties of a model, which is why several metrics are typically reported to summarize a model's performance. Unfortunately, these measures are not easily understandable by many clinicians. Moreover, comparison of models across studies in an objective manner is challenging, and no tool exists to compare models using the same performance metrics. This paper looks at previous ML studies done in gastroenterology, provides an explanation of what different metrics mean in the context of binary classification in the presented studies, and gives a thorough explanation of how different metrics should be interpreted. We also release an open source web-based tool that may be used to aid in calculating the most relevant metrics presented in this paper so that other researchers and clinicians may easily incorporate them into their research.

Asunto(s)

Inteligencia Artificial , Benchmarking , Humanos , Aprendizaje Automático , Programas Informáticos

8.

SinGAN-Seg: Synthetic training data generation for medical image segmentation.

Thambawita, Vajira; Salehi, Pegah; Sheshkal, Sajad Amouei; Hicks, Steven A; Hammer, Hugo L; Parasa, Sravanthi; Lange, Thomas de; Halvorsen, Pål; Riegler, Michael A.

PLoS One ; 17(5): e0267976, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35500005

RESUMEN

Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Therefore, artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the data used to train them. Large amounts of data can be difficult to obtain in medicine due to privacy reasons, expensive and time-consuming annotations, and a general lack of data samples for infrequent lesions. In this study, we present a novel synthetic data generation pipeline, called SinGAN-Seg, to produce synthetic medical images with corresponding masks using a single training image. Our method is different from the traditional generative adversarial networks (GANs) because our model needs only a single image and the corresponding ground truth to train. We also show that the synthetic data generation pipeline can be used to produce alternative artificial segmentation datasets with corresponding ground truth masks when real datasets are not allowed to share. The pipeline is evaluated using qualitative and quantitative comparisons between real data and synthetic data to show that the style transfer technique used in our pipeline significantly improves the quality of the generated data and our method is better than other state-of-the-art GANs to prepare synthetic images when the size of training datasets are limited. By training UNet++ using both real data and the synthetic data generated from the SinGAN-Seg pipeline, we show that the models trained on synthetic data have very close performances to those trained on real data when both datasets have a considerable amount of training data. In contrast, we show that synthetic data generated from the SinGAN-Seg pipeline improves the performance of segmentation models when training datasets do not have a considerable amount of data. All experiments were performed using an open dataset and the code is publicly available on GitHub.

Asunto(s)

Aprendizaje Profundo , Procesamiento de Imagen Asistido por Computador , Algoritmos , Inteligencia Artificial , Procesamiento de Imagen Asistido por Computador/métodos , Redes Neurales de la Computación

9.

Impact of Image Resolution on Deep Learning Performance in Endoscopy Image Classification: An Experimental Study Using a Large Dataset of Endoscopic Images.

Thambawita, Vajira; Strümke, Inga; Hicks, Steven A; Halvorsen, Pål; Parasa, Sravanthi; Riegler, Michael A.

Diagnostics (Basel) ; 11(12)2021 Nov 24.

Artículo en Inglés | MEDLINE | ID: mdl-34943421

RESUMEN

Recent trials have evaluated the efficacy of deep convolutional neural network (CNN)-based AI systems to improve lesion detection and characterization in endoscopy. Impressive results are achieved, but many medical studies use a very small image resolution to save computing resources at the cost of losing details. Today, no conventions between resolution and performance exist, and monitoring the performance of various CNN architectures as a function of image resolution provides insights into how subtleties of different lesions on endoscopy affect performance. This can help set standards for image or video characteristics for future CNN-based models in gastrointestinal (GI) endoscopy. This study examines the performance of CNNs on the HyperKvasir dataset, consisting of 10,662 images from 23 different findings. We evaluate two CNN models for endoscopic image classification under quality distortions with image resolutions ranging from 32 × 32 to 512 × 512 pixels. The performance is evaluated using two-fold cross-validation and F1-score, maximum Matthews correlation coefficient (MCC), precision, and sensitivity as metrics. Increased performance was observed with higher image resolution for all findings in the dataset. MCC was achieved at image resolutions between 512 × 512 pixels for classification for the entire dataset after including all subclasses. The highest performance was observed with an MCC value of 0.9002 when the models were trained on the highest resolution and tested on the same resolution. Different resolutions and their effect on CNNs are explored. We show that image resolution has a clear influence on the performance which calls for standards in the field in the future.

10.

DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine.

Thambawita, Vajira; Isaksen, Jonas L; Hicks, Steven A; Ghouse, Jonas; Ahlberg, Gustav; Linneberg, Allan; Grarup, Niels; Ellervik, Christina; Olesen, Morten Salling; Hansen, Torben; Graff, Claus; Holstein-Rathlou, Niels-Henrik; Strümke, Inga; Hammer, Hugo L; Maleckar, Mary M; Halvorsen, Pål; Riegler, Michael A; Kanters, Jørgen K.

Sci Rep ; 11(1): 21896, 2021 11 09.

Artículo en Inglés | MEDLINE | ID: mdl-34753975

RESUMEN

Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribution may alleviate the privacy issue. In this study, we present generative adversarial networks (GANs) capable of generating realistic synthetic DeepFake 10-s 12-lead electrocardiograms (ECGs). We have developed and compared two methods, named WaveGAN* and Pulse2Pulse. We trained the GANs with 7,233 real normal ECGs to produce 121,977 DeepFake normal ECGs. By verifying the ECGs using a commercial ECG interpretation program (MUSE 12SL, GE Healthcare), we demonstrate that the Pulse2Pulse GAN was superior to the WaveGAN* to produce realistic ECGs. ECG intervals and amplitudes were similar between the DeepFake and real ECGs. Although these synthetic ECGs mimic the dataset used for creation, the ECGs are not linked to any individuals and may thus be used freely. The synthetic dataset will be available as open access for researchers at OSF.io and the DeepFake generator available at the Python Package Index (PyPI) for generating synthetic ECGs. In conclusion, we were able to generate realistic synthetic ECGs using generative adversarial neural networks on normal ECGs from two population studies, thereby addressing the relevant privacy issues in medical datasets.

Asunto(s)

Electrocardiografía , Redes Neurales de la Computación , Simulación por Computador , Conjuntos de Datos como Asunto , Humanos , Privacidad

11.

Explaining deep neural networks for knowledge discovery in electrocardiogram analysis.

Hicks, Steven A; Isaksen, Jonas L; Thambawita, Vajira; Ghouse, Jonas; Ahlberg, Gustav; Linneberg, Allan; Grarup, Niels; Strümke, Inga; Ellervik, Christina; Olesen, Morten Salling; Hansen, Torben; Graff, Claus; Holstein-Rathlou, Niels-Henrik; Halvorsen, Pål; Maleckar, Mary M; Riegler, Michael A; Kanters, Jørgen K.

Sci Rep ; 11(1): 10949, 2021 05 26.

Artículo en Inglés | MEDLINE | ID: mdl-34040033

RESUMEN

Deep learning-based tools may annotate and interpret medical data more quickly, consistently, and accurately than medical doctors. However, as medical doctors are ultimately responsible for clinical decision-making, any deep learning-based prediction should be accompanied by an explanation that a human can understand. We present an approach called electrocardiogram gradient class activation map (ECGradCAM), which is used to generate attention maps and explain the reasoning behind deep learning-based decision-making in ECG analysis. Attention maps may be used in the clinic to aid diagnosis, discover new medical knowledge, and identify novel features and characteristics of medical tests. In this paper, we showcase how ECGradCAM attention maps can unmask how a novel deep learning model measures both amplitudes and intervals in 12-lead electrocardiograms, and we show an example of how attention maps may be used to develop novel ECG features.

Asunto(s)

Aprendizaje Profundo , Electrocardiografía , Descubrimiento del Conocimiento , Modelos Cardiovasculares , Adulto , Anciano , Algoritmos , Cardiólogos , Exactitud de los Datos , Diagnóstico por Computador , Femenino , Cardiopatías/diagnóstico , Cardiopatías/fisiopatología , Humanos , Masculino , Persona de Mediana Edad , Análisis para Determinación del Sexo

12.

Kvasir-Capsule, a video capsule endoscopy dataset.

Smedsrud, Pia H; Thambawita, Vajira; Hicks, Steven A; Gjestang, Henrik; Nedrejord, Oda Olsen; Næss, Espen; Borgli, Hanna; Jha, Debesh; Berstad, Tor Jan Derek; Eskeland, Sigrun L; Lux, Mathias; Espeland, Håvard; Petlund, Andreas; Nguyen, Duc Tien Dang; Garcia-Ceja, Enrique; Johansen, Dag; Schmidt, Peter T; Toth, Ervin; Hammer, Hugo L; de Lange, Thomas; Riegler, Michael A; Halvorsen, Pål.

Sci Data ; 8(1): 142, 2021 05 27.

Artículo en Inglés | MEDLINE | ID: mdl-34045470

RESUMEN

Artificial intelligence (AI) is predicted to have profound effects on the future of video capsule endoscopy (VCE) technology. The potential lies in improving anomaly detection while reducing manual labour. Existing work demonstrates the promising benefits of AI-based computer-assisted diagnosis systems for VCE. They also show great potential for improvements to achieve even better results. Also, medical data is often sparse and unavailable to the research community, and qualified medical personnel rarely have time for the tedious labelling work. We present Kvasir-Capsule, a large VCE dataset collected from examinations at a Norwegian Hospital. Kvasir-Capsule consists of 117 videos which can be used to extract a total of 4,741,504 image frames. We have labelled and medically verified 47,238 frames with a bounding box around findings from 14 different classes. In addition to these labelled images, there are 4,694,266 unlabelled frames included in the dataset. The Kvasir-Capsule dataset can play a valuable role in developing better algorithms in order to reach true potential of VCE technology.

Asunto(s)

Endoscopía Capsular , Enfermedades Intestinales/patología , Intestino Delgado/patología , Aprendizaje Automático , Humanos

13.

Machine Learning-Based Analysis of Sperm Videos and Participant Data for Male Fertility Prediction.

Hicks, Steven A; Andersen, Jorunn M; Witczak, Oliwia; Thambawita, Vajira; Halvorsen, Pål; Hammer, Hugo L; Haugen, Trine B; Riegler, Michael A.

Sci Rep ; 9(1): 16770, 2019 11 14.

Artículo en Inglés | MEDLINE | ID: mdl-31727961

RESUMEN

Methods for automatic analysis of clinical data are usually targeted towards a specific modality and do not make use of all relevant data available. In the field of male human reproduction, clinical and biological data are not used to its fullest potential. Manual evaluation of a semen sample using a microscope is time-consuming and requires extensive training. Furthermore, the validity of manual semen analysis has been questioned due to limited reproducibility, and often high inter-personnel variation. The existing computer-aided sperm analyzer systems are not recommended for routine clinical use due to methodological challenges caused by the consistency of the semen sample. Thus, there is a need for an improved methodology. We use modern and classical machine learning techniques together with a dataset consisting of 85 videos of human semen samples and related participant data to automatically predict sperm motility. Used techniques include simple linear regression and more sophisticated methods using convolutional neural networks. Our results indicate that sperm motility prediction based on deep learning using sperm motility videos is rapid to perform and consistent. Adding participant data did not improve the algorithms performance. In conclusion, machine learning-based automatic analysis may become a valuable tool in male infertility investigation and research.

Asunto(s)

Infertilidad Masculina/diagnóstico , Análisis de Semen/métodos , Espermatozoides/fisiología , Humanos , Aprendizaje Automático , Masculino , Microscopía por Video , Redes Neurales de la Computación , Reproducibilidad de los Resultados , Motilidad Espermática

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA