RESUMEN
BACKGROUND: Urogenital schistosomiasis is considered a Neglected Tropical Disease (NTD) by the World Health Organization (WHO). It is estimated to affect 150 million people worldwide, with a high relevance in resource-poor settings of the African continent. The gold-standard diagnosis is still direct observation of Schistosoma haematobium eggs in urine samples by optical microscopy. Novel diagnostic techniques based on digital image analysis by Artificial Intelligence (AI) tools are a suitable alternative for schistosomiasis diagnosis. METHODOLOGY: Digital images of 24 urine sediment samples were acquired in non-endemic settings. S. haematobium eggs were manually labeled in digital images by laboratory professionals and used for training YOLOv5 and YOLOv8 models, which would achieve automatic detection and localization of the eggs. Urine sediment images were also employed to perform binary classification of images to detect erythrocytes/leukocytes with the MobileNetv3Large, EfficientNetv2, and NasNetLarge models. A robotized microscope system was employed to automatically move the slide through the X-Y axis and to auto-focus the sample. RESULTS: A total number of 1189 labels were annotated in 1017 digital images from urine sediment samples. YOLOv5x training demonstrated a 99.3% precision, 99.4% recall, 99.3% F-score, and 99.4% mAP0.5 for S. haematobium detection. NasNetLarge has an 85.6% accuracy for erythrocyte/leukocyte detection with the test dataset. Convolutional neural network training and comparison demonstrated that YOLOv5x for the detection of eggs and NasNetLarge for the binary image classification to detect erythrocytes/leukocytes were the best options for our digital image database. CONCLUSIONS: The development of low-cost novel diagnostic techniques based on the detection and identification of S. haematobium eggs in urine by AI tools would be a suitable alternative to conventional microscopy in non-endemic settings. This technical proof-of-principle study allows laying the basis for improving the system, and optimizing its implementation in the laboratories.
RESUMEN
In a clinical context, conventional optical microscopy is commonly used for the visualization of biological samples for diagnosis. However, the availability of molecular techniques and rapid diagnostic tests are reducing the use of conventional microscopy, and consequently the number of experienced professionals starts to decrease. Moreover, the continuous visualization during long periods of time through an optical microscope could affect the final diagnosis results due to induced human errors and fatigue. Therefore, microscopy automation is a challenge to be achieved and address this problem. The aim of the study is to develop a low-cost automated system for the visualization of microbiological/parasitological samples by using a conventional optical microscope, and specially designed for its implementation in resource-poor settings laboratories. A 3D-prototype to automate the majority of conventional optical microscopes was designed. Pieces were built with 3D-printing technology and polylactic acid biodegradable material with Tinkercad/Ultimaker Cura 5.1 slicing softwares. The system's components were divided into three subgroups: microscope stage pieces, storage/autofocus-pieces, and smartphone pieces. The prototype is based on servo motors, controlled by Arduino open-source electronic platform, to emulate the X-Y and auto-focus (Z) movements of the microscope. An average time of 27.00 ± 2.58 seconds is required to auto-focus a single FoV. Auto-focus evaluation demonstrates a mean average maximum Laplacian value of 11.83 with tested images. The whole automation process is controlled by a smartphone device, which is responsible for acquiring images for further diagnosis via convolutional neural networks. The prototype is specially designed for resource-poor settings, where microscopy diagnosis is still a routine process. The coalescence between convolutional neural network predictive models and the automation of the movements of a conventional optical microscope confer the system a wide range of image-based diagnosis applications. The accessibility of the system could help improve diagnostics and provide new tools to laboratories worldwide.
Asunto(s)
Microscopía , Microscopía/métodos , Microscopía/instrumentación , Microscopía/economía , Humanos , Impresión Tridimensional/instrumentación , Programas Informáticos , Robótica/instrumentación , Teléfono Inteligente , Automatización , Imagenología Tridimensional/métodosRESUMEN
Introduction: Malaria is one of the most prevalent infectious diseases in sub-Saharan Africa, with 247 million cases reported worldwide in 2021 according to the World Health Organization. Optical microscopy remains the gold standard technique for malaria diagnosis, however, it requires expertise, is time-consuming and difficult to reproduce. Therefore, new diagnostic techniques based on digital image analysis using artificial intelligence tools can improve diagnosis and help automate it. Methods: In this study, a dataset of 2571 labeled thick blood smear images were created. YOLOv5x, Faster R-CNN, SSD, and RetinaNet object detection neural networks were trained on the same dataset to evaluate their performance in Plasmodium parasite detection. Attention modules were applied and compared with YOLOv5x results. To automate the entire diagnostic process, a prototype of 3D-printed pieces was designed for the robotization of conventional optical microscopy, capable of auto-focusing the sample and tracking the entire slide. Results: Comparative analysis yielded a performance for YOLOv5x on a test set of 92.10% precision, 93.50% recall, 92.79% F-score, and 94.40% mAP0.5 for leukocyte, early and mature Plasmodium trophozoites overall detection. F-score values of each category were 99.0% for leukocytes, 88.6% for early trophozoites and 87.3% for mature trophozoites detection. Attention modules performance show non-significant statistical differences when compared to YOLOv5x original trained model. The predictive models were integrated into a smartphone-computer application for the purpose of image-based diagnostics in the laboratory. The system can perform a fully automated diagnosis by the auto-focus and X-Y movements of the robotized microscope, the CNN models trained for digital image analysis, and the smartphone device. The new prototype would determine whether a Giemsa-stained thick blood smear sample is positive/negative for Plasmodium infection and its parasite levels. The whole system was integrated into the iMAGING smartphone application. Conclusion: The coalescence of the fully-automated system via auto-focus and slide movements and the autonomous detection of Plasmodium parasites in digital images with a smartphone software and AI algorithms confers the prototype the optimal features to join the global effort against malaria, neglected tropical diseases and other infectious diseases.
RESUMEN
Malaria is an infectious disease caused by parasites of the genus Plasmodium spp. It is transmitted to humans by the bite of an infected female Anopheles mosquito. It is the most common disease in resource-poor settings, with 241 million malaria cases reported in 2020 according to the World Health Organization. Optical microscopy examination of blood smears is the gold standard technique for malaria diagnosis; however, it is a time-consuming method and a well-trained microscopist is needed to perform the microbiological diagnosis. New techniques based on digital imaging analysis by deep learning and artificial intelligence methods are a challenging alternative tool for the diagnosis of infectious diseases. In particular, systems based on Convolutional Neural Networks for image detection of the malaria parasites emulate the microscopy visualization of an expert. Microscope automation provides a fast and low-cost diagnosis, requiring less supervision. Smartphones are a suitable option for microscopic diagnosis, allowing image capture and software identification of parasites. In addition, image analysis techniques could be a fast and optimal solution for the diagnosis of malaria, tuberculosis, or Neglected Tropical Diseases in endemic areas with low resources. The implementation of automated diagnosis by using smartphone applications and new digital imaging technologies in low-income areas is a challenge to achieve. Moreover, automating the movement of the microscope slide and image autofocusing of the samples by hardware implementation would systemize the procedure. These new diagnostic tools would join the global effort to fight against pandemic malaria and other infectious and poverty-related diseases.
RESUMEN
Modern organizations typically store their data in a raw format in data lakes. These data are then processed and usually stored under hybrid layouts, because they allow projection and selection operations. Thus, they allow (when required) to read less data from the disk. However, this is not very well exploited by distributed processing frameworks (e.g., Hadoop, Spark) when analytical queries are posed. These frameworks divide the data into multiple partitions and then process each partition in a separate task, consequently creating tasks based on the total file size and not the actual size of the data to be read. This typically leads to launching more tasks than needed, which, in turn, increases the query execution time and induces significant waste of computing resources. To allow a more efficient use of resources and reduce the query execution time, we propose a method that decides the number of tasks based on the data being read. To this end, we first propose a cost-based model for estimating the size of data read in hybrid layouts. Next, we use the estimated reading size in a multi-objective optimization method to decide the number of tasks and computational resources to be used. We prototyped our solution for Apache Parquet and Spark and found that our estimations are highly correlated (0.96) with the real executions. Further, using TPC-H we show that our recommended configurations are only 5.6% away from the Pareto front and provide 2.1 × speedup compared with default solutions.