Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 79
Filtrar
1.
Artigo em Inglês | MEDLINE | ID: mdl-38698163

RESUMO

PURPOSE: Informative image selection in laryngoscopy has the potential for improving automatic data extraction alone, for selective data storage and a faster review process, or in combination with other artificial intelligence (AI) detection or diagnosis models. This paper aims to demonstrate the feasibility of AI in providing automatic informative laryngoscopy frame selection also capable of working in real-time providing visual feedback to guide the otolaryngologist during the examination. METHODS: Several deep learning models were trained and tested on an internal dataset (n = 5147 images) and then tested on an external test set (n = 646 images) composed of both white light and narrow band images. Four videos were used to assess the real-time performance of the best-performing model. RESULTS: ResNet-50, pre-trained with the pretext strategy, reached a precision = 95% vs. 97%, recall = 97% vs, 89%, and the F1-score = 96% vs. 93% on the internal and external test set respectively (p = 0.062). The four testing videos are provided in the supplemental materials. CONCLUSION: The deep learning model demonstrated excellent performance in identifying diagnostically relevant frames within laryngoscopic videos. With its solid accuracy and real-time capabilities, the system is promising for its development in a clinical setting, either autonomously for objective quality control or in conjunction with other algorithms within a comprehensive AI toolset aimed at enhancing tumor detection and diagnosis.

3.
Comput Biol Med ; 174: 108430, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38613892

RESUMO

BACKGROUND: To investigate the effectiveness of contrastive learning, in particular SimClr, in reducing the need for large annotated ultrasound (US) image datasets for fetal standard plane identification. METHODS: We explore SimClr advantage in the cases of both low and high inter-class variability, considering at the same time how classification performance varies according to different amounts of labels used. This evaluation is performed by exploiting contrastive learning through different training strategies. We apply both quantitative and qualitative analyses, using standard metrics (F1-score, sensitivity, and precision), Class Activation Mapping (CAM), and t-Distributed Stochastic Neighbor Embedding (t-SNE). RESULTS: When dealing with high inter-class variability classification tasks, contrastive learning does not bring a significant advantage; whereas it results to be relevant for low inter-class variability classification, specifically when initialized with ImageNet weights. CONCLUSIONS: Contrastive learning approaches are typically used when a large number of unlabeled data is available, which is not representative of US datasets. We proved that SimClr either as pre-training with backbone initialized via ImageNet weights or used in an end-to-end dual-task may impact positively the performance over standard transfer learning approaches, under a scenario in which the dataset is small and characterized by low inter-class variability.


Assuntos
Ultrassonografia Pré-Natal , Humanos , Ultrassonografia Pré-Natal/métodos , Gravidez , Feminino , Aprendizado de Máquina , Feto/diagnóstico por imagem , Algoritmos , Interpretação de Imagem Assistida por Computador/métodos , Processamento de Imagem Assistida por Computador/métodos
4.
Comput Med Imaging Graph ; 113: 102350, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-38340574

RESUMO

Recent advances in medical imaging have highlighted the critical development of algorithms for individual vertebral segmentation on computed tomography (CT) scans. Essential for diagnostic accuracy and treatment planning in orthopaedics, neurosurgery and oncology, these algorithms face challenges in clinical implementation, including integration into healthcare systems. Consequently, our focus lies in exploring the application of knowledge distillation (KD) methods to train shallower networks capable of efficiently segmenting vertebrae in CT scans. This approach aims to reduce segmentation time, enhance suitability for emergency cases, and optimize computational and memory resource efficiency. Building upon prior research in the field, a two-step segmentation approach was employed. Firstly, the spine's location was determined by predicting a heatmap, indicating the probability of each voxel belonging to the spine. Subsequently, an iterative segmentation of vertebrae was performed from the top to the bottom of the CT volume over the located spine, using a memory instance to record the already segmented vertebrae. KD methods were implemented by training a teacher network with performance similar to that found in the literature, and this knowledge was distilled to a shallower network (student). Two KD methods were applied: (1) using the soft outputs of both networks and (2) matching logits. Two publicly available datasets, comprising 319 CT scans from 300 patients and a total of 611 cervical, 2387 thoracic, and 1507 lumbar vertebrae, were used. To ensure dataset balance and robustness, effective data augmentation methods were applied, including cleaning the memory instance to replicate the first vertebra segmentation. The teacher network achieved an average Dice similarity coefficient (DSC) of 88.22% and a Hausdorff distance (HD) of 7.71 mm, showcasing performance similar to other approaches in the literature. Through knowledge distillation from the teacher network, the student network's performance improved, with an average DSC increasing from 75.78% to 84.70% and an HD decreasing from 15.17 mm to 8.08 mm. Compared to other methods, our teacher network exhibited up to 99.09% fewer parameters, 90.02% faster inference time, 88.46% shorter total segmentation time, and 89.36% less associated carbon (CO2) emission rate. Regarding our student network, it featured 75.00% fewer parameters than our teacher, resulting in a 36.15% reduction in inference time, a 33.33% decrease in total segmentation time, and a 42.96% reduction in CO2 emissions. This study marks the first exploration of applying KD to the problem of individual vertebrae segmentation in CT, demonstrating the feasibility of achieving comparable performance to existing methods using smaller neural networks.


Assuntos
Dióxido de Carbono , Tomografia Computadorizada por Raios X , Humanos , Tomografia Computadorizada por Raios X/métodos , Redes Neurais de Computação , Algoritmos , Vértebras Lombares
5.
JMIR Aging ; 7: e50537, 2024 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-38386279

RESUMO

BACKGROUND: The rise in life expectancy is associated with an increase in long-term and gradual cognitive decline. Treatment effectiveness is enhanced at the early stage of the disease. Therefore, there is a need to find low-cost and ecological solutions for mass screening of community-dwelling older adults. OBJECTIVE: This work aims to exploit automatic analysis of free speech to identify signs of cognitive function decline. METHODS: A sample of 266 participants older than 65 years were recruited in Italy and Spain and were divided into 3 groups according to their Mini-Mental Status Examination (MMSE) scores. People were asked to tell a story and describe a picture, and voice recordings were used to extract high-level features on different time scales automatically. Based on these features, machine learning algorithms were trained to solve binary and multiclass classification problems by using both mono- and cross-lingual approaches. The algorithms were enriched using Shapley Additive Explanations for model explainability. RESULTS: In the Italian data set, healthy participants (MMSE score≥27) were automatically discriminated from participants with mildly impaired cognitive function (20≤MMSE score≤26) and from those with moderate to severe impairment of cognitive function (11≤MMSE score≤19) with accuracy of 80% and 86%, respectively. Slightly lower performance was achieved in the Spanish and multilanguage data sets. CONCLUSIONS: This work proposes a transparent and unobtrusive assessment method, which might be included in a mobile app for large-scale monitoring of cognitive functionality in older adults. Voice is confirmed to be an important biomarker of cognitive decline due to its noninvasive and easily accessible nature.


Assuntos
Disfunção Cognitiva , Fala , Humanos , Idoso , Feminino , Masculino , Disfunção Cognitiva/diagnóstico , Estudos Transversais , Itália/epidemiologia , Idoso de 80 Anos ou mais , Fala/fisiologia , Espanha/epidemiologia , Testes de Estado Mental e Demência , Aprendizado de Máquina , Algoritmos
6.
Laryngoscope ; 134(6): 2826-2834, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38174772

RESUMO

OBJECTIVE: To investigate the potential of deep learning for automatically delineating (segmenting) laryngeal cancer superficial extent on endoscopic images and videos. METHODS: A retrospective study was conducted extracting and annotating white light (WL) and Narrow-Band Imaging (NBI) frames to train a segmentation model (SegMENT-Plus). Two external datasets were used for validation. The model's performances were compared with those of two otolaryngology residents. In addition, the model was tested on real intraoperative laryngoscopy videos. RESULTS: A total of 3933 images of laryngeal cancer from 557 patients were used. The model achieved the following median values (interquartile range): Dice Similarity Coefficient (DSC) = 0.83 (0.70-0.90), Intersection over Union (IoU) = 0.83 (0.73-0.90), Accuracy = 0.97 (0.95-0.99), Inference Speed = 25.6 (25.1-26.1) frames per second. The external testing cohorts comprised 156 and 200 images. SegMENT-Plus performed similarly on all three datasets for DSC (p = 0.05) and IoU (p = 0.07). No significant differences were noticed when separately analyzing WL and NBI test images on DSC (p = 0.06) and IoU (p = 0.78) and when analyzing the model versus the two residents on DSC (p = 0.06) and IoU (Senior vs. SegMENT-Plus, p = 0.13; Junior vs. SegMENT-Plus, p = 1.00). The model was then tested on real intraoperative laryngoscopy videos. CONCLUSION: SegMENT-Plus can accurately delineate laryngeal cancer boundaries in endoscopic images, with performances equal to those of two otolaryngology residents. The results on the two external datasets demonstrate excellent generalization capabilities. The computation speed of the model allowed its application on videolaryngoscopies simulating real-time use. Clinical trials are needed to evaluate the role of this technology in surgical practice and resection margin improvement. LEVEL OF EVIDENCE: III Laryngoscope, 134:2826-2834, 2024.


Assuntos
Aprendizado Profundo , Neoplasias Laríngeas , Laringoscopia , Imagem de Banda Estreita , Humanos , Laringoscopia/métodos , Imagem de Banda Estreita/métodos , Neoplasias Laríngeas/diagnóstico por imagem , Neoplasias Laríngeas/cirurgia , Neoplasias Laríngeas/patologia , Estudos Retrospectivos , Gravação em Vídeo , Masculino , Feminino , Pessoa de Meia-Idade , Luz , Idoso
7.
Int J Comput Assist Radiol Surg ; 19(3): 481-492, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38066354

RESUMO

PURPOSE: In twin-to-twin transfusion syndrome (TTTS), abnormal vascular anastomoses in the monochorionic placenta can produce uneven blood flow between the two fetuses. In the current practice, TTTS is treated surgically by closing abnormal anastomoses using laser ablation. This surgery is minimally invasive and relies on fetoscopy. Limited field of view makes anastomosis identification a challenging task for the surgeon. METHODS: To tackle this challenge, we propose a learning-based framework for in vivo fetoscopy frame registration for field-of-view expansion. The novelties of this framework rely on a learning-based keypoint proposal network and an encoding strategy to filter (i) irrelevant keypoints based on fetoscopic semantic image segmentation and (ii) inconsistent homographies. RESULTS: We validate our framework on a dataset of six intraoperative sequences from six TTTS surgeries from six different women against the most recent state-of-the-art algorithm, which relies on the segmentation of placenta vessels. CONCLUSION: The proposed framework achieves higher performance compared to the state of the art, paving the way for robust mosaicking to provide surgeons with context awareness during TTTS surgery.


Assuntos
Transfusão Feto-Fetal , Terapia a Laser , Gravidez , Feminino , Humanos , Fetoscopia/métodos , Transfusão Feto-Fetal/diagnóstico por imagem , Transfusão Feto-Fetal/cirurgia , Placenta/cirurgia , Placenta/irrigação sanguínea , Terapia a Laser/métodos , Algoritmos
8.
Med Image Anal ; 92: 103066, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38141453

RESUMO

Fetoscopy laser photocoagulation is a widely adopted procedure for treating Twin-to-Twin Transfusion Syndrome (TTTS). The procedure involves photocoagulation pathological anastomoses to restore a physiological blood exchange among twins. The procedure is particularly challenging, from the surgeon's side, due to the limited field of view, poor manoeuvrability of the fetoscope, poor visibility due to amniotic fluid turbidity, and variability in illumination. These challenges may lead to increased surgery time and incomplete ablation of pathological anastomoses, resulting in persistent TTTS. Computer-assisted intervention (CAI) can provide TTTS surgeons with decision support and context awareness by identifying key structures in the scene and expanding the fetoscopic field of view through video mosaicking. Research in this domain has been hampered by the lack of high-quality data to design, develop and test CAI algorithms. Through the Fetoscopic Placental Vessel Segmentation and Registration (FetReg2021) challenge, which was organized as part of the MICCAI2021 Endoscopic Vision (EndoVis) challenge, we released the first large-scale multi-center TTTS dataset for the development of generalized and robust semantic segmentation and video mosaicking algorithms with a focus on creating drift-free mosaics from long duration fetoscopy videos. For this challenge, we released a dataset of 2060 images, pixel-annotated for vessels, tool, fetus and background classes, from 18 in-vivo TTTS fetoscopy procedures and 18 short video clips of an average length of 411 frames for developing placental scene segmentation and frame registration for mosaicking techniques. Seven teams participated in this challenge and their model performance was assessed on an unseen test dataset of 658 pixel-annotated images from 6 fetoscopic procedures and 6 short clips. For the segmentation task, overall baseline performed was the top performing (aggregated mIoU of 0.6763) and was the best on the vessel class (mIoU of 0.5817) while team RREB was the best on the tool (mIoU of 0.6335) and fetus (mIoU of 0.5178) classes. For the registration task, overall the baseline performed better than team SANO with an overall mean 5-frame SSIM of 0.9348. Qualitatively, it was observed that team SANO performed better in planar scenarios, while baseline was better in non-planner scenarios. The detailed analysis showed that no single team outperformed on all 6 test fetoscopic videos. The challenge provided an opportunity to create generalized solutions for fetoscopic scene understanding and mosaicking. In this paper, we present the findings of the FetReg2021 challenge, alongside reporting a detailed literature review for CAI in TTTS fetoscopy. Through this challenge, its analysis and the release of multi-center fetoscopic data, we provide a benchmark for future research in this field.


Assuntos
Transfusão Feto-Fetal , Placenta , Feminino , Humanos , Gravidez , Algoritmos , Transfusão Feto-Fetal/diagnóstico por imagem , Transfusão Feto-Fetal/cirurgia , Transfusão Feto-Fetal/patologia , Fetoscopia/métodos , Feto , Placenta/diagnóstico por imagem
9.
iScience ; 26(12): 108349, 2023 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-38058310

RESUMO

Pesticide exposure, even at low doses, can have detrimental effects on ecosystems. This study aimed at validating the use of machine learning for recognizing motor anomalies, produced by minimal insecticide exposure on a model insect species. The Mediterranean fruit fly, Ceratitis capitata (Diptera: Tephritidae), was exposed to food contaminated with low concentrations of Carlina acaulis essential oil (EO). A deep learning approach enabled fly pose estimation on video recordings in a custom-built arena. Five machine learning algorithms were trained on handcrafted features, extracted from the predicted pose, to distinguish treated individuals. Random Forest and K-Nearest Neighbor algorithms best performed, with an area under the receiver operating characteristic (ROC) curve of 0.75 and 0.73, respectively. Both algorithms achieved an accuracy of 0.71. Results show the machine learning potential for detecting sublethal effects arising from insecticide exposure on fly motor behavior, which could also affect other organisms and environmental health.

10.
Artigo em Inglês | MEDLINE | ID: mdl-38082565

RESUMO

Vocal folds motility evaluation is paramount in both the assessment of functional deficits and in the accurate staging of neoplastic disease of the glottis. Diagnostic endoscopy, and in particular videoendoscopy, is nowadays the method through which the motility is estimated. The clinical diagnosis, however, relies on the examination of the videoendoscopic frames, which is a subjective and professional-dependent task. Hence, a more rigorous, objective, reliable, and repeatable method is needed. To support clinicians, this paper proposes a machine learning (ML) approach for vocal cords motility classification. From the endoscopic videos of 186 patients with both vocal cords preserved motility and fixation, a dataset of 558 images relative to the two classes was extracted. Successively, a number of features was retrieved from the images and used to train and test four well-grounded ML classifiers. From test results, the best performance was achieved using XGBoost, with precision = 0.82, recall = 0.82, F1 score = 0.82, and accuracy = 0.82. After comparing the most relevant ML models, we believe that this approach could provide precise and reliable support to clinical evaluation.Clinical Relevance- This research represents an important advancement in the state-of-the-art of computer-assisted otolaryngology, to develop an effective tool for motility assessment in the clinical practice.


Assuntos
Endoscopia , Prega Vocal , Humanos , Prega Vocal/diagnóstico por imagem , Glote , Gravação de Videoteipe , Aprendizado de Máquina
11.
Artigo em Inglês | MEDLINE | ID: mdl-38082662

RESUMO

Pesticides are still abused in modern agriculture. The effects of their exposure to even sub-lethal doses can be detrimental to ecosystem stability and human health. This work aims to validate the use of machine learning techniques for recognizing motor abnormalities and to assess any effect post-exposure to a minimal dosage of these substances on a model organism, gaining insights into potential risks for human health. The test subject was the Mediterranean fruit fly, Ceratitis capitata (Wiedemann) (Diptera: Tephritidae), exposed to food contaminated with the LC30 of Carlina acaulis essential oil. A deep learning approach enabled the pose estimation within an arena. Statistical analysis highlighted the most significant features between treated and untreated groups. Based on this analysis, two learning-based algorithms, Random Forest (RF) and XGBoost were employed. The results were compared through different metrics. RF algorithm generated a model capable of distinguishing treated subjects with an area under the receiver operating characteristic curve of 0.75 and an accuracy of 0.71. Through an image-based analysis, this study revealed acute effects due to minimal pesticide doses. So, even small amounts of these biocides drifted far from distribution areas may negatively affect the environment and humans.


Assuntos
Ceratitis capitata , Praguicidas , Animais , Humanos , Ceratitis capitata/efeitos dos fármacos , Relação Dose-Resposta a Droga , Ecossistema , Praguicidas/toxicidade , Tephritidae
12.
Artigo em Inglês | MEDLINE | ID: mdl-38083260

RESUMO

Amyloidosis refers to a range of medical conditions in which misshapen proteins accumulate in various organs and tissues, forming insoluble fibrils. Cardiac amyloidosis is frequently linked to the buildup of misfolded transthyretin (TTR) or immunoglobulin light chains (AL). Delayed diagnosis, due to lack of disease awareness, results in a poor prognosis, especially in patients with AL amyloidosis. Early identification is therefore a key factor to improve patient outcomes. This study investigates the use of supervised machine-learning algorithms to support clinicians in classifying amyloidosis and control subjects. The aim of this work is to foster model interpretability reporting the most important risk factors in predicting the presence of cardiac amyloidosis. We analyzed electronic health records (EHRs) of 418 participants acquired in a time window of 12 years as part of a case-control study conducted in Fondazione Toscana Gabriele Monasterio (Italy) clinical practice. This work paves the way for the creation of digital health solutions that can aid in amyloidosis screening. The effective handling, analysis, and interpretation of these solutions can have a transformative effect on modern healthcare, offering new opportunities for improved patient care.


Assuntos
Amiloidose , Cardiomiopatias , Humanos , Estudos de Casos e Controles , Registros Eletrônicos de Saúde , Cardiomiopatias/diagnóstico , Amiloidose/diagnóstico , Amiloidose/metabolismo , Aprendizado de Máquina , Eletrônica
13.
Artigo em Inglês | MEDLINE | ID: mdl-38083494

RESUMO

The identification of fetal-head standard planes (FHSPs) from ultrasound (US) images is of fundamental importance to visualize cerebral structures and diagnose neural anomalies during gestation in a standardized way. To support the activity of healthcare operators, deep-learning algorithms have been proposed to classify these planes. To date, the translation of such algorithms in clinical practice is hampered by several factors, including the lack of large annotated datasets to train robust and generalizable algorithms. This paper proposes an approach to generate synthetic FHSP images with conditional generative adversarial network (cGAN), using class activation maps (CAMs) obtained from FHSP classification algorithms as cGAN conditional prior. Using the largest publicly available FHSP dataset, we generated realistic images of the three common FHSPs: trans-cerebellum, trans-thalamic and trans-ventricular. The evaluation through t-SNE shows the potential of the proposed approach to attenuate the problem of limited availability of annotated FHSP images.


Assuntos
Algoritmos , Encéfalo , Feminino , Gravidez , Humanos , Encéfalo/diagnóstico por imagem , Ultrassonografia Pré-Natal/métodos , Cerebelo , Feto
14.
Artigo em Inglês | MEDLINE | ID: mdl-38083694

RESUMO

Spinal muscular atrophy (SMA) is a rare neuromuscular disease which may cause impairments in oro-facial musculature. Most of the individuals with SMA present bulbar signs such as flaccid dysarthria which mines their abilities to speak and, as consequence, their psychic balance. To support clinicians, recent work has demonstrated the feasibility of video-based techniques for assessing the oro-facial functions in patients with neurological disorders such as amyotrophic lateral sclerosis. However, no work has so far focused on automatic and quantitative monitoring of dysarthria in SMA. To overcome limitations this work's aim is to propose a cloud-based store-and-forward telemonitoring system for automatic and quantitative evaluation of oro-facial muscles in individuals with SMA. The system integrates a convolutional neural network (CNN) aimed at identifying the position of facial landmarks from video recordings acquired via a web application by an SMA patient.Clinical relevance- The proposed work is in the preliminary stage, but it represents the first step towards a better understanding of the bulbar-functions' evolution in patients with SMA.


Assuntos
Esclerose Lateral Amiotrófica , Atrofia Muscular Espinal , Humanos , Disartria/diagnóstico , Disartria/etiologia , Autocuidado , Atrofia Muscular Espinal/complicações , Atrofia Muscular Espinal/diagnóstico , Esclerose Lateral Amiotrófica/complicações , Doenças Raras
16.
Int J Comput Assist Radiol Surg ; 18(12): 2349-2356, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37587389

RESUMO

PURPOSE: Fetoscopic laser photocoagulation of placental anastomoses is the most effective treatment for twin-to-twin transfusion syndrome (TTTS). A robust mosaic of placenta and its vascular network could support surgeons' exploration of the placenta by enlarging the fetoscope field-of-view. In this work, we propose a learning-based framework for field-of-view expansion from intra-operative video frames. METHODS: While current state of the art for fetoscopic mosaicking builds upon the registration of anatomical landmarks which may not always be visible, our framework relies on learning-based features and keypoints, as well as robust transformer-based image-feature matching, without requiring any anatomical priors. We further address the problem of occlusion recovery and frame relocalization, relying on the computed features and their descriptors. RESULTS: Experiments were conducted on 10 in-vivo TTTS videos from two different fetal surgery centers. The proposed framework was compared with several state-of-the-art approaches, achieving higher [Formula: see text] on 7 out of 10 videos and a success rate of [Formula: see text] in occlusion recovery. CONCLUSION: This work introduces a learning-based framework for placental mosaicking with occlusion recovery from intra-operative videos using a keypoint-based strategy and features. The proposed framework can compute the placental panorama and recover even in case of camera tracking loss where other methods fail. The results suggest that the proposed framework has large potential to pave the way to creating a surgical navigation system for TTTS by providing robust field-of-view expansion.


Assuntos
Transfusão Feto-Fetal , Fetoscopia , Feminino , Humanos , Gravidez , Transfusão Feto-Fetal/cirurgia , Fetoscopia/métodos , Fotocoagulação , Placenta/cirurgia
17.
Front Cardiovasc Med ; 10: 1151705, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37424918

RESUMO

Aims: Diagnosis of myocardial fibrosis is commonly performed with late gadolinium contrast-enhanced (CE) cardiac magnetic resonance (CMR), which might be contraindicated or unavailable. Coronary computed tomography (CCT) is emerging as an alternative to CMR. We sought to evaluate whether a deep learning (DL) model could allow identification of myocardial fibrosis from routine early CE-CCT images. Methods and results: Fifty consecutive patients with known left ventricular (LV) dysfunction (LVD) underwent both CE-CMR and (early and late) CE-CCT. According to the CE-CMR patterns, patients were classified as ischemic (n = 15, 30%) or non-ischemic (n = 35, 70%) LVD. Delayed enhancement regions were manually traced on late CE-CCT using CE-CMR as reference. On early CE-CCT images, the myocardial sectors were extracted according to AHA 16-segment model and labeled as with scar or not, based on the late CE-CCT manual tracing. A DL model was developed to classify each segment. A total of 44,187 LV segments were analyzed, resulting in accuracy of 71% and area under the ROC curve of 76% (95% CI: 72%-81%), while, with the bull's eye segmental comparison of CE-CMR and respective early CE-CCT findings, an 89% agreement was achieved. Conclusions: DL on early CE-CCT acquisition may allow detection of LV sectors affected with myocardial fibrosis, thus without additional contrast-agent administration or radiational dose. Such tool might reduce the user interaction and visual inspection with benefit in both efforts and time.

18.
Acta Otorhinolaryngol Ital ; 43(4): 283-290, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-37488992

RESUMO

Objective: To achieve instance segmentation of upper aerodigestive tract (UADT) neoplasms using a deep learning (DL) algorithm, and to identify differences in its diagnostic performance in three different sites: larynx/hypopharynx, oral cavity and oropharynx. Methods: A total of 1034 endoscopic images from 323 patients were examined under narrow band imaging (NBI). The Mask R-CNN algorithm was used for the analysis. The dataset split was: 935 training, 48 validation and 51 testing images. Dice Similarity Coefficient (Dsc) was the main outcome measure. Results: Instance segmentation was effective in 76.5% of images. The mean Dsc was 0.90 ± 0.05. The algorithm correctly predicted 77.8%, 86.7% and 55.5% of lesions in the larynx/hypopharynx, oral cavity, and oropharynx, respectively. The mean Dsc was 0.90 ± 0.05 for the larynx/hypopharynx, 0.60 ± 0.26 for the oral cavity, and 0.81 ± 0.30 for the oropharynx. The analysis showed inferior diagnostic results in the oral cavity compared with the larynx/hypopharynx (p < 0.001). Conclusions: The study confirms the feasibility of instance segmentation of UADT using DL algorithms and shows inferior diagnostic results in the oral cavity compared with other anatomic areas.


Assuntos
Laringe , Neoplasias , Humanos , Boca , Hipofaringe , Algoritmos
19.
Comput Biol Med ; 163: 107194, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37421736

RESUMO

BACKGROUND AND OBJECTIVES: Patients suffering from neurological diseases may develop dysarthria, a motor speech disorder affecting the execution of speech. Close and quantitative monitoring of dysarthria evolution is crucial for enabling clinicians to promptly implement patients' management strategies and maximizing effectiveness and efficiency of communication functions in term of restoring, compensating or adjusting. In the clinical assessment of orofacial structures and functions, at rest condition or during speech and non-speech movements, a qualitative evaluation is usually performed, throughout visual observation. METHODS: To overcome limitations posed by qualitative assessments, this work presents a store-and-forward self-service telemonitoring system that integrates, within its cloud architecture, a convolutional neural network (CNN) for analyzing video recordings acquired by individuals with dysarthria. This architecture - called facial landmark Mask RCNN - aims at locating facial landmarks as a prior for assessing the orofacial functions related to speech and examining dysarthria evolution in neurological diseases. RESULTS: When tested on the Toronto NeuroFace dataset, a publicly available annotated dataset of video recordings from patients with amyotrophic lateral sclerosis (ALS) and stroke, the proposed CNN achieved a normalized mean error equal to 1.79 on localizing the facial landmarks. We also tested our system in a real-life scenario on 11 bulbar-onset ALS subjects, obtaining promising outcomes in terms of facial landmark position estimation. DISCUSSION AND CONCLUSIONS: This preliminary study represents a relevant step towards the use of remote tools to support clinicians in monitoring the evolution of dysarthria.


Assuntos
Esclerose Lateral Amiotrófica , Disartria , Humanos , Disartria/diagnóstico , Computação em Nuvem , Fala , Gravação em Vídeo
20.
Otolaryngol Head Neck Surg ; 169(4): 811-829, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37051892

RESUMO

OBJECTIVE: The endoscopic and laryngoscopic examination is paramount for laryngeal, oropharyngeal, nasopharyngeal, nasal, and oral cavity benign lesions and cancer evaluation. Nevertheless, upper aerodigestive tract (UADT) endoscopy is intrinsically operator-dependent and lacks objective quality standards. At present, there has been an increased interest in artificial intelligence (AI) applications in this area to support physicians during the examination, thus enhancing diagnostic performances. The relative novelty of this research field poses a challenge both for the reviewers and readers as clinicians often lack a specific technical background. DATA SOURCES: Four bibliographic databases were searched: PubMed, EMBASE, Cochrane, and Google Scholar. REVIEW METHODS: A structured review of the current literature (up to September 2022) was performed. Search terms related to topics of AI, machine learning (ML), and deep learning (DL) in UADT endoscopy and laryngoscopy were identified and queried by 3 independent reviewers. Citations of selected studies were also evaluated to ensure comprehensiveness. CONCLUSIONS: Forty-one studies were included in the review. AI and computer vision techniques were used to achieve 3 fundamental tasks in this field: classification, detection, and segmentation. All papers were summarized and reviewed. IMPLICATIONS FOR PRACTICE: This article comprehensively reviews the latest developments in the application of ML and DL in UADT endoscopy and laryngoscopy, as well as their future clinical implications. The technical basis of AI is also explained, providing guidance for nonexpert readers to allow critical appraisal of the evaluation metrics and the most relevant quality requirements.


Assuntos
Inteligência Artificial , Médicos , Humanos , Endoscopia , Laringoscopia , Aprendizado de Máquina
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...