Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
IEEE Trans Biomed Eng ; PP2024 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-38557627

RESUMEN

OBJECTIVES: Data scarcity and domain shifts lead to biased training sets that do not accurately represent deployment conditions. A related practical problem is cross-modal image segmentation, where the objective is to segment unlabelled images using previously labelled datasets from other imaging modalities. METHODS: We propose a cross-modal segmentation method based on conventional image synthesis boosted by a new data augmentation technique called Generative Blending Augmentation (GBA). GBA leverages a SinGAN model to learn representative generative features from a single training image to diversify realistically tumor appearances. This way, we compensate for image synthesis errors, subsequently improving the generalization power of a downstream segmentation model. The proposed augmentation is further combined to an iterative self-training procedure leveraging pseudo labels at each pass. RESULTS: The proposed solution ranked first for vestibular schwannoma (VS) segmentation during the validation and test phases of the MICCAI CrossMoDA 2022 challenge, with best mean Dice similarity and average symmetric surface distance measures. CONCLUSION AND SIGNIFICANCE: Local contrast alteration of tumor appearances and iterative self-training with pseudo labels are likely to lead to performance improvements in a variety of segmentation contexts.

2.
Comput Med Imaging Graph ; 113: 102349, 2024 04.
Artículo en Inglés | MEDLINE | ID: mdl-38330635

RESUMEN

Autosomal-dominant polycystic kidney disease is a prevalent genetic disorder characterized by the development of renal cysts, leading to kidney enlargement and renal failure. Accurate measurement of total kidney volume through polycystic kidney segmentation is crucial to assess disease severity, predict progression and evaluate treatment effects. Traditional manual segmentation suffers from intra- and inter-expert variability, prompting the exploration of automated approaches. In recent years, convolutional neural networks have been employed for polycystic kidney segmentation from magnetic resonance images. However, the use of Transformer-based models, which have shown remarkable performance in a wide range of computer vision and medical image analysis tasks, remains unexplored in this area. With their self-attention mechanism, Transformers excel in capturing global context information, which is crucial for accurate organ delineations. In this paper, we evaluate and compare various convolutional-based, Transformers-based, and hybrid convolutional/Transformers-based networks for polycystic kidney segmentation. Additionally, we propose a dual-task learning scheme, where a common feature extractor is followed by per-kidney decoders, towards better generalizability and efficiency. We extensively evaluate various architectures and learning schemes on a heterogeneous magnetic resonance imaging dataset collected from 112 patients with polycystic kidney disease. Our results highlight the effectiveness of Transformer-based models for polycystic kidney segmentation and the relevancy of exploiting dual-task learning to improve segmentation accuracy and mitigate data scarcity issues. A promising ability in accurately delineating polycystic kidneys is especially shown in the presence of heterogeneous cyst distributions and adjacent cyst-containing organs. This work contribute to the advancement of reliable delineation methods in nephrology, paving the way for a broad spectrum of clinical applications.


Asunto(s)
Quistes , Enfermedades Renales Poliquísticas , Riñón Poliquístico Autosómico Dominante , Humanos , Riñón/diagnóstico por imagen , Riñón Poliquístico Autosómico Dominante/diagnóstico por imagen , Riñón Poliquístico Autosómico Dominante/patología , Enfermedades Renales Poliquísticas/patología , Imagen por Resonancia Magnética/métodos , Quistes/patología
3.
Comput Med Imaging Graph ; 110: 102308, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37918328

RESUMEN

Multi-modal medical image segmentation is a crucial task in oncology that enables the precise localization and quantification of tumors. The aim of this work is to present a meta-analysis of the use of multi-modal medical Transformers for medical image segmentation in oncology, specifically focusing on multi-parametric MR brain tumor segmentation (BraTS2021), and head and neck tumor segmentation using PET-CT images (HECKTOR2021). The multi-modal medical Transformer architectures presented in this work exploit the idea of modality interaction schemes based on visio-linguistic representations: (i) single-stream, where modalities are jointly processed by one Transformer encoder, and (ii) multiple-stream, where the inputs are encoded separately before being jointly modeled. A total of fourteen multi-modal architectures are evaluated using different ranking strategies based on dice similarity coefficient (DSC) and average symmetric surface distance (ASSD) metrics. In addition, cost indicators such as the number of trainable parameters and the number of multiply-accumulate operations (MACs) are reported. The results demonstrate that multi-path hybrid CNN-Transformer-based models improve segmentation accuracy when compared to traditional methods, but come at the cost of increased computation time and potentially larger model size.


Asunto(s)
Benchmarking , Tomografía Computarizada por Tomografía de Emisión de Positrones , Procesamiento de Imagen Asistido por Computador
4.
Med Biol Eng Comput ; 55(12): 2123-2141, 2017 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-28550413

RESUMEN

The visual examination of the vibration patterns of the vocal folds is an essential method to understand the phonation process and diagnose voice disorders. However, a detailed analysis of the phonation based on this technique requires a manual or a semi-automatic segmentation of the glottal area, which is difficult and time consuming. The present work presents a cuasi-automatic framework to accurately segment the glottal area introducing several techniques not explored before in the state of the art. The method takes advantage of the possibility of a minimal user intervention for those cases where the automatic computation fails. The presented method shows a reliable delimitation of the glottal gap, achieving an average improvement of 13 and 18% with respect to two other approaches found in the literature, while reducing the error of wrong detection of total closure instants. Additionally, the results suggest that the set of validation guidelines proposed can be used to standardize the criteria of accuracy and efficiency of the segmentation algorithms.


Asunto(s)
Procesamiento de Imagen Asistido por Computador/métodos , Fonación/fisiología , Pliegues Vocales/fisiología , Adulto , Anciano de 80 o más Años , Algoritmos , Fenómenos Biomecánicos , Femenino , Glotis/diagnóstico por imagen , Glotis/fisiología , Humanos , Masculino , Persona de Mediana Edad , Modelos Biológicos , Vibración , Grabación en Video , Pliegues Vocales/diagnóstico por imagen
5.
Biomed Res Int ; 2015: 259239, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26557656

RESUMEN

Disordered voices are frequently assessed by speech pathologists using perceptual evaluations. This might lead to problems caused by the subjective nature of the process and due to the influence of external factors which compromise the quality of the assessment. In order to increase the reliability of the evaluations, the design of automatic evaluation systems is desirable. With that in mind, this paper presents an automatic system which assesses the Grade and Roughness level of the speech according to the GRBAS perceptual scale. Two parameterization methods are used: one based on the classic Mel-Frequency Cepstral Coefficients, which has already been used successfully in previous works, and other derived from modulation spectra. For the latter, a new group of parameters has been proposed, named Modulation Spectra Morphological Parameters: MSC, DRB, LMR, MSH, MSW, CIL, PALA, and RALA. In methodology, PCA and LDA are employed to reduce the dimensionality of feature space, and GMM classifiers to evaluate the ability of the proposed features on distinguishing the different levels. Efficiencies of 81.6% and 84.7% are obtained for Grade and Roughness, respectively, using modulation spectra parameters, while MFCCs performed 80.5% and 77.7%. The obtained results suggest the usefulness of the proposed Modulation Spectra Morphological Parameters for automatic evaluation of Grade and Roughness in the speech.


Asunto(s)
Procesamiento de Señales Asistido por Computador , Espectrografía del Sonido/métodos , Trastornos de la Voz/clasificación , Trastornos de la Voz/diagnóstico , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Reproducibilidad de los Resultados , Voz , Adulto Joven
6.
Biomed Eng Online ; 14: 100, 2015 Oct 29.
Artículo en Inglés | MEDLINE | ID: mdl-26510707

RESUMEN

BACKGROUND: The image-based analysis of the vocal folds vibration plays an important role in the diagnosis of voice disorders. The analysis is based not only on the direct observation of the video sequences, but also in an objective characterization of the phonation process by means of features extracted from the recorded images. However, such analysis is based on a previous accurate identification of the glottal gap, which is the most challenging step for a further automatic assessment of the vocal folds vibration. METHODS: In this work, a complete framework to automatically segment and track the glottal area (or glottal gap) is proposed. The algorithm identifies a region of interest that is adapted along time, and combine active contours and watershed transform for the final delineation of the glottis and also an automatic procedure for synthesize different videokymograms is proposed. RESULTS: Thanks to the ROI implementation, our technique is robust to the camera shifting and also the objective test proved the effectiveness and performance of the approach in the most challenging scenarios that it is when exist an inappropriate closure of the vocal folds. CONCLUSIONS: The novelties of the proposed algorithm relies on the used of temporal information for identify an adaptive ROI and the use of watershed merging combined with active contours for the glottis delimitation. Additionally, an automatic procedure for synthesize multiline VKG by the identification of the glottal main axis is developed.


Asunto(s)
Endoscopía , Procesamiento de Imagen Asistido por Computador/métodos , Pliegues Vocales , Automatización , Humanos , Fonación , Factores de Tiempo , Pliegues Vocales/fisiología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA