mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments.

Baban A Erep, Thierry Roland; Chaari, Lotfi

Baban A Erep, Thierry Roland; Chaari, Lotfi.

Afiliação

Baban A Erep TR; Toulouse INP, University of Toulouse, Institut de Recherche en Informatique de Toulouse, 31400 Toulouse, France.
Chaari L; Toulouse INP, University of Toulouse, Institut de Recherche en Informatique de Toulouse, 31400 Toulouse, France.

Sensors (Basel) ; 24(1)2023 Dec 29.

Article em En | MEDLINE | ID: mdl-38203070

ABSTRACT

ABSTRACT

Recent decades have witnessed the development of vision-based dietary assessment (VBDA) systems. These systems generally consist of three main stages food image analysis, portion estimation, and nutrient derivation. The effectiveness of the initial step is highly dependent on the use of accurate segmentation and image recognition models and the availability of high-quality training datasets. Food image segmentation still faces various challenges, and most existing research focuses mainly on Asian and Western food images. For this reason, this study is based on food images from sub-Saharan Africa, which pose their own problems, such as inter-class similarity and dishes with mixed-class food. This work focuses on the first stage of VBDAs, where we introduce two notable contributions. Firstly, we propose mid-DeepLabv3+, an enhanced food image segmentation model based on DeepLabv3+ with a ResNet50 backbone. Our approach involves adding a middle layer in the decoder path and SimAM after each extracted backbone feature layer. Secondly, we present CamerFood10, the first food image dataset specifically designed for sub-Saharan African food segmentation. It includes 10 classes of the most consumed food items in Cameroon. On our dataset, mid-DeepLabv3+ outperforms benchmark convolutional neural network models for semantic image segmentation, with an mIoU (mean Intersection over Union) of 65.20%, representing a +10.74% improvement over DeepLabv3+ with the same backbone.

Assuntos

Avaliação Nutricional; Semântica; Alimentos; Dieta; Nutrientes

Palavras-chave

CNN; CamerFood10 dataset; food segmentation; semantic segmentation

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Semântica / Avaliação Nutricional Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google