Your browser doesn't support javascript.
loading
BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos.
Nakayama, Luis Filipe; Restrepo, David; Matos, João; Ribeiro, Lucas Zago; Malerbi, Fernando Korn; Celi, Leo Anthony; Regatieri, Caio Saito.
Affiliation
  • Nakayama LF; Department of Ophthalmology, São Paulo Federal University, São Paulo, São Paulo, Brazil.
  • Restrepo D; Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
  • Matos J; Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
  • Ribeiro LZ; Telematics Department, University of Cauca, Popayán, Cauca, Colombia.
  • Malerbi FK; Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
  • Celi LA; Faculty of Engineering of University of Porto, Porto, Portugal.
  • Regatieri CS; Department of Ophthalmology, São Paulo Federal University, São Paulo, São Paulo, Brazil.
PLOS Digit Health ; 3(7): e0000454, 2024 Jul.
Article in En | MEDLINE | ID: mdl-38991014
ABSTRACT

INTRODUCTION:

The Brazilian Multilabel Ophthalmological Dataset (BRSET) addresses the scarcity of publicly available ophthalmological datasets in Latin America. BRSET comprises 16,266 color fundus retinal photos from 8,524 Brazilian patients, aiming to enhance data representativeness, serving as a research and teaching tool. It contains sociodemographic information, enabling investigations into differential model performance across demographic groups.

METHODS:

Data from three São Paulo outpatient centers yielded demographic and medical information from electronic records, including nationality, age, sex, clinical history, insulin use, and duration of diabetes diagnosis. A retinal specialist labeled images for anatomical features (optic disc, blood vessels, macula), quality control (focus, illumination, image field, artifacts), and pathologies (e.g., diabetic retinopathy). Diabetic retinopathy was graded using International Clinic Diabetic Retinopathy and Scottish Diabetic Retinopathy Grading. Validation used a ConvNext model trained during 50 epochs using a weighted cross entropy loss to avoid overfitting, with 70% training (20% validation), and 30% testing subsets. Performance metrics included area under the receiver operating curve (AUC) and Macro F1-score. Saliency maps were calculated for interpretability.

RESULTS:

BRSET comprises 65.1% Canon CR2 and 34.9% Nikon NF5050 images. 61.8% of the patients are female, and the average age is 57.6 (± 18.26) years. Diabetic retinopathy affected 15.8% of patients, across a spectrum of disease severity. Anatomically, 20.2% showed abnormal optic discs, 4.9% abnormal blood vessels, and 28.8% abnormal macula. A ConvNext V2 model was trained and evaluated BRSET in four prediction tasks "binary diabetic retinopathy diagnosis (Normal vs Diabetic Retinopathy)" (AUC 97, F1 89); "3 class diabetic retinopathy diagnosis (Normal, Proliferative, Non-Proliferative)" (AUC 97, F1 82); "diabetes diagnosis" (AUC 91, F1 83); "sex classification" (AUC 87, F1 70).

DISCUSSION:

BRSET is the first multilabel ophthalmological dataset in Brazil and Latin America. It provides an opportunity for investigating model biases by evaluating performance across demographic groups. The model performance of three prediction tasks demonstrates the value of the dataset for external validation and for teaching medical computer vision to learners in Latin America using locally relevant data sources.

Full text: 1 Collection: 01-internacional Database: MEDLINE Country/Region as subject: America do sul / Brasil Language: En Journal: PLOS Digit Health Year: 2024 Document type: Article Affiliation country: Brasil

Full text: 1 Collection: 01-internacional Database: MEDLINE Country/Region as subject: America do sul / Brasil Language: En Journal: PLOS Digit Health Year: 2024 Document type: Article Affiliation country: Brasil