Your browser doesn't support javascript.
loading
Designing and Evaluating a Hierarchical Framework for Matching Food Outlets across Multi-sourced Geospatial Datasets: a Case Study of San Diego County.
Cao, Yanjia; Yang, Jiue-An; Nara, Atsushi; Jankowska, Marta M.
Afiliação
  • Cao Y; Department of Geography, The University of Hong Kong, Pok Fu Lam, Hong Kong. yanjiac@hku.hk.
  • Yang JA; Department of Population Sciences, Beckman Research Institute, City of Hope, Duarte, CA, USA.
  • Nara A; Department of Geography, San Diego State University, San Diego, CA, USA.
  • Jankowska MM; Department of Population Sciences, Beckman Research Institute, City of Hope, Duarte, CA, USA.
J Urban Health ; 101(1): 155-169, 2024 Feb.
Article em En | MEDLINE | ID: mdl-38167974
ABSTRACT
Research on retail food environment (RFE) relies on data availability and accuracy. However, the discrepancies in RFE datasets may lead to imprecision when measuring association with health outcomes. In this research, we present a two-tier hierarchical point of interest (POI) matching framework to compare and triangulate food outlets across multiple geospatial data sources. Two matching parameters were used including the geodesic distance between businesses and the similarity of business names according to Levenshtein distance (LD) and Double Metaphone (DM). Sensitivity analysis was conducted to determine thresholds of matching parameters. Our Tier 1 matching used more restricted parameters to generate high confidence-matched POIs, whereas in Tier 2 we opted for relaxed matching parameters and applied a weighted multi-attribute model on the previously unmatched records. Our case study in San Diego County, California used government, commercial, and crowdsourced data and returned 20.2% matched records from Tier 1 and 18.6% matched from Tier 2. Our manual validation shows a 100% matching rate for Tier 1 and up to 30.6% for Tier 2. Matched and unmatched records from Tier 1 were further analyzed for spatial patterns and categorical differences. Our hierarchical POI matching framework generated highly confident food POIs by conflating datasets and identified some food POIs that are unique to specific data sources. Triangulating RFE data can reduce uncertain and invalid POI listings when representing food environment using multiple data sources. Studies investigating associations between food environment and health outcomes may benefit from improved quality of RFE.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Meio Ambiente / Abastecimento de Alimentos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Meio Ambiente / Abastecimento de Alimentos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article