Basic level scene understanding: categories, attributes and structures.

Xiao, Jianxiong; Hays, James; Russell, Bryan C; Patterson, Genevieve; Ehinger, Krista A; Torralba, Antonio; Oliva, Aude

Xiao, Jianxiong; Hays, James; Russell, Bryan C; Patterson, Genevieve; Ehinger, Krista A; Torralba, Antonio; Oliva, Aude.

Afiliação

Xiao J; Computer Science, Princeton University Princeton, NJ, USA.

Front Psychol ; 4: 506, 2013.

Article em En | MEDLINE | ID: mdl-24009590

ABSTRACT

ABSTRACT

A longstanding goal of computer vision is to build a system that can automatically understand a 3D scene from a single image. This requires extracting semantic concepts and 3D information from 2D images which can depict an enormous variety of environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the richly annotated SUN database which is a collection of annotated images spanning 908 different scene categories with object, attribute, and geometric labels for many scenes. This database allows us to systematically study the space of scenes and to establish a benchmark for scene and object recognition. We augment the categorical SUN database with 102 scene attributes for every image and explore attribute recognition. Finally, we present an integrated system to extract the 3D structure of the scene and objects depicted in an image.

Palavras-chave

3D context; SUN database; basic level scene understanding; geometry recognition; scene attributes; scene recognition

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2013 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2013 Tipo de documento: Article