Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
IEEE Trans Pattern Anal Mach Intell ; 31(5): 919-30, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-19299864

RESUMO

Particle filtering is frequently used for visual tracking problems since it provides a general framework for estimating and propagating probability density functions for nonlinear and non-Gaussian dynamic systems. However, this algorithm is based on a Monte Carlo approach and the cost of sampling and measurement is a problematic issue, especially for high-dimensional problems. We describe an alternative to the classical particle filter in which the underlying density function has an analytic representation for better approximation and effective propagation. The techniques of density interpolation and density approximation are introduced to represent the likelihood and the posterior densities with Gaussian mixtures, where all relevant parameters are automatically determined. The proposed analytic approach is shown to perform more efficiently in sampling in high-dimensional space. We apply the algorithm to real-time tracking problems and demonstrate its performance on real video sequences as well as synthetic examples.


Assuntos
Algoritmos , Inteligência Artificial , Interpretação de Imagem Assistida por Computador/métodos , Reconhecimento Automatizado de Padrão/métodos , Processamento de Sinais Assistido por Computador , Técnica de Subtração , Teorema de Bayes , Aumento da Imagem/métodos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
2.
IEEE Trans Pattern Anal Mach Intell ; 30(7): 1186-97, 2008 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-18550902

RESUMO

Visual features are commonly modeled with probability density functions in computer vision problems, but current methods such as a mixture of Gaussians and kernel density estimation suffer from either the lack of flexibility, by fixing or limiting the number of Gaussian components in the mixture, or large memory requirement, by maintaining a non-parametric representation of the density. These problems are aggravated in real-time computer vision applications since density functions are required to be updated as new data becomes available. We present a novel kernel density approximation technique based on the mean-shift mode finding algorithm, and describe an efficient method to sequentially propagate the density modes over time. While the proposed density representation is memory efficient, which is typical for mixture densities, it inherits the flexibility of non-parametric methods by allowing the number of components to be variable. The accuracy and compactness of the sequential kernel density approximation technique is illustrated by both simulations and experiments. Sequential kernel density approximation is applied to on-line target appearance modeling for visual tracking, and its performance is demonstrated on a variety of videos.


Assuntos
Algoritmos , Inteligência Artificial , Interpretação de Imagem Assistida por Computador/métodos , Reconhecimento Automatizado de Padrão/métodos , Técnica de Subtração , Simulação por Computador , Sistemas Computacionais , Aumento da Imagem/métodos , Modelos Estatísticos , Movimento (Física) , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Distribuições Estatísticas
3.
IEEE Trans Vis Comput Graph ; 22(10): 2300-2314, 2016 10.
Artigo em Inglês | MEDLINE | ID: mdl-26685252

RESUMO

4D film is an immersive entertainment system that presents various physical effects with a film in order to enhance viewers' experiences. Despite the recent emergence of 4D theaters, production of 4D effects relies on manual authoring. In this paper, we present algorithms that synthesize three classes of motion effects from the audiovisual content of a film. The first class of motion effects is those responding to fast camera motion to enhance the immersiveness of point-of-view shots, delivering fast and dynamic vestibular feedback. The second class moves viewers as closely as possible to the trajectory of slowly moving camera. Such motion provides an illusional effect of observing the scene from a distance while moving slowly within the scene. For these two classes, our algorithms compute the relative camera motion and then map it to a motion command to the 4D chair using appropriate motion mapping algorithms. The last class is for special effects, such as explosions, and our algorithm uses sound for the synthesis of impulses and vibrations. We assessed the subjective quality of our algorithms by user experiments, and results indicated that our algorithms can provide compelling motion effects.

4.
IEEE Trans Pattern Anal Mach Intell ; 38(7): 1411-24, 2016 07.
Artigo em Inglês | MEDLINE | ID: mdl-26452250

RESUMO

We propose a novel algorithm to cluster and annotate a set of input images jointly, where the images are clustered into several discriminative groups and each group is identified with representative labels automatically. For these purposes, each input image is first represented by a distribution of candidate labels based on its similarity to images in a labeled reference image database. A set of these label-based representations are then refined collectively through a non-negative matrix factorization with sparsity and orthogonality constraints; the refined representations are employed to cluster and annotate the input images jointly. The proposed approach demonstrates performance improvements in image clustering over existing techniques, and illustrates competitive image labeling accuracy in both quantitative and qualitative evaluation. In addition, we extend our joint clustering and labeling framework to solving the weakly-supervised image classification problem and obtain promising results.

5.
IEEE Trans Pattern Anal Mach Intell ; 34(5): 1017-23, 2012 May.
Artigo em Inglês | MEDLINE | ID: mdl-22156099

RESUMO

Background modeling and subtraction is a natural technique for object detection in videos captured by a static camera, and also a critical preprocessing step in various high-level computer vision applications. However, there have not been many studies concerning useful features and binary segmentation algorithms for this problem. We propose a pixelwise background modeling and subtraction technique using multiple features, where generative and discriminative techniques are combined for classification. In our algorithm, color, gradient, and Haar-like features are integrated to handle spatio-temporal variations for each pixel. A pixelwise generative background model is obtained for each feature efficiently and effectively by Kernel Density Approximation (KDA). Background subtraction is performed in a discriminative manner using a Support Vector Machine (SVM) over background likelihood vectors for a set of features. The proposed algorithm is robust to shadow, illumination changes, spatial variations of background. We compare the performance of the algorithm with other density-based methods using several different feature combinations and modeling techniques, both quantitatively and qualitatively.

6.
AMIA Annu Symp Proc ; 2012: 485-94, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23304320

RESUMO

Children with unilateral cleft lip and palate (UCLP) suffer from negative public perceptions. A better treatment strategy should be established to help them live an ordinary life with improved perceptions. To do that, it is important to understand the relationship between physical facial features and perceptual judgment. In this paper, we present FaceReview, a new visualization system to support interactive exploration of a heterogeneous multidimensional dataset with facial measurement data and subjective judgment data. To seamlessly link the two data, we design FaceReview based on information visualization techniques that are proven to be useful and therefore commonly used, such as brushing and linking, small multiples, and dynamic query. Our design decisions successfully support exploratory tasks of our collaborators. We present a case study to show the efficacy of FaceReview.


Assuntos
Recursos Audiovisuais , Fenda Labial , Fissura Palatina , Face/anatomia & histologia , Criança , Estética , Humanos , Variações Dependentes do Observador , Fotografação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA