Pesquisa | Biblioteca Virtual em Saúde

DO-Conv: Depthwise Over-Parameterized Convolutional Layer.

Cao, Jinming; Li, Yangyan; Sun, Mingchao; Chen, Ying; Lischinski, Dani; Cohen-Or, Daniel; Chen, Baoquan; Tu, Changhe.

IEEE Trans Image Process ; 31: 3726-3736, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35594231

RESUMO

Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer. We refer to this depthwise over-parameterized convolutional layer as DO-Conv, which is a novel way of over-parameterization. We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation. Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization. As DO-Conv introduces performance gains without incurring any computational complexity increase for inference, we advocate it as an alternative to the conventional convolutional layer. We open sourced an implementation of DO-Conv in Tensorflow, PyTorch and GluonCV at https://github.com/yangyanli/DO-Conv.

Palettailor: Discriminable Colorization for Categorical Data.

Lu, Kecheng; Feng, Mi; Chen, Xin; Sedlmair, Michael; Deussen, Oliver; Lischinski, Dani; Cheng, Zhanglin; Wang, Yunhai.

IEEE Trans Vis Comput Graph ; 27(2): 475-484, 2021 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-33048720

RESUMO

We present an integrated approach for creating and assigning color palettes to different visualizations such as multi-class scatterplots, line, and bar charts. While other methods separate the creation of colors from their assignment, our approach takes data characteristics into account to produce color palettes, which are then assigned in a way that fosters better visual discrimination of classes. To do so, we use a customized optimization based on simulated annealing to maximize the combination of three carefully designed color scoring functions: point distinctness, name difference, and color discrimination. We compare our approach to state-of-the-art palettes with a controlled user study for scatterplots and line charts, furthermore we performed a case study. Our results show that Palettailor, as a fully-automated approach, generates color palettes with a higher discrimination quality than existing approaches. The efficiency of our optimization allows us also to incorporate user modifications into the color selection process.

Liver tumors segmentation from CTA images using voxels classification and affinity constraint propagation.

Freiman, Moti; Cooper, Ofir; Lischinski, Dani; Joskowicz, Leo.

Int J Comput Assist Radiol Surg ; 6(2): 247-55, 2011 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-20574799

RESUMO

OBJECTIVE: We present a method and a validation study for the nearly automatic segmentation of liver tumors in CTA scans. MATERIALS AND METHODS: Our method inputs a liver CTA scan and a small number of user-defined seeds. It first classifies the liver voxels into tumor and healthy tissue classes with an SVM classification engine from which a new set of high- quality seeds is generated. Next, an energy function describing the propagation of these seeds is defined over the 3D image. The functional consists of a set of linear equations that are optimized with the conjugate gradients method. The result is a continuous segmentation map that is thresholded to obtain a binary segmentation. RESULTS: A retrospective study on a validated clinical dataset consisting of 20 tumors from nine patients' CTA scans from the MICCAI'08 3D Liver Tumors Segmentation Challenge Workshop yielded an average aggregate score of 67, an average symmetric surface distance of 1.76 mm (SD = 0.61 mm) which is better than the 2.0 mm of other methods on the same database, and a comparable volumetric overlap error (33.8 vs. 32.6%). The advantage of our method is that it requires less user interaction compared to other methods. CONCLUSION: Our results indicate that our method is accurate, efficient, and robust to wide variety of tumor types and is comparable or superior to other semi-automatic segmentation methods, with much less user interaction.

Assuntos

Imageamento Tridimensional , Neoplasias Hepáticas/diagnóstico por imagem , Reconhecimento Automatizado de Padrão/métodos , Interpretação de Imagem Radiográfica Assistida por Computador/métodos , Algoritmos , Humanos , Estudos Retrospectivos

Spectral matting.

Levin, Anat; Rav-Acha, Alex; Lischinski, Dani.

IEEE Trans Pattern Anal Mach Intell ; 30(10): 1699-712, 2008 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-18703825

RESUMO

We present spectral matting: a new approach to natural image matting that automatically computes a basis set of fuzzy matting components from the smallest eigenvectors of a suitably defined Laplacian matrix. Thus, our approach extends spectral segmentation techniques, whose goal is to extract hard segments, to the extraction of soft matting components. These components may then be used as building blocks to easily construct semantically meaningful foreground mattes, either in an unsupervised fashion, or based on a small amount of user input.

Assuntos

Algoritmos , Inteligência Artificial , Aumento da Imagem/métodos , Interpretação de Imagem Assistida por Computador/métodos , Imageamento Tridimensional/métodos , Reconhecimento Automatizado de Padrão/métodos , Processamento de Sinais Assistido por Computador , Técnica de Subtração , Reprodutibilidade dos Testes , Sensibilidade e Especificidade

A closed-form solution to natural image matting.

Levin, Anat; Lischinski, Dani; Weiss, Yair.

IEEE Trans Pattern Anal Mach Intell ; 30(2): 228-42, 2008 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-18084055

RESUMO

Interactive digital matting, the process of extracting a foreground object from an image based on limited user input, is an important task in image and video editing. From a computer vision perspective, this task is extremely challenging because it is massively ill-posed -- at each pixel we must estimate the foreground and the background colors, as well as the foreground opacity ("alpha matte") from a single color measurement. Current approaches either restrict the estimation to a small part of the image, estimating foreground and background colors based on nearby pixels where they are known, or perform iterative nonlinear estimation by alternating foreground and background color estimation with alpha estimation. In this paper we present a closed-form solution to natural image matting. We derive a cost function from local smoothness assumptions on foreground and background colors, and show that in the resulting expression it is possible to analytically eliminate the foreground and background colors to obtain a quadratic cost function in alpha. This allows us to find the globally optimal alpha matte by solving a sparse linear system of equations. Furthermore, the closed-form formula allows us to predict the properties of the solution by analyzing the eigenvectors of a sparse matrix, closely related to matrices used in spectral image segmentation algorithms. We show that high quality mattes for natural images may be obtained from a small amount of user input.

Dynamosaicing: mosaicing of dynamic scenes.

Rav-Acha, Alex; Pritch, Yael; Lischinski, Dani; Peleg, Shmuel.

IEEE Trans Pattern Anal Mach Intell ; 29(10): 1789-801, 2007 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-17699923

RESUMO

This paper explores the manipulation of time in video editing, enabling to control the chronological time of events. These time manipulations include slowing down (or postponing) some dynamic events while speeding up (or advancing) others. When a video camera scans a scene, aligning all the events to a single time interval will result in a panoramic movie. Time manipulations are obtained by first constructing an aligned space-time volume from the input video, and then sweeping a continuous 2D slice (time front) through that volume, generating a new sequence of images. For dynamic scenes, aligning the input video frames poses an important challenge. We propose to align dynamic scenes using a new notion of "dynamics constancy", which is more appropriate for this task than the traditional assumption of "brightness constancy". Another challenge is to avoid visual seams inside moving objects and other visual artifacts resulting from sweeping the space-time volumes with time fronts of arbitrary geometry. To avoid such artifacts, we formulate the problem of finding optimal time front geometry as one of finding a minimal cut in a 4D graph, and solve it using max-flow methods.

Assuntos

Algoritmos , Inteligência Artificial , Aumento da Imagem/métodos , Interpretação de Imagem Assistida por Computador/métodos , Reconhecimento Automatizado de Padrão/métodos , Fotografação/métodos , Técnica de Subtração , Gravação em Vídeo/métodos , Armazenamento e Recuperação da Informação/métodos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA