Búsqueda | Portal Regional de la BVS

1.

Tosi, Fabio; Aleotti, Filippo; Ramirez, Pierluigi Zama; Poggi, Matteo; Salti, Samuele; Mattoccia, Stefano; Stefano, Luigi Di.

IEEE Trans Pattern Anal Mach Intell ; PP2024 Jun 07.

Artículo en Inglés | MEDLINE | ID: mdl-38848234

RESUMEN

We propose a framework that combines traditional, hand-crafted algorithms and recent advances in deep learning to obtain high-quality, high-resolution disparity maps from stereo images. By casting the refinement process as a continuous feature sampling strategy, our neural disparity refinement network can estimate an enhanced disparity map at any output resolution. Our solution can process any disparity map produced by classical stereo algorithms, as well as those predicted by modern stereo networks or even different depth-from-images approaches, such as the COLMAP structure-from-motion pipeline. Nonetheless, when deployed in the former configuration, our framework performs at its best in terms of zero-shot generalization from synthetic to real images. Moreover, its continuous formulation allows for easily handling the unbalanced stereo setup very diffused in mobile phones.

2.

Self-supervised depth super-resolution with contrastive multiview pre-training.

Qiao, Xin; Ge, Chenyang; Zhao, Chaoqiang; Tosi, Fabio; Poggi, Matteo; Mattoccia, Stefano.

Neural Netw ; 168: 223-236, 2023 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-37769459

RESUMEN

Many low-level vision tasks, including guided depth super-resolution (GDSR), struggle with the issue of insufficient paired training data. Self-supervised learning is a promising solution, but it remains challenging to upsample depth maps without the explicit supervision of high-resolution target images. To alleviate this problem, we propose a self-supervised depth super-resolution method with contrastive multiview pre-training. Unlike existing contrastive learning methods for classification or segmentation tasks, our strategy can be applied to regression tasks even when trained on a small-scale dataset and can reduce information redundancy by extracting unique features from the guide. Furthermore, we propose a novel mutual modulation scheme that can effectively compute the local spatial correlation between cross-modal features. Exhaustive experiments demonstrate that our method attains superior performance with respect to state-of-the-art GDSR methods and exhibits good generalization to other modalities.

Asunto(s)

Redes Neurales de la Computación

3.

Depth Restoration in Under-Display Time-of-Flight Imaging.

Qiao, Xin; Ge, Chenyang; Deng, Pengchao; Wei, Hao; Poggi, Matteo; Mattoccia, Stefano.

IEEE Trans Pattern Anal Mach Intell ; 45(5): 5668-5683, 2023 May.

Artículo en Inglés | MEDLINE | ID: mdl-36155477

RESUMEN

Under-display imaging has recently received considerable attention in both academia and industry. As a variation of this technique, under-display ToF (UD-ToF) cameras enable depth sensing for full-screen devices. However, it also brings problems of image blurring, signal-to-noise ratio and ranging accuracy reduction. To address these issues, we propose a cascaded deep network to improve the quality of UD-ToF depth maps. The network comprises two subnets, with the first using a complex-valued network in raw domain to perform denoising, deblurring and raw measurements enhancement jointly, while the second refining depth maps in depth domain based on the proposed multi-scale depth enhancement block (MSDEB). To enable training, we establish a data acquisition device and construct a real UD-ToF dataset by collecting real paired ToF raw data. Besides, we also build a large-scale synthetic UD-ToF dataset through noise analysis. The quantitative and qualitative evaluation results on public datasets and ours demonstrate that the presented network outperforms state-of-the-art algorithms and can further promote full-screen devices in practical applications.

4.

On the Confidence of Stereo Matching in a Deep-Learning Era: A Quantitative Evaluation.

Poggi, Matteo; Kim, Seungryong; Tosi, Fabio; Kim, Sunok; Aleotti, Filippo; Min, Dongbo; Sohn, Kwanghoon; Mattoccia, Stefano.

IEEE Trans Pattern Anal Mach Intell ; 44(9): 5293-5313, 2022 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-33798066

RESUMEN

Stereo matching is one of the most popular techniques to estimate dense depth maps by finding the disparity between matching pixels on two, synchronized and rectified images. Alongside with the development of more accurate algorithms, the research community focused on finding good strategies to estimate the reliability, i.e., the confidence, of estimated disparity maps. This information proves to be a powerful cue to naively find wrong matches as well as to improve the overall effectiveness of a variety of stereo algorithms according to different strategies. In this paper, we review more than ten years of developments in the field of confidence estimation for stereo matching. We extensively discuss and evaluate existing confidence measures and their variants, from hand-crafted ones to the most recent, state-of-the-art learning based methods. We study the different behaviors of each measure when applied to a pool of different stereo algorithms and, for the first time in literature, when paired with a state-of-the-art deep stereo network. Our experiments, carried out on five different standard datasets, provide a comprehensive overview of the field, highlighting in particular both strengths and limitations of learning-based strategies.

5.

On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.

Poggi, Matteo; Tosi, Fabio; Batsos, Konstantinos; Mordohai, Philippos; Mattoccia, Stefano.

IEEE Trans Pattern Anal Mach Intell ; 44(9): 5314-5334, 2022 09.

Artículo en Inglés | MEDLINE | ID: mdl-33819150

RESUMEN

Stereo matching is one of the longest-standing problems in computer vision with close to 40 years of studies and research. Throughout the years the paradigm has shifted from local, pixel-level decision to various forms of discrete and continuous optimization to data-driven, learning-based methods. Recently, the rise of machine learning and the rapid proliferation of deep learning enhanced stereo matching with new exciting trends and applications unthinkable until a few years ago. Interestingly, the relationship between these two worlds is two-way. While machine, and especially deep, learning advanced the state-of-the-art in stereo matching, stereo itself enabled new ground-breaking methodologies such as self-supervised monocular depth estimation based on deep networks. In this paper, we review recent research in the field of learning-based depth estimation from single and binocular images highlighting the synergies, the successes achieved so far and the open challenges the community is going to face in the immediate future.

Asunto(s)

Algoritmos , Aprendizaje Automático

6.

Computer Vision for 3D Perception and Applications.

Poggi, Matteo; Moeslund, Thomas B.

Sensors (Basel) ; 21(12)2021 Jun 08.

Artículo en Inglés | MEDLINE | ID: mdl-34201036

RESUMEN

Effective 3D perception of an observed scene greatly enriches the knowledge about the surrounding environment and is crucial to effectively develop high-level applications for various purposes [...].

Asunto(s)

Computadores , Percepción

7.

Continual Adaptation for Deep Stereo.

Poggi, Matteo; Tonioni, Alessio; Tosi, Fabio; Mattoccia, Stefano; Di Stefano, Luigi.

IEEE Trans Pattern Anal Mach Intell ; PP2021 Apr 28.

Artículo en Inglés | MEDLINE | ID: mdl-33909558

RESUMEN

Depth estimation from stereo images is carried out with unmatched results by convolutional neural networks trained end-to-end to regress dense disparities. Like for most tasks, this is possible if large amounts of labelled samples are available for training, possibly covering the whole data distribution encountered at deployment time. Being such an assumption systematically unmet in real applications, the capacity of adapting to any unseen setting becomes of paramount importance. Purposely, we propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments. We design a lightweight and modular architecture, Modularly ADaptive Network (MADNet), and formulate Modular ADaptation algorithms (MAD, MAD++) which permit efficient optimization of independent sub-portions of the entire network. In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms. With both sources, no other data than the input images being gathered at deployment time are needed. Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression.

8.

Real-Time Single Image Depth Perception in the Wild with Handheld Devices.

Aleotti, Filippo; Zaccaroni, Giulio; Bartolomei, Luca; Poggi, Matteo; Tosi, Fabio; Mattoccia, Stefano.

Sensors (Basel) ; 21(1)2020 Dec 22.

Artículo en Inglés | MEDLINE | ID: mdl-33375010

RESUMEN

Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications. For the latter, depth estimation from a single image would represent the most versatile solution since a standard camera is available on almost any handheld device. Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve real-time performance, often not compatible with low-power embedded systems. Therefore, in this paper, we deeply investigate all these issues, showing how they are both addressable by adopting appropriate network design and training strategies. Moreover, we also outline how to map the resulting networks on handheld devices to achieve real-time performance. Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications. Indeed, to further support this evidence, we report experimental results concerning real-time, depth-aware augmented reality and image blurring with smartphones in the wild.

9.

Unsupervised Domain Adaptation for Depth Prediction from Images.

Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Stefano, Luigi Di.

IEEE Trans Pattern Anal Mach Intell ; 42(10): 2396-2409, 2020 10.

Artículo en Inglés | MEDLINE | ID: mdl-31514127

RESUMEN

State-of-the-art approaches to infer dense depth measurements from images rely on CNNs trained end-to-end on a vast amount of data. However, these approaches suffer a drastic drop in accuracy when dealing with environments much different in appearance and/or context from those observed at training time. This domain shift issue is usually addressed by fine-tuning on smaller sets of images from the target domain annotated with depth labels. Unfortunately, relying on such supervised labeling is seldom feasible in most practical settings. Therefore, we propose an unsupervised domain adaptation technique which does not require groundtruth labels. Our method relies only on image pairs and leverages on classical stereo algorithms to produce disparity measurements alongside with confidence estimators to assess upon their reliability. We propose to fine-tune both depth-from-stereo as well as depth-from-mono architectures by a novel confidence-guided loss function that handles the measured disparities as noisy labels weighted according to the estimated confidence. Extensive experimental results based on standard datasets and evaluation protocols prove that our technique can address effectively the domain shift issue with both stereo and monocular depth prediction architectures and outperforms other state-of-the-art unsupervised loss functions that may be alternatively deployed to pursue domain adaptation.

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA