Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
IEEE Trans Image Process ; 33: 1162-1174, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38300776

RESUMO

Hashing and quantization have greatly succeeded by benefiting from deep learning for large-scale image retrieval. Recently, deep product quantization methods have attracted wide attention. However, representation capability of codewords needs to be further improved. Moreover, since the number of codewords in the codebook depends on experience, representation capability of codewords is usually imbalanced, which leads to redundancy or insufficiency of codewords and reduces retrieval performance. Therefore, in this paper, we propose a novel deep product quantization method, named Entropy Optimized deep Weighted Product Quantization (EOWPQ), which not only encodes samples into the weighted codewords in a new flexible manner but also balances the codeword assignment, improving while balancing representation capability of codewords. Specifically, we encode samples using the linear weighted sum of codewords instead of a single codeword as traditionally. Meanwhile, we establish the linear relationship between the weighted codewords and semantic labels, which effectively maintains semantic information of codewords. Moreover, in order to balance the codeword assignment, that is, avoiding some codewords representing most samples or some codewords representing very few samples, we maximize the entropy of the coding probability distribution and obtain the optimal coding probability distribution of samples by utilizing optimal transport theory, which achieves the optimal assignment of codewords and balances representation capability of codewords. The experimental results on three benchmark datasets show that EOWPQ can achieve better retrieval performance and also show the improvement of representation capability of codewords and the balance of codeword assignment.

2.
IEEE Trans Neural Netw Learn Syst ; 34(11): 8195-8209, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-34982704

RESUMO

In this article, we present a new pansharpening method, a zero-reference generative adversarial network (ZeRGAN), which fuses low spatial resolution multispectral (LR MS) and high spatial resolution panchromatic (PAN) images. In the proposed method, zero-reference indicates that it does not require paired reduced-scale images or unpaired full-scale images for training. To obtain accurate fusion results, we establish an adversarial game between a set of multiscale generators and their corresponding discriminators. Through multiscale generators, the fused high spatial resolution MS (HR MS) images are progressively produced from LR MS and PAN images, while the discriminators aim to distinguish the differences of spatial information between the HR MS images and the PAN images. In other words, the HR MS images are generated from LR MS and PAN images after the optimization of ZeRGAN. Furthermore, we construct a nonreference loss function, including an adversarial loss, spatial and spectral reconstruction losses, a spatial enhancement loss, and an average constancy loss. Through the minimization of the total loss, the spatial details in the HR MS images can be enhanced efficiently. Extensive experiments are implemented on datasets acquired by different satellites. The results demonstrate that the effectiveness of the proposed method compared with the state-of-the-art methods. The source code is publicly available at https://github.com/RSMagneto/ZeRGAN.

3.
Sensors (Basel) ; 22(8)2022 Apr 14.
Artigo em Inglês | MEDLINE | ID: mdl-35458986

RESUMO

Eye movement has become a new behavioral feature for biometric authentication. In the eye movement-based authentication methods that use temporal features and artificial design features, the required duration of eye movement recordings are too long to be applied. Therefore, this study aims at using eye movement recordings with shorter duration to realize authentication. And we give out a reasonable eye movement recording duration that should be less than 12 s, referring to the changing pattern of the deviation degree between the gaze point and the stimulus point on the screen. In this study, the temporal motion features of the gaze points and the spatial distribution features of the saccade are using to represent the personal identity. Two datasets are constructed for the experiments, including 5 s and 12 s of eye movement recordings. On the datasets constructed in this paper, the open-set authentication results show that the Equal Error Rate of our proposed methods can reach 10.62% when recording duration is 12 s and 12.48% when recording duration is 5 s. The closed-set authentication results show that the Equal Error Rate of our proposed methods can reach 5.25% when recording duration is 12 s and 7.82% when recording duration is 5 s. It demonstrates that the proposed method provides a reference for the eye movements data-based identity authentication.


Assuntos
Identificação Biométrica , Movimentos Oculares , Identificação Biométrica/métodos , Fixação Ocular , Movimento (Física) , Movimento , Movimentos Sacádicos
4.
Biomed Opt Express ; 11(7): 3585-3600, 2020 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-33014553

RESUMO

Typical optical coherence tomographic angiography (OCTA) acquisition areas on commercial devices are 3×3- or 6×6-mm. Compared to 3×3-mm angiograms with proper sampling density, 6×6-mm angiograms have significantly lower scan quality, with reduced signal-to-noise ratio and worse shadow artifacts due to undersampling. Here, we propose a deep-learning-based high-resolution angiogram reconstruction network (HARNet) to generate enhanced 6×6-mm superficial vascular complex (SVC) angiograms. The network was trained on data from 3×3-mm and 6×6-mm angiograms from the same eyes. The reconstructed 6×6-mm angiograms have significantly lower noise intensity, stronger contrast and better vascular connectivity than the original images. The algorithm did not generate false flow signal at the noise level presented by the original angiograms. The image enhancement produced by our algorithm may improve biomarker measurements and qualitative clinical assessment of 6×6-mm OCTA.

5.
Neural Netw ; 126: 132-142, 2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32217354

RESUMO

Cross-modal retrieval has recently attracted much interest along with the rapid development of multimodal data, and effectively utilizing the complementary relationship of different modal data and eliminating the heterogeneous gap as much as possible are the two key challenges. In this paper, we present a novel network model termed cross-modal Dual Subspace learning with Adversarial Network (DSAN). The main contributions are as follows: (1) Dual subspaces (visual subspace and textual subspace) are proposed, which can better mine the underlying structure information of different modalities as well as modality-specific information. (2) An improved quadruplet loss is proposed, which takes into account the relative distance and absolute distance between positive and negative samples, together with the introduction of the idea of hard sample mining. (3) Intra-modal constrained loss is proposed to maximize the distance of the most similar cross-modal negative samples and their corresponding cross-modal positive samples. In particular, feature preserving and modality classification act as two antagonists. DSAN tries to narrow the heterogeneous gap between different modalities, and distinguish the original modality of random samples in dual subspaces. Comprehensive experimental results demonstrate that, DSAN significantly outperforms 9 state-of-the-art methods on four cross-modal datasets.


Assuntos
Teoria dos Jogos , Aprendizado de Máquina , Redes Neurais de Computação , Mídias Sociais , Humanos , Aprendizado de Máquina/tendências , Estimulação Luminosa/métodos , Mídias Sociais/tendências
6.
Int J Neural Syst ; 29(6): 1950002, 2019 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-30880525

RESUMO

Event-related potentials (ERPs) especially P300 are popular effective features for brain-computer interface (BCI) systems based on electroencephalography (EEG). Traditional ERP-based BCI systems may perform poorly for small training samples, i.e. the undersampling problem. In this study, the ERP classification problem was investigated, in particular, the ERP classification in the high-dimensional setting with the number of features larger than the number of samples was studied. A flexible group sparse discriminative analysis algorithm based on Moreau-Yosida regularization was proposed for alleviating the undersampling problem. An optimization problem with the group sparse criterion was presented, and the optimal solution was proposed by using the regularized optimal scoring method. During the alternating iteration procedure, the feature selection and classification were performed simultaneously. Two P300-based BCI datasets were used to evaluate our proposed new method and compare it with existing standard methods. The experimental results indicated that the features extracted via our proposed method are efficient and provide an overall better P300 classification accuracy compared with several state-of-the-art methods.


Assuntos
Interfaces Cérebro-Computador , Análise Discriminante , Eletroencefalografia/métodos , Potenciais Evocados P300/fisiologia , Algoritmos , Humanos
7.
Neural Comput ; 29(7): 1902-1918, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28562218

RESUMO

Graph-based clustering methods perform clustering on a fixed input data graph. Thus such clustering results are sensitive to the particular graph construction. If this initial construction is of low quality, the resulting clustering may also be of low quality. We address this drawback by allowing the data graph itself to be adaptively adjusted in the clustering procedure. In particular, our proposed weight adaptive Laplacian (WAL) method learns a new data similarity matrix that can adaptively adjust the initial graph according to the similarity weight in the input data graph. We develop three versions of these methods based on the L2-norm, fuzzy entropy regularizer, and another exponential-based weight strategy, that yield three new graph-based clustering objectives. We derive optimization algorithms to solve these objectives. Experimental results on synthetic data sets and real-world benchmark data sets exhibit the effectiveness of these new graph-based clustering methods.

8.
Front Hum Neurosci ; 11: 626, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29375339

RESUMO

Neural disruptions during emotion regulation are common of generalized anxiety disorder (GAD). Identifying distinct functional and effective connectivity patterns in GAD may provide biomarkers for their diagnoses. This study aims to investigate the differences of features of brain network connectivity between GAD patients and healthy controls (HC), and to assess whether those differences can serve as biomarkers to distinguish GAD from controls. Independent component analysis (ICA) with hierarchical partner matching (HPM-ICA) was conducted on resting-state functional magnetic resonance imaging data collected from 20 GAD patients with medicine-free and 20 matched HC, identifying nine highly reproducible and significantly different functional brain connectivity patterns across diagnostic groups. We then utilized Granger causality (GC) to study the effective connectivity between the regions that identified by HPM-ICA. The linear discriminant analysis was finally used to distinguish GAD from controls with these measures of neural connectivity. The GAD patients showed stronger functional connectivity in amygdala, insula, putamen, thalamus, and posterior cingulate cortex, but weaker in frontal and temporal cortex compared with controls. Besides, the effective connectivity in GAD was decreased from the cortex to amygdala and basal ganglia. Applying the ICA and GC features to the classifier led to a classification accuracy of 87.5%, with a sensitivity of 90.0% and a specificity of 85.0%. These findings suggest that the presence of emotion dysregulation circuits may contribute to the pathophysiology of GAD, and these aberrant brain features may serve as robust brain biomarkers for GAD.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA