Pesquisa | Portal Regional da BVS

1.

Distributional Drift Adaptation With Temporal Conditional Variational Autoencoder for Multivariate Time Series Forecasting.

He, Hui; Zhang, Qi; Yi, Kun; Shi, Kaize; Niu, Zhendong; Cao, Longbing.

IEEE Trans Neural Netw Learn Syst ; PP2024 Apr 29.

Artigo em Inglês | MEDLINE | ID: mdl-38683706

RESUMO

Due to the nonstationary nature, the distribution of real-world multivariate time series (MTS) changes over time, which is known as distribution drift. Most existing MTS forecasting models greatly suffer from distribution drift and degrade the forecasting performance over time. Existing methods address distribution drift via adapting to the latest arrived data or self-correcting per the meta knowledge derived from future data. Despite their great success in MTS forecasting, these methods hardly capture the intrinsic distribution changes, especially from a distributional perspective. Accordingly, we propose a novel framework temporal conditional variational autoencoder (TCVAE) to model the dynamic distributional dependencies over time between historical observations and future data in MTSs and infer the dependencies as a temporal conditional distribution to leverage latent variables. Specifically, a novel temporal Hawkes attention (THA) mechanism represents temporal factors that subsequently fed into feedforward networks to estimate the prior Gaussian distribution of latent variables. The representation of temporal factors further dynamically adjusts the structures of Transformer-based encoder and decoder to distribution changes by leveraging a gated attention mechanism (GAM). Moreover, we introduce conditional continuous normalization flow (CCNF) to transform the prior Gaussian to a complex and form-free distribution to facilitate flexible inference of the temporal conditional distribution. Extensive experiments conducted on six real-world MTS datasets demonstrate the TCVAE's superior robustness and effectiveness over the state-of-the-art MTS forecasting baselines. We further illustrate the TCVAE applicability through multifaceted case studies and visualization in real-world scenarios.

2.

eVAE: Evolutionary Variational Autoencoder.

Wu, Zhangkai; Cao, Longbing; Qi, Lei.

IEEE Trans Neural Netw Learn Syst ; PP2024 Mar 28.

Artigo em Inglês | MEDLINE | ID: mdl-38546992

RESUMO

Variational autoencoders (VAEs) are challenged by the imbalance between representation inference and task fitting caused by surrogate loss. To address this issue, existing methods adjust their balance by directly tuning their coefficients. However, these methods suffer from a tradeoff uncertainty, i.e., nondynamic regulation over iterations and inflexible hyperparameters for learning tasks. Accordingly, we make the first attempt to introduce an evolutionary VAE (eVAE), building on the variational information bottleneck (VIB) theory and integrative evolutionary neural learning. eVAE integrates a variational genetic algorithm (VGA) into VAE with variational evolutionary operators, including variational mutation (V-mutation), crossover, and evolution. Its training mechanism synergistically and dynamically addresses and updates the learning tradeoff uncertainty in the evidence lower bound (ELBO) without additional constraints and hyperparameter tuning. Furthermore, eVAE presents an evolutionary paradigm to tune critical factors of VAEs and addresses the premature convergence and random search problem in integrating evolutionary optimization into deep learning. Experiments show that eVAE addresses the KL-vanishing problem for text generation with low reconstruction loss, generates all the disentangled factors with sharp images, and improves image generation quality. eVAE achieves better disentanglement, generation performance, and generation-inference balance than its competitors. Code available at: https://github.com/amasawa/eVAE.

3.

Differentiating behavioral impact with or without vaccination certification under mass vaccination and non-pharmaceutical interventions on mitigating COVID-19.

Cao, Hu; Cao, Longbing.

Sci Rep ; 14(1): 707, 2024 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-38184669

RESUMO

As COVID-19 vaccines became widely available worldwide, many countries implemented vaccination certification, also known as a "green pass", to promote and expedite vaccination on containing virus spread from the latter half of 2021. This policy allowed those vaccinated to have more freedom in public activities compared to more constraints on the unvaccinated in addition to existing non-pharmaceutical interventions (NPIs). Accordingly, the vaccination certification also induced heterogeneous behaviors of unvaccinated and vaccinated groups. This makes it essential yet challenging to model the behavioral impact of vaccination certification on the two groups and the transmission dynamics of COVID-19 within and between the groups. Very limited quantitative work is available for addressing these purposes. Here we propose an extended epidemiological model SEIQRD[Formula: see text] to effectively distinguish the behavioral impact of vaccination certification on unvaccinated and vaccinated groups through incorporating two contrastive transmission chains. SEIQRD[Formula: see text] also quantifies the impact of the green pass policy. With the resurgence of COVID-19 in Greece, Austria, and Israel in 2021, our simulation results indicate that their implementation of vaccination certification brought about more than a 14-fold decrease in the total number of infections and deaths as compared to a scenario with no such a policy. Additionally, a green pass policy may offer a reasonable practical solution to strike the balance between public health and individual's freedom during the pandemic.

Assuntos

COVID-19 , Vacinação em Massa , Humanos , Vacinas contra COVID-19 , COVID-19/epidemiologia , COVID-19/prevenção & controle , Vacinação , Certificação

4.

Gray Learning From Non-IID Data With Out-of-Distribution Samples.

Zhao, Zhilin; Cao, Longbing; Wang, Chang-Dong.

IEEE Trans Neural Netw Learn Syst ; PP2023 Nov 14.

Artigo em Inglês | MEDLINE | ID: mdl-37962995

RESUMO

The integrity of training data, even when annotated by experts, is far from guaranteed, especially for non-independent and identically distributed (non-IID) datasets comprising both in-and out-of-distribution samples. In an ideal scenario, the majority of samples would be in-distribution, while samples that deviate semantically would be identified as out-of-distribution and excluded during the annotation process. However, experts may erroneously classify these out-of-distribution samples as in-distribution, assigning them labels that are inherently unreliable. This mixture of unreliable labels and varied data types makes the task of learning robust neural networks notably challenging. We observe that both in-and out-of-distribution samples can almost invariably be ruled out from belonging to certain classes, aside from those corresponding to unreliable ground-truth labels. This opens the possibility of utilizing reliable complementary labels that indicate the classes to which a sample does not belong. Guided by this insight, we introduce a novel approach, termed gray learning (GL), which leverages both ground-truth and complementary labels. Crucially, GL adaptively adjusts the loss weights for these two label types based on prediction confidence levels. By grounding our approach in statistical learning theory, we derive bounds for the generalization error, demonstrating that GL achieves tight constraints even in non-IID settings. Extensive experimental evaluations reveal that our method significantly outperforms alternative approaches grounded in robust statistics.

5.

Supervision Adaptation Balancing In-Distribution Generalization and Out-of-Distribution Detection.

Zhao, Zhilin; Cao, Longbing; Lin, Kun-Yu.

IEEE Trans Pattern Anal Mach Intell ; 45(12): 15743-15758, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-37792646

RESUMO

The discrepancy between in-distribution (ID) and out-of-distribution (OOD) samples can lead to distributional vulnerability in deep neural networks, which can subsequently lead to high-confidence predictions for OOD samples. This is mainly due to the absence of OOD samples during training, which fails to constrain the network properly. To tackle this issue, several state-of-the-art methods include adding extra OOD samples to training and assign them with manually-defined labels. However, this practice can introduce unreliable labeling, negatively affecting ID classification. The distributional vulnerability presents a critical challenge for non-IID deep learning, which aims for OOD-tolerant ID classification by balancing ID generalization and OOD detection. In this paper, we introduce a novel supervision adaptation approach to generate adaptive supervision information for OOD samples, making them more compatible with ID samples. First, we measure the dependency between ID samples and their labels using mutual information, revealing that the supervision information can be represented in terms of negative probabilities across all classes. Second, we investigate data correlations between ID and OOD samples by solving a series of binary regression problems, with the goal of refining the supervision information for more distinctly separable ID classes. Our extensive experiments on four advanced network architectures, two ID datasets, and eleven diversified OOD datasets demonstrate the efficacy of our supervision adaptation approach in improving both ID classification and OOD detection capabilities.

6.

Copula Variational LSTM for High-Dimensional Cross-Market Multivariate Dependence Modeling.

Xu, Jia; Cao, Longbing.

IEEE Trans Neural Netw Learn Syst ; PP2023 Jul 28.

Artigo em Inglês | MEDLINE | ID: mdl-37506022

RESUMO

We address a challenging problem-modeling high-dimensional, long-range dependencies between nonnormal multivariates, which is important for demanding applications such as cross-market modeling (CMM). With heterogeneous indicators and markets, CMM aims to capture between-market financial couplings and influence over time and within-market interactions between financial variables. We make the first attempt to integrate deep variational sequential learning with copula-based statistical dependence modeling and characterize both temporal dependence degrees and structures between hidden variables representing nonnormal multivariates. Our copula variational learning network weighted partial regular vine copula-based variational long short-term memory (WPVC-VLSTM) integrates variational long short-term memory (LSTM) networks and regular vine copula to model variational sequential dependence degrees and structures. The regular vine copula models nonnormal distributional dependence degrees and structures. VLSTM captures variational long-range dependencies coupling high-dimensional dynamic hidden variables without strong hypotheses and multivariate constraints. WPVC-VLSTM outperforms benchmarks, including linear models, stochastic volatility models, deep neural networks, and variational recurrent networks in terms of both technical significance and portfolio forecasting performance. WPVC-VLSTM shows a step-forward for CMM and deep variational learning.

7.

Lactobacillus gasseri LGV03 isolated from the cervico-vagina of HPV-cleared women modulates epithelial innate immune responses and suppresses the growth of HPV-positive human cervical cancer cells.

Gao, Qiong; Fan, Tao; Luo, Siying; Zheng, Jieting; Zhang, Lin; Cao, Longbing; Zhang, Zikang; Li, Li; Huang, Zhu; Zhang, Huifen; Huang, Liuxuan; Xiao, Qing; Qiu, Feng.

Transl Oncol ; 35: 101714, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37331103

RESUMO

Persistent human papillomavirus (HPV) infections is necessary for the development of cervical cancers. An increasing number of retrospective studies have found the depletion of Lactobacillus microbiota in the cervico-vagina facilitate HPV infection and might be involved in viral persistence and cancer development. However, there have been no reports confirming the immunomodulatory effects of Lactobacillus microbiota isolated from cervico-vaginal samples of HPV clearance in women. Using cervico-vaginal samples from HPV persistent infection and clearance in women, this study investigated the local immune properties in cervical mucosa. As expected, type I interferons, such as IFN-α and IFN-ß, and TLR3 globally downregulated in HPV+ persistence group. Luminex cytokine/chemokine panel analysis revealed that L. jannaschii LJV03, L. vaginalis LVV03, L. reuteri LRV03, and L. gasseri LGV03 isolated from cervicovaginal samples of HPV clearance in women altered the host's epithelial immune response, particularly L. gasseri LGV03. Furthermore, L. gasseri LGV03 enhanced the poly (I:C)-induced production of IFN by modulating the IRF3 pathway and attenuating poly (I:C)-induced production of proinflammatory mediators by regulating the NF-κB pathway in Ect1/E6E7 cells, indicating that L. gasseri LGV03 keeps the innate system alert to potential pathogens and reduces the inflammatory effects during persistent pathogen infection. L. gasseri LGV03 also markedly inhibited the proliferation of Ect1/E6E7 cells in a zebrafish xenograft model, which may be attributed to an increased immune response mediated by L. gasseri LGV03.

8.

Out-of-Distribution Detection by Cross-Class Vicinity Distribution of In-Distribution Data.

Zhao, Zhilin; Cao, Longbing; Lin, Kun-Yu.

IEEE Trans Neural Netw Learn Syst ; PP2023 May 26.

Artigo em Inglês | MEDLINE | ID: mdl-37235465

RESUMO

Deep neural networks for image classification only learn to map in-distribution inputs to their corresponding ground-truth labels in training without differentiating out-of-distribution samples from in-distribution ones. This results from the assumption that all samples are independent and identically distributed (IID) without distributional distinction. Therefore, a pretrained network learned from in-distribution samples treats out-of-distribution samples as in-distribution and makes high-confidence predictions on them in the test phase. To address this issue, we draw out-of-distribution samples from the vicinity distribution of training in-distribution samples for learning to reject the prediction on out-of-distribution inputs. A cross-class vicinity distribution is introduced by assuming that an out-of-distribution sample generated by mixing multiple in-distribution samples does not share the same classes of its constituents. We, thus, improve the discriminability of a pretrained network by finetuning it with out-of-distribution samples drawn from the cross-class vicinity distribution, where each out-of-distribution input corresponds to a complementary label. Experiments on various in-/out-of-distribution datasets show that the proposed method significantly outperforms the existing methods in improving the capacity of discriminating between in-and out-of-distribution samples.

9.

AI and data science for smart emergency, crisis and disaster resilience.

Cao, Longbing.

Int J Data Sci Anal ; 15(3): 231-246, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37035277

RESUMO

The uncertain world has seen increasing emergencies, crises and disasters (ECDs), such as the COVID-19 pandemic, hurricane Ian, global financial inflation and recession, misinformation disaster, and cyberattacks. AI for smart disaster resilience (AISDR) transforms classic reactive and scripted disaster management to digital proactive and intelligent resilience across ECD ecosystems. A systematic overview of diverse ECDs, classic ECD management, ECD data complexities, and an AISDR research landscape are presented in this article. Translational disaster AI is essential to enable smart disaster resilience.

10.

Revealing the Distributional Vulnerability of Discriminators by Implicit Generators.

Zhao, Zhilin; Cao, Longbing; Lin, Kun-Yu.

IEEE Trans Pattern Anal Mach Intell ; 45(7): 8888-8901, 2023 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-37015685

RESUMO

In deep neural learning, a discriminator trained on in-distribution (ID) samples may make high-confidence predictions on out-of-distribution (OOD) samples. This triggers a significant matter for robust, trustworthy and safe deep learning. The issue is primarily caused by the limited ID samples observable in training the discriminator when OOD samples are unavailable. We propose a general approach for fine-tuning discriminators by implicit generators (FIG). FIG is grounded on information theory and applicable to standard discriminators without retraining. It improves the ability of a standard discriminator in distinguishing ID and OOD samples by generating and penalizing its specific OOD samples. According to the Shannon entropy, an energy-based implicit generator is inferred from a discriminator without extra training costs. Then, a Langevin dynamic sampler draws specific OOD samples for the implicit generator. Lastly, we design a regularizer fitting the design principle of the implicit generator to induce high entropy on those generated OOD samples. The experiments on different networks and datasets demonstrate that FIG achieves the state-of-the-art OOD detection performance.

11.

BiT-MAC: Mortality prediction by bidirectional time and multi-feature attention coupled network on multivariate irregular time series.

Wang, Qinfen; Chen, Geng; Jin, Xuting; Ren, Siyuan; Wang, Gang; Cao, Longbing; Xia, Yong.

Comput Biol Med ; 155: 106586, 2023 03.

Artigo em Inglês | MEDLINE | ID: mdl-36774888

RESUMO

Mortality prediction is crucial to evaluate the severity of illness and assist in improving the prognosis of patients. In clinical settings, one way is to analyze the multivariate time series (MTSs) of patients based on their medical data, such as heart rates and invasive mean arterial blood pressure. However, this suffers from sparse, irregularly sampled, and incomplete data issues. These issues can compromise the performance of follow-up MTS-based analytic applications. Plenty of existing methods try to deal with such irregular MTSs with missing values by capturing the temporal dependencies within a time series, yet in-depth research on modeling inter-MTS couplings remains rare and lacks model interpretability. To this end, we propose a bidirectional time and multi-feature attention coupled network (BiT-MAC) to capture the temporal dependencies (i.e., intra-time series coupling) and the hidden relationships among variables (i.e., inter-time series coupling) with a bidirectional recurrent neural network and multi-head attention, respectively. The resulting intra- and inter-time series coupling representations are then fused to estimate the missing values for a more robust MTS-based prediction. We evaluate BiT-MAC by applying it to the missing-data corrupted mortality prediction on two real-world clinical datasets, i.e., PhysioNet'2012 and COVID-19. Extensive experiments demonstrate the superiority of BiT-MAC over cutting-edge models, verifying the great value of the deep and hidden relations captured by MTSs. The interpretability of features is further demonstrated through a case study.

Assuntos

COVID-19 , Humanos , Fatores de Tempo , Frequência Cardíaca , Redes Neurais de Computação

12.

An Efficient Method for Modeling Nonoccurring Behaviors by Negative Sequential Patterns With Loose Constraints.

Qiu, Ping; Gong, Yongshun; Zhao, Yuhai; Cao, Longbing; Zhang, Chengqi; Dong, Xiangjun.

IEEE Trans Neural Netw Learn Syst ; 34(4): 1864-1878, 2023 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-33729957

RESUMO

The sequence analysis handles sequential discrete events and behaviors, which can be represented by temporal point processes (TPPs). However, TPP models only occurring events and behaviors. This article explores an efficient method for the negative sequential pattern (NSP) mining to leverage TPP in modeling both frequently occurring and nonoccurring events and behaviors. NSP mining is good at the challenging modeling of nonoccurrences of events and behaviors and their combinations with occurring events, with existing methods built on incorporating various constraints into NSP representations, e.g., simplifying NSP formulations and reducing computational costs. Such constraints restrict the flexibility of NSPs, and nonoccurring behaviors (NOBs) cannot be comprehensively exposed. This article addresses this issue by loosening some inflexible constraints in NSP mining and solves a series of consequent challenges. First, we provide a new definition of negative containment with the set theory according to the loose constraints. Second, an efficient method quickly calculates the supports of negative sequences. Our method only uses the information about the corresponding positive sequential patterns (PSPs) and avoids additional database scans. Finally, a novel and efficient algorithm, NegI-NSP, is proposed to efficiently identify highly valuable NSPs. Theoretical analyses, comparisons, and experiments on four synthetic and two real-life data sets clearly show that NegI-NSP can efficiently discover more useful NOBs.

13.

Explicit and Implicit Pattern Relation Analysis for Discovering Actionable Negative Sequences.

Wang, Wei; Cao, Longbing.

IEEE Trans Neural Netw Learn Syst ; PP2022 Oct 10.

Artigo em Inglês | MEDLINE | ID: mdl-36215383

RESUMO

Real-life events, behaviors, and interactions produce sequential data. An important but rarely explored problem is to analyze those nonoccurring (also called negative) yet important sequences, forming negative sequence analysis (NSA). A typical NSA area is to discover negative sequential patterns (NSPs) consisting of important nonoccurring and occurring elements and patterns. The limited existing work on NSP mining relies on frequentist and downward closure property-based pattern selection, producing large and highly redundant NSPs, nonactionable for business decision-making. This work makes the first attempt for actionable NSP discovery. It builds an NSP graph representation, quantifies both explicit occurrence and implicit nonoccurrence-based element and pattern relations, and then discovers significant, diverse, and informative NSPs in the NSP graph to represent the entire NSP set for discovering actionable NSPs. A DPP-based NSP representation and actionable NSP discovery method, EINSP, introduces novel and significant contributions to NSA and sequence analysis: 1) it represents NSPs by a determinantal point process (DPP)-based graph; 2) it quantifies actionable NSPs in terms of their statistical significance, diversity, and strength of explicit/implicit element/pattern relations; and 3) it models and measures both explicit and implicit element/pattern relations in the DPP-based NSP graph to represent direct and indirect couplings between NSP items, elements, and patterns. We substantially analyze the effectiveness of EINSP in terms of various theoretical and empirical aspects, including complexity, item/pattern coverage, pattern size and diversity, implicit pattern relation strength, and data factors.

14.

AI in Combating the COVID-19 Pandemic.

Cao, Longbing.

IEEE Intell Syst ; 37(2): 3-13, 2022 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-35935446

RESUMO

The SARS-CoV-2 virus, the COVID-19 disease, and the resulting pandemic have reshaped the entire world in an unprecedented manner. Massive efforts have been made by AI communities to combat the pandemic. What roles has AI played in tackling COVID-19? How has AI performed in the battle against COVID-19? Where are the gaps and opportunities? What lessons can we learn to enhance the ability of AI to battle future pandemics? These questions, despite being fundamental, are yet to be answered in full or systematically. They need to be addressed by AI communities as a priority despite the easing of the omicron infectiousness and threat. This article reviews these issues with reflections on global AI research and the literature on tackling COVID-19. It is envisaged that the demand and priority of developing "pandemic AI" will increase over time, with smart global epidemic early warning systems to be developed by a global collaborative AI effort.

15.

Modeling time evolving COVID-19 uncertainties with density dependent asymptomatic infections and social reinforcement.

Liu, Qing; Cao, Longbing.

Sci Rep ; 12(1): 5891, 2022 04 07.

Artigo em Inglês | MEDLINE | ID: mdl-35393500

RESUMO

The COVID-19 pandemic has posed significant challenges in modeling its complex epidemic transmissions, infection and contagion, which are very different from known epidemics. The challenges in quantifying COVID-19 complexities include effectively modeling its process and data uncertainties. The uncertainties are embedded in implicit and high-proportional undocumented infections, asymptomatic contagion, social reinforcement of infections, and various quality issues in the reported data. These uncertainties become even more apparent in the first 2 months of the COVID-19 pandemic, when the relevant knowledge, case reporting and testing were all limited. Here we introduce a novel hybrid approach SUDR by expanding the foundational compartmental epidemic Susceptible-Infected-Recovered (SIR) model with two compartments to a Susceptible-Undocumented infected-Documented infected-Recovered (SUDR) model. First, SUDR (1) characterizes and distinguishes Undocumented (U) and Documented (D) infections commonly seen during COVID-19 incubation periods and asymptomatic infections. Second, SUDR characterizes the probabilistic density of infections by capturing exogenous processes like clustering contagion interactions, superspreading, and social reinforcement. Lastly, SUDR approximates the density likelihood of COVID-19 prevalence over time by incorporating Bayesian inference into SUDR. Different from existing COVID-19 models, SUDR characterizes the undocumented infections during unknown transmission processes. To capture the uncertainties of temporal transmission and social reinforcement during COVID-19 contagion, the transmission rate is modeled by a time-varying density function of undocumented infectious cases. By sampling from the mean-field posterior distribution with reasonable priors, SUDR handles the randomness, noise and sparsity of COVID-19 observations widely seen in the public COVID-19 case data. The results demonstrate a deeper quantitative understanding of the above uncertainties, in comparison with classic SIR, time-dependent SIR, and probabilistic SIR models.

Assuntos

COVID-19 , Infecções Assintomáticas/epidemiologia , Teorema de Bayes , COVID-19/epidemiologia , Humanos , Pandemias , Reforço Social , SARS-CoV-2

16.

Personalized next-best action recommendation with multi-party interaction learning for automated decision-making.

Cao, Longbing; Zhu, Chengzhang.

PLoS One ; 17(1): e0263010, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35085347

RESUMO

Automated next-best action recommendation for each customer in a sequential, dynamic and interactive context has been widely needed in natural, social and business decision-making. Personalized next-best action recommendation must involve past, current and future customer demographics and circumstances (states) and behaviors, long-range sequential interactions between customers and decision-makers, multi-sequence interactions between states, behaviors and actions, and their reactions to their counterpart's actions. No existing modeling theories and tools, including Markovian decision processes, user and behavior modeling, deep sequential modeling, and personalized sequential recommendation, can quantify such complex decision-making on a personal level. We take a data-driven approach to learn the next-best actions for personalized decision-making by a reinforced coupled recurrent neural network (CRN). CRN represents multiple coupled dynamic sequences of a customer's historical and current states, responses to decision-makers' actions, decision rewards to actions, and learns long-term multi-sequence interactions between parties (customer and decision-maker). Next-best actions are then recommended on each customer at a time point to change their state for an optimal decision-making objective. Our study demonstrates the potential of personalized deep learning of multi-sequence interactions and automated dynamic intervention for personalized decision-making in complex systems.

Assuntos

Tomada de Decisões Assistida por Computador , Modelos Teóricos , Redes Neurais de Computação

17.

Unsupervised Heterogeneous Coupling Learning for Categorical Representation.

Zhu, Chengzhang; Cao, Longbing; Yin, Jianping.

IEEE Trans Pattern Anal Mach Intell ; 44(1): 533-549, 2022 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-32750827

RESUMO

Complex categorical data is often hierarchically coupled with heterogeneous relationships between attributes and attribute values and the couplings between objects. Such value-to-object couplings are heterogeneous with complementary and inconsistent interactions and distributions. Limited research exists on unlabeled categorical data representations, ignores the heterogeneous and hierarchical couplings, underestimates data characteristics and complexities, and overuses redundant information, etc. The deep representation learning of unlabeled categorical data is challenging, overseeing such value-to-object couplings, complementarity and inconsistency, and requiring large data, disentanglement, and high computational power. This work introduces a shallow but powerful UNsupervised heTerogeneous couplIng lEarning (UNTIE) approach for representing coupled categorical data by untying the interactions between couplings and revealing heterogeneous distributions embedded in each type of couplings. UNTIE is efficiently optimized w.r.t. a kernel k-means objective function for unsupervised representation learning of heterogeneous and hierarchical value-to-object couplings. Theoretical analysis shows that UNTIE can represent categorical data with maximal separability while effectively represent heterogeneous couplings and disclose their roles in categorical data. The UNTIE-learned representations make significant performance improvement against the state-of-the-art categorical representations and deep representation models on 25 categorical data sets with diversified characteristics.

18.

Neural Time-Aware Sequential Recommendation by Jointly Modeling Preference Dynamics and Explicit Feature Couplings.

Zhang, Qi; Cao, Longbing; Shi, Chongyang; Niu, Zhendong.

IEEE Trans Neural Netw Learn Syst ; 33(10): 5125-5137, 2022 10.

Artigo em Inglês | MEDLINE | ID: mdl-33852391

RESUMO

In recommendation, both stationary and dynamic user preferences on items are embedded in the interactions between users and items (e.g., rating or clicking) within their contexts. Sequential recommender systems (SRSs) need to jointly involve such context-aware user-item interactions in terms of the couplings between the user and item features and sequential user actions on items over time. However, such joint modeling is non-trivial and significantly challenges the existing work on preference modeling, which either only models user-item interactions by latent factorization models but ignores user preference dynamics or only captures sequential user action patterns without involving user/item features and context factors and their coupling and influence on user actions. We propose a neural time-aware recommendation network (TARN) with a temporal context to jointly model 1) stationary user preferences by a feature interaction network and 2) user preference dynamics by a tailored convolutional network. The feature interaction network factorizes the pairwise couplings between non-zero features of users, items, and temporal context by the inner product of their feature embeddings while alleviating data sparsity issues. In the convolutional network, we introduce a convolutional layer with multiple filter widths to capture multi-fold sequential patterns, where an attentive average pooling (AAP) obtains significant and large-span feature combinations. To learn the preference dynamics, a novel temporal action embedding represents user actions by incorporating the embeddings of items and temporal context as the inputs of the convolutional network. The experiments on typical public data sets demonstrate that TARN outperforms state-of-the-art methods and show the necessity and contribution of involving time-aware preference dynamics and explicit user/item feature couplings in modeling and interpreting evolving user preferences.

Assuntos

Aprendizagem , Redes Neurais de Computação

19.

Table2Vec-automated universal representation learning of enterprise data DNA for benchmarkable and explainable enterprise data science.

Cao, Longbing; Zhu, Chengzhang.

Sci Rep ; 11(1): 23957, 2021 12 14.

Artigo em Inglês | MEDLINE | ID: mdl-34907319

RESUMO

Enterprise data typically involves multiple heterogeneous data sources and external data that respectively record business activities, transactions, customer demographics, status, behaviors, interactions and communications with the enterprise, and the consumption and feedback of its products, services, production, marketing, operations, and management, etc. They involve enterprise DNA associated with domain-oriented transactions and master data, informational and operational metadata, and relevant external data. A critical challenge in enterprise data science is to enable an effective 'whole-of-enterprise' data understanding and data-driven discovery and decision-making on all-round enterprise DNA. Accordingly, here we introduce a neural encoder Table2Vec for automated universal representation learning of entities such as customers from all-round enterprise DNA with automated data characteristics analysis and data quality augmentation. The learned universal representations serve as representative and benchmarkable enterprise data genomes (similar to biological genomes and DNA in organisms) and can be used for enterprise-wide and domain-specific learning tasks. Table2Vec integrates automated universal representation learning on low-quality enterprise data and downstream learning tasks. Such automated universal enterprise representation and learning cannot be addressed by existing enterprise data warehouses (EDWs), business intelligence and corporate analytics systems, where 'enterprise big tables' are constructed with reporting and analytics conducted by specific analysts on respective domain subjects and goals. It addresses critical limitations and gaps of existing representation learning, enterprise analytics and cloud analytics, which are analytical subject, task and data-specific, creating analytical silos in an enterprise. We illustrate Table2Vec in characterizing all-round customer data DNA in an enterprise on complex heterogeneous multi-relational big tables to build universal customer vector representations. The learned universal representation of each customer is all-round, representative and benchmarkable to support both enterprise-wide and domain-specific learning goals and tasks in enterprise data science. Table2Vec significantly outperforms the existing shallow, boosting and deep learning methods typically used for enterprise analytics. We further discuss the research opportunities, directions and applications of automated universal enterprise representation and learning and the learned enterprise data DNA for automated, all-purpose, whole-of-enterprise and ethical machine learning and data science.

Assuntos

Bases de Dados de Ácidos Nucleicos , Aprendizado de Máquina , Análise de Sequência de DNA , Software

20.

e-RNSP: An Efficient Method for Mining Repetition Negative Sequential Patterns.

Dong, Xiangjun; Gong, Yongshun; Cao, Longbing.

IEEE Trans Cybern ; 50(5): 2084-2096, 2020 May.

Artigo em Inglês | MEDLINE | ID: mdl-30296247

RESUMO

Negative sequential patterns (NSPs), which capture both frequent occurring and nonoccurring behaviors, become increasingly important and sometimes play a role irreplaceable by analyzing occurring behaviors only. Repetition sequential patterns capture repetitions of patterns in different sequences as well as within a sequence and are very important to understand the repetition relations between behaviors. Though some methods are available for mining NSP and repetition positive sequential patterns (RPSPs), we have not found any methods for mining repetition NSP (RNSP). RNSP can help the analysts to further understand the repetition relationships between items and capture more comprehensive information with repetition properties. However, mining RNSP is much more difficult than mining NSP due to the intrinsic challenges of nonoccurring items. To address the above issues, we first propose a formal definition of repetition negative containment. Then, we propose a method to convert repetition negative containment to repetition positive containment, which fast calculates the repetition supports by only using the corresponding RPSP's information without rescanning databases. Finally, we propose an efficient algorithm, called e-RNSP, to mine RNSP efficiently. To the best of our knowledge, e-RNSP is the first algorithm to efficiently mine RNSP. Intensive experimental results on the first four real and synthetic datasets clearly show that e-RNSP can efficiently discover the repetition negative patterns; results on the fifth dataset prove the effectiveness of RNSP which are captured by the proposed method; and the results on the rest 16 datasets analyze the impacts of data characteristics on mining process.

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA