Your browser doesn't support javascript.
loading
An Online Semantic-Enhanced Graphical Model for Evolving Short Text Stream Clustering.
IEEE Trans Cybern ; 52(12): 13809-13820, 2022 Dec.
Article en En | MEDLINE | ID: mdl-34591776
ABSTRACT
Due to the popularity of social media and online fora, such as Twitter, Reddit, Facebook, and Wechat, short text stream clustering has gained significant attention in recent years. However, most existing short text stream clustering approaches usually work on static data and tend to cause a "term ambiguity" problem due to the sparse word representation. Beyond, they often exploit short text streams in a batch way and are difficult to find evolving topics in term-changing subspaces. In this article, we propose an online semantic-enhanced graphical model for evolving short text stream clustering (OSGM), by exploiting the word-occurrence semantic information and dynamically maintaining evolving active topics in term-changing subspaces in an online way. Compared to the existing approaches, our online model is not only free of determining the optimal batch size but also lends itself to handling large-scale data streams efficiently. It is also able to handle the "term ambiguity" problem without incorporating features from external resources. More importantly, to the best of our knowledge, it is the first work to extract evolving topics in term-changing subspaces automatically in an online way. Extensive experiments demonstrate that the proposed model yields better performance compared to many state-of-the-art algorithms on both synthetic and real-world datasets.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Semántica / Medios de Comunicación Sociales Límite: Humans Idioma: En Revista: IEEE Trans Cybern Año: 2022 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Semántica / Medios de Comunicación Sociales Límite: Humans Idioma: En Revista: IEEE Trans Cybern Año: 2022 Tipo del documento: Article
...