Your browser doesn't support javascript.
loading
Distributed Spectrum Management in Cognitive Radio Networks by Consensus-Based Reinforcement Learning.
Dasic, Dejan; Ilic, Nemanja; Vucetic, Miljan; Peric, Miroslav; Beko, Marko; Stankovic, Milos S.
Afiliação
  • Dasic D; Artificial Intelligence Department, Vlatacom Institute, 11070 Belgrade, Serbia.
  • Ilic N; Faculty of Technical Sciences, Singidunum University, 11000 Belgrade, Serbia.
  • Vucetic M; COPELABS, Universidade Lusófona de Humanidades e Tecnologias, 1749-024 Lisbon, Portugal.
  • Peric M; Artificial Intelligence Department, Vlatacom Institute, 11070 Belgrade, Serbia.
  • Beko M; Department of Information Technologies, College of Applied Technical Sciences, 37000 Krusevac, Serbia.
  • Stankovic MS; Artificial Intelligence Department, Vlatacom Institute, 11070 Belgrade, Serbia.
Sensors (Basel) ; 21(9)2021 Apr 23.
Article em En | MEDLINE | ID: mdl-33922677
ABSTRACT
In this paper, we propose a new algorithm for distributed spectrum sensing and channel selection in cognitive radio networks based on consensus. The algorithm operates within a multi-agent reinforcement learning scheme. The proposed consensus strategy, implemented over a directed, typically sparse, time-varying low-bandwidth communication network, enforces collaboration between the agents in a completely decentralized and distributed way. The motivation for the proposed approach comes directly from typical cognitive radio networks' practical scenarios, where such a decentralized setting and distributed operation is of essential importance. Specifically, the proposed setting provides all the agents, in unknown environmental and application conditions, with viable network-wide information. Hence, a set of participating agents becomes capable of successful calculation of the optimal joint spectrum sensing and channel selection strategy even if the individual agents are not. The proposed algorithm is, by its nature, scalable and robust to node and link failures. The paper presents a detailed discussion and analysis of the algorithm's characteristics, including the effects of denoising, the possibility of organizing coordinated actions, and the convergence rate improvement induced by the consensus scheme. The results of extensive simulations demonstrate the high effectiveness of the proposed algorithm, and that its behavior is close to the centralized scheme even in the case of sparse neighbor-based inter-node communication.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Sensors (Basel) Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Sensors (Basel) Ano de publicação: 2021 Tipo de documento: Article