Sökning: onr:"swepub:oai:DiVA.org:kth-312654" >
Joint Beam Training...
Joint Beam Training and Data Transmission Control for mmWave Delay-Sensitive Communications : A Parallel Reinforcement Learning Approach
-
- Lei, Wanlu (författare)
- KTH,Teknisk informationsvetenskap,Ericsson AB, S-16480 Stockholm, Sweden
-
- Zhang, Deyou (författare)
- KTH,Teknisk informationsvetenskap
-
- Ye, Yu (författare)
- KTH,Teknisk informationsvetenskap
-
visa fler...
-
Lu, Chenguang (författare)
-
visa färre...
-
(creator_code:org_t)
- Institute of Electrical and Electronics Engineers (IEEE), 2022
- 2022
- Engelska.
-
Ingår i: IEEE Journal of Selected Topics in Signal Processing. - : Institute of Electrical and Electronics Engineers (IEEE). - 1932-4553. ; 16:3, s. 447-459
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Future communication networks call for new solutions to support their capacity and delay demands by leveraging potentials of the millimeter wave (mmWave) frequency band. However, the beam training procedure in mmWave systems incurs significant overhead as well as huge energy consumption. As such, deriving an adaptive control policy is beneficial to both delay-sensitive and energy-efficient data transmission over mmWave networks. To this end, we investigate the problem of joint beam training and data transmission control for mmWave delay-sensitive communications in this paper. Specifically, the considered problem is firstly formulated as a constrained Markov Decision Process (MDP), which aims to minimize the cumulative energy consumption over the whole considered period of time under delay constraint. By introducing a Lagrange multiplier, we transform the constrained MDP into an unconstrained one, which is then solved via a parallel-rollout-based reinforcement learning method in a data-driven manner. Our numerical results demonstrate that the optimized policy via parallel-rollout significantly outperforms other baseline policies in both energy consumption and delay performance.
Nyckelord
- Training
- Data communication
- Transmitters
- Energy consumption
- Delays
- Array signal processing
- Reinforcement learning
- Beam training
- data-driven
- delay-sensitive
- Markov decision process
- millimeter wave
- reinforcement learning
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas