IEEE Transactions on Communications,
vol. 68, no. 3, Mar. 2020, pp. 1581-1592, DOI 10.1109/TCOMM.2019.2961332.
One of the top 50 most accessed articles
each month during 2020, as well as eight months in 2021 (Jan., Feb., May, June, July, Aug., Sep., Nov. and Dec.) and
two months so far in 2022 (Mar. and Apr.) among all articles published in the IEEE Transactions on Communications
Deep Reinforcement Learning for 5G Networks: Joint Beamforming, Power Control, and Interference Coordination
Faris B. Mismar (1),
Brian L. Evans (1), and
Ahmed Alkhateeb (2)
Department of Electrical and Computer Engineering,
Wireless Networking and Communications Group,
The University of Texas at Austin,
Austin, TX 78712 USA
School of Electrical, Computer and Energy Engineering,
Arizona State University,
Tempe, AZ 85287 USA
Published on IEEE Explore
Paper draft on arXiv
The fifth generation of wireless communications (5G) promises massive
increases in traffic volume and data rates, as well as improved reliability
in voice calls.
Jointly optimizing beamforming, power control, and interference coordination
in a 5G wireless network to enhance the communication performance to end
users poses a significant challenge.
In this paper, we formulate the joint design of beamforming, power control,
and interference coordination as a non-convex optimization problem to
maximize the signal to interference plus noise ratio (SINR) and solve this
problem using deep reinforcement learning.
By using the greedy nature of deep Q-learning to estimate future rewards
of actions and using the reported coordinates of the users served by the
network, we propose an algorithm for voice bearers and data bearers in
sub-6 GHz and millimeter wave (mmWave) frequency bands, respectively.
The algorithm improves the performance measured by SINR and sum-rate
In realistic cellular environments, the simulation results show that
our algorithm outperforms the link adaptation industry standards
for sub-6 GHz voice bearers.
For data bearers in the mmWave frequency band, our algorithm approaches the
maximum sum rate capacity, but with less than 4% of the required run time.
COPYRIGHT NOTICE: All the documents on this server
have been submitted by their authors to scholarly journals or conferences
as indicated, for the purpose of non-commercial dissemination of
The manuscripts are put on-line to facilitate this purpose.
These manuscripts are copyrighted by the authors or the journals in which
they were published.
You may copy a manuscript for scholarly, non-commercial purposes, such
as research or instruction, provided that you agree to respect these
Last Updated 05/21/22.