吃什么最补肾壮阳| 伤口不愈合用什么药| 人生没有什么不可放下| 瑶柱是什么东西| 慢热是什么意思| sp是什么的缩写| 势均力敌什么意思| 护理部主任是什么级别| 墨菲定律是什么| 吃什么解辣最快方法| 痔疮是什么引起的| 酱油色尿是什么原因| 眼睛红是什么病的前兆| 女性吃金蝉有什么好处| 1975年属兔五行属什么| penis什么意思| 今天是什么纪念日| 眉毛长痘是什么原因| 宇字属于五行属什么| 谁也不知道下一秒会发生什么| 为什么会得炎症| 尿酸高要注意什么| 什么工作赚钱| 大姨夫是什么意思| 生理期可以吃什么水果| 小腿发痒是什么原因| mid什么意思| 吴亦凡属什么生肖| 素海参是什么做的| 睾丸是什么| 做梦梦到捡钱是什么征兆| 种牙和假牙有什么区别| 你本来就很美是什么广告| 想吐吃什么药| 男士戴什么手串好| 夜不能寐什么意思| 胰岛素高是什么意思| 双抗是什么药| 析是什么意思| 什么是前奶什么是后奶| 腹胀是什么病的前兆| 拔牙之后可以吃什么| 阴道镜是什么| 全员加速中什么时候播| 饱和脂肪酸是什么意思| 肾虚用什么补最好| 外阴痒用什么药膏| 帕金森看什么科室| 黄疸高是什么原因引起的| 姐姐的孩子叫什么| 肌酐高有什么症状| 半夜腿抽筋是什么原因| 刚字五行属什么| 茶色尿是什么原因引起的| 五险都有什么险| 世界上最长的英文单词是什么| 梦见鞋丢了是什么意思| qy是什么意思| jc是什么牌子| 501是什么意思| 什么的列车| 剁椒是什么辣椒| 总流口水是什么原因| 为什么会得子宫肌瘤| 梅长苏是什么电视剧| 羊内腰和外腰分别是什么| 日本打工需要什么条件| poct是什么意思| 鱿鱼属于什么类| 吃什么容易消化| 狐臭挂什么科| 小孩晚上睡不着是什么原因| 做梦梦见水是什么意思| 维生素检查项目叫什么| 体位性低血压是什么| 尿黄起泡是什么原因| 知了是什么意思| 什么是表达方式| 死不瞑目是什么意思| 单侧耳鸣是什么原因引起的| 三高挂号挂什么科| 妈妈的妹妹应该叫什么| 胃溃疡吃什么药好得快| 眼睛黄是什么原因| 消化不好吃什么药最好| 东边日出西边雨是什么生肖| 细菌性阴道炎用什么药效果最好| 高贵的什么| 冯巩什么军衔| 黄茶适合什么人喝| 机化是什么意思| 大保健什么意思| 猫喜欢吃什么| 小孩咳嗽挂什么科| 1129什么星座| 中心句是什么意思| 蚊子咬了为什么会痒| 痰湿中阻吃什么中成药| 木糖醇是什么东西| 一个小时尿一次是什么原因| 拍脑部ct挂什么科| 后背发热是什么原因| 一什么清风| 肾火旺吃什么药| 月经期间适合做什么运动| 风疹是什么| 神经性头痛吃什么药效果好| 脸色发黑是什么病的前兆| 遍体鳞伤是什么意思| 现在流行什么样的衣柜| 什么是激光| 什么时候初伏第一天| k开头的是什么车| 黑枸杞对男性性功能有什么帮助| 无料案内所是什么意思| 张家界莓茶有什么功效| 烧心吃点什么药| 氯硝西泮片是什么药| 梦见洗车是什么意思| 玫瑰花有什么作用| 爆菊什么意思| 嘴唇周围长痘痘是什么原因导致| 蝉的鸣叫声像什么| 胆结石不能吃什么东西| 车厘子什么时候成熟| 陌然是什么意思| 带牙套是什么意思| dr什么意思| 过期的啤酒有什么用处| 甲状腺是什么症状表现| 今年25岁属什么生肖| 老蜜蜡什么颜色最好| 急性荨麻疹不能吃什么食物| jhs空调是什么牌子| 杜甫的诗被称为什么| 吃核桃有什么好处和坏处| 扁桃体切除对身体有什么影响| 火腿炒什么菜好吃| 前列腺液和精液有什么区别| 豆角和什么一起炒好吃| 骨髓水肿是什么意思| 一个不一个好念什么| 北上广是什么意思| 所不欲勿施于人是什么意思| 胃泌素高是什么原因| hpv初期有什么症状女性| 骨折和断了有什么区别| 宝宝大便有泡沫是什么原因| 夏至为什么吃馄饨| 多事之秋是什么意思| 解脲支原体阳性是什么病| dpoy什么意思| 胃窦炎吃什么药效果最好| 大便溏薄是什么意思| 肺气肿有什么症状| 什么动物没尾巴| 38妇女节送老婆什么礼物| 右眼皮跳代表什么| 为什么医院都让喝雀巢奶粉| 大排畸主要检查什么| 中国防御系统叫什么| 紫癜吃什么药| 黄金桂是什么茶| 憩室炎吃什么药| 诸葛亮为什么气死周瑜| 搞基什么意思| 开什么玩笑| 荒唐是什么意思| 什么时候敷面膜效果最好| 什么是紫外线| 萎缩性胃炎是什么症状| 梅长苏结局是什么| 安宫牛黄丸什么时间吃最好| 省长是什么级别| 4月26日什么星座| 口甜是什么原因引起的| 贵州的特产是什么| 双皮奶是什么做的| 826是什么星座| 弃市是什么意思| 自然什么意思| 什么叫翡翠| 护肝片什么时候吃最好| 撰文是什么意思| 依托考昔片是什么药| 黑油是什么油| 跳楼是什么感觉| 蒲公英茶有什么功效| 这个季节种什么菜合适| cab是什么意思| 98年出生属什么| 庆帝为什么杀叶轻眉| 感冒喉咙痛吃什么药| 7月26日是什么日子| 鸡咳嗽吃什么药| 血脂高吃什么药好| 独角仙生活在什么地方| 海鲜不能和什么一起吃| pcl是什么意思| 欧阳修字什么| 9月是什么季节| 甲状腺做什么检查| 县宣传部长是什么级别| 二氧化碳是什么东西| 尿胆原norm是什么意思| 考拉是什么意思| 心肌缺血吃什么食物| 前列腺在人体什么位置| 白细胞高是什么问题| 绮罗是什么意思| 鱼油什么人不能吃| 广西为什么简称桂| 一岁宝宝发烧吃什么药| 颈部有肿块看什么科室| gp是什么意思| 耳膜破了有什么症状| 鱼石是什么| rover是什么意思| 无花果什么时候结果| 做生意的人最忌讳什么| 肝素是什么| 紫荆花代表什么生肖| 平均血红蛋白浓度偏低是什么意思| 编外人员是什么意思| 赎罪是什么意思| 甲抗是什么原因引起的| 我宣你是什么意思| 尿潜血阳性什么意思| 身上长黑痣是什么原因| 喝大麦茶有什么好处| 血液粘稠是什么原因| 莫须有什么意思| 钙片吃多了有什么副作用| 梦见好多水是什么预兆| 机能鞋是什么意思| 吃什么升血压快| 回盲瓣呈唇形什么意思| 男方派去接亲要说什么| 40gp是什么意思| 怀孕早期吃什么| 十二月份是什么星座| 跳梁小丑是什么生肖| 缘分使然是什么意思| 菜瓜是什么瓜| 仲夏夜是什么时候| 飞天是什么意思| 什么水果消炎| 月经推迟不来吃什么药| 薏米有什么作用| 前列腺钙化灶什么意思| 梦见好多狗是什么预兆| 12月10日是什么星座| 经常放响屁是什么原因| 新生儿血糖低是什么原因| 有狐臭是什么原因| ad什么时候吃最好| 身体缺糖有什么症状| 山竹里面黄黄的是什么可以吃吗| 血小板高什么原因| 八大碗都有什么菜| 阴蒂长什么样| 八月十二是什么星座| 顾名思义的顾什么意思| 蛋白粉什么时候吃| 百度

晋江市市长张文贤在晋江企业创新发展大会上的讲话

Chengwang?Ji, Kehui?Li, Haiquan?Lu, Qiaoyan?Peng, Jintao?Wang and?Shaodan?Ma The authors are with the State Key Laboratory of Internet of Things for Smart City and the Department of Electrical and Computer Engineering, University of Macau, Macao SAR, China (e-mails: {ji.chengwang, qiaoyan.peng, wang.jintao}@connect.um.edu.mo; yc47997@um.edu.mo; haiq_lu@163.com; shaodanma@um.edu.mo).
Abstract
百度 数据方面,辽宁队哈德森41分8篮板5助攻,赵继伟19分5助攻,郭艾伦18分8篮板5助攻,李晓旭16分7篮板,韩德君13分13篮板;北京队方硕24分4助攻,汉密尔顿21分12篮板,杰克逊38分10篮板10助攻,翟晓川12分5篮板。

Reconfigurable distributed antenna and reflecting surface (RDARS) is a promising architecture for future sixth-generation (6G) wireless networks. In particular, the dynamic working mode configuration for the RDARS-aided system brings an extra selection gain compared to the existing reconfigurable intelligent surface (RIS)-aided system and distributed antenna system (DAS). In this paper, we consider the RDARS-aided downlink multiple-input multiple-output (MIMO) system and aim to maximize the weighted sum rate (WSR) by jointly optimizing the beamforming matrices at the based station (BS) and RDARS, as well as mode switching matrix at RDARS. The optimization problem is challenging to be solved due to the non-convex objective function and mixed integer binary constraint. To this end, a penalty term-based weight minimum mean square error (PWM) algorithm is proposed by integrating the majorization-minimization (MM) and weight minimum mean square error (WMMSE) methods. To further escape the local optimum point in the PWM algorithm, a model-driven DL method is integrated into this algorithm, where the key variables related to the convergence of PWM algorithm are trained to accelerate the convergence speed and improve the system performance. Simulation results are provided to show that the PWM-based beamforming network (PWM-BFNet) can reduce the number of iterations by half and achieve performance improvements of 26.53% and 103.2% at the scenarios of high total transmit power and a large number of RDARS transmit elements (TEs), respectively.

Index Terms:
Reconfigurable distributed antennas and reflecting surface (RDARS), joint beamforming and mode switching, PWM, model-driven.

I Introduction

The demand of enhanced wireless communication performance has driven the development of various critical technologies. For example, by exploiting multiple antennas to harness spatial diversity and multiplexing gains, massive multi-input and multi-output (MIMO) has become the key technology in current fifth-generation (5G) and future sixth-generation (6G) mobile communication networks [1, 2]. To mitigate the practical issues of millimeter wave (mmWave) and terahertz (THz) high-frequency bands, such as severe path loss and limited material penetration capabilities, distributed antenna systems have been proposed, where multiple geographically distributed antennas coordinate to transmit signals, so as to significantly enhance the spatial multiplexing capabilities in mmWave communications [1, 3, 4, 5]. However, each distributed antenna requires a dedicated radio-frequency (RF) chain, which leads to high hardware cost and energy consumption [6]. To address this issue, reconfigurable intelligent surface (RIS)/intelligent reflecting surface (IRS) has been proposed as a cost-effective and energy-efficient technology. Specifically, RIS is able to create a virtual link to enable the incident signal towards the desired area, whereas its performance suffers from the multiplicative fading effect, thus requiring a large number of reflecting elements to obtain the high reflection gain [7, 8, 9, 10, 11, 12, 13]. Besides, the efficient control and fast phase adjustment for RIS are two challenging problems to be solved. On the one hand, the wireless link to control RIS occupies the frequency and space resources. On the other hand, the synchronization problem between the base station (BS) and RIS needs to be solved. In particular, the dedicated wire link can be established to transmit the control and passive phase signals, with reduced latency and resource consumption. Nonetheless, this results in higher hardware costs [14].

Recently, reconfigurable distributed antennas and reflecting surface (RDARS) has been proposed as a promising technology for 6G mmWave communications [15, 16, 14, 10, 17, 18]. By integrating the benefits of the RIS and distributed antennas, RDARS showcases the potential in many aspects, such as improving system capacity, overcoming multiplicative fading, and enhancing communication reliability. Specifically, RDARS is a novel type of programmable metasurface composed of reconfigurable elements. The working mode of each element can be dynamically adjusted between the connection mode and reflection mode via the RDARS controller. The element working in the connection mode is connected with the BS with a cable or fiber, which can transmit or receive signal as a distributed antenna. On the other hand, the element working in the reflection mode functions as the passive element as in the conventional RIS, which can reflect the incident signal to the desired direction. As a result, the dynamic position alteration of connected element is enabled by working mode adjustment.

Due to the promising advantages of RDARS, many research efforts have been devoted to this direction and demonstrated its superiority over RIS-aided systems and DAS via the theoretical performance analysis and prototype experiment [15, 16, 14, 10, 17, 18]. To reduce the overhead of channel estimation and computational complexity of the system, the authors in [15] proposed a two-timescale transceiver design. In [16], a reconfigurable codebook and low overhead beam training tailored for RDARS were proposed, so as to cater to practical communication scenarios. To improve transmission reliability, the mean square error (MSE) was minimized in the uplink MIMO communication system [14]. Furthermore, the authors in [10] developed a RDARS-aided uplink system prototype, and experiments were conducted to verify the improvement of ergodic achievable rate. Besides the wireless communication, an integrated sensing and communication (ISAC) system prototype was developed by leveraging the geometric relationship between BS and RDARS, and a high sensing performance was achieved without incurring communication performance loss [17]. In [18], the radar output signal-to-noise ratio (SNR) was maximized to improve sensing performance , thanks to the flexible working mode selection brought by RDARS.

It is worth mentioning that the efficient beamforming design is an important research hotpot in recent years, so as to meet the increasing demand of channel capacity [19, 20, 21, 22, 23]. In [19], the secrecy rate was maximized by jointly optimizing hybrid beamforming at BS and passive beamforming at RIS, where weighted minimum mean-squared error (WMMSE) transformation and penalty-dual-decomposition (PDD) were utilized to tackle this problem. In [20], the problem was firstly transformed into its equivalent form by using the fractional programming (FP) method, followed by PDD to maximize the weighted sum rate (WSR) in a double RIS-aided system. By taking both transceiver and RIS hardware impairments into consideration, the WSR maximization problem has been efficiently solved by invoking the block coordinate descent (BCD) framework for an active RIS-assisted system [21]. To investigate the cell-free massive MIMO system performance under both ring and star topologies, a penalty-majorization-minimization (MM)-based distributed beamforming design algorithm was proposed to maximize the achievable sum rate at each BS by decomposing the high-order terms[22]. Considering the user fairness, the minimum user achievable rate was maximized, and the formulated problem was efficiently solved by a randomized alternating direction method of multiplier (ADMM) algorithm [23]. Nonetheless, as the numbers of BS antennas and RIS elements increase, existing beamforming optimization algorithms are faced with high computational complexity, due to the enlarged channel matrix dimension. Besides, these algorithms involve multiple iterations, without guaranteed global optimality, and may lead to a local optimum solution. Moreover, random or fixed initialization points were mainly applied in existing algorithms, which may result in a low-speed convergence behavior.

To tackle these issues, deep learning (DL)-based beamforming design has gained the significant research interest [24, 25, 26, 27]. In [24], the federated learning method was utilized to train the mapping relationship between the channel and beam index, so as to design the beamforming vector. Similarly, parallel convolution neural networks (CNNs) were introduced to reduce the training overhead of beamforming design in [25]. However, the supervised learning relying on the pre-labeled dataset may incur performance degradation. Considering this issue, the sum rate maximization problem was efficiently solved by utilizing deep reinforcement learning (DRL), which embraces the advantages of DL in neural network training and reinforcement learning (RL) algorithms [26]. Additionally, an unsupervised learning network was proposed to realize the mapping from channel to beamforming with high computational efficiency in cell-free MIMO systems [27]. Note that the aforementioned works are mainly the data-driven methods, which usually have high network training overhead and computational memory requirement. Besides, data-driven methods are difficult to adapt to the complex and dynamic environments, and thus high performance can not be guaranteed. Moreover, data-driven neural networks with multiple hidden layers and the requirement for extensive hyperparameter tuning suffer from limited interpretability.

To overcome these drawbacks, the paradigm of model-driven neural network was proposed by integrating neural networks into the iteration steps in optimization algorithms, or unfolding all iterations into a lay-wise structure [28, 29, 30, 31, 32, 33]. Specifically, a fixed number of iterations is set to guarantee a fast convergence behavior, unlike the conventional optimization algorithms [29]. Moreover, the expert knowledge can be utilized to simplify the network training. For example, a simple solution structure of the active beamforming was introduced in [30], which helps to generate a better initialization of WMMSE algorithm. In addition, an approximating matrix inversion was proposed for the MM-based algorithm to reduce the computation complexity in [31]. To reduce the number of trainable parameters, several important parameters related to the specific model are set to be trainable, such as the scaling matrix of matrix inversion operation, the weights of forward-backward splitting, and the step sizes of retraction function in the manifold optimization algorithm [31, 32, 33].

Meanwhile, we notice that for a RDARS-aided multi-user communication system, several fundamental issues on WSR maximization remain unsolved. First, how to obtain the seamless blend of the optimal beamforming design and RDARS element configurations is still an intractable problem. To be specific, one of the challenges in the considered system is the joint design of BS beamforming and RDARS beamforming. Besides, how to achieve effective operation mode configuration remains unknown, due to that the interactively coupled variables. Moreover, the binary mode switching constraint exacerbates the challenges of system design. Second, how to obtain the efficient beamforming initialization points? Since inappropriate initialization may result in a low convergence speed and local optimal solution, it is important to develop a proper initialization method for the joint beamforming designs. Third, how to reduce the computational complexity by embedding the iterations of the proposed optimization algorithm within the neural networks? This problem arises from the high computational complexity suffered by the joint beamforming design, as well as the new mode switching requirement in RDARS systems. Specifically, some key hyper-parameters are critical to intertwine the deep unfolding layer-wise structure with the network to be trained.

In this paper, we investigate the WSR maximization problem in the RDARS-aided mmWave downlink MIMO system, by jointly optimizing the BS beamforming, RDARS beamforming, and mode switching, subject to the total transmit power and binary mode switching constraints. First, we propose a penalty term-based WMMSE (PWM) algorithm. To accelerate the convergence speed and improve the system performance, a model-driven neural network is next proposed by leveraging expert knowledge related to the active beamforming design and adaptively learning the penalty terms in the mode switching matrix optimization according to the current iteration step in progress. The main contributions of this paper are summarized as follows:

  • ?

    Firstly, we propose an efficient algorithm based on the convex optimization methods for WSR maximization. Specifically, the original optimization problem is equivalently transformed into a more tractable one based on the weighted minimum mean square error (WMMSE)-based algorithm, and the optimal active beamforming is derived in closed-form. Then, the power iteration algorithm is used to meet the unit-modulus constraint in the equivalent problem. Moreover, the binary mode selection constraints are satisfied by introducing the penalty term and majorization minimization (MM) method. With the alternative optimization (AO) algorithm, each block of variables is iteratively optimized in an alternate way until convergence is achieved.

  • ?

    Secondly, a model-driven DL is integrated into the proposed optimization algorithm to further accelerate the convergence speed and improve the system performance. Specifically, a simple solution structure of active beamforming is introduced for its initialization. The equivalent transmit power and auxiliary terms related to the simple solution structure are set as trainable parameters. Then, an adaptive penalty term is trained by taking the computational complexity of mode switching optimization into consideration. These trainable parameters are trained by the model-driven PWM-based beamforming network (PWM-BFNet).

  • ?

    Lastly, numerical results demonstrate the superiority of the RDARS architecture, in terms of reducing the number of transmit antennas and transmit power. It is shown that the proposed algorithms are capable of significantly accelerating the convergence speed and increasing the system performance, especially for the case with high transmit power and a large number of RDARS transmit elements (TEs).

Notations: For a complex vector ??\bf xbold_x, xix_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT represents the iiitalic_i-th entry. The elements of vector ???[a,b]{\bf x}[a,b]bold_x [ italic_a , italic_b ] are comprised of the elements of vector ??{\bf x}bold_x, beginning with the aaitalic_a-th element and ending at the bbitalic_b-th element. For a complex matrix ??\bf Xbold_X, ??[i,j]\mathbf{X}_{[i,j]}bold_X start_POSTSUBSCRIPT [ italic_i , italic_j ] end_POSTSUBSCRIPT represents the element in the iiitalic_i-th row and jjitalic_j-th column. Tr?(??)\operatorname{Tr}(\bf X)roman_Tr ( bold_X ), ??||\bf X||| | bold_X | | and ??F||\mathbf{X}||_{F}| | bold_X | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT denote its trace, the 2-norm and F-norm of ??\bf Xbold_X, respectively. ??T{\bf x}^{T}bold_x start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, ??H{\bf x}^{H}bold_x start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT, ??T{\bf X}^{T}bold_X start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT and ??H{\bf X}^{H}bold_X start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT stand for the transpose and conjugate transpose of vector ??\bf xbold_x and matrix ??\bf Xbold_X, respectively.

II System Model and Problem Formulation

II-A System Model

Fig. 1 shows a RDARS-aided MIMO system with a BS and KKitalic_K single-antenna users. The BS is equipped with NtN_{\rm{t}}italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT antennas, and the RDARS has NNitalic_N elements. Each element of RDARS can work in two modes: connection mode and reflection mode. The working mode of each element is determined by a diagonal mode switching matrix ???N×N{\bf{A}}\in\mathbb{R}^{N\times N}bold_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT with ??[i,i]{0,1}\mathbf{A}_{[i,i]}\in\{0,1\}bold_A start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT ∈ { 0 , 1 }. To be specific, when ??[i,i]=1\mathbf{A}_{[i,i]}=1bold_A start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT = 1, the iiitalic_i-th element works in the connection mode and can be regarded as a distributed antenna with the capabilities of transmitting and receiving signal. Otherwise, ??[i,i]=0\mathbf{A}_{[i,i]}=0bold_A start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT = 0 represents that the iiitalic_i-th element operates in the reflection mode which functions as a passive element to reflect the incident signals. Let aaitalic_a denote the number of elements working in the connection mode, while the remaining N?aN-aitalic_N - italic_a elements operate in the reflection mode. In other words, aaitalic_a RDARS elements are programmed as the distributed antennas like transmit elements, so as to serve multiple users cooperatively with BS antennas. Let ??t={1,2,,Nt}\mathcal{N_{\rm{t}}}=\{1,2,\dots,N_{\rm{t}}\}caligraphic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT = { 1 , 2 , … , italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT }, ??={1,2,,N}\mathcal{N}=\{1,2,\dots,N\}caligraphic_N = { 1 , 2 , … , italic_N }, ??={1,2,,a}\mathcal{A}=\{1,2,\dots,a\}caligraphic_A = { 1 , 2 , … , italic_a } and ??={1,2,,K}\mathcal{K}=\{1,2,\dots,K\}caligraphic_K = { 1 , 2 , … , italic_K } denote the sets of indices of BS antennas, RDARS elements, connected elements, and users, respectively.

By denoting the transmit symbol vector as ???K×1\mathbf{s}\in\mathbb{C}^{K\times 1}bold_s ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × 1 end_POSTSUPERSCRIPT with ???{????H}=??K{\mathbb{E}\{\mathbf{s}\mathbf{s}^{H}\}}={\bf{I}}_{K}blackboard_E { bold_ss start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } = bold_I start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT, the combined transmit signal from the BS and RDARS elements in the connection mode is

??\displaystyle\mathbf{x}bold_x =[??b??r]???=????,\displaystyle=\begin{bmatrix}{\mathbf{W}}_{\rm{b}}\\ {\mathbf{W}}_{\rm{r}}\end{bmatrix}\mathbf{s}=\mathbf{F}\mathbf{s},= [ start_ARG start_ROW start_CELL bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] bold_s = bold_Fs , (1)

where ??b=[??b,1,??b,2,?,??b,K]?Nt×K{\mathbf{W}}_{\rm{b}}=\left[\mathbf{w}_{{\rm{b}},1},\mathbf{w}_{{\rm{b}},2},\cdots,\mathbf{w}_{{\rm{b}},K}\right]\in\mathbb{C}^{N_{\rm{t}}\times K}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT = [ bold_w start_POSTSUBSCRIPT roman_b , 1 end_POSTSUBSCRIPT , bold_w start_POSTSUBSCRIPT roman_b , 2 end_POSTSUBSCRIPT , ? , bold_w start_POSTSUBSCRIPT roman_b , italic_K end_POSTSUBSCRIPT ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT × italic_K end_POSTSUPERSCRIPT and ??r=[??r,1,??r,2,?,??r,K]?a×K\mathbf{W}_{\rm{r}}=\left[\mathbf{w}_{{\rm{r}},1},\mathbf{w}_{{\rm{r}},2},\cdots,\mathbf{w}_{{\rm{r}},K}\right]\in\mathbb{C}^{a\times K}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT = [ bold_w start_POSTSUBSCRIPT roman_r , 1 end_POSTSUBSCRIPT , bold_w start_POSTSUBSCRIPT roman_r , 2 end_POSTSUBSCRIPT , ? , bold_w start_POSTSUBSCRIPT roman_r , italic_K end_POSTSUBSCRIPT ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_a × italic_K end_POSTSUPERSCRIPT denote the BS and RDARS beamforming matrices, respectively. ??=[??1,??2,?,??K]?(Nt+a)×K\mathbf{F}=\left[\mathbf{f}_{1},\mathbf{f}_{2},\cdots,\mathbf{f}_{K}\right]\in\mathbb{C}^{(N_{\rm{t}}+a)\times K}bold_F = [ bold_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ? , bold_f start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ] ∈ blackboard_C start_POSTSUPERSCRIPT ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) × italic_K end_POSTSUPERSCRIPT represents the equivalent transmit beamforming matrix with ??k??(Nt+a)×1{\bf{f}}_{k}\in\mathcal{C}^{(N_{\rm{t}}+a)\times 1}bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) × 1 end_POSTSUPERSCRIPT.

Refer to caption
Figure 1: A RDARS-assisted downlink MIMO architecture.

The direct links between the BS and user equipments (UEs) are assumed to be severely blocked by obstacles, due to the high directivity of mmWave signals. The channels from the BS to RDARS and that from RDARS to the kkitalic_k-th UE are denoted by ???N×Nt{{\bf{G}}}\in\mathbb{C}^{N\times N_{\rm{t}}}bold_G ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and ??r,k?N×1{\bf{h}}_{{\rm{r}},k}\in\mathbb{C}^{N\times 1}bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT, respectively. By considering the general Rician channel model, the channels ??{{\bf{G}}}bold_G and ??r,k{\bf{h}}_{{\rm{r}},k}bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT are given by

??=\displaystyle{\bf{G}}=bold_G = κb?(ξξ+1???~?(N,χ,ψ)???H?(Nt,θ)+1ξ+1???~),\displaystyle\kappa_{{\rm{b}}}\left(\sqrt{\frac{\xi}{\xi+1}}\tilde{{\bf{b}}}(N,\chi,\psi){\bf{b}}^{H}({{N_{\rm{t}}}},\theta)+\sqrt{\frac{1}{\xi+1}}\widetilde{{\bf{G}}}\right),italic_κ start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG italic_ξ end_ARG start_ARG italic_ξ + 1 end_ARG end_ARG over~ start_ARG bold_b end_ARG ( italic_N , italic_χ , italic_ψ ) bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT , italic_θ ) + square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_ξ + 1 end_ARG end_ARG over~ start_ARG bold_G end_ARG ) , (2)
??r,k=\displaystyle{\bf{h}}_{{\rm{r}},k}=bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT = κr,k?(ξξ+1???~?(N,?k,υk)+1ξ+1???~r,k),\displaystyle\kappa_{{\rm{r}},k}\left(\sqrt{\frac{\xi}{\xi+1}}\tilde{\bf{b}}(N,\phi_{k},\upsilon_{k})+\sqrt{\frac{1}{\xi+1}}\widetilde{{\bf{h}}}_{r,k}\right),italic_κ start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG italic_ξ end_ARG start_ARG italic_ξ + 1 end_ARG end_ARG over~ start_ARG bold_b end_ARG ( italic_N , italic_? start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_υ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) + square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_ξ + 1 end_ARG end_ARG over~ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ) , (3)

where ξ\xiitalic_ξ denotes the Rician factor, and κb\kappa_{{\rm{b}}}italic_κ start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT and κr,k\kappa_{\rm{r},k}italic_κ start_POSTSUBSCRIPT roman_r , roman_k end_POSTSUBSCRIPT denote the complex-valued channel coefficients of BS-RDARS and RDARS-UE kkitalic_k channels, respectively. Let χ\chiitalic_χ and ψ\psiitalic_ψ represent the vertical and horizontal angle-of-arrivals (AoAs) from the BS to RDARS, respectively, θ\thetaitalic_θ represent the angle-of-departure (AoD) from the BS to RDARS. The vertical and horizontal AoAs from the RDARS to the kkitalic_k-th UE are denoted by ?k\phi_{k}italic_? start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and υk\upsilon_{k}italic_υ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, respectively. The elements within the non-line-of-sight (NLoS) components and ??~r,k\widetilde{\bf{h}}_{r,k}over~ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT are characterized by a standard complex Gaussian distribution. Moreover, the steering vector is

???(N,θ)=1N?[ej?π?θ?0,ej?π?θ?1,?,ej?π?θ?(N?1)]T,{\bf{b}}(N,\theta)=\frac{1}{\sqrt{N}}[e^{j\pi\theta\cdot 0},e^{j\pi\theta\cdot 1},\cdots,e^{j{\pi}\theta\cdot(N-1)}]^{T},bold_b ( italic_N , italic_θ ) = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N end_ARG end_ARG [ italic_e start_POSTSUPERSCRIPT italic_j italic_π italic_θ ? 0 end_POSTSUPERSCRIPT , italic_e start_POSTSUPERSCRIPT italic_j italic_π italic_θ ? 1 end_POSTSUPERSCRIPT , ? , italic_e start_POSTSUPERSCRIPT italic_j italic_π italic_θ ? ( italic_N - 1 ) end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (4)

where the inter-antenna spacing is half wavelength. Then, we have ??~?(N,χ,ψ)=???(Nz,χ)????(Ny,ψ)\tilde{\bf{b}}(N,\chi,\psi)={\bf{b}}(N_{\rm{z}},\chi)\otimes{{\bf{b}}}(N_{\rm{y}},\psi)over~ start_ARG bold_b end_ARG ( italic_N , italic_χ , italic_ψ ) = bold_b ( italic_N start_POSTSUBSCRIPT roman_z end_POSTSUBSCRIPT , italic_χ ) ? bold_b ( italic_N start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT , italic_ψ ), where NzN_{\rm{z}}italic_N start_POSTSUBSCRIPT roman_z end_POSTSUBSCRIPT and NyN_{\rm{y}}italic_N start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT denote the numbers of elements along with the vertical and horizontal directions, respectively.

The received signal for the kkitalic_k-th UE is given by

yk=??r,kH?(??N???)????????b???+??r,kH???~???r???+nk,{y_{k}}={\bf{h}}_{{\rm{r}},k}^{H}{\bf{(}}{{\bf{I}}_{N}}{\bf{-A)\Phi}}{{\bf{G}}}{\bf{W}}_{{\rm{b}}}{\bf{s}}+{\bf{h}}_{{\rm{r}},k}^{H}{\bf{\tilde{A}}}{\bf{W}}_{{\rm{r}}}{\bf{s}}+{n_{k}},italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_Φ bold_GW start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT bold_s + bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT bold_s + italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (5)

where ???N×N{\bf{\Phi}}\in\mathbb{C}^{N\times N}bold_Φ ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT denotes the RDARS’s phase shift matrix, mode switching matrix ??~?N×a{\bf{\tilde{A}}}\in\mathbb{C}^{N\times a}over~ start_ARG bold_A end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_a end_POSTSUPERSCRIPT consists of non-zero columns of the sparse mode switching matrix ??\bf{A}bold_A, and nkn_{k}italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the additive white Gaussian noise (AWGN) with power σk2\sigma^{2}_{k}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Let ??=[φ1,,φN]H=diag?(??)\bm{\varphi}={[{\varphi_{1}},...,{\varphi_{N}}]^{H}}=\operatorname{diag}(\bf{\Phi})bold_italic_φ = [ italic_φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_φ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT = roman_diag ( bold_Φ ) and ??r,k=diag?(??r,kH)???{{\bf{H}}_{\mathrm{r},k}}=\operatorname{diag}({\bf{h}}_{{\rm{r}},k}^{H}){\bf{G}}bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT = roman_diag ( bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) bold_G, the received signal can be equivalently expressed as

yk=??k???k?sk+mkK??k???m?sm+nk,y_{k}={{\bf{h}}_{k}}{{\bf{f}}_{k}}{s_{k}}+\sum\nolimits_{m\neq k}^{K}{{{\bf{h}}_{k}}{{\bf{f}}_{m}}{s_{m}}}+{n_{k}},italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_m ≠ italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (6)

where ??k{{\bf{h}}_{k}}bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the effective channel between the BS and the kkitalic_k-th UE, given by ??k=[??H?(??N???)???r,k,??r,kH???~]{{\bf{h}}_{k}}=[{{\bm{\varphi}}^{H}}{\bf{(}}{{\bf{I}}_{N}}{\bf{-A)}}{{\bf{H}}_{{\rm{r}},k}},{\bf{h}}_{{\rm{r}},k}^{H}{\bf{\tilde{A}}}]bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [ bold_italic_φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT , bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG ]. The signal-to-noise-plus-interference ratio (SINR) for the kkitalic_k-th UE is

γk=|??k???k|2mkK|??k???m|2+σk2.{\gamma_{k}}=\frac{{{{\left|{{{\bf{h}}_{k}}{{\bf{f}}_{k}}}\right|}^{2}}}}{{{{\sum\nolimits_{m\neq k}^{K}|{{{\bf{h}}_{k}}{{\bf{f}}_{m}}}}|^{2}}+{\sigma_{k}}^{2}}}.italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG | bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_m ≠ italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT | bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (7)

The achievable rate of UE k is Rk=log2?(1+γk){R_{k}}={\log_{2}}(1+{\gamma_{k}})italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ).

II-B Problem Formulation

Our objective is to maximize the WSR, by jointly optimizing the BS beamforming, RDARS beamforming, and mode switching matrix. The optimization problem can be formulated as

max??,??,??,??~\displaystyle\mathop{\max}\limits_{\begin{subarray}{c}{{\bf{F}}},{\bf{\Phi}},{\bf{A}},{\tilde{\bf{A}}}\end{subarray}}roman_max start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_F , bold_Φ , bold_A , over~ start_ARG bold_A end_ARG end_CELL end_ROW end_ARG end_POSTSUBSCRIPT k=1Kαk?Rk\displaystyle\;\;\sum\nolimits_{k=1}^{K}{{\alpha_{k}}{R_{k}}}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (8a)
s.t. Tr?(????H)Ptot,\displaystyle~\operatorname{Tr}({{\bf{F}}}{\bf{F}}^{H})\leq{P_{\rm{tot}}},roman_Tr ( bold_FF start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) ≤ italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT , (8b)
|??[i,i]|=1,?i??,\displaystyle~|\mathbf{\Phi}_{[i,i]}|=1,\forall i\in{\mathcal{N}},| bold_Φ start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT | = 1 , ? italic_i ∈ caligraphic_N , (8c)
i=1N??[i,i]=a,??[i,i]{0,1},?i??,\displaystyle~\sum\nolimits_{i=1}^{N}\!\mathbf{A}_{[i,i]}\!\!=a,\mathbf{A}_{[i,i]}\!\in\!\{0,1\},\forall i\in{\mathcal{N}},∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_A start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT = italic_a , bold_A start_POSTSUBSCRIPT [ italic_i , italic_i ] end_POSTSUBSCRIPT ∈ { 0 , 1 } , ? italic_i ∈ caligraphic_N , (8d)
i=1N??~[i,l]=1,??~[i,l]{0,1},?l??,i??,\displaystyle~\sum\nolimits_{i=1}^{N}\!\tilde{\mathbf{A}}_{[i,l]}\!\!=1,\tilde{\mathbf{A}}_{[i,l]}\!\!\in\!\!\{0,1\},\forall l\in{\mathcal{A}},i\in{\mathcal{N}},∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT [ italic_i , italic_l ] end_POSTSUBSCRIPT = 1 , over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT [ italic_i , italic_l ] end_POSTSUBSCRIPT ∈ { 0 , 1 } , ? italic_l ∈ caligraphic_A , italic_i ∈ caligraphic_N , (8e)
??=??~???~H,\displaystyle~{\bf{A}}=\tilde{{\bf{A}}}\tilde{{\bf{A}}}^{H},bold_A = over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , (8f)

where αk\alpha_{k}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the weighted factor of the kkitalic_k-th UE, and PtotP_{\rm{tot}}italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT denotes the maximum total transit power. It is observed from (8) that it is a mixed-integer nonlinear programming (MINLP) problem, which is a non-convex problem due to highly coupled variables and unit-modulus constraints. Moreover, the binary mode switching constraints exacerbate the challenge. To solve this problem, we propose an efficient penalty term-based WMMSE (PWM) algorithm in the following section.

III PWM Algorithm

In this section, we first reformulate the original WSR problem via the WMMSE method [34]. Then, an AO-based algorithm is proposed to solve the reformulated problem based on MM and penalty methods.

By temporarily removing constraint (8b) and introducing the auxiliary vectors ??=[λ1,?,λK]T\bm{\lambda}=[\lambda_{1},\cdots,\lambda_{K}]^{T}bold_italic_λ = [ italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ? , italic_λ start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT and ??=[u1,?,uK]T{\bf{u}}=[u_{1},\cdots,u_{K}]^{T}bold_u = [ italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ? , italic_u start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, problem (8) can be transformed into a WMMSE problem as

min??,??,??,??~,??,??k=1Kαk?(λk?ek?log?λk)?s.t.?(?8c?),(?8d?),(?8e?),(?8f?),\displaystyle\mathop{\min}\limits_{\begin{subarray}{c}{{\bf{F}}},{\bf{\Phi}},{\bf{A}},\\ {\tilde{\bf{A}}},\bf{u},\bm{\lambda}\end{subarray}}\;\sum\limits_{k=1}^{K}{{\alpha_{k}}({\lambda_{k}}{e_{k}}-\log{\lambda_{k}})}\;\;\textrm{s.t.}\;\eqref{con: Phi},\eqref{con: A},\eqref{con: A tilde},\eqref{con: A +A tilde},roman_min start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_F , bold_Φ , bold_A , end_CELL end_ROW start_ROW start_CELL over~ start_ARG bold_A end_ARG , bold_u , bold_italic_λ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_log italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) s.t. italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , (9)

where eke_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the MSE for detecting UE kkitalic_k’s signal, which is given by

ek\displaystyle{e_{k}}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =1?ukH???k???k???kH???kH?uk\displaystyle=1-u_{k}^{H}{{\bf{h}}_{k}}{{\bf{f}}_{k}}-{\bf{f}}_{k}^{H}{\bf{h}}_{k}^{H}{u_{k}}= 1 - italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
+ukH???k?m=1K??m???mH???kH?uk+ukH?uk?σk2Ptot?m=1K??mH???m.\displaystyle+u_{k}^{H}{{\bf{h}}_{k}}\sum\limits_{m=1}^{K}{{{\bf{f}}_{m}}{\bf{f}}_{m}^{H}}{\bf{h}}_{k}^{H}{u_{k}}+u_{k}^{H}{u_{k}}\frac{{\sigma_{k}^{2}}}{P_{\rm{tot}}}\sum\limits_{m=1}^{K}{{\bf{f}}_{m}^{H}{{\bf{f}}_{m}}}.+ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT . (10)

The following Lemma is given to verify the equivalence between problem (8) and (9).

Lemma 1:

Problem (9) is equivalent to the weight sum rate maximization problem (8), where the global optimal solutions for the two problems are identical.

Proof.

Please refer to Appendix A. ?

Note that problem (9) is still a non-convex problem due to the coupled variables. To this end, the AO-based method is applied, where all the variables are divided into four blocks, i.e., 1) active beamforming {??k,uk,λk}\{{\bf{f}}_{k},u_{k},\lambda_{k}\}{ bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }, 2) passive beamforming ??\bf{\Phi}bold_Φ, 3) sparse mode switching matrix ??{\bf{A}}bold_A and 4) mode switching matrix ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG. Then each block of variables is optimized in an iterative manner until convergence is achieved.

III-A Active Beamforming Optimization

With fixed ??\bf{\Phi}bold_Φ, ??{\bf{A}}bold_A and ??~\tilde{{\bf{A}}}over~ start_ARG bold_A end_ARG, problem (9) is reduced to

min??,??,??\displaystyle\mathop{\min}\limits_{{{\bf{F}}},\bf{u},\bm{\lambda}}\quadroman_min start_POSTSUBSCRIPT bold_F , bold_u , bold_italic_λ end_POSTSUBSCRIPT k=1Kαk?(λk?ek?log?λk).\displaystyle\sum\nolimits_{k=1}^{K}{{\alpha_{k}}({\lambda_{k}}{e_{k}}-\log{\lambda_{k}})}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_log italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (11)

The solutions to problem (11) can be derived in closed-form as

λkopt=\displaystyle\lambda_{k}^{\mathrm{opt}}=italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = ek?1,\displaystyle{e_{k}^{-1}},italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , (12)
ukopt=\displaystyle u_{k}^{\mathrm{opt}}=italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = (??k?m=1K??m???mH???kH+σk2Ptot?m=1K??mH???m)?1???k???k,\displaystyle{({{\bf{h}}_{k}}\sum\limits_{m=1}^{K}{{{\bf{f}}_{m}}{\bf{f}}_{m}^{H}}{\bf{h}}_{k}^{H}+\frac{{\sigma_{k}^{2}}}{P_{\rm{tot}}}\sum\limits_{m=1}^{K}{{\bf{f}}_{m}^{H}}{{\bf{f}}_{m}})^{-1}}{{\bf{h}}_{k}}{{\bf{f}}_{k}},( bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (13)
??kopt=\displaystyle{\bf{f}}_{k}^{\mathrm{opt}}=bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = αk?uk?λk?(m=1Kαm?umH?um?λm?(σm2Ptot???N+a+??kH???k))?1???kH.\displaystyle{\alpha_{k}}{u_{k}}{\lambda_{k}}(\!\sum\limits_{m=1}^{K}{{\alpha_{m}}u_{m}^{H}{u_{m}}{\lambda_{m}}(\frac{{\sigma_{m}^{2}}}{P_{\rm{tot}}}{{\bf{I}}_{N+a}}\!\!+\!\!{\bf{h}}_{k}^{H}{{\bf{h}}_{k}}){)^{-1}}{\bf{h}}_{k}^{H}}.italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( divide start_ARG italic_σ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG bold_I start_POSTSUBSCRIPT italic_N + italic_a end_POSTSUBSCRIPT + bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT . (14)

By substituting (13) into (III), we obtain the minimum MSE (MMSE) as ekmmse=1???kH???kH?Jk?1???k???ke_{k}^{\mathrm{mmse}}=1-{\bf{f}}_{k}^{H}{\bf{h}}_{k}^{H}{J_{k}^{-1}}{{\bf{h}}_{k}}{{\bf{f}}_{k}}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_mmse end_POSTSUPERSCRIPT = 1 - bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , with Jk=??k?m=1K??m???mH???kH+σk2Ptot?m=1K??mH???m{J_{k}}={{\bf{h}}_{k}}\sum\nolimits_{m=1}^{K}{{{\bf{f}}_{m}}{\bf{f}}_{m}^{H}}{\bf{h}}_{k}^{H}+\frac{\sigma_{k}^{2}}{P_{\rm{tot}}}\sum\nolimits_{m=1}^{K}{{\bf{f}}_{m}^{H}}{{\bf{f}}_{m}}italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.

III-B Passive Beamforming Optimization

Given fixed uku_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, λk\lambda_{k}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, ??k{\bf{f}}_{k}bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, ??{\bf{A}}bold_A and ??~\tilde{{\bf{A}}}over~ start_ARG bold_A end_ARG, problem (9) is reduced to

min????H??????+???????+???????s.t.?(?8c?),\displaystyle\mathop{\min}\limits_{\bm{\varphi}}\quad{{\bm{\varphi}}^{H}}\bf{C}{\bm{\varphi}}+{{\bm{\beta}}^{H}}{\bm{\varphi}}+{{\bm{\varphi}}^{H}}{\bm{\beta}}\;\;\;\;\;\;\;\textrm{s.t.}\;\;\eqref{con: Phi},roman_min start_POSTSUBSCRIPT bold_italic_φ end_POSTSUBSCRIPT bold_italic_φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_C bold_italic_φ + bold_italic_β start_POSTSUPERSCRIPT bold_H end_POSTSUPERSCRIPT bold_italic_φ + bold_italic_φ start_POSTSUPERSCRIPT bold_H end_POSTSUPERSCRIPT bold_italic_β s.t. bold_( bold_) , (15)

where ??=k=1Kαk?λk?ukH?uk?(??N???)???r,k?m=1K??b,m???b,mH???r,kH?(??N???){\bf{C}}=\sum\nolimits_{k=1}^{K}{\alpha_{k}}{\lambda_{k}}u_{k}^{H}{u_{k}}({\bf{I}}_{N}-{\bf{A}}){{\bf{H}}_{{\rm{r}},k}}\sum\nolimits_{m=1}^{K}{{\bf{w}}_{\mathrm{b},m}}{\bf{w}}_{\mathrm{b},m}^{H}{{\bf{H}}^{H}_{{\rm{r}},k}}({\bf{I}}_{N}-{\bf{A}})bold_C = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) and ??=k=1Kαk?λk?ukH?uk?(??N???)???r,k?m=1K??b,m???r,mH???~H???r,k?αk?λk?ukH?(??N???)???r,k???b,k{\bm{\beta}}=\sum\nolimits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}u_{k}^{H}{u_{k}}({\bf{I}}_{N}-{\bf{A}}){{\bf{H}}_{{\rm{r}},k}}\sum\nolimits_{m=1}^{K}{{{\bf{w}}_{\mathrm{b},m}}{{\bf{w}}^{H}_{\mathrm{r},m}}{{{\bf{\tilde{A}}}}^{H}}}}{\bf{h}}_{{\rm{r}},k}-{\alpha_{k}}{\lambda_{k}}u_{k}^{H}({\bf{I}}_{N}-{\bf{A}}){{\bf{H}}_{{\rm{r}},k}}{{\bf{w}}_{\mathrm{b},k}}bold_italic_β = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_m end_POSTSUBSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT - italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_k end_POSTSUBSCRIPT. Let ??=[??,q]T\mathbf{p}=[\bm{\varphi},q]^{T}bold_p = [ bold_italic_φ , italic_q ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT where pnp_{n}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the nnitalic_n-th element of ??{\bf{p}}bold_p, and qqitalic_q is an auxiliary variable. Therefore, problem (15) is equivalent to

max????H??????s.t.?|pn|=1,n=1,?,N+1,\displaystyle\mathop{\max}\limits_{\bf{p}}\;\;{{\bf{p}}^{H}}{\bf{Dp}}\;\;\textrm{s.t.}|p_{n}|=1,n=1,\cdots,N+1,roman_max start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Dp s.t. | italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = 1 , italic_n = 1 , ? , italic_N + 1 , (16)

where ??=[???,???;???H,0]{\bf{D}}=\left[{-{\bf{C}}},{-{\bm{\beta}}};{-{{\bm{\beta}}^{H}}},0\right]bold_D = [ - bold_C , - bold_italic_β ; - bold_italic_β start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , 0 ].

It is observed that the optimization problem (16) is a unimodular quadratic program and can be solved by the power iteration algorithm. Specifically, the value of ppitalic_p in the qqitalic_q-th iteration is [35]

??(q+1)=ej?arg?((??+ε???N+1)???(q)),{\bf{p}}^{(q+1)}=e^{j\mathrm{arg}(({\bf{D}}+\varepsilon{\bf{I}}_{N+1}){\bf{p}}^{(q)})},bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT = italic_e start_POSTSUPERSCRIPT italic_j roman_arg ( ( bold_D + italic_ε bold_I start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT ) bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT , (17)

where ε???N+1\varepsilon{\bf{I}}_{N+1}italic_ε bold_I start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT is introduced to ensure that ??+ε???N+1{\bf{D}}+\varepsilon{\bf{I}}_{N+1}bold_D + italic_ε bold_I start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT is positive definite. After the iteration converges, the passive beamforming vector can be derived as ??=ej?arg?(??[1:N]pN+1)\bm{\varphi}=e^{j\arg(\frac{\mathbf{p}{[1:N]}}{p_{N+1}})}bold_italic_φ = italic_e start_POSTSUPERSCRIPT italic_j roman_arg ( divide start_ARG bold_p [ 1 : italic_N ] end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT end_ARG ) end_POSTSUPERSCRIPT, thus solving problem (15). This guarantees the convergence of the iterative algorithm, as shown in Lemma 2.

Lemma 2:

The problem is equivalent to problem (16) as follows:

max??\displaystyle\mathop{\max}\limits_{\bf{p}}\quadroman_max start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT ??H??????\displaystyle{{\bf{p}}^{H}}{\bf{D}}^{{}^{\prime}}{\bf{p}}bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p (18a)
s.t. |pn|=1,n=1,?,N+1.\displaystyle|p_{n}|=1,n=1,\cdots,N+1.| italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = 1 , italic_n = 1 , ? , italic_N + 1 . (18b)

Furthermore, the power iteration algorithm is guaranteed to converge to at least a local optimum of problem (16) when ??{\bf{D}}^{{}^{\prime}}bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is positive-defined.

Proof.

Please refer to Appendix B. ?

III-C Optimization of Sparse Mode Switching Matrix ??{\bf{A}}bold_A

To create a unified framework, we employ the penalty technique to consolidate constraint (8f) into penalty terms. With fixed ??\bf{F}bold_F,??\bf{u}bold_u, ??\bm{\lambda}bold_italic_λ, ??\bf{\Phi}bold_Φ, and ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG, problem (8) can be reformulated as

min??k=1Kαk?(λk?ek?log?λk)+12?ρ??????~???~HF2?s.t.?(?8d?),\displaystyle\mathop{\min}\limits_{{\bf{A}}}\sum\limits_{k=1}^{K}{{\alpha_{k}}({\lambda_{k}}{e_{k}}\!\!-\!\!\log{\lambda_{k}})}\!\!+\!\!\frac{1}{2\rho}||\mathbf{A}\!\!-\!\!\tilde{\bf{A}}\tilde{\bf{A}}^{H}||_{F}^{2}\;\;\;\textrm{s.t.}\;\eqref{con: A},roman_min start_POSTSUBSCRIPT bold_A end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_log italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. italic_( italic_) , (19)

where 1/(2?ρ)1/(2\rho)1 / ( 2 italic_ρ ) is the penalty factor.

Let ??=diag?(??)\bf{a}=\operatorname{diag}(\bf{A})bold_a = roman_diag ( bold_A ) and ??~=[??~?[:,1]T,?,??~?[:,a]T]T=[??ˉ1T,?,??ˉaT]T?N?a×1\tilde{\mathbf{a}}=[\mathbf{\tilde{A}}[:,1]^{T},\cdots,\mathbf{\tilde{A}}[:,a]^{T}]^{T}=[\bar{\mathbf{a}}_{1}^{T},\cdots,\bar{\mathbf{a}}_{a}^{T}]^{T}\in\mathbb{C}^{Na\times 1}over~ start_ARG bold_a end_ARG = [ over~ start_ARG bold_A end_ARG [ : , 1 ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , ? , over~ start_ARG bold_A end_ARG [ : , italic_a ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT = [ overˉ start_ARG bold_a end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , ? , overˉ start_ARG bold_a end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N italic_a × 1 end_POSTSUPERSCRIPT, where ??~?[:,l]\mathbf{\tilde{A}}[:,l]over~ start_ARG bold_A end_ARG [ : , italic_l ] denotes the llitalic_l-th non-zero column of ??~\bf{\tilde{A}}over~ start_ARG bold_A end_ARG. As such, the objective function of problem (19) can be rewritten as

f3?(??)\displaystyle{f_{3}}({\bf{A}})italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( bold_A ) =??1H???+??T???1+??T???2+??2H???+??T???1???\displaystyle={\bf{r}}_{1}^{H}{\bf{a}}\!+\!{{\bf{a}}^{T}}{{\bf{r}}_{1}}\!+\!{{\bf{a}}^{T}}{{\bf{r}}_{2}}+{\bf{r}}_{2}^{H}{\bf{a}}+{{\bf{a}}^{T}}{{\bf{R}}_{1}}{\bf{a}}= bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a
+??3H???+??T???3+12?ρ?(??4T???+r5),\displaystyle+{\bf{r}}_{3}^{H}{\bf{a}}\!+\!{{\bf{a}}^{T}}{{\bf{r}}_{3}}\!+\!\frac{1}{{2\rho}}({\bf{r}}_{4}^{T}{\bf{a}}\!+\!{r_{5}}),+ bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ( bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_a + italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) , (20)

where the auxiliary parameters are given by

??1=\displaystyle{{\bf{r}}_{1}}\!\!=bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = k=1Kαk?λk?uk?diag?(??b,kH???r,kH)???,\displaystyle\!\!\sum\nolimits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}{u_{k}}\operatorname{diag}({{\bf{w}}^{H}_{{\rm{b}},k}}{{\bf{H}}_{{\rm{r}},k}^{H}}){\bm{\varphi}}},∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_diag ( bold_w start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_b , italic_k end_POSTSUBSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) bold_italic_φ , (21a)
??2=\displaystyle{{\bf{r}}_{2}}\!\!=bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ?k=1Kαk?λk?ukH?uk???H???r,k?m=1K??b,m???b,mH???r,kH???,\displaystyle\!\!-\!\!\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}u_{k}^{H}{u_{k}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{{\rm{r}},k}}\!\!\sum\limits_{m=1}^{K}{{{\bf{w}}_{{\rm{b}},m}}{\bf{w}}_{{\rm{b}},m}^{H}}{\bf{H}}_{{\rm{r}},k}^{H}{\bm{\varphi}}},- ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_φ , (21b)
??3=\displaystyle{{\bf{r}}_{3}}\!\!=bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = ?k=1Kαk?λk?ukH?uk???H???r,k?m=1K??b,m???r,mH???~H???r,k,\displaystyle\!\!-\!\!\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}u_{k}^{H}{u_{k}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{{\rm{r}},k}}\!\!\sum\limits_{m=1}^{K}{{{\bf{w}}_{{\rm{b}},m}}{\bf{w}}_{{\rm{r}},m}^{H}}{{{\bf{\tilde{A}}}}^{H}}{{\bf{h}}_{{\rm{r}},k}}},- ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT roman_r , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT , (21c)
??4=\displaystyle{{\bf{r}}_{4}}\!\!=bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = diag?(??N?2???~???~H),\displaystyle\operatorname{diag}({{\bf{I}}_{N}}-2{\bf{\tilde{A}}}{{{\bf{\tilde{A}}}}^{H}}),roman_diag ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - 2 over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) , (21d)
r5=\displaystyle{r_{5}}\!\!=italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = Tr?(??~???~H),\displaystyle\operatorname{Tr}({\bf{\tilde{A}}}{{{\bf{\tilde{A}}}}^{H}}),roman_Tr ( over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) , (21e)
??1=\displaystyle{{\bf{R}}_{1}}\!\!=bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = k=1Kαk?λk?ukH?uk???H???r,k?m=1K??b,m???b,mH???r,kH???.\displaystyle\!\!\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}u_{k}^{H}{u_{k}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{{\rm{r}},k}}\!\!\sum\limits_{m=1}^{K}{{{\bf{w}}_{{\rm{b}},m}}{\bf{w}}_{{\rm{b}},m}^{H}}{\bf{H}}_{{\rm{r}},k}^{H}{\bf{\Phi}}}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ . (21f)

The derivations of auxiliary parameters are shown in Appendix C.

In the following, we first apply the MM technique to find a tractable surrogate function of (19). Based on the second-order Taylor expansion, an upper bound of the term ??T???1???{{\bf{a}}^{T}}{{\bf{R}}_{1}}{\bf{a}}bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a can be derived as ??T???1?????T???1???+2???{??T?(??1???1)???t}+??tT?(??1???1)???t{{\bf{a}}^{T}}{{\bf{R}}_{1}}{\bf{a}}\leq{{\bf{a}}^{T}}{{\bf{\Lambda}}_{1}}{\bf{a}}+2\Re\left\{{{{\bf{a}}^{T}}({{\bf{R}}_{1}}-{{\bf{\Lambda}}_{1}}){{\bf{a}}_{t}}}\right\}+{{\bf{a}}_{t}}^{T}({{\bf{\Lambda}}_{1}}-{{\bf{R}}_{1}}){{\bf{a}}_{t}}bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a ≤ bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a + 2 roman_? { bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } + bold_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, where ??1=λˉmax?(??1)???N{{\bf{\Lambda}}_{1}}={{\bar{\lambda}}_{\max}}({{\bf{R}}_{1}}){{\bf{I}}_{N}}bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = overˉ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, with λˉmax?(??1){{\bar{\lambda}}_{\max}}({{\bf{R}}_{1}})overˉ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) denoting the maximum eigenvalue of ??1{{\bf{R}}_{1}}bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then, the surrogate function with respect to (w.r.t.) ??\bf{A}bold_A is given by fˉ3?(??)=2???{??1H???}+2???{??2H???}+2???{??3H???}+2???{??T?(??1???1)???t}+12?ρ???4T???{{\bar{f}}_{3}}({\bf{A}})=2\Re\left\{{{\bf{r}}_{1}^{H}{\bf{a}}}\right\}+2\Re\left\{{{\bf{r}}_{2}^{H}{\bf{a}}}\right\}+2\Re\left\{{{\bf{r}}_{3}^{H}{\bf{a}}}\right\}+2\Re\left\{{{{\bf{a}}^{T}}({{\bf{R}}_{1}}-{{\bf{\Lambda}}_{1}}){{\bf{a}}_{t}}}\right\}+\frac{1}{{2\rho}}{\bf{r}}_{4}^{T}{\bf{a}}overˉ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( bold_A ) = 2 roman_? { bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a } + 2 roman_? { bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a } + 2 roman_? { bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a } + 2 roman_? { bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_a. Thus, problem (19) is rewritten as

min????{??6H???}s.t.ai{0,1},\displaystyle\mathop{\min}\limits_{\bf{a}}\quad\Re\left\{{{\bf{r}}_{6}^{H}{\bf{a}}}\right\}\quad\ \textrm{s.t.}\quad{a_{i}}\in\left\{{0,1}\right\},roman_min start_POSTSUBSCRIPT bold_a end_POSTSUBSCRIPT roman_? { bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a } s.t. italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { 0 , 1 } , (22)

with ??6=2???1+2???2+2???3+2?(??1???1)???t+12?ρ???4{{\bf{r}}_{6}}=2{{\bf{r}}_{1}}+2{{\bf{r}}_{2}}+2{{\bf{r}}_{3}}+2({{\bf{R}}_{1}}-{{\bf{\Lambda}}_{1}}){{\bf{a}}_{t}}+\frac{1}{{2\rho}}{{\bf{r}}_{4}}bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT = 2 bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 2 bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + 2 ( bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - bold_Λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT. Let ?\mathcal{M}caligraphic_M denote the set of first aaitalic_a minimum elements of ??{??6}\Re\left\{{{\bf{r}}_{6}}\right\}roman_? { bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT }. Accordingly, the optimal solution to problem (19) can be derived as

aiopt={1,?{??6}[i]?,0,?{??6}[i]??.{a}^{\mathrm{opt}}_{i}=\begin{cases}1,&\Re\left\{{{\bf{r}}_{6}}\right\}_{[i]}\in\mathcal{M},\\ 0,&\Re\left\{{{\bf{r}}_{6}}\right\}_{[i]}\notin\mathcal{M}.\end{cases}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { start_ROW start_CELL 1 , end_CELL start_CELL roman_? { bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT } start_POSTSUBSCRIPT [ italic_i ] end_POSTSUBSCRIPT ∈ caligraphic_M , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL roman_? { bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT } start_POSTSUBSCRIPT [ italic_i ] end_POSTSUBSCRIPT ? caligraphic_M . end_CELL end_ROW (23)

It is observed from (23) that we need to find the first aaitalic_a minimum elements from ??{??6}\Re\left\{{\bf{r}}_{6}\right\}roman_? { bold_r start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT }, i.e., ai=1{a}_{i}=1italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1, where the corresponding indexes are the locations of elements working in the connection mode.

III-D Optimization of Mode Switching Matrix ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG

By employing the penalty term and for fixed ??\bf{F}bold_F,??\bf{u}bold_u, ??\bm{\lambda}bold_italic_λ, ??\bf{\Phi}bold_Φ and ??{\bf{A}}bold_A, problem (8) can be reformulated as

min??~k=1Kαk?(λk?ek?log?λk)+12?ρ??????~???~HF2?s.t.?(?8e?),\displaystyle\mathop{\min}\limits_{\tilde{{\bf{A}}}}\sum\limits_{k=1}^{K}{{\alpha_{k}}({\lambda_{k}}{e_{k}}\!\!-\!\!\log{\lambda_{k}})}\!\!+\!\!\frac{1}{2\rho}||\mathbf{A}\!\!-\!\!\tilde{\bf{A}}\tilde{\bf{A}}^{H}||_{F}^{2}\;\;\;\textrm{s.t.}\eqref{con: A tilde},roman_min start_POSTSUBSCRIPT over~ start_ARG bold_A end_ARG end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_log italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. italic_( italic_) , (24)

where 1/(2?ρ)1/(2\rho)1 / ( 2 italic_ρ ) is the penalty factor. With fixed ??{\mathbf{a}}bold_a, the objective function can be expressed in terms of ??~\tilde{\mathbf{a}}over~ start_ARG bold_a end_ARG is

f4?(??~)=??~1H???~+??~H???~1+??~2H???~+??~H???~2+??~H???2???~,{f_{4}}({\bf{\tilde{A}}})={{{\bf{\tilde{r}}}}_{1}}^{H}{\bf{\tilde{a}}}+{{{\bf{\tilde{a}}}}^{H}}{{{\bf{\tilde{r}}}}_{1}}+{{{\bf{\tilde{r}}}}_{2}}^{H}{\bf{\tilde{a}}}+{{{\bf{\tilde{a}}}}^{H}}{{{\bf{\tilde{r}}}}_{2}}+{{{\bf{\tilde{a}}}}^{H}}{{\bf{R}}_{2}}{\bf{\tilde{a}}},italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( over~ start_ARG bold_A end_ARG ) = over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_a end_ARG + over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_a end_ARG + over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over~ start_ARG bold_a end_ARG , (25)

where ??~1=k=1Kαk?λk?|uk|2?m=1K((??r,m???b,mH???r,kH?(??N???)???)????r,k){{{\bf{\tilde{r}}}}_{1}}\!\!=\!\!\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}}|u_{k}|^{2}\sum\limits_{m=1}^{K}{(({{\bf{w}}_{{\rm{r}},m}}{{\bf{w}}^{H}_{{\rm{b}},m}}{{\bf{H}}^{H}_{{\rm{r}},k}}{\bf{(}}{{\bf{I}}_{N}}-{\bf{A){\bm{\varphi}}}})}^{*}\!\!\otimes\!\!{{\bf{h}}_{{\rm{r}},k}})over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( ( bold_w start_POSTSUBSCRIPT roman_r , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_b , italic_m end_POSTSUBSCRIPT bold_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_italic_φ ) start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT ? bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ), ??2=k=1Kαk?λk?m=1K(??r,m??uk???r,k)?(??r,mT???r,kH?ukH)?1ρ?(??a???){{\bf{R}}_{2}}\!\!=\!\!\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}}\!\!\sum\limits_{m=1}^{K}{({{\bf{w}}^{*}_{{\rm{r}},m}}}\!\otimes\!{u_{k}}{{\bf{h}}_{{\rm{r}},k}})({{\bf{w}}^{T}_{{\rm{r}},m}}\!\otimes\!{{\bf{h}}^{H}_{{\rm{r}},k}}{u}_{k}^{H})\!-\!\frac{1}{\rho}({{\bf{I}}_{a}}\!\otimes\!{\bf{A}})bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( bold_w start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_m end_POSTSUBSCRIPT ? italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ) ( bold_w start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_m end_POSTSUBSCRIPT ? bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ( bold_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ? bold_A ) and ??~2=?k=1Kαk?λk?ukH?(??r,k????r,k){{{\bf{\tilde{r}}}}_{2}}=-\sum\limits_{k=1}^{K}{{\alpha_{k}}{\lambda_{k}}}u_{k}^{H}({{\bf{w}}^{*}_{{\rm{r}},k}}\otimes{{\bf{h}}_{{\rm{r}},k}})over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_w start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ? bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT ). Based on the second-order Taylor expansion, a convex surrogate function of ??~T???2???~{{{\bf{\tilde{a}}}}^{T}}{{\bf{R}}_{2}}{\bf{\tilde{a}}}over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over~ start_ARG bold_a end_ARG can be obtained as

??~T???2???~\displaystyle{{{\bf{\tilde{a}}}}^{T}}{{\bf{R}}_{2}}{{\bf{\tilde{a}}}}\leqover~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over~ start_ARG bold_a end_ARG ≤ ??~T???2???~+2???{??~T?(??2???2)???~t}\displaystyle{{{\bf{\tilde{a}}}}^{T}}{{\bf{\Lambda}}_{2}}{\bf{\tilde{a}}}+2\Re\left\{{{{{\bf{\tilde{a}}}}^{T}}({{\bf{R}}_{2}}-{{\bf{\Lambda}}_{2}})}\mathbf{\tilde{a}}_{t^{{}^{\prime}}}\right\}over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_Λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over~ start_ARG bold_a end_ARG + 2 roman_? { over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - bold_Λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) over~ start_ARG bold_a end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT }
+??~tT?(??2???2)???~t,\displaystyle+\mathbf{\tilde{a}}_{t^{{}^{\prime}}}^{T}({{\bf{\Lambda}}_{2}}-{{\bf{R}}_{2}})\mathbf{\tilde{a}}_{t^{{}^{\prime}}},+ over~ start_ARG bold_a end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_Λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) over~ start_ARG bold_a end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , (26)

where ??2=λ~max?(??2)???N{{\bf{\Lambda}}_{2}}={{\tilde{\lambda}}_{\max}}({{\bf{R}}_{2}}){{\bf{I}}_{N}}bold_Λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = over~ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, with λ~max?(??2){{\tilde{\lambda}}_{\max}}({{\bf{R}}_{2}})over~ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) denoting the maximum eigenvalue of ??2{{\bf{R}}_{2}}bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Then, the objective function is expressed in terms of ??~\tilde{\mathbf{a}}over~ start_ARG bold_a end_ARG as fˉ4?(??~)=??{??~3H???~}{{\bar{f}}_{4}}({\bf{\tilde{A}}})=\Re\left\{{{{{\bf{\tilde{r}}}}_{3}}^{H}{\bf{\tilde{a}}}}\right\}overˉ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( over~ start_ARG bold_A end_ARG ) = roman_? { over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_a end_ARG }, where ??~3=2???~1+2???~2+2?(??2???2)???~t{{{\bf{\tilde{r}}}}_{3}}=2{{\bf{\tilde{r}}}_{1}}+2{{\bf{\tilde{r}}}_{2}}+2({{\bf{R}}_{2}}-{{\bf{\Lambda}}_{2}})\mathbf{\tilde{a}}_{t^{{}^{\prime}}}over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 2 over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 2 ( bold_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - bold_Λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) over~ start_ARG bold_a end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. Thus, the sub-problem w.r.t. ??~\tilde{\mathbf{a}}over~ start_ARG bold_a end_ARG is formulated as

min??~\displaystyle\mathop{\min}\limits_{\tilde{\mathbf{a}}}\quadroman_min start_POSTSUBSCRIPT over~ start_ARG bold_a end_ARG end_POSTSUBSCRIPT ??{??~??H???~}\displaystyle\Re\left\{{\bf{\tilde{r}}_{3}}^{H}{\tilde{\mathbf{a}}}\right\}roman_? { over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT bold_3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_a end_ARG } (27a)
s.t. i=1Naˉl,i=1,l=1,?,a,\displaystyle\sum\nolimits_{i=1}^{N}\bar{{a}}_{l,i}=1,\;l=1,\cdots,a,∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT overˉ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_l , italic_i end_POSTSUBSCRIPT = 1 , italic_l = 1 , ? , italic_a , (27b)
mn,m,n{v|aˉl,v=1,l=1,?,a},\displaystyle m\neq n,\;m,n\in\{v|\bar{{a}}_{l,v}=1,l=1,\cdots,a\},italic_m ≠ italic_n , italic_m , italic_n ∈ { italic_v | overˉ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_l , italic_v end_POSTSUBSCRIPT = 1 , italic_l = 1 , ? , italic_a } , (27c)

where aˉl,i\bar{{a}}_{l,i}overˉ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_l , italic_i end_POSTSUBSCRIPT denotes the iiitalic_i-th entry of the llitalic_l-th segment in ??~\tilde{\bf{a}}over~ start_ARG bold_a end_ARG. Note that each segment of ??~\tilde{\bf{a}}over~ start_ARG bold_a end_ARG consists of NNitalic_N elements, which come from the corresponding column of ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG. For problem (27), constraint (27b) is the constraint of the total number of selected elements in each column of ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG. Constraint (27c) guarantees that aaitalic_a different elements should be selected to operate in connection mode.

By temporarily relaxing constraint (27c), the solution to problem (27) is given by

aˉl,i={1,i=arg?minm{??[(l?1)N+1:lN]m},0,Otherwise,{\bar{a}}_{l,i}\!\!=\begin{cases}1,&\!\!i\!=\!\mathop{\arg\min}\limits_{m}\{{\bf c}[(l-1)N+1:lN]_{m}\},\\ 0,&\!\!\textrm{Otherwise},\end{cases}overˉ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_l , italic_i end_POSTSUBSCRIPT = { start_ROW start_CELL 1 , end_CELL start_CELL italic_i = start_BIGOP roman_arg roman_min end_BIGOP start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT { bold_c [ ( italic_l - 1 ) italic_N + 1 : italic_l italic_N ] start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL Otherwise , end_CELL end_ROW (28)

where ??=??{??~??}{\bf c}=\Re\left\{{\bf{\tilde{r}}_{3}}\right\}bold_c = roman_? { over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT bold_3 end_POSTSUBSCRIPT }, l{1,?,a}l\in\{1,\cdots,a\}italic_l ∈ { 1 , ? , italic_a }, i,m{1,?,N}i,m\in\{1,\cdots,N\}italic_i , italic_m ∈ { 1 , ? , italic_N }, and arg?minm{??[(l?1)N+1:lN]m}\mathop{\arg\min}\limits_{m}\{{\bf c}[(l-1)N+1:lN]_{m}\}start_BIGOP roman_arg roman_min end_BIGOP start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT { bold_c [ ( italic_l - 1 ) italic_N + 1 : italic_l italic_N ] start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } returns the index corresponding to the minimum value of ??[(l?1)N+1:lN]{\bf c}[(l-1)N+1:lN]bold_c [ ( italic_l - 1 ) italic_N + 1 : italic_l italic_N ]. It is observed from (28) that aˉl,iopt=1{\bar{a}}^{\rm{\mathrm{opt}}}_{l,i}=1overˉ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l , italic_i end_POSTSUBSCRIPT = 1 holds when the iiitalic_i-th entry of ??[(l?1)N+1:lN]{\bf c}[(l-1)N+1:lN]bold_c [ ( italic_l - 1 ) italic_N + 1 : italic_l italic_N ] is the minimum value among the NNitalic_N elements of the l-th block of ??{\bf c}bold_c. By considering the constraint (27c), the mode switching matrix optimization should be further discussed in the following.

Let ??={i|aˉl,iopt=1,l=1,?,a}\mathcal{T}=\{i|\bar{a}^{\rm{opt}}_{l,i}=1,l=1,\cdots,a\}caligraphic_T = { italic_i | overˉ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l , italic_i end_POSTSUBSCRIPT = 1 , italic_l = 1 , ? , italic_a } denote the set of selected indices and MMitalic_M represent the number of distinct index types in ??\mathcal{T}caligraphic_T. If M=aM=aitalic_M = italic_a, the solution to problem (27) is given in (28). If M<aM<aitalic_M < italic_a, the number of selected connection elements is insufficient, in other words, constraint (27c) is not satisfied. In this case, we need to reselect the index for the segments with repeated indexes. Specifically, We assign the repeated index to the segment that minimizes the objective function of problem (27) and then select the index corresponding to the sub-minimum value in the remaining segments until M=aM=aitalic_M = italic_a.

The above repeated reselection steps ensure that both constraint (27c) holds and the objective function of problem (27) is minimized. Moreover, in the (t+1)(t+1)( italic_t + 1 )-th round, the penalty term ρ\rhoitalic_ρ in the objective function is updated in a moderate step size η(0,1)\eta\in(0,1)italic_η ∈ ( 0 , 1 ), which is given by

ρ(t+1)=η?ρ(t).\rho^{(t+1)}=\eta\rho^{(t)}.italic_ρ start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_η italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT . (29)
Input: KKitalic_K, NtN_{\rm{t}}italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT, NNitalic_N, aaitalic_a, ??\mathbf{G}bold_G, ??r,k{\bf{h}}_{{\rm{r}},k}bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT, σk{{\sigma_{k}}}italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, αk\alpha_{k}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.
1Randomly initialize ??\bm{\varphi}bold_italic_φ, ??\bf{a}bold_a, ??~\bf{\tilde{a}}over~ start_ARG bold_a end_ARG ;
2 Initialize ??=[??bT\mathbf{F}=[{\bf W}_{\mathrm{b}}^{T}bold_F = [ bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, ??rT]T{\bf W}_{\mathrm{r}}^{T}]^{T}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT by the adaptive MRT-ZF initialization;
3 Calculate uku_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and λk{{\lambda_{k}}}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT according to (12) and (13) ;
4 Update ??b{\bf W}_{\mathrm{b}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT, ??r{\bf W}_{\mathrm{r}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT according to (14);
5 repeat
6??? Calculate uku_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and λk{{\lambda_{k}}}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT according to (13) and (12);
7??? Update ??\bm{\varphi}bold_italic_φ according to (17) until the objective function of (16) converges;
8??? Update ??~\bf\tilde{A}over~ start_ARG bold_A end_ARG according to (28);
9??? Update ??\bf Abold_A according to (23);
10??? Update ??\mathbf{F}bold_F according to (14) and scale ??\mathbf{F}bold_F according to (8b);
11??? Update ρ\rhoitalic_ρ according to (29);
12until?the convergence is satisfied;
13Return ??opt=diag?(??opt){\bf A}^{\mathrm{opt}}=\operatorname{diag}(\bf{a}^{\mathrm{opt}})bold_A start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = roman_diag ( bold_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ), ??~opt{\bf{\tilde{A}}}^{\mathrm{opt}}over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??bopt{\mathbf{W}}_{\rm{b}}^{\mathrm{opt}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??ropt{\bf W}_{\rm{r}}^{\mathrm{opt}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??opt=diag(??opt)H{\bf{\Phi}}^{\mathrm{opt}}=\operatorname{diag}(\bm{\varphi}^{\mathrm{opt}})^{H}bold_Φ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = roman_diag ( bold_italic_φ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT;
Output: ??b{\bf W}_{\mathrm{b}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT, ??r{\bf W}_{\mathrm{r}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT, ??{\bf\Phi}bold_Φ, ??{\bf A}bold_A, ??~{\bf{\tilde{A}}}over~ start_ARG bold_A end_ARG.
Algorithm?1 PWM Algorithm for Joint Beamforming and Mode Switching

The proposed PWM algorithm for solving problem (8) is summarized in Algorithm 1. The passive beamforming and index matrices are randomly initialized in Step 1. The equivalent active beamforming matrix is initialized via the adaptive maximum ratio transmission (MRT) and zero-forcing (ZF) methods in Step 1. Specifically, we apply the MRT method to enhance the strength of the transmit signal in the low-SNR regime. While for the high-SNR regime, the ZF method is utilized to suppress the inter-user interference (IUI). With the auxiliary variables calculated in Step 1, we update the active begriming matrices in Step 1. Finally, the mode switching matrix is updated based on other variables according to the MM method from Steps 1 to 1.

III-E Convergence and Complexity Analysis

The sub-problem for updating ??{\bm{\varphi}}bold_italic_φ is optimally solved and thus the objective function of problem (16) is maximized, which is equivalent to problem (9). Moreover, the objective value of problem (III-C) is non-increasing via the MM method, which guarantees the convergence of Algorithm 1. Moreover, the computational complexity of the proposed algorithm is analyzed as follows. Specifically, for the active beamforming optimization, the computational complexity depends on the number of UEs KKitalic_K and the numbers of BS antennas and RDARS transmit elements, given by ???(K?(Nt+a)3)\mathcal{O}(K(N_{\rm{t}}+a)^{3})caligraphic_O ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). For the passive beamforming optimization, the calculation of ??\bf{C}bold_C dominates the complexity, which is given by ???(K2?N2)\mathcal{O}(K^{2}N^{2})caligraphic_O ( italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). In addition, problem (16) is solved by power iteration, and the complexity is ???(Ip?N2)\mathcal{O}(I_{p}N^{2})caligraphic_O ( italic_I start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) where IpI_{p}italic_I start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT denotes the inner iteration steps required for convergence. For the mode switching matrix, the complexities of optimizing ??\bf{A}bold_A and ??~\tilde{\bf{A}}over~ start_ARG bold_A end_ARG mainly arise from eigenvalue decomposition (EVD), which are given by ???((2?N)3)\mathcal{O}((2N)^{3})caligraphic_O ( ( 2 italic_N ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) and ???(N3)\mathcal{O}(N^{3})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ), respectively. Therefore, the total complexity of Algorithm 1 is ???(I?(K?(Nt+a)3+K2?N2+Ip?(N)2+3?N2))\mathcal{O}(I(K(N_{\rm{t}}+a)^{3}+K^{2}N^{2}+I_{p}(N)^{2}+3N^{2}))caligraphic_O ( italic_I ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_I start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_N ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 3 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ), where IIitalic_I denotes the number of iterations.

IV Model-Driven PWM-BFNet

In this section, we propose a model-driven PWM-BFNet to mitigate the number of iterations required by PWM algorithm and escape from local optima, thus reducing the computational complexity while improving performance.

Specifically, the PWM-BFNet leverages DL to learn crucial parameters associated with the convergence speed, i.e., the penalty term ρ(t)\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT in each iteration and regularization parameter ε(t)\varepsilon^{(t)}italic_ε start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. After an in-depth presentation of the model-driven DL, a comprehensive comparison and analysis of the complexity for the proposed techniques and other existing algorithms is provided.

IV-A Convergence Acceleration

Though the convergence of the proposed PWM algorithm can be guaranteed, its convergence speed depends on the penalty term ρ\rhoitalic_ρ and regularization term ε\varepsilonitalic_ε.

Generally, it is preferable to set ρ\rhoitalic_ρ to moderate values during initialization, ensuring that the penalized objective function is primarily dominated by the original objective function rather than the penalty terms [18, 36]. Specifically, it is observed from problems (III-C), (25) and Algorithm 1 that ρ\rhoitalic_ρ should be sufficiently large in the initial steps, thus ensuring that the first terms for the objective function of problem (III-C) and (25) dominate these function values. Meanwhile, the aims of problem (III-C) and (25) are to minimize the corresponding first terms. When the beamforming matrices render the values of the first terms tending to be stable, a suitable ρ\rhoitalic_ρ guarantees that the associated terms dominate these function values, and the iteration processes aim to decrease the corresponding terms. Therefore, when the penalty term of each iterative step is not suitable, the convergence speed of problem (III-C) and (25) is greatly limited to by considerable degree, thus influencing the performance of the proposed algorithms. To mitigate the impact of the penalty term on the outer iteration, a model-driven DL is designed by selecting ρ\rhoitalic_ρ as a trainable variable.

In addition, another inner iteration related to ε\varepsilonitalic_ε in (17) impacts the convergence speed of the proposed algorithm. Specifically, when ε+\varepsilon\rightarrow+\inftyitalic_ε → + ∞, ??(q+1)=ej?arg?((??+ε???N+1)???(q))ej?arg?(ε???(q))=??(q){\bf{p}}^{(q+1)}=e^{j\mathrm{arg}(({\bf{D}}+\varepsilon{\bf{I}}_{N+1}){\bf{p}}^{(q)})}\rightarrow e^{j\mathrm{arg}(\varepsilon{\bf{p}}^{(q)})}={\bf{p}}^{(q)}bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT = italic_e start_POSTSUPERSCRIPT italic_j roman_arg ( ( bold_D + italic_ε bold_I start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT ) bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT → italic_e start_POSTSUPERSCRIPT italic_j roman_arg ( italic_ε bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT = bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT, thus causing the iteration meaningless. On the other hand, when ε\varepsilonitalic_ε is too small, the positive definiteness of ??+??N+1{\bf{D}}+{\bf{I}}_{N+1}bold_D + bold_I start_POSTSUBSCRIPT italic_N + 1 end_POSTSUBSCRIPT cannot be guaranteed, which may compromise the convergence speed. It is observed from Algorithm 1 that the passive beamforming is updated by ensuring that the objective function of problem (15) converges in step 1, and the complexity of this step is linear with the number of inner iterations. Therefore, we introduce a model-driven DL to train ε\varepsilonitalic_ε, and set the number of inner iterations as 1, so as to reduce the complexity of updating passive beamforming and accelerate the convergence speed of the proposed algorithm.

Moreover, the common data-driven method depends on the suitable neural network to train beam indices or complex beamforming matrices, where a high-quality dataset is needed. Besides, the large number of hidden layers in training network leads to a large amount of trainable parameters, which increases the training complexity and memory requirements. Therefore, by integrating the expert model knowledge, we propose a model-driven DL to train a small number of parameters. To be specific, the optimal active beamforming vector in (14) can be represented by a simple solution structure as follows[37],

??kopt=pk?(??Nt+a+m=1Kδmσ2???mH???m)?1???kH(??Nt+a+m=1Kδmσ2???mH???m)?1???kH,\mathbf{f}^{\rm{opt}}_{k}=\sqrt{p_{k}}\frac{(\mathbf{I}_{N_{\rm{t}}+a}+\sum_{m=1}^{K}\frac{\delta_{m}}{\sigma^{2}}\mathbf{h}^{H}_{m}\mathbf{h}_{m})^{-1}\mathbf{h}^{H}_{k}}{||(\mathbf{I}_{N_{\rm{t}}+a}+\sum_{m=1}^{K}\frac{\delta_{m}}{\sigma^{2}}\mathbf{h}^{H}_{m}\mathbf{h}_{m})^{-1}\mathbf{h}^{H}_{k}||},bold_f start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = square-root start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG divide start_ARG ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG | | ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | | end_ARG , (30)

with

m=1Kpm=m=1Kδm=Ptot.\sum_{m=1}^{K}{p_{m}}=\sum_{m=1}^{K}{\delta_{m}}=P_{\rm{tot}}.∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT . (31)

To achieve a faster convergence speed, the active beamforming vector ??k\mathbf{f}_{k}bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be initialized based on the simple structure in (30), and the parameters pkp_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and δk\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are trainable parameters in the deep unfolding network. It is observed from the constraint (31) that the total value for pkp_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and δk\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT should be achieved. Therefore, the trainable parameters pkp_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and δk\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are scaled by the total power constraint through a softmax layer as follows.

pk=epkm=1Kepm?Ptot,p_{k}=\frac{e^{p^{{}^{\prime}}_{k}}}{\sum_{m=1}^{K}e^{p^{{}^{\prime}}_{m}}}P_{\rm{tot}},italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_e start_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT , (32)
δk=eδkm=1Keδm?Ptot,\delta_{k}=\frac{e^{\delta^{{}^{\prime}}_{k}}}{\sum_{m=1}^{K}e^{\delta^{{}^{\prime}}_{m}}}P_{\rm{tot}},italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_e start_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT , (33)

where pmp^{{}^{\prime}}_{m}italic_p start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and δm\delta^{{}^{\prime}}_{m}italic_δ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT are set as the input of the softmax layer.

For the network training, the unsupervised learning is adopted to train the PWM-BFNet. Since the goal of problem (8) is to maximize WSR, the loss function is formulated as:

?=?1Ns?n=1Nsf?(PWM-BFNet?(??(n),??~r(n))),\mathcal{L}=-\frac{1}{N_{\rm{s}}}\sum_{n=1}^{N_{\rm{s}}}f\left(\textrm{PWM-BFNet}(\mathbf{G}^{(n)},\widetilde{\mathbf{H}}^{(n)}_{\rm{r}})\right),caligraphic_L = - divide start_ARG 1 end_ARG start_ARG italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_f ( PWM-BFNet ( bold_G start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , over~ start_ARG bold_H end_ARG start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT ) ) , (34)

where NsN_{\rm{s}}italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT is the number of channel realizations in the training dataset, f?(?)f(\cdot)italic_f ( ? ) denotes the objective function of problem (8), and PWM-BFNet?(??(n),??~r(n))\textrm{PWM-BFNet}(\mathbf{G}^{(n)},\widetilde{\mathbf{H}}^{(n)}_{\rm{r}})PWM-BFNet ( bold_G start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , over~ start_ARG bold_H end_ARG start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT ) is the beamformer obtained by PWM-BFNet taking the nnitalic_n-th channel realization {??(n),??~r(n)}\{\mathbf{G}^{(n)},\widetilde{\mathbf{H}}^{(n)}_{\rm{r}}\}{ bold_G start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , over~ start_ARG bold_H end_ARG start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT } as input. Let ??~r=[??r,1,??r,2,?,??r,K]\widetilde{\mathbf{H}}_{\rm{r}}=[{\bf{h}}_{{\rm{r}},1},{\bf{h}}_{{\rm{r}},2},\cdots,{\bf{h}}_{{\rm{r}},K}]over~ start_ARG bold_H end_ARG start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT = [ bold_h start_POSTSUBSCRIPT roman_r , 1 end_POSTSUBSCRIPT , bold_h start_POSTSUBSCRIPT roman_r , 2 end_POSTSUBSCRIPT , ? , bold_h start_POSTSUBSCRIPT roman_r , italic_K end_POSTSUBSCRIPT ]. The proposed model-driven PWM-BFNet algorithm is summarized in Algorithm 2.

Refer to caption
Figure 2: An illustration of the network structure for PWM-BFNet.
Input: KKitalic_K, NtN_{\rm{t}}italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT, NNitalic_N, aaitalic_a, ??\mathbf{G}bold_G, ??r,k{\bf{h}}_{{\rm{r}},k}bold_h start_POSTSUBSCRIPT roman_r , italic_k end_POSTSUBSCRIPT, σk{{\sigma_{k}}}italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, αk\alpha_{k}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.
1Randomly initialize ??\bf{a}bold_a, ??~\bf{\tilde{a}}over~ start_ARG bold_a end_ARG and ??\bm{\varphi}bold_italic_φ;
2 Initialize ??=[??bT\mathbf{F}=[{\bf W}_{\mathrm{b}}^{T}bold_F = [ bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT??rT]T{\bf W}_{\mathrm{r}}^{T}]^{T}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT with ZF beamforming;
3 Calculate λk{{\lambda_{k}}}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and uku_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT according to (13) and (12);
4 Update ??b{\bf W}_{\mathrm{b}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT, ??r{\bf W}_{\mathrm{r}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT according to (14);
5 Update ??\bm{\varphi}bold_italic_φ according to (17) and trainable variable ε(0)\varepsilon^{(0)}italic_ε start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT;
6 Initialize ??\mathbf{F}bold_F with pkp_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and δk\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, ?k??\forall k\in\mathcal{K}? italic_k ∈ caligraphic_K ;
7 repeat
8??? Calculate λk{{\lambda_{k}}}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and uku_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT according to (13) and (12);
9??? Update ??\bm{\varphi}bold_italic_φ according to (17) and trainable variable ε(i)\varepsilon^{(i)}italic_ε start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT;
10??? Update ??~\bf\tilde{A}over~ start_ARG bold_A end_ARG according to (28) and trainable variable ρ(i)\rho^{(i)}italic_ρ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT;
11??? Update ??\bf Abold_A according to (23) and trainable variable ρ(i)\rho^{(i)}italic_ρ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT;
12??? Update ??\mathbf{F}bold_F according to (14) and scale ??\mathbf{F}bold_F according to (8b);
13???
14until?the convergence is satisfied.;
15Return ??opt=diag?(??opt){\bf A}^{\mathrm{opt}}=\operatorname{diag}(\bf{a}^{\mathrm{opt}})bold_A start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = roman_diag ( bold_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ), ??~opt{\bf{\tilde{A}}}^{\mathrm{opt}}over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??bopt{\mathbf{W}}_{\rm{b}}^{\mathrm{opt}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??ropt{\bf W}_{\rm{r}}^{\mathrm{opt}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT, ??opt=diag(??opt)H{\bf{\Phi}}^{\mathrm{opt}}=\operatorname{diag}(\bm{\varphi}^{\mathrm{opt}})^{H}bold_Φ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT = roman_diag ( bold_italic_φ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT;
Output: ??b{\bf W}_{\mathrm{b}}bold_W start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT, ??r{\bf W}_{\mathrm{r}}bold_W start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT, ??{\bf\Phi}bold_Φ, ??{\bf A}bold_A, ??~{\bf{\tilde{A}}}over~ start_ARG bold_A end_ARG.
Algorithm?2 PWM-BFNet Algorithm for Joint Beamforming and Mode Switching

IV-B Complexity Analysis

Next, we compare the complexity of the proposed algorithm with other joint beamforming algorithms, including the PWM, PWM-BFNet, RCB-BF [16], and DLRCB-PWM algorithms, which are listed in Table I. For the RCB-BF algorithm, the computational complexity of active beamforming optimization arises from the codeword selection, i.e., ???(K?(Nt2+a2))\mathcal{O}(K(N^{2}_{\rm{t}}+a^{2}))caligraphic_O ( italic_K ( italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ). For the passive beamforming optimization, the water-filling iteration algorithm dominates the complexity, which is ???(Io?(K?N3+K2?N?(N+a)))\mathcal{O}(I_{\rm{o}}(KN^{3}+K^{2}N(N+a)))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_o end_POSTSUBSCRIPT ( italic_K italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N ( italic_N + italic_a ) ) ), where IoI_{\rm{o}}italic_I start_POSTSUBSCRIPT roman_o end_POSTSUBSCRIPT denotes the number of outer iterations. Thus, the total complexity of the RCB-BF algorithm is ???(Io?(K?N3+K2?N?(N+a))+K?(Nt2+a2))\mathcal{O}(I_{\rm{o}}(KN^{3}+K^{2}N(N+a))+K(N^{2}_{\rm{t}}+a^{2}))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_o end_POSTSUBSCRIPT ( italic_K italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N ( italic_N + italic_a ) ) + italic_K ( italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ). For the proposed DLRCB-PWM algorithm, the complexity of each iteration is the same as the PWM algorithm, where the initializations of beamforming matrices are obtained according to the RCB-based beamforming design. Due to the DLRCB initialization, the number of inner and outer iterations can be reduced, which leads to a lower complexity. Therefore, the total complexity of the DLRCB-PWM algorithm is ???(Id?(K?(Nt+a)3+K2?N2+Ipd?N2+3?N2))\mathcal{O}(I_{\rm{d}}(K(N_{\rm{t}}+a)^{3}+K^{2}N^{2}+I_{\rm{pd}}N^{2}+3N^{2}))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_I start_POSTSUBSCRIPT roman_pd end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 3 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ), where IdI_{\rm{d}}italic_I start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT denotes the number of outer iterations and IpdI_{\rm{pd}}italic_I start_POSTSUBSCRIPT roman_pd end_POSTSUBSCRIPT denotes the number of inner iterations. For the PWM-BFNet algorithm, the number of inner iteration steps is reduced to 1, which significantly decreases the complexity. Furthermore, the computational complexity of active beamforming is only caused by the initial step that reconstructs the beamforming vectors according to (30), i.e., ???(K?(Nt+a)2)\mathcal{O}(K(N_{\rm{t}}+a)^{2})caligraphic_O ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Thus, the total complexity of the PWM-BFNet algorithm is ???(Id?(K?(Nt+a)2+K2?N2+5?N3))\mathcal{O}(I^{{}^{\prime}}_{\rm{d}}(K(N_{\rm{t}}+a)^{2}+K^{2}N^{2}+5N^{3}))caligraphic_O ( italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 5 italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ) , where IdI^{{}^{\prime}}_{\rm{d}}italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT denotes the number of outer iterations. It is observed that the active beamforming vectors initialized by the PWM-BFNet algorithm are closer to the optimal solution, which indicates Id<Id?II^{{}^{\prime}}_{\rm{d}}<I_{\rm{d}}\ll Iitalic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT < italic_I start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ? italic_I, thus leading to a lower complexity and faster convergence speed.

TABLE I: Complexity Analysis
Algorithm Computational Complexity
PWM ???(I?(K?(Nt+a)3+K2?N2+Ip?N2+5?N3))\mathcal{O}(I(K(N_{\rm{t}}+a)^{3}+K^{2}N^{2}+I_{\rm{p}}N^{2}+5N^{3}))caligraphic_O ( italic_I ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_I start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 5 italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) )
RCB-BF ???(Io?(K?N3+K2?N?(N+a))+K?(Nt2+a2))\mathcal{O}(I_{\rm{o}}(KN^{3}+K^{2}N(N+a))+K(N^{2}_{\rm{t}}+a^{2}))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_o end_POSTSUBSCRIPT ( italic_K italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N ( italic_N + italic_a ) ) + italic_K ( italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) )
DLRCB-PWM ???(Id?(K?(Nt+a)3+K2?N2+Ipd?N2+5?N3))\mathcal{O}(I_{\rm{d}}(K(N_{\rm{t}}+a)^{3}+K^{2}N^{2}+I_{\rm{pd}}N^{2}+5N^{3}))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_I start_POSTSUBSCRIPT roman_pd end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 5 italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) )
PWM-BFNet ???(Id?(K?(Nt+a)2+K2?N2+5?N3))\mathcal{O}(I^{{}^{\prime}}_{\rm{d}}(K(N_{\rm{t}}+a)^{2}+K^{2}N^{2}+5N^{3}))caligraphic_O ( italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ( italic_K ( italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT + italic_a ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 5 italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) )

V Simulation Results

In this section, numerical results are provided to verify the proposed schemes. The number of UEs is K=4K=4italic_K = 4, the number of antennas at BS is 161616, and the number of total elements for RDARS is 128128128, respectively. The number of TEs is a=8a=8italic_a = 8. The BS and the RDARS are located at (0, 0, 15) m and (10, 0, 15) m, respectively. The UEs are randomly distributed within a circle, where its center and radius are (10, 50, 2) m and 5 m, respectively. The path loss models are given by c0?(dD0)?δc_{0}(\frac{d}{D_{0}})^{-\delta}italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( divide start_ARG italic_d end_ARG start_ARG italic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - italic_δ end_POSTSUPERSCRIPT, where c0c_{0}italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the path loss at the reference distance D0=1D_{0}=1italic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1 m, dditalic_d denotes the link distance, and δ\deltaitalic_δ denotes the path loss exponent. We set c0=60.4c_{0}=60.4italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 60.4 dB. The path loss exponents of the BS-RDARS and RDARS-UE KKitalic_K links are δb=2.2\delta_{\mathrm{b}}=2.2italic_δ start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT = 2.2 and δr=2.4\delta_{\mathrm{r}}=2.4italic_δ start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT = 2.4, respectively. Other system parameters are set as follows: ξ=10\xi=10italic_ξ = 10, ρ(0)=106\rho^{(0)}=10^{6}italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = 10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT, η=10?3\eta=10^{-3}italic_η = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT, and σk2=?80\sigma_{k}^{2}=-80italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 80 dBm. For the PWM-BFNet algorithm, we set {ρ~ini,ρ~(i),ε~ini,ε~(i),pk,δk}\{\tilde{\rho}_{\rm{ini}},\tilde{\rho}^{(i)},\tilde{\varepsilon}_{\rm{ini}},\tilde{\varepsilon}^{(i)},p_{k},\delta_{k}\}{ over~ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT roman_ini end_POSTSUBSCRIPT , over~ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , over~ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT roman_ini end_POSTSUBSCRIPT , over~ start_ARG italic_ε end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } as trainable parameters, and the corresponding parameters are substituting into the PWM algorithm for 1iId1\leq i\leq I^{{}^{\prime}}_{\rm{d}}1 ≤ italic_i ≤ italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT and k??k\in\mathcal{K}italic_k ∈ caligraphic_K, where ρ~ini=ρ(0)\tilde{\rho}_{\rm{ini}}=\rho^{(0)}over~ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT roman_ini end_POSTSUBSCRIPT = italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, ρ~(i)=ρ(i)\tilde{\rho}^{(i)}=\rho^{(i)}over~ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = italic_ρ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT, and ε~ini=ε~(0)=ε(0)\tilde{\varepsilon}_{\rm{ini}}=\tilde{\varepsilon}^{(0)}=\varepsilon^{(0)}over~ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT roman_ini end_POSTSUBSCRIPT = over~ start_ARG italic_ε end_ARG start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = italic_ε start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT at the initialization step. Similarly, we generate 10,000 channel realizations to train the model-driven network. Note that the label corresponding to each sample is not required due to the unsupervised learning. PyTorch, a DL-based architecture, is used for building and training the proposed algorithms, and the networks are trained by using the SGD optimizer. The momentum is set as 0.7. The number of batches in each epoch is 100010001000, where the number of samples in each batch is 10. Moreover, the number of training epochs is 30.

For comparison, the following architectures are considered: 1) RDARS PWM-BFNet: The RDARS-aided system with the PWM-BFNet is considered; 2) RDARS PWM: The RDARS-aided system with the PWM algorithm is considered; 3) RDARS DLRCB-PWM: The RDARS-aided system with the DLRCB-PWM algorithm is considered, where the initializations of beamforming matrices are generated by RCBs; 4) RDARS Fixed Index: The RDARS-aided system with the PWM algorithm is considered, where the the working modes of RDARS elements are fixed; The first aaitalic_a elements are chosen as the transmit elements, and the others are passive elements; 5) DAS: The DAS is considered, where the number of BS antennas N~t\tilde{N}_{\rm{t}}over~ start_ARG italic_N end_ARG start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT is equal to NtN_{\rm{t}}italic_N start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT, and the number of distributed antennas a~\tilde{a}over~ start_ARG italic_a end_ARG is equal to aaitalic_a. The active beamforming matrix is optimized based on the PWM algorithm, which indicates that aaitalic_a distributed antennas are located in optimal positions determined in the RDARS-aided system; 6) RIS: The RIS-aided system is considered, where the number of RIS elements is N~=N\tilde{N}=Nover~ start_ARG italic_N end_ARG = italic_N. The active and passive beamforming matrices are optimized based on the PWM algorithm. Furthermore, the performance of the PWM-BFNet trained in different parameters is compared.

V-A Convergence Analysis

Refer to caption
Figure 3: Convergence behaviors of proposed algorithms. The curves corresponding to RIS are scaled by the right y-axis, while the curves corresponding to the other schemes are scaled by the left y-axis.

In Fig, 3, we plot the WSR versus the number of iterations with the different architectures. It is observed that the PWM-based algorithm achieves convergence for different architectures, which verifies the effectiveness of this algorithm. Besides, the DLRCB-PWM algorithm has a better convergence performance than the PWM algorithm. This is because the RCB-based initialization generates better initialization points for the beamforming matrices. It is also observed that the PWM-BFNet significantly improves the performance due to the integration of the model-driven DL. Specifically, the parameters in each iteration step are trained for multiple channel realizations, so as to escape the local optimum of the PWM algorithm. Moreover, the performance of the PWM-BFNet algorithm after one iteration attains a significant performance enhancement compared to the PWM algorithm after convergence, and the number of iterations required for convergence is reduced. By incorporating the simple solution structure of active beamforming and model-driven DL, PWM-BFNet achieves a fast convergence performance, which converges around 5 iterations. Moreover, it is observed that PWM-BFNet yields the best results, which is attributed to its more effective initialization. Specifically, the simple structure solutions of active beamforming matrices reconstructed by trained parameters are close to spatial directions of the optimal solutions. In addition, the inner iterations of the passive beamforming are reduced to 1, which accelerates the convergence.

Furthermore, the RDARS architecture achieves a better performance than the DAS and RIS architectures, even for the fixed element configuration, which demonstrates the inherent benefits of RDARS. Moreover, the WSR of the RDARS-aided system with mode switching optimization is significantly improved compared with the fixed index scheme, which verifies the necessity of optimizing the mode switching.

V-B Robustness Analysis

In this subsection, the performance of the proposed algorithms is evaluated by considering the impacts of total transmit power PtotP_{\rm{tot}}italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT, numbers of UEs KKitalic_K, numbers of RDARS elements NNitalic_N, and the Rician factor ξ\xiitalic_ξ. Specifically, the curves of RDARS with the PWM-BFNet algorithm (trained at corresponding parameters) denote that the networks to be tested are trained at corresponding parameters. By contrast, the curves of RDARS with the PWM-BFNet algorithm (trained at a given parameter) indicate that the networks to be tested are trained always at a given parameter. For example, the blue curve using the cross-shaped markers in Fig. 4 indicates that the test networks are all trained at Ptot=30P_{\rm{tot}}=30italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT = 30 dBm, while the blue square curve represents the test networks are trained at corresponding total transmit powers.

It is observed from Figs. 4-8 that the proposed PWM-BFNet outperforms other algorithms in various scenarios, by leveraging the inherent advantages of the model-driven DL and the proposed PWM algorithm. It is worth mentioning that the proposed PWM-BFNet maintains a superior performance under different parameters which implies its robustness to variations in system setups. This is attributed to the ability of PWM-BFNet in capturing the mapping function from channels to beamforming vectors and model switching matrix, a task that is arguably performed more effectively than conventional data-driven DL approaches. This inherently improves performance and robustness. Furthermore, the slight performance degradation of the DL method due to differences between training and test environments is mitigated by subsequent iterations.

V-B1 Impact of The Total Transmit Power

Refer to caption
Figure 4: WSR versus total transmit power with different architectures.

Fig. 4 plots the WSR versus the total transmit power for different architectures. The PWM-BFNet achieves a better performance than the PWM algorithm, and the performance gap increases with the total transmit power. This is because the IUI increases, and the PWM algorithm is prone to fall into local optimality. Furthermore, the RDARS architecture with the fixed index scheme achieves a performance comparable to that of DAS, and the RDARS with the mode switching has a further performance improvement. This demonstrates the extra mode selection gain provided by dynamic element configurations of RDARS.

V-B2 Impact of The Number of UEs

Refer to caption
Figure 5: WSR versus the number of UEs with different architectures.

In Fig. 5, we plot the WSR versus the number of UEs for different architectures. It is observed that the performance curves of all schemes decrease with the number of UEs. The reason is that the spatial diversity gain is limited as the number of UEs increases, leading to more severe IUI. Moreover, the PWM-BFNet trained at K=4K=4italic_K = 4 still remains a high performance for scenarios with different numbers of UEs. Specifically, the WSR of PWM-BFNet trained at K=4K=4italic_K = 4 when K=5K=5italic_K = 5 is comparable to that of the PWM algorithm when K=2K=2italic_K = 2, which shows the advantage of model-driven DL. Furthermore, the performance of the RIS-aided system has a notable decline with the number of UEs, which shows that this system is more sensitive to the number of UEs compared with other architectures. This is because the multiplicative fading suffered by RIS significantly impacts the system performance for a large number of UEs.

V-B3 Impact of The Number of RDARS elements

Refer to caption
Figure 6: WSR versus the number of RDARS elements with different architectures.

In Fig. 6, we plot the WSR versus the number of RDARS elements with the different architectures. It is observed that the performance curves of RDARS-aided system increase with the number of RDARS elements. Specifically, the performance enhancement arising from the increase in the number of RDARS elements is limited, which is due to the limited passive beamforming gain provided by passive elements. Moreover, the RDARS-aided system with around 100 elements using the fixed index scheme achieves a comparable performance to the DAS. This demonstrates the need for dynamic element mode switching, especially for widely fluctuating channel conditions.

V-B4 Impact of The Rician Factor

Refer to caption
Figure 7: WSR versus the Rician factor with different architectures.

In Fig. 7, we plot the WSR versus the Rician factor with the different architectures. It is seen that the WSR curves decrease as the Rician factor increases. This is because as the Rician factor increases, the line-of-sight (LoS) component dominates the channel, thus resulting in the spatial diversity loss, i.e., the BS-RDARS channel. Thus, the mitigation of IUI is limited due to more significant interference. However, the performance degradation fluctuates within a small range, which further demonstrates the robustness of the proposed algorithms.

V-B5 Impact of The Number of TEs

Refer to caption
Figure 8: WSR versus the number of RDARS transmit elements with different architectures.

In Fig. 8, we plot the WSR versus the number of RDARS TEs with the different architectures. It is observed that the performance of PWM-BFNet has a considerable performance improvement compared to that of the PWM algorithm. The reason is that the PWM algorithm is more easily trapped into a local optimum, due to the intractable integer constraint of the mode switching matrix, especially for a larger number of RDARS TEs.

VI Conclusion

In this paper, we considered a RDARS-aided downlink MIMO system, where the WSR was maximized by jointly optimizing the beamforming matrices for the BS and RDARS and the mode switching matrix for RDARS. The PWM algorithm was first proposed to solve the MINLP problem by utilizing the WMMSE, MM, and power iteration algorithms. Moreover, the parameters determining the convergence speed of the PWM algorithm and the system performance were analyzed, and then the PWM algorithm was deeply unfolded into a model-driven DL network, i.e., PWM-BFNet, with these parameters trained. Simulation results verified the PWM-BFNet achieved a significantly high convergence speed and considerable performance improvement, especially at a high total transmit power and a large number of RDARS TEs. Meanwhile, compared to the benchmark schemes, the improved initialization and trainable variables reduced the number of iterations required for convergence, which significantly reduced the computational complexity. The results in this paper showed the superiority of the RDARS-aided system and illustrated the effectiveness of integrating the model-driven DL into joint beamforming and mode switching design for this system.

Appendix A: Proof of Lemma 1

With (5), the MSE of yky_{k}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is

ek\displaystyle e_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =???{(y^k?yk)?(y^k?yk)H}\displaystyle=\mathbb{E}\{(\hat{y}_{k}-y_{k})(\hat{y}_{k}-y_{k})^{H}\}= blackboard_E { ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT }
=??{((ukH??k??k)?1)((ukH??k??k)?1)H\displaystyle=\mathbb{E}\{\left((u^{H}_{k}{\bf{h}}_{k}{\bf{f}}_{k})-1\right)\left((u^{H}_{k}{\bf{h}}_{k}{\bf{f}}_{k})-1\right)^{H}= blackboard_E { ( ( italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - 1 ) ( ( italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - 1 ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT
+ukHikK??k??i??iH??iHuk+ukHnknkHuk}\displaystyle+u^{H}_{k}\sum_{i\neq k}^{K}{\bf{h}}_{k}{\bf{f}}_{i}{\bf{f}}^{H}_{i}{\bf{h}}^{H}_{i}u_{k}+u^{H}_{k}n_{k}n^{H}_{k}u_{k}\}+ italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i ≠ italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }
=1?ukH???k???k???kH???kH?uk+ukH?ikK??k???i???iH???iH?uk\displaystyle=1-u^{H}_{k}{\bf{h}}_{k}{\bf{f}}_{k}-{\bf{f}}^{H}_{k}{\bf{h}}^{H}_{k}u_{k}+u^{H}_{k}\sum_{i\neq k}^{K}{\bf{h}}_{k}{\bf{f}}_{i}{\bf{f}}^{H}_{i}{\bf{h}}^{H}_{i}u_{k}= 1 - italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i ≠ italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
+ukH?uk?σk2?m=1K??mH???mPtot.\displaystyle+u^{H}_{k}u_{k}\sigma^{2}_{k}\sum_{m=1}^{K}\frac{{\bf{f}}^{H}_{m}{\bf{f}}_{m}}{P_{\rm{tot}}}.+ italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT divide start_ARG bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG . (35)

An auxiliary term m=1K(??mH???m)/Ptot\sum_{m=1}^{K}({\bf{f}}^{H}_{m}{\bf{f}}_{m})/P_{\rm{tot}}∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) / italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT is introduced to the last term of (Appendix A: Proof of Lemma 1) to remove the maximum transmit power constraint in (8). Furthermore, given other fixed variables, by letting the first-order derivative of eke_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with respect to μk\mu_{k}italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be equal to zero, we have

?ek?uk\displaystyle\frac{\partial e_{k}}{\partial u_{k}}divide start_ARG ? italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ? italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG =???k???k+i=1K??k???i???iH???kH?ukH+uk?σk2=0.\displaystyle=-{\bf{h}}_{k}{\bf{f}}_{k}+\sum_{i=1}^{K}{\bf{h}}_{k}{\bf{f}}_{i}{\bf{f}}^{H}_{i}{\bf{h}}^{H}_{k}u^{H}_{k}+u_{k}\sigma^{2}_{k}=0.= - bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 . (36)

The optimal solution of uk{u_{k}}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in (9) is given by

ukopt=Jk?1???k???k,u^{\rm{opt}}_{k}={{J}}_{k}^{-1}{\bf{h}}_{k}{\bf{f}}_{k},italic_u start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (37)

where Jk=i=1K??k???i???iH???kH+σk2{{J}}_{k}=\sum_{i=1}^{K}{{\bf{h}}_{k}{\bf{f}}_{i}{\bf{f}}^{H}_{i}{\bf{h}}^{H}_{k}}+\sigma^{2}_{k}italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Similarly, by letting the first-order derivative of eke_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with respect to the auxiliary term λk\lambda_{k}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be equal to zero, we have

λkopt=ek?1.\lambda^{\rm{opt}}_{k}=e^{-1}_{k}.italic_λ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (38)

By substituting ukoptu^{\rm{opt}}_{k}italic_u start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and λkopt\lambda^{\rm{opt}}_{k}italic_λ start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, ?k??\forall k\in\mathcal{K}? italic_k ∈ caligraphic_K, into problem (9), we have the following equivalent problem

max??,??,??,??~\displaystyle\mathop{\max}\limits_{{{\bf{F}}},{\bf{\Phi}},{\bf{A}},{\tilde{\bf{A}}}}\;roman_max start_POSTSUBSCRIPT bold_F , bold_Φ , bold_A , over~ start_ARG bold_A end_ARG end_POSTSUBSCRIPT k=1Kαklog2(ekmin)?1\displaystyle\;\;\sum\limits_{k=1}^{K}{{\alpha_{k}}\log_{2}{(e^{\rm{min}}_{k})^{-1}}}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (39a)
s.t. (?8c?),(?8d?),(?8e?),(?8f?),\displaystyle\eqref{con: Phi},\eqref{con: A},\eqref{con: A tilde},\eqref{con: A +A tilde},italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , (39b)

where

ekmin=(1???kH???kH?Jk?1???k???k)?1.e^{\rm{min}}_{k}=(1-{\bf{f}}^{H}_{k}{\bf{h}}^{H}_{k}J_{k}^{-1}{\bf{h}}_{k}{\bf{f}}_{k})^{-1}.italic_e start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( 1 - bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (40)

By applying the Woodbury matrix identity to (40), we have

log2(ekmin)?1=log2(1+??k??k??kH??kHΥk?1),\displaystyle\log_{2}{(e^{\rm{min}}_{k})^{-1}}=\log_{2}{(1+{\bf{h}}_{k}{\bf{f}}_{k}{\bf{f}}^{H}_{k}{\bf{h}}^{H}_{k}\varUpsilon_{k}^{-1})},roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_Υ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) , (41)

with

Υk=Jk?1???k???k???kH???kH.\varUpsilon_{k}=J_{k}^{-1}-{\bf{h}}_{k}{\bf{f}}_{k}{\bf{f}}^{H}_{k}{\bf{h}}^{H}_{k}.roman_Υ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_f start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (42)

By combining (39) and (41), the proof of Lemma 1 is thus completed.

Appendix B: Proof of Lemma?2

With ??H??????=??H???+ε???N???=??H?????+(N+1){\bf{p}}^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}={\bf{p}}^{H}{{\bf{D}}+\varepsilon{\bf{I}}_{N}}{\bf{p}}={\bf{p}}^{H}{\bf{D}}{\bf{p}}+(N+1)bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p = bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D + italic_ε bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_p = bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Dp + ( italic_N + 1 ), problem (18) is thus equivalent to problem (16). Then, based on the definition, we have

??{(??(q+1))H??????(q)}\displaystyle\Re\{\left({\bf{p}}^{(q+1)}\right)^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q)}\}roman_? { ( bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT } =max|pn=1|??{??H??????q}\displaystyle=\mathop{\max}\limits_{|p_{n}=1|}\Re\{{\bf{p}}^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{q}\}= roman_max start_POSTSUBSCRIPT | italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 | end_POSTSUBSCRIPT roman_? { bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT }
(??(q))H??????(q).\displaystyle\geq\left({\bf{p}}^{(q)}\right)^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q)}.≥ ( bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT . (43)

When ??(q+1)??(q){\bf{p}}^{(q+1)}\neq{\bf{p}}^{(q)}bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT ≠ bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT and the matrix ??{\bf{D}}^{{}^{\prime}}bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is positive definite, we have

(??(q+1)???(q))H????(??(q+1)???(q))>0.({\bf{p}}^{(q+1)}-{\bf{p}}^{(q)})^{H}{\bf{D}}^{{}^{\prime}}({\bf{p}}^{(q+1)}-{\bf{p}}^{(q)})>0.( bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT - bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ( bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT - bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) > 0 . (44)

It then follows that

(??(q+1))H??????(q+1)\displaystyle({\bf{p}}^{(q+1)})^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q+1)}( bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT >2???{(??(q+1))H??????(q)}?(??(q))H??????(q)\displaystyle\!>\!2\Re\{({\bf{p}}^{(q+1)})^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q)}\}-({\bf{p}}^{(q)})^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q)}> 2 roman_? { ( bold_p start_POSTSUPERSCRIPT ( italic_q + 1 ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT } - ( bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT
>(??(q))H??????(q),\displaystyle\!>\!({\bf{p}}^{(q)})^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}^{(q)},> ( bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ( italic_q ) end_POSTSUPERSCRIPT , (45)

which implies that the objective function of problem (18) monotonically increases with ??{\bf{p}}bold_p. Furthermore, the objective function of problem (18) is upper-bounded by

??H??????\displaystyle{\bf{p}}^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p =m=1N+1n=1N+1pm??Dm,n?pnm=1N+1n=1N+1|pm??Dm,n?pn|\displaystyle=\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}p^{*}_{m}{{D}}^{{}^{\prime}}_{m,n}p_{n}\leq\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}|p^{*}_{m}{{D}}^{{}^{\prime}}_{m,n}p_{n}|= ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT | italic_p start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT |
m=1N+1n=1N+1|Dm,n|.\displaystyle\leq\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}|{{D}}^{{}^{\prime}}_{m,n}|.≤ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT | italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT | . (46)

Thus, the power iteration algorithm is guaranteed to converge to a stationary point ??ˉ\bar{\bf{p}}overˉ start_ARG bold_p end_ARG, given by

?????ˉ=abs(?????ˉ)ej?arg(?????ˉ)=abs(?????ˉ)??ˉ.{\bf{D}}^{{}^{\prime}}\bar{\bf{p}}=\operatorname*{abs}({\bf{D}}^{{}^{\prime}}\bar{\bf{p}})\odot e^{j\operatorname*{arg}({\bf{D}}^{{}^{\prime}}\bar{\bf{p}})}=\operatorname*{abs}({\bf{D}}^{{}^{\prime}}\bar{\bf{p}})\odot\bar{\bf{p}}.bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG = roman_abs ( bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG ) ⊙ italic_e start_POSTSUPERSCRIPT italic_j roman_arg ( bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG ) end_POSTSUPERSCRIPT = roman_abs ( bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG ) ⊙ overˉ start_ARG bold_p end_ARG . (47)

Furthermore, we have

??H?(diag(abs(?????ˉ))???)???\displaystyle{\bf{p}}^{H}(\operatorname*{diag}(\operatorname*{abs}({\bf{D}}^{{}^{\prime}}\bar{\bf{p}}))\!-\!{\bf{D}}^{{}^{\prime}}){\bf{p}}bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( roman_diag ( roman_abs ( bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG ) ) - bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_p
=??H?diag(abs(?????ˉ))??????H??????\displaystyle={\bf{p}}^{H}\operatorname*{diag}(\operatorname*{abs}({\bf{D}}^{{}^{\prime}}\bar{\bf{p}})){\bf{p}}-{\bf{p}}^{H}{\bf{D}}^{{}^{\prime}}{\bf{p}}= bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_diag ( roman_abs ( bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT overˉ start_ARG bold_p end_ARG ) ) bold_p - bold_p start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT bold_p
=m=1N+1|n=1N+1Dm,n?pn|?m=1N+1n=1N+1??m??Dm,n?pn\displaystyle=\sum_{m=1}^{N+1}\left|\sum_{n=1}^{N+1}{{D}}^{{}^{\prime}}_{m,n}p_{n}\right|-\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}{\bf{p}}^{*}_{m}{{D}}^{{}^{\prime}}_{m,n}p_{n}= ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT | ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | - ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT
|m=1N+1n=1N+1??m??Dm,n?pn|?m=1N+1n=1N+1??m??Dm,n?pn\displaystyle\geq\left|\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}{\bf{p}}^{*}_{m}{{D}}^{{}^{\prime}}_{m,n}p_{n}\right|-\sum_{m=1}^{N+1}\sum_{n=1}^{N+1}{\bf{p}}^{*}_{m}{{D}}^{{}^{\prime}}_{m,n}p_{n}≥ | ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | - ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N + 1 end_POSTSUPERSCRIPT bold_p start_POSTSUPERSCRIPT ? end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m , italic_n end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT
0.\displaystyle\geq 0.≥ 0 . (48)

Therefore, ??ˉ\bar{\bf{p}}overˉ start_ARG bold_p end_ARG is a local optimum of problem (16) based on the results revealed in Appendix B of [35]. This thus completes the proof of Lemma 2.

Appendix C: Derivation of Auxiliary Parameters

For any ??{\bf{a}}bold_a, the last term of the objective function of (19) can be expressed as

12?ρ??????~???~HF2\displaystyle\frac{1}{2\rho}||{\bf{A}}-\tilde{\bf{A}}\tilde{\bf{A}}^{H}||^{2}_{{F}}divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT =12?ρ?Tr((??H???~???~H)?(?????~???~H))\displaystyle=\frac{1}{2\rho}\operatorname*{Tr}\left(({\bf{A}}^{H}-\tilde{\bf{A}}\tilde{\bf{A}}^{H})({\bf{A}}-\tilde{\bf{A}}\tilde{\bf{A}}^{H})\right)= divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG roman_Tr ( ( bold_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) ( bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) )
=12?ρ?Tr(???(??N?2???~???~H)+??~???~H)\displaystyle=\frac{1}{2\rho}\operatorname*{Tr}\left({\bf{A}}({\bf{I}}_{N}-2\tilde{\bf{A}}\tilde{\bf{A}}^{H})+\tilde{\bf{A}}\tilde{\bf{A}}^{H}\right)= divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG roman_Tr ( bold_A ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - 2 over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) + over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT )
=12?ρ?(??4T???+r5),\displaystyle=\frac{1}{2\rho}({\bf{r}}^{T}_{4}{\bf{a}}+r_{5}),= divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ( bold_r start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT bold_a + italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) , (49)

where ??4{{\bf{r}}_{4}}bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and r5r_{5}italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT are given in (21). The objection function of (19) can be rewritten as

f3?(??)=k=1Kαk?λk?ek?(??)+12?ρ??????~???~HF2.\displaystyle f_{3}({\bf{A}})=\sum_{k=1}^{K}\alpha_{k}\lambda_{k}e_{k}({\bf{a}})+\frac{1}{2\rho}||{\bf{A}}-\tilde{\bf{A}}\tilde{\bf{A}}^{H}||^{2}_{{F}}.italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( bold_A ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_a ) + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT . (50)

with ek?(??)e_{k}({\bf{a}})italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_a ) given by

ek?(??)\displaystyle{e_{k}}({\bf{a}})italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_a ) =ukH???H?diag?(??r,k???b,k)???+uk???T?diag?(??b,kH???r,kH)???\displaystyle=u_{k}^{H}{{\bm{\varphi}}^{H}}\operatorname{diag}({{\bf{H}}_{r,k}}{{\bf{w}}_{b,k}}){\bf{a}}+{u_{k}}{{\bf{a}}^{T}}\operatorname{diag}({\bf{w}}_{b,k}^{H}{{\bf{H}}_{r,k}}^{H}){\bm{\varphi}}= italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_diag ( bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_k end_POSTSUBSCRIPT ) bold_a + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_diag ( bold_w start_POSTSUBSCRIPT italic_b , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) bold_italic_φ
???T?ukH?uk???H???r,k?m=1K??b,m???b,mH???r,kH???\displaystyle-{{\bf{a}}^{T}}u_{k}^{H}{u_{k}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{r,k}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{b,m}}{\bf{w}}_{b,m}^{H}}{\bf{H}}_{r,k}^{H}{\bm{\varphi}}- bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_φ
?ukH?uk???H???r,k?m=1K??b,m???b,mH???r,kH??????\displaystyle-u_{k}^{H}{u_{k}}{{\bm{\varphi}}^{H}}{{\bf{H}}_{r,k}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{b,m}}{\bf{w}}_{b,m}^{H}}{\bf{H}}_{r,k}^{H}{\bf{\Phi a}}- italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_italic_φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ bold_a
+ukH?uk???T???H???r,k?m=1K??b,m???b,mH???r,kH??????\displaystyle+u_{k}^{H}{u_{k}}{{\bf{a}}^{T}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{r,k}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{b,m}}{\bf{w}}_{b,m}^{H}}{\bf{H}}_{r,k}^{H}{\bf{\Phi a}}+ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ bold_a
?ukH?uk?m=1K??r,kH???~???r,m???b,mH???r,kH??????\displaystyle-u_{k}^{H}{u_{k}}\sum\limits_{m=1}^{K}{{\bf{h}}_{r,k}^{H}{\bf{\tilde{A}}}{{\bf{w}}_{r,m}}{\bf{w}}_{b,m}^{H}}{{\bf{H}}_{r,k}}^{H}{\bf{\Phi a}}- italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ bold_a
?ukH?uk???T???H???r,k?m=1K??b,m???r,mH???~H???r,k.\displaystyle-u_{k}^{H}{u_{k}}{{\bf{a}}^{T}}{{\bf{\Phi}}^{H}}{{\bf{H}}_{r,k}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{b,m}}{\bf{w}}_{r,m}^{H}}{{{\bf{\tilde{A}}}}^{H}}{{\bf{h}}_{r,k}}.- italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT . (51)

Therefore, we have

f3?(??)\displaystyle{f_{3}}({\bf{A}})italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( bold_A ) =??1H???+??T???1+??T???2+??2H???+??T???1???+??3H???\displaystyle={\bf{r}}_{1}^{H}{\bf{a}}+{{\bf{a}}^{T}}{{\bf{r}}_{1}}+{{\bf{a}}^{T}}{{\bf{r}}_{2}}+{\bf{r}}_{2}^{H}{\bf{a}}+{{\bf{a}}^{T}}{{\bf{R}}_{1}}{\bf{a}}+{\bf{r}}_{3}^{H}{\bf{a}}= bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a + bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a + bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a
+??T???3+12?ρ?(??4T???+r5),\displaystyle+{{\bf{a}}^{T}}{{\bf{r}}_{3}}+\frac{1}{{2\rho}}({\bf{r}}_{4}^{T}{\bf{a}}+{{{r}}_{5}}),+ bold_a start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ( bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_a + italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) , (52)

where ??1{\bf{r}}_{1}bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, ??2{{\bf{r}}_{2}}bold_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, ??3{{\bf{r}}_{3}}bold_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, ??4{{\bf{r}}_{4}}bold_r start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, and r5{{{r}}_{5}}italic_r start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT are given in (21).

Similarly, the objective function of (24) can be expressed as

f4?(??~)=k=1Kαk?λk?ek?(??~)+12?ρ??????~???~HF2,\displaystyle f_{4}(\tilde{{\bf{A}}})=\sum_{k=1}^{K}\alpha_{k}\lambda_{k}e_{k}(\tilde{{\bf{a}}})+\frac{1}{2\rho}||{\bf{A}}-\tilde{\bf{A}}\tilde{\bf{A}}^{H}||^{2}_{{F}},italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( over~ start_ARG bold_A end_ARG ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( over~ start_ARG bold_a end_ARG ) + divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT , (53)

with ek?(??~)e_{k}(\tilde{{\bf{a}}})italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( over~ start_ARG bold_a end_ARG ) given by

ek?(??~)\displaystyle{e_{k}}(\tilde{{\bf{a}}})italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( over~ start_ARG bold_a end_ARG ) =ukH?uk???r,kH???~?m=1K??r,m???b,mH???r,kH?(??N???)???\displaystyle=u_{k}^{H}{u_{k}}{{\bf{h}}^{H}_{r,k}}{\bf{\tilde{A}}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{r,m}}{\bf{w}}_{b,m}^{H}}{\bf{H}}_{r,k}^{H}{\bf{(}}{{\bf{I}}_{N}}{\bf{-A){\bm{\varphi}}}}= italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT over~ start_ARG bold_A end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_italic_φ
?ukH???r,kH???~???r,k???r,kH???~H???r,k?uk\displaystyle-u_{k}^{H}{\bf{h}}_{r,k}^{H}{\bf{\tilde{A}}}{{\bf{w}}_{r,k}}-{\bf{w}}_{r,k}^{H}{{{\bf{\tilde{A}}}}^{H}}{{\bf{h}}_{r,k}}{u_{k}}- italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG bold_w start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT - bold_w start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
+ukH?uk???H?(??N???)???r,k?m=1K??b,m???r,mH???~H???r,k\displaystyle+u_{k}^{H}{u_{k}}{{\bm{\varphi}}^{H}}{\bf{(}}{{\bf{I}}_{N}}{\bf{-A)}}{{\bf{H}}_{r,k}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{b,m}}{\bf{w}}_{r,m}^{H}}{{{\bf{\tilde{A}}}}^{H}}{{\bf{h}}_{r,k}}+ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_italic_φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - bold_A ) bold_H start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_b , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT
+ukH?uk???r,kH???~?m=1K??r,m???r,mH???~H???r,k.\displaystyle+u_{k}^{H}{u_{k}}{\bf{h}}_{r,k}^{H}{\bf{\tilde{A}}}\sum\limits_{m=1}^{K}{{{\bf{w}}_{r,m}}{\bf{w}}_{r,m}^{H}}{{{\bf{\tilde{A}}}}^{H}}{{\bf{h}}_{r,k}}.+ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_r , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_r , italic_k end_POSTSUBSCRIPT . (54)

Moreover, the last term of (53) can be expressed as

12?ρ??????~???~HF2\displaystyle\frac{1}{2\rho}||{\bf{A}}-\tilde{\bf{A}}\tilde{\bf{A}}^{H}||^{2}_{{F}}divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG | | bold_A - over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT =12?ρ?(Tr(??+??~???~H)?2?Tr(?????~???~H))\displaystyle=\frac{1}{2\rho}\left(\operatorname*{Tr}({\bf{A}}+\tilde{\bf{A}}\tilde{\bf{A}}^{H})-2\operatorname*{Tr}({\bf{A}}\tilde{\bf{A}}\tilde{\bf{A}}^{H})\right)= divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ( roman_Tr ( bold_A + over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) - 2 roman_Tr ( bold_A over~ start_ARG bold_A end_ARG over~ start_ARG bold_A end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) )
=12?ρ?(a???~H?(??a???)???~).\displaystyle=\frac{1}{2\rho}(a-{\tilde{\bf{a}}^{H}}({\bf{I}}_{a}\otimes{\bf{A}})\tilde{\bf{a}}).= divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ( italic_a - over~ start_ARG bold_a end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ? bold_A ) over~ start_ARG bold_a end_ARG ) . (55)

Therefore, we have (III-C), (21), and (25), which completes the proof.

References

  • [1] Q.?Xue, C.?Ji, S.?Ma, J.?Guo, Y.?Xu, Q.?Chen, and W.?Zhang, “A survey of beam management for mmWave and THz communications towards 6G,” IEEE Commun. Surveys Tuts., vol.?26, no.?3, pp. 1520–1559, 3rd Quart., 2024.
  • [2] H.?Lu, Y.?Zeng, C.?You, Y.?Han, J.?Zhang, Z.?Wang, Z.?Dong, S.?Jin, C.-X. Wang, T.?Jiang, X.?You, and R.?Zhang, “A tutorial on near-field XL-MIMO communications toward 6G,” IEEE Commun. Surveys Tuts., vol.?26, no.?4, pp. 2213–2257, 4th Quart., 2024.
  • [3] E.?G. Larsson, “Massive synchrony in distributed antenna systems,” IEEE Trans. Signal Process., vol.?72, pp. 855–866, Jan. 2024.
  • [4] W.?Jiang, Q.?Zhou, J.?He, M.?A. Habibi, S.?Melnyk, M.?El-Absi, B.?Han, M.?D. Renzo, H.?D. Schotten, F.-L. Luo, T.?S. El-Bawab, M.?Juntti, M.?Debbah, and V.?C.?M. Leung, “Terahertz communications and sensing for 6G and beyond: A comprehensive review,” IEEE Commun. Surveys Tuts., vol.?26, no.?4, pp. 2326–2381, 4th Quart., 2024.
  • [5] S.?Uchimura, K.?Ando, G.?T.?F. de?Abreu, and K.?Ishibashi, “Joint design of equalization and beamforming for single-carrier MIMO transmission over millimeter-wave and sub-terahertz channels,” IEEE Trans. Wireless Commun., vol.?24, no.?3, pp. 1978–1991, 2025.
  • [6] M.?Mohammadi, Z.?Mobini, H.?Quoc?Ngo, and M.?Matthaiou, “Ten years of research advances in full-duplex massive MIMO,” IEEE Trans. Commun., vol.?73, no.?3, pp. 1756–1786, Mar. 2025.
  • [7] Y.?Liu, X.?Liu, X.?Mu, T.?Hou, J.?Xu, M.?Di?Renzo, and N.?Al-Dhahir, “Reconfigurable intelligent surfaces: Principles and opportunities,” IEEE Commun. Surveys Tuts., vol.?23, no.?3, pp. 1546–1577, 3rd Quart., 2021.
  • [8] H.?Lu, Y.?Zeng, S.?Jin, and R.?Zhang, “Single-carrier delay alignment modulation for multi-IRS aided communication,” IEEE Trans. Wireless Commun., vol.?23, no.?4, pp. 3267–3282, Apr. 2024.
  • [9] A.?Alkhateeb, G.?Leus, and R.?W.?H. Jr., “Limited feedback hybrid precoding for multi-user millimeter wave systems,” IEEE Trans. Wirel. Commun., vol.?14, no.?11, pp. 6481–6494, Nov. 2015.
  • [10] C.?Ma, X.?Yang, J.?Wang, G.?Yang, and S.?Ma, “Reconfigurable distributed antennas and reflecting surface (RDARS): A new architecture for wireless communications,” IEEE Trans. Commun., vol.?72, no.?10, pp. 6583–6598, Oct. 2024.
  • [11] P.?V. Trinh and S.?Sugiura, “Maximum secrecy throughput analysis of practical RIS-aided mmWave systems over unified FTR/IFTR fading channels,” IEEE Trans. Wireless Commun., vol.?23, no.?11, pp. 16?590–16?605, Nov. 2024.
  • [12] Z.?Zhang, L.?Dai, X.?Chen, C.?Liu, F.?Yang, R.?Schober, and H.?V. Poor, “Active RIS vs. passive RIS: Which will prevail in 6G?” IEEE Trans. Commun., vol.?71, no.?3, pp. 1707–1725, Mar. 2023.
  • [13] Q.?Peng, Q.?Wu, W.?Chen, S.?Ma, M.-M. Zhao, and O.?A. Dobre, “Semi-passive intelligent reflecting surface-enabled sensing systems,” IEEE Trans. Commun., vol.?72, no.?12, pp. 7674–7688, Dec. 2024.
  • [14] J.?Wang, C.?Ma, S.?Gong, X.?Yang, and S.?Ma, “Joint beamforming optimization and mode selection for RDARS-aided MIMO systems,” IEEE Trans. Wireless Commun., vol.?23, no.?11, pp. 17?557–17?572, Nov. 2024.
  • [15] C.?Ma, J.?Wang, X.?Yang, G.?Yang, W.?Zhang, and S.?Ma, “RDARS empowered massive MIMO system: Two-timescale transceiver design with imperfect CSI,” IEEE Trans. Wireless Commun., vol.?23, pp. 18?806–18?821, Dec. 2024.
  • [16] C.?Ji, Q.?Xue, H.?Lu, J.?Wang, Q.?Peng, S.?Ma, and W.?Zhang, “Reconfigurable codebook-based beamforming for RDARS-aided mmWave MU-MIMO systems,” arXiv preprint arXiv:2504.01333, 2025.
  • [17] J.?Wang, C.?Ji, J.?Guo, and S.?Ma, “Demo: Reconfigurable distributed antennas and reflecting surface (RDARS)-aided integrated sensing and communication system,” in Proc. IEEE ICCC, 2023, pp. 1–2.
  • [18] P.?Zhang, J.?Wang, Y.?Shao, and S.?Ma, “Integrated sensing and communication with reconfigurable distributed antenna and reflecting surface: Joint beamforming and mode selection,” IEEE Internet Things J., vol.?12, no.?13, pp. 24?401–24?416, Jul. 2025.
  • [19] S.?Pala, O.?Taghizadeh, M.?Katwe, K.?Singh, C.?Li, and A.?Schmeink, “Secure RIS-assisted hybrid beamforming design with low-resolution phase shifters,” IEEE Trans. Wireless Commun., vol.?23, no.?8, pp. 10?198–10?212, Aug. 2024.
  • [20] B.?Wang, C.?Pan, H.?Ren, Z.?Yu, Y.?Zhang, M.?Liu, and G.?Zhou, “Beamforming design for double-active-RIS-aided communication systems with inter-excitation,” IEEE Trans. Wireless Commun., vol.?24, no.?7, pp. 5855–5870, Jul. 2025.
  • [21] Z.?Peng, Z.?Zhang, C.?Pan, M.?D. Renzo, O.?A. Dobre, and J.?Wang, “Beamforming optimization for active RIS-aided multiuser communications with hardware impairments,” IEEE Trans. Wirel. Commun., vol.?23, no.?8, pp. 9884–9898, Feb. 2024.
  • [22] M.?Liu, M.?Li, R.?Liu, and Q.?Liu, “Distributed distortion-aware beamforming designs for cell-free mMIMO systems,” IEEE J. Sel. Top. Signal Process., vol.?19, no.?2, pp. 381–397, Mar. 2025.
  • [23] B.?Wang, J.?Fang, Y.?Xiao, and M.?Haardt, “Max-min beamforming for large-scale cell-free massive MIMO: A randomized ADMM algorithm,” IEEE Trans. Wireless Commun., vol.?24, no.?5, pp. 4315–4328, May 2025.
  • [24] A.?M. Elbir and S.?Coleri, “Federated learning for hybrid beamforming in mm-wave massive MIMO,” IEEE Commun. Lett., vol.?24, no.?12, pp. 2795–2799, Dec. 2020.
  • [25] L.?Shen, T.?Chang, K.?Feng, and P.?Huang, “Design and implementation for deep learning based adjustable beamforming training for millimeter wave communication systems,” IEEE Trans. Veh. Technol., vol.?70, no.?3, pp. 2413–2427, Mar. 2021.
  • [26] C.?Huang, R.?Mo, and C.?Yuen, “Reconfigurable intelligent surface assisted multiuser MISO systems exploiting deep reinforcement learning,” IEEE J. Sel. Areas Commun., vol.?38, no.?8, pp. 1839–1850, Aug. 2020.
  • [27] G.?Chen, Z.?Wang, H.?Lin, Y.?Huang, and L.?Yang, “Computationally efficient unsupervised deep learning for robust joint AP clustering and beamforming design in cell-free systems,” IEEE Trans. Wireless Commun., vol.?24, no.?5, pp. 4250–4266, May 2025.
  • [28] A.?Balatsoukas-Stimming and C.?Studer, “Deep unfolding for communications systems: A survey and some new directions,” in Proc. IEEE Int. Workshop Signal Process. Syst. (SiPS), 2019, pp. 266–271.
  • [29] H.?He, S.?Jin, C.?Wen, F.?Gao, G.?Y. Li, and Z.?Xu, “Model-driven deep learning for physical layer communications,” IEEE Wirel. Commun., vol.?26, no.?5, pp. 77–83, Oct. 2019.
  • [30] W.?Jin, J.?Zhang, C.?Wen, S.?Jin, X.?Li, and S.?Han, “Low-complexity joint beamforming for RIS-assisted MU-MISO systems based on model-driven deep learning,” IEEE Trans. Wireless Commun., vol.?23, no.?7, pp. 6968–6982, Dec. 2023.
  • [31] Q.?Xu, J.?Sun, and Z.?Xu, “Efficient MU-MIMO beamforming based on majorization-minimization and deep unfolding,” IEEE Trans. Wireless Commun., vol.?24, no.?5, pp. 3949–3963, May 2025.
  • [32] G.?Sun, M.?Cao, W.?Wang, W.?Xu, and C.?Studer, “Deep-unfolded massive grant-free transmission in cell-free wireless communication systems,” IEEE Trans. Signal Process., vol.?73, pp. 1094–1109, Feb. 2025.
  • [33] K.-M. Chen, H.-Y. Chang, R.?Y. Chang, and W.-H. Chung, “Deep unfolded hybrid beamforming in reconfigurable intelligent surface aided mmwave mimo-ofdm systems,” IEEE Wireless Communications Letters, vol.?13, no.?4, pp. 1118–1122, 2024.
  • [34] Q.?Shi, M.?Razaviyayn, Z.?Luo, and C.?He, “An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,” IEEE Trans. Signal Process., vol.?59, no.?9, pp. 4331–4340, Sep. 2011.
  • [35] M.?Soltanalian and P.?Stoica, “Designing unimodular codes via quadratic optimization,” IEEE Trans. Signal Process., vol.?62, no.?5, pp. 1221–1234, May 2014.
  • [36] Q.?Wu and R.?Zhang, “Joint active and passive beamforming optimization for intelligent reflecting surface assisted SWIPT under QoS constraints,” IEEE J. Sel. Areas Commun., vol.?38, no.?8, pp. 1735–1748, Aug. 2020.
  • [37] E.?Bj?rnson, M.?Bengtsson, and B.?E. Ottersten, “Optimal multiuser transmit beamforming: A difficult problem with a simple solution structure [lecture notes],” IEEE Signal Process. Mag., vol.?31, no.?4, pp. 142–148, Jun. 2014.
一什么冰箱 白玫瑰适合送什么人 压片糖果是什么意思 吃雪燕有什么好处 胰腺钙化灶是什么意思
脱发缺乏什么维生素 阿奇霉素是什么药 皮肤软组织感染是什么意思 胃痉挛什么症状 kksk是什么意思
3月2号什么星座 所剩无几是什么意思 奶霜是什么 一什么阳光填量词 弱精是什么意思
脸上长红色的痘痘是什么原因 海兔是什么 口干口苦是什么病 龙眼什么时候上市 哇哦什么意思
输卵管造影是什么意思hcv8jop0ns9r.cn 腋窝下疼痛是什么原因hcv9jop2ns5r.cn 为什么生气会胃疼0735v.com 人为什么会出汗hcv7jop5ns4r.cn 货号是什么hcv8jop8ns9r.cn
蜘蛛怕什么hcv7jop9ns8r.cn 颖五行属什么hcv9jop2ns5r.cn 纯化水是什么水kuyehao.com gsp全称是什么hcv7jop6ns7r.cn 减肥喝什么hcv8jop9ns1r.cn
蛇蝎心肠是什么生肖hcv9jop8ns2r.cn 新疆有什么烟hcv8jop4ns2r.cn 榴莲树长什么样子图片hcv9jop7ns3r.cn 肠炎有什么表现hcv8jop5ns1r.cn 夏天穿什么衣服比较凉爽hcv8jop6ns0r.cn
58什么意思wuhaiwuya.com 水饮是什么意思hcv9jop5ns5r.cn 双侧卵巢显示不清是什么意思hcv8jop2ns5r.cn 生闷气容易得什么病hcv8jop9ns2r.cn 长此以往什么意思hcv9jop5ns6r.cn
百度