14
TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 1 MIMO Systems with Restricted Pre/Post-coding – Capacity Analysis based on Coupled Doubly-correlated Wishart Matrices Vishnu V. Ratnam, Student Member, IEEE, Andreas F. Molisch, Fellow, IEEE, and Haralabos C. Papadopoulos, Member, IEEE Abstract—Many practical communication systems have some form of restricted precoding or postcoding such as Antenna Selection, Selection Combining, Beam Selection and Limited Feedback precoding, to name a few. The capacity analysis of such systems is, in general, difficult and previous works in the literature provide results only for certain simplified cases. The current paper derives a novel approach to analyze the capacity for such systems under a very generic setting. The results are based on asymptotic closed-form expressions for second-order statistics and joint distributions of eigenvalues for a set of coupled, doubly-correlated Wishart matrices. A tight approximation to the joint distribution of the eigenvalues in the non-asymptotic regime is also proposed. These results are then used to show that the system capacity can be approximated as the largest element of a correlated Gaussian vector. Showing that this is equivalent to the problem of finding the distribution of sum of lognormals, we propose a novel approach to characterize its distribution. As an application, the capacity for an antenna selection system and a limited feedback precoding system are compared to their respective approximations. The paper also demonstrates how the results can be used to design the precoding codebook in limited feedback systems. Index Terms—Restricted precoding, Antenna Selection, Lim- ited Feedback precoding, Joint eigenvalue distribution, Wishart matrices, Capacity distribution. I. I NTRODUCTION With the rising amount of downlink data traffic but a limited available spectrum, there is an impending spectrum crunch and a need for higher spectral efficiency. Multiple-input- multiple-output (MIMO) transmission technologies promise large gains in spectral efficiency by offering spatial degrees of freedom for data transmission. In fact, with the progress in digital and radio-frequency (RF) hardware technology, the development of low complexity precoding algorithms and also the possibility of using the mm-wave frequency band for cellular transmission, cellular networks are moving towards the massive MIMO regime i.e., base stations with dozens or hundreds of antenna elements. First proposed in [1], data transmission in massive MIMO systems offers several ad- vantages such as simplified precoding, higher beamforming gains etc. However, having a massive antenna array brings with it several problems. Firstly, the cost of channel state V. V. Ratnam and A. F. Molisch are with the Department of Electrical Engineering, University of Southern California, Los Angeles, CA, 90089 USA (e-mail: {ratnam, molisch}@usc.edu) H. C. Papadopoulos is with Docomo Innovations, Palo Alto, CA, 94304 USA (e-mail: [email protected]) information (CSI) feedback from user terminal to the base- station, in the downlink scenario, increases significantly for frequency division duplexing systems 1 . This has led to the proposition of limited feedback precoding (LFP) [2], wherein the transmitter maintains a codebook of precoding vectors and the receiver feeds back only the codebood index based on the CSI. Secondly, since RF hardware such as analogue-to-digital converters, digital-to-analogue converters, mixers, RF filters etc. are power hungry and expensive, with massive antenna arrays it may be impractical to equip each antenna with a dedicated RF chain. This has led to the proposition of transmit antenna selection (TAS) [3], [4], wherein a smaller number of RF chains feed the transmit antennas via an array of RF switches. Another such example is a beam selection system [5]–[7]. In all of these precoding methods, the data precoding is restricted to take-on only a restricted set of values (either due to limited CSI or due to limitations in the RF hardware). We shall refer to any such system with restrictions on data precoding as a restricted pre-coded system. 2 Such systems form an important class of practically viable massive MIMO systems. The performance analysis for restricted precoded systems is difficult in general and no closed form expressions (for the most general setting) are known to date. For the case of limited feedback precoding, lower bounds on the capacity with beamforming in an isotropic channel were proposed in [8]. System upper bounds under similar settings were considered in [9], [10] etc. The lower bound in [8] was extended to the case of spatial multiplexing in [11]. For correlated channels, heuristic designs for the codebook were suggested in [8], [12], [13]. However, bounds on the performance and the optimal design of the codebook for spatial multiplexing in a correlated channel are not available in literature to the best of the authors’ knowledge. Even in the relatively simple case of random vector quantized beamforming, performance bounds and good codebook designs for a correlated MIMO channel were found only recently [14]. A more complete discussion of the results prior to 2008 are available in [2]. Similarly, for the case of 1 Unlike with time division duplexing, where CSI from uplink training can be used for downlink transmission, with frequency division duplexing, we need to rely on downlink training and uplink CSI feedback. 2 A system with similar restrictions at the receiver, for example: Receive Antenna Selection, shall be referred to as a Restricted postcoded system

TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL ...TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 1 MIMO Systems with Restricted Pre/Post-coding – Capacity Analysis based on

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 1

    MIMO Systems with Restricted Pre/Post-coding –Capacity Analysis based on CoupledDoubly-correlated Wishart MatricesVishnu V. Ratnam, Student Member, IEEE, Andreas F. Molisch, Fellow, IEEE,

    and Haralabos C. Papadopoulos, Member, IEEE

    Abstract—Many practical communication systems have someform of restricted precoding or postcoding such as AntennaSelection, Selection Combining, Beam Selection and LimitedFeedback precoding, to name a few. The capacity analysis ofsuch systems is, in general, difficult and previous works inthe literature provide results only for certain simplified cases.The current paper derives a novel approach to analyze thecapacity for such systems under a very generic setting. Theresults are based on asymptotic closed-form expressions forsecond-order statistics and joint distributions of eigenvalues fora set of coupled, doubly-correlated Wishart matrices. A tightapproximation to the joint distribution of the eigenvalues in thenon-asymptotic regime is also proposed. These results are thenused to show that the system capacity can be approximated asthe largest element of a correlated Gaussian vector. Showing thatthis is equivalent to the problem of finding the distribution ofsum of lognormals, we propose a novel approach to characterizeits distribution. As an application, the capacity for an antennaselection system and a limited feedback precoding system arecompared to their respective approximations. The paper alsodemonstrates how the results can be used to design the precodingcodebook in limited feedback systems.

    Index Terms—Restricted precoding, Antenna Selection, Lim-ited Feedback precoding, Joint eigenvalue distribution, Wishartmatrices, Capacity distribution.

    I. INTRODUCTION

    With the rising amount of downlink data traffic but a limitedavailable spectrum, there is an impending spectrum crunchand a need for higher spectral efficiency. Multiple-input-multiple-output (MIMO) transmission technologies promiselarge gains in spectral efficiency by offering spatial degreesof freedom for data transmission. In fact, with the progressin digital and radio-frequency (RF) hardware technology, thedevelopment of low complexity precoding algorithms and alsothe possibility of using the mm-wave frequency band forcellular transmission, cellular networks are moving towardsthe massive MIMO regime i.e., base stations with dozensor hundreds of antenna elements. First proposed in [1], datatransmission in massive MIMO systems offers several ad-vantages such as simplified precoding, higher beamforminggains etc. However, having a massive antenna array bringswith it several problems. Firstly, the cost of channel state

    V. V. Ratnam and A. F. Molisch are with the Department of ElectricalEngineering, University of Southern California, Los Angeles, CA, 90089 USA(e-mail: {ratnam, molisch}@usc.edu)

    H. C. Papadopoulos is with Docomo Innovations, Palo Alto, CA, 94304USA (e-mail: [email protected])

    information (CSI) feedback from user terminal to the base-station, in the downlink scenario, increases significantly forfrequency division duplexing systems1. This has led to theproposition of limited feedback precoding (LFP) [2], whereinthe transmitter maintains a codebook of precoding vectors andthe receiver feeds back only the codebood index based on theCSI. Secondly, since RF hardware such as analogue-to-digitalconverters, digital-to-analogue converters, mixers, RF filtersetc. are power hungry and expensive, with massive antennaarrays it may be impractical to equip each antenna with adedicated RF chain. This has led to the proposition of transmitantenna selection (TAS) [3], [4], wherein a smaller numberof RF chains feed the transmit antennas via an array of RFswitches. Another such example is a beam selection system[5]–[7]. In all of these precoding methods, the data precodingis restricted to take-on only a restricted set of values (eitherdue to limited CSI or due to limitations in the RF hardware).We shall refer to any such system with restrictions on dataprecoding as a restricted pre-coded system.2 Such systemsform an important class of practically viable massive MIMOsystems.

    The performance analysis for restricted precoded systemsis difficult in general and no closed form expressions (forthe most general setting) are known to date. For the case oflimited feedback precoding, lower bounds on the capacity withbeamforming in an isotropic channel were proposed in [8].System upper bounds under similar settings were consideredin [9], [10] etc. The lower bound in [8] was extended to thecase of spatial multiplexing in [11]. For correlated channels,heuristic designs for the codebook were suggested in [8], [12],[13]. However, bounds on the performance and the optimaldesign of the codebook for spatial multiplexing in a correlatedchannel are not available in literature to the best of the authors’knowledge. Even in the relatively simple case of randomvector quantized beamforming, performance bounds and goodcodebook designs for a correlated MIMO channel were foundonly recently [14]. A more complete discussion of the resultsprior to 2008 are available in [2]. Similarly, for the case of

    1Unlike with time division duplexing, where CSI from uplink training canbe used for downlink transmission, with frequency division duplexing, weneed to rely on downlink training and uplink CSI feedback.

    2A system with similar restrictions at the receiver, for example: ReceiveAntenna Selection, shall be referred to as a Restricted postcoded system

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 2

    TAS3, bounds on the distribution of the capacity for spatialmultiplexing in an isotropic channel were found in [15]. Theergodic capacity in the high and low signal-to-noise ratio(SNR) regimes were discussed in [16]. A loose upper boundon the outage probability for a spatial multiplexing systemwith receive antenna selection, in a correlated fading channelwas considered in [17]. However, the performance analysis ofspatial multiplexing with TAS in a correlated channel is notavailable in the literature to the best of the authors’ knowledge.A more complete review of literature on antenna selection isavailable in [2], [4], [18].

    Evaluation of the system performance is important in de-signing good systems. For example, it can aid in the design ofgood codebooks for a LFP system. In this paper we develop amathematical framework for the analysis of such systems. Asshall be shown in Sec. II, some of the important performancemeasures like mean and outage capacity are a function of theeigenvalues of a set of coupled4, doubly-correlated Wishartmatrices. Therefore characterizing the joint eigenvalue distri-bution across these coupled Wishart matrices is an essentialstep towards characterizing the performance. The asymptoticeigenvalue distribution [19], [20], non-asymptotic diagonaldistribution [21] and joint eigenvalue distributions [22] for asingle Wishart matrix have been widely characterized bothwith and with-out correlated entries. The joint eigenvaluedistribution for a pair of correlated Wishart matrices wascharacterized in [23], [24]. However, the joint eigenvaluedistribution across a larger set of correlated, let alone coupled,Wishart matrices has not been studied in literature to the bestof our knowledge.

    The contributions of this paper are as follows: We derivethe asymptotic second-order statistics and joint distribution ofeigenvalues across a set of coupled, doubly-correlated Wishartmatrices. We also propose a tight approximation for thejoint distribution of eigenvalues in the non-asymptotic regime.These results are then used to approximate the distributionof capacity for a restricted precoded system. In the process,we propose a new technique for finding the distribution ofthe largest element of a correlated Gaussian vector. As anapplication of the proposed techniques, we also design anefficient codebook for a limited feedback system. Though wefocus here on restricted precoding, the presented analysis canalso be extended to the case of restricted postcoding.

    The rest of the paper is organized as follows: In Sec. II,the channel model is introduced and the problem of findingthe capacity of a restricted precoded system is formulated. Thejoint distribution of the channel eigenvalues are derived in Sec.III. Using these results, the approximate capacity distributionfor a restricted precoded system is derived in Sec. IV. To studythe effectiveness of the approximation, simulations under somepractical channel parameters are performed in Sec. V. As anapplication of the results, we also demonstrate how it can be

    3It is well known that transmit antenna selection without CSI at thetransmitter (CSIT) can be interpreted as a type of LFP [2]. However withthe presence of CSIT, this is not true.

    4By coupling, we mean that the Gaussian matrices generating the set ofWishart matrices have some common elements.

    used to design a good codebook for a limited feedback systemin Sec. VI. Finally, the conclusions are presented in Sec. VII.

    Notation used in this work is as follows: scalars are rep-resented by light-case letters; vectors by bold-case letters;matrices are represented by capitalized bold-case letters andsets; and subspaces are represented by calligraphic letters.Additionally, ai represents the i-th element of a vector a, ‖a‖Prepresents the LP norm of a vector a, [A]i, j represents the(i, j)-th element of a matrix A, [A]c{i } and [A]r{i } representthe i-th column and row vectors of matrix A, respectively,‖A‖F represents the Frobenius norm of a matrix A, A† isthe conjugate transpose of a matrix A and |A| representsthe cardinality of a set A or dimension of a space A. Also,E{} represents the expectation operator, P is the probabilityoperator, Ii and Oi, j are the i × i and i × j identity and zeromatrices respectively, and R and C represent the field of realand complex numbers.

    II. GENERAL ASSUMPTIONS AND CHANNEL MODEL

    We consider a point-to-point MIMO link where the trans-mitter has an array with N � 1 antenna elements and the re-ceiver has M ≤ N antenna elements, respectively. We assume anarrow-band system with a frequency flat and temporally blockfading channel. The channel fading statistics are assumed tobe Rayleigh in amplitude, doubly spatially correlated (both attransmitter and receiver end) and to follow the widely usedKronecker correlation model [25]. The transmitter is assumedto have restricted precoding, wherein, the transmit data vectoris precoded by a precoding matrix of dimension N ×K , whereK ≤ N . For LFP, K corresponds to the number of transmit datastreams and in the case of TAS, K corresponds to the numberof RF chains, respectively. Note that in LFP, we typically havethe number of data streams K ≤ M . However, for analyticaltractability, in this paper we assume K ≥ M and the case ofK < M is deferred to future work. Under these assumptions,the baseband downlink received signal vector can be expressedas:

    y =√ρHTx + n

    =√ρR1/2rx GR

    1/2tx Tx + n (1)

    where y is the M × 1 received signal vector, ρ is the SNR,H is the M × N small-scale fading channel matrix, Rtx is theN × N transmit spatial-correlation matrix, Rrx is the M × Mreceive spatial-correlation matrix, G is an M × N matrix withindependent and identically distributed (i.i.d.) CN(0,1) com-ponents (circularly symmetric zero-mean complex Gaussianentries with unit variance), n ∼ CN(OM×1, IM ) is the M × 1normalized AWGN noise vector, T is a N × K transmit pre-coding matrix and x is the K × 1 transmit data vector.

    In a restricted precoded system, the precoding matrix T canonly attain a restricted set of values, i.e., T ∈ T . Here, we shallrefer to this set T as a codebook. For example, in LFP, T isthe set of all N×K precoding matrices in the codebook and inthe case of TAS, it is the set of all possible N ×K submatricesof IN formed by picking K out of N (distinct) columns. LetT = {T1, ...,T |T |}. We assume that the precoding matrices are

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 3

    semi-unitary i.e. T†i Ti = IK for all i ∈ {1, .., |T |}5. We furtherassume that H is quasi-static and therefore the system capacityis computed for each channel realization as:

    C(H) = maxP,1≤i≤ |T |

    log���IM + ρHTiPT†i H†��� (2)

    s.t Tr{P} ≤ 1

    where P = E{xx†} is the transmit power allocation matrix. Inunitary LFP, since the transmitter does not have CSI, typicallyequal power allocation is used i.e. P = 1K IK .

    6 In the caseof transmit antenna selection with CSI however, the capacityoptimal water filling power allocation can be used. This is themajor difference between LFP and TAS with CSIT. Thoughthe results can also be extended to the case of water-fillingwith a little effort, for the case of TAS we assume that equalpower is allocated to all the non-zero channel eigenvalues.This scheme is capacity optimal in the high SNR regime [26].Furthermore, since with antenna selection the best precoder islikely to yield less skewed channel eigenvalues, the capacityloss due to equal power allocation is small even at low SNR.Under these assumptions, the capacity expressions7 in eithercase can be expressed as:

    CLFP(H) = max1≤i≤ |T |

    {M∑m=1

    log(1 + ρKλ̃im)

    }(3)

    CTAS(H) = max1≤i≤ |T |

    {M∑m=1

    log[1 +

    ρλ̃imrank{HTi}

    ]}≈ max

    1≤i≤ |T |

    {M∑m=1

    log[1 +

    ρλ̃imM

    ]}(4)

    where λ̃im is the m-th largest eigenvalue of HTiT†i H† and

    the last step follows from the fact that M ≤ K and the bestchannel is rank deficient with very low probability. Since theeffective channel for a given precoder matrix T is HT, weshall henceforth refer to λ̃im as the m-th “channel" eigenvaluefor precoder Ti .

    III. JOINT DISTRIBUTION OF CHANNEL EIGENVALUES

    From (3) and (4) it is clear that the capacity distributiondepends only on the eigenvalues of HTiT†i H

    †. It can be easilyverified from (1) that {HTiT†i H† |1 ≤ i ≤ |T |} forms a setof coupled, doubly-correlated Wishart matrices. The couplingcomes from the fact that all these matrices are generated fromthe same i.i.d. random matrix G. In this section, we character-ize the joint distribution of the eigenvalues of these coupledWishart matrices. We first derive the asymptotic second-orderstatistics and the joint distribution of the eigenvalues in the

    5This assumption is valid for TAS and also holds for the most commoncase of LFP called limited feedback unitary precoding.

    6Note that with availability of second-order CSI at transmitter, statisticalpower allocation can also be used to improve performance slightly. Also, in themost general non-unitary LFP setting, the codebook can be augmented withadditional feedback bits to convey power allocation information. Though suchnon-unitary precoding is beyond the scope of this work, the analysis presentedhere can also be extended to these scenarios with little effort.

    7Due to the use of sub-optimal power allocation, strictly speaking, theseexpressions correspond to “achievable data-rate". However in this work, witha slight abuse of notation, we shall refer to them as capacity.

    large antenna limit i.e., for N,K →∞ (with a fixed ratio) whileM is kept fixed (finite). Note that this is counter intuitive in thecase of LFP since there we typically have K ≤ M . However,this scaling is required for analytical tractability and wherenecessary, we shall also consider approximations for the, morepractical, finite antenna regime (including the case of K = M).

    For the large antenna limit, we define the scaled param-eters N = sNo and K = sKo, where No,Ko are constantsand s is the scaling factor. We define a family of N × Ntransmit correlation matrices Rtx and a family of codebooksT = {T1, ..,T |T |} as a function of s. For the family ofcodebooks, the codebook size |T | is fixed but the precodingmatrices Ti are N×K semi-unitary matrices as a function of s.The eigenvalues of HTiT†i H

    † typically diverge as s increases.Therefore we shall instead characterize the eigenvalues of

    its normalized counterpart Qi ,HTiT†iH

    Tr{T†iRtxTi }.8 We define the

    eigen decomposition Qi = EiΛiE†i where Ei and Λi are theunordered eigenvector and eigenvalue matrices, respectively.We shall refer to these eigenvalues λim = [Λi]m,m as thenormalized channel eigenvalues. We also define the eigendecompositions Rrx = ErxΛrx[Erx]† and Rtx = EtxΛtx[Etx]†where, λtx

    k= [Λtx]kk , λrxk = [Λtx]kk are the k-th largest

    eigenvalues of Rtx, Rrx respectively.

    A. First-order approximation and second-order statistics

    The expression for the normalized channel eigenvalues andtheir second-order statistics, in the large antenna limit, aregiven by the following theorem, which extends the results in[20] to the joint statistics case:

    Theorem III.1. Consider a family of transmit correlationmatrices Rtx and a family of precoding matrices T ={T1, ..,T |T |} as a function of s. If the eigenvalues of Rrx areall distinct and lims→∞

    ‖T†iRtxTi ‖FTr{T†iRtxTi }

    = 0 for all i ∈ {1, .., |T |},then as s→∞:

    λim ' erxm †Qierxm , Ûλim (5)µim = E{λim} ' λrxm , Ûµim (6)

    K i jmn = E{λimλjn

    }− µimµjn

    'δmnλ

    rxmλ

    rxn

    T†jRtxTi

    2FTr{T†i RtxTi}Tr{T

    †jRtxTj}

    , ÛK i jmn (7)

    for all 1 ≤ m,n ≤ M , i, j ∈ {1, .., |T |} and where we use “'" todenote a first-order approximation (i.e., an equality in whichthe higher order terms that do not influence the asymptoticstatistics of λim as s→∞ are neglected), λim = [Λi]m,m anderxm = [Erx]c{m}.Proof. See Appendix A. �

    These normalized channel eigenvalues [λi1, .., λiM ] are un-ordered and are picked in the permutation such that theHoffman-Weilandt inequality holds (see proof of TheoremIII.1). Note that the conditions required for the above theorem

    8Note that if Tr{T†iRtxTi } = 0, the corresponding channel eigenvalues aretrivially zero. Here we only consider the non-trivial case of Tr{T†iRtxTi } > 0.

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 4

    are somewhat difficult to verify since they depend on thecodebook. A simpler sufficient condition, independent of thecodebook, is given by the following proposition.

    Proposition III.1.1 (Simpler sufficient condition). TheoremIII.1 is satisfied if eigenvalues of Rrx are all distinct and eitherlims→∞

    ∑Kk=1 (λtxk )

    2

    [∑K`=1 λtxN+1−`]2 = 0 or lims→∞ (λtx1 )

    2∑K`=1 (λtxN+1−`)2

    = 0

    Proof. See Appendix B. �

    Intuitively, the theorem states that as long as the eigenvaluesof the transmit correlation matrix are not too skewed (so thatthe law of large numbers is applicable) the normalized channeleigenvalues asymptotically converge. Therefore, the first-orderapproximations to the normalized channel eigenvalues arevalid for large s, and these are used to derive the second-orderstatistics. Some examples of families of transmit correlationmatrices which satisfy the skewness constraints in PropositionIII.1.1 are discussed in Appendix D.

    Though the presented results are asymptotic, we are inter-ested in how quickly the terms in (5)-(7) converge to their first-order approximations as a function of s. It is worth mentioningthat for the special case of a single receive antenna (M = 1),(5)-(7) are exact for all values of s. For larger values of M ,a comparison of the convergence speeds of the first-orderapproximations to results from Monte-Carlo simulations arestudied in Fig. 1 for a sample restricted precoded system. Here,the unordered eigenvalues of Λi are being compared with theordered eigenvalues obtained from Monte-Carlo simulations.Such a comparison is reasonable if the overlap between themarginal distributions of the unordered eigenvalues is low i.e.,if the eigenvalues of Rrx are sufficiently well separated and ifs � 1. A comparison of the convergence of the asymptoticfirst-order expression (5) to Monte-Carlo simulation results isstudied in Fig. 1a. It shows that while the convergence of (5)is very quick with K for large eigenvalues it is slower for thesmaller eigenvalues. A comparison of the approximate second-order eigen statistics to Monte-Carlo simulations as a functionof s is presented in Fig. 1b. It shows that the second-orderstatistics match even for K = 2, validating quick convergence.Similar results have been observed for a wide variety of systemparameters. The seemingly slow convergence of Ûλim for smalleigenvalues in Fig. 1a and µim in Fig. 1b is a result of thecomparison of ordered with unordered eigenvalues.

    Approximation III.1. Due to the accuracy and quick conver-gence of the first-order approximations, we will use the ÛX’s inplace of the X’s in (5)-(7), even in the non-asymptotic regime,i.e., for finite values of s.

    B. Joint Distribution of eigenvalues

    In this section we find the joint distribution of the normal-ized channel eigenvalues. Since the actual distribution is hardto characterize, we first derive the asymptotic joint distributionand later consider approximations for finite values of K . Thefollowing theorem gives us partial results on the asymptoticjoint distribution of eigenvalues:

    0 10 20 30 40 50 60 7010

    −3

    10−2

    10−1

    100

    101

    102

    103

    E{[λ̇

    1m−λ1m]2}/K

    11

    mm

    K

    m=1, η=0.9

    m=2, η=0.9

    m=1, η=0.5

    m=2, η=0.5

    m=1, η=0.2

    m=2, η=0.2Smaller eigen value

    Larger eigen value

    (a) Eigenvalue estimation

    0 5 10 15 20 25 30 35 40 45 50

    0

    0.2

    0.4

    0.6

    0.8

    1

    1.2

    1.4

    1.6

    1.8

    2

    K

    µ11 (Monte-Carlo)µ̇11

    K1111 (Monte-Carlo)

    K̇1111

    K1112 (Monte-Carlo)

    K̇1112

    K1211 (Monte-Carlo)

    K̇1211

    K1212 (Monte-Carlo)

    K̇1212

    (b) Eigenvalue statistics

    Fig. 1. Convergence of normalized channel eigenvalues to their first-orderapproximations for a restricted precoded system, as a function of K : (a)Compares the mean square error E{[λim − Ûλim]2 } normalized by K11mm ,as a function of K for different values of transmit correlation η (b) Comparesµim , K

    i jmn , Ûµim , ÛK i jmn as a function of K for η = 0.5

    (system parameters:

    N = 2K ,M = 2, [Rtx]ab = η |a−b | , [Rrx]ab = (0.5)|a−b | , T1 = [IN ]c{1:K }and T2 = [IN ]c

    {1+

    ⌊K2

    ⌋:⌊

    3K2

    ⌋} )

    Theorem III.2. Consider a family of transmit correlationmatrices Rtx and a family of precoding matrices T ={T1, ..,T |T |} as a function of s. Then the vector of eigenvaluesv = [λi1m1, λi2m2, ..., λiLmL ] for any finite L, 1 ≤ i1, ..., iL ≤ |T |and 1 ≤ m1, ...,mL ≤ M are jointly Gaussian distributed ass→∞, with second-order statistics as given in Theorem III.1,if:

    1) The eigenvalues of Rrx are all distinct.2) Ti` = Ui` ⊗ V for all 1 ≤ ` ≤ L, where ⊗ defines the

    Kronecker product, V is a s × s unitary matrix and Ui`is any fixed No × Ko semi-unitary matrix.

    3) The transmit-correlation matrix eigenvalues satisfy

    lims→∞(λtx1 )

    2∑sk=1 (λtxN+1−k )

    2 = 0.

    Proof. See Appendix C. �

    Some examples of families of transmit correlation matrices

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 5

    0 1 2 3 4 5 60

    0.5

    1

    1.5

    λ11

    PD

    F

    K=2

    K=4

    K=10

    K=20

    K=50

    (a) Marginal PDF

    0 5 10 15 20 25 30 35 40 45 508

    10

    12

    14

    16

    18

    20

    K

    Ku

    rto

    sis

    0 5 10 15 20 25 30 35 40 45 50

    0

    1

    2

    3

    4

    5

    6

    K

    Ske

    wn

    ess

    v = {λ11,λ21}v = {logλ11, logλ21}Bivariate Gaussian

    v = {λ11,λ21}v = {logλ11, logλ21}Bivariate Gaussian

    (b) Skewness-Kurtosis

    Fig. 2. Asymptotic convergence of the distribution of normalized channeleigen-values for a restricted precoded system, as a function K : (a) Plotsthe empirical probability distribution of the eigenvalue λ11 (b) Compares thekurtosis and skewness of the bi-variate random vectors v = {λ11, λ21 } andv = {logλ11, logλ21 } to a bi-variate Gaussian with same second-order statis-tics

    (system parameters: N = 2K ,M = 2, [Rtx]ab = (0.5)|a−b | , [Rrx]ab =

    (0.5)|a−b | , T = {T1, T2 } where T1 = [IN ]c{1:K } , T2 = [IN ]c{1+

    ⌊K2

    ⌋:⌊

    3K2

    ⌋}and 5000 samples

    )

    which satisfy the skewness constraint (condition 3) are dis-cussed in Appendix D. Intuitively, the theorem states that ifthe eigenvalues of the transmit correlation matrix are not tooskewed (so that Lyapunov’s central limit theorem is applicable)then the normalized channel eigenvalues corresponding tothe precoding matrices that are sufficiently well separated intheir column space are asymptotically jointly Gaussian. Tocharacterize this convergence, the empirical distribution ofthe normalized channel eigenvalues for a sample restrictedprecoded system is plotted in Fig. 2a for different values ofK . Following the approach in [27], to test the joint normality,the Kurtosis and Skewness of a vector of eigenvalues v(corresponding to a well-separated codebook) are plotted inFig. 2b as a function of K . From [27], for a large sampleset from a p-variate Gaussian distribution, the skewness andkurtosis converge to the values of 0 and p(p+2), respectively.

    Both figures suggest that though the joint distribution isasymptotically Gaussian, the convergence is very slow. Suchlarge values of K may be impractical and therefore, otherapproximations to the joint distribution are required in thefinite antenna regime.

    In this paper, we propose a joint lognormal distributionas an approximation for the normalized channel eigenvaluedistribution in the finite antenna regime. We observe from Fig.2a that in the non-asymptotic regime, a lognormal distributionmay indeed be a better fit for the marginal distribution. In Fig.2b, the kurtosis and skewness for the logarithm of eigenvaluesare also depicted. The quick convergence of these parameterswith K provides further credence to this hypothesis. Apartfrom ensuring that the eigenvalues are always non-negative,a joint-lognormal approximation for eigenvalues also ensuresthat the capacity is Gaussian distributed in the high SNRregime. This is an intuitively pleasing result and is consistentwith prior literature [28], [29].

    Approximation III.2. In the rest of the paper, we shallapproximate the normalized channel eigenvalues for any setof precoding matrices to be jointly lognormally distributed inthe non-asymptotic regime.

    Unlike in Theorem III.2, which considers only precodingmatrices that are sufficiently well separated, here we approx-imate the eigenvalues corresponding to any set of precodingmatrices to be jointly lognormal distributed. The validity ofthis approximation is studied in Fig. 3 wherein the Skewnessand Kurtosis for the eigenvalues corresponding to all precodingmatrices of a sample antenna selection system are plotted.These results also show that a jointly lognormal distributionis a better fit than a Gaussian fit for the normalized channeleigenvalues. However, even for the logarithm of eigenvalues,the Skewness and Kurtosis values deviate partially from thoseof a Gaussian distribution, thereby suggesting that Approx.III.2 is not very accurate. However, this approximation isneeded for analytical tractability. In Sec. IV, it is demonstratedthat the resulting approximation error in estimating the channelcapacity is relatively small.

    IV. CAPACITY ANALYSIS

    The individual channel capacities for i = 1, .., |T | can beexpressed in the form Ci(αi,H) ,

    ∑Mm=1 log(1 + αiλim) where

    the αis are suitably chosen constants. In particular, inspectionof (3)-(4) and the definition of normalized channel eigenvalues

    (see Section III) reveals that for LFP αi =ρTr{T†iRtxTi }

    K while

    for TAS, αi =ρTr{T†iRtxTi }

    M . For sufficiently large values of αi ,we can approximate:

    Ci(αi,H) ≈M∑m=1

    log (αiλim) (8)

    Now, from Approx. III.2, for moderately large values of αiwe have:

    {C1(α1,H), . . . ,C |T |(α |T |,H))} ∼ Jointly Gaussian (9)

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 6

    1 1.5 2 2.5 3 3.5 4 4.5 50

    50

    100

    150

    200

    K

    Ku

    rto

    sis

    1 1.5 2 2.5 3 3.5 4 4.5 50

    20

    40

    60

    K

    Ske

    wn

    ess

    v = {λ11, ..,λ|T |,1}

    v = {logλ11, .., logλ|T |,1}N-variate Gaussian

    v = {λ11, ..,λ|T |,1}

    v = {logλ11, .., logλ|T |,1}

    N-variate Gaussian

    Fig. 3. Comparison of Skewness and Kurtosis of the set of normal-ized channel eigenvalues v = {λ11, .., λ|T | ,1 } and their logarithms v ={logλ11, .., logλ|T | ,1 } to a Gaussian distribution with same second-orderstatistics, in an antenna selection system

    (system parameters: N = 2K ,M =

    1, [Rtx]ab = (0.5)|a−b | , [Rrx]ab = (0.5)|a−b | , T = {T |T is a N ×K submatrx of IN } and 10000 samples

    ) 9Additionally, using Approx. III.1, (8) and results on thesecond-order statistics of a lognormal random vector [30], wecan easily show that:

    C̄i , E{Ci(αi,H)} ≈M∑m=1

    log

    αi Ûµ2im√Ûµ2im + ÛK iimm

    (10)κi j , E{Ci(αi,H)Cj(αj,H)}

    −E{Ci(αi,H)}E{Cj(αj,H)}

    ≈∑m,n

    log

    [Ûµim Ûµjn + ÛK i jmnÛµim Ûµjn

    ]=

    M∑m=1

    log

    [Ûµim Ûµjm + ÛK i jmmÛµim Ûµjm

    ](11)

    A comparison of the approximate joint statistics and marginaldistribution of Ci(αi,H) to Monte-Carlo simulations for asample antenna selection system is given in Fig. 4, as afunction of K . The results show that the approximations aretight for K ≥ 4. In Fig. 4c, the skewness and kurtosis of thevector of individual capacities corresponding to all precodingmatrices for the antenna selection system are studied. Theclose fit to a Gaussian distribution suggests that the impact ofApprox. III.2 on capacity is small. Similar results have beenobserved for a wide variety of system parameters.

    A. System capacity

    A general abstraction of the system capacity expressions(3)–(4) can be given by:

    Cmax(H) = max1≤i≤ |T |

    {Ci(αi,H)} (12)

    Note that (9)–(11) provide a model that fully characterizesthe joint distribution of the individual capacities. From (9),

    9Though v is a(NK

    )random vector, the Kurtosis and Skewness are computed

    only for the dominant subspace, which has N principal components.

    0 5 10 15 20 25 30 35 40 45 5010

    −2

    10−1

    100

    101

    K

    Mean{C1(α1,H) }Mean{C1(α1,H)} sim.Var{C1(α1,H)}Var{C1(α1,H)} sim.Crosscov{C1(α1,H), C2(α2,H)}Crosscov{C1(α1,H), C2(α2,H)} sim.Crosscov{C1(α1,H), C3(α3,H)}Crosscov{C1(α1,H), C3(α3,H)} sim.

    (a) Joint statistics of Ci (αi ,H)

    0 1 2 3 4 5 6 7 8 9 100

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    0.7

    Capacity (nats/s/Hz)

    PD

    F

    Simulation, K=8

    Gaussian Approx., K=8

    Simulation, K=4

    Gaussian Approx, K=4

    Simulation, K=2

    Gaussian Approx, K=2

    (b) PDF of C1(α1,H)

    1 1.5 2 2.5 3 3.5 4 4.5 5

    0

    20

    40

    60

    80

    100

    120

    140

    K

    Kurtosis: v = {C1(α1,H), .., C|T |(α|T |,H)}

    Kurtosis: N-variate Gaussian

    Skewness: v = {C1(α1,H), .., C|T |(α|T |,H)}

    Skewness: N-variate Gaussian

    (c) Skewness-Kurtosis

    Fig. 4. Individual capacity distribution of an antenna selection system:(a) Compares the joint second-order statistics of capacities across differentprecoding matrices as a function of K (b) Compares the empirical PDFof C1(α1,H) to a Gaussian distribution with mean and variance as givenby (10)-(11) (c) Compares skewness and kurtosis of the set of channelcapacities v = {C1(α1,H), ..,CT (α|T | ,H)} to a Gaussian distribution withsame second-order statistics

    (system parameters: N = 2K ,M = 2, ρ =

    10, [Rtx]ab = (0.5)|a−b | , [Rrx]ab = (0.5)|a−b | , αi =ρTr{T†

    iRtxTi }

    M ,T1 = [IN ]c{1:K } , T2 = [IN ]c

    {1+

    ⌊K2

    ⌋:⌊

    3K2

    ⌋} , T3 = [IN ]c{2:(K+1)} , T ={T |T is a N × K submatrx of IN } and 10000 samples

    )9

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 7

    Cmax(H) can be expressed as the largest element of a correlatedGaussian vector.

    For the maximum of a set of correlated Gaussian randomvariables, neither the exact distribution nor even the mean isnot known in closed form. Existing methods [31] to solvefor them are too cumbersome, especially when |T | is large.Though several bounds exist on the mean [32]–[36], theyare not uniformly tight across all correlation structures. Onthe other hand, numerical approaches like in [37], [38] arerecursive and therefore are likely to accumulate significantamount of error when the number of variables |T | are large.This is specifically relevant to our scenario since the codebooksize |T | can be very large. We therefore formulate a newapproach to compute the distribution and mean of the largestelement of a correlated Gaussian vector. Note that:

    Cmax(H) =

    [C1(α1,H), ...,C |T |(α |T |,H)]

    = log

    [eC1(α1 ,H), ..., eC|T |(α|T | ,H)]

    ≈ log

    [eC1(α1 ,H), ..., eC|T |(α|T | ,H)]

    p

    for p ≥ log |T |

    =1p

    log[ |T |∑i=1

    epCi (αi ,H)]

    (13)

    where the second last step follows from the norm inequalitiesL−1/p ‖a‖p ≤ ‖a‖∞ ≤ ‖a‖p for any vector a of length L.Since epCi (αi ,H) ≈

    (∏Mm=1 αiλim

    )pis lognormal distributed,

    equation (13) above shows that the largest element of acorrelated Gaussian vector can be approximately representedas the logarithm of a sum of correlated lognormals.

    B. Sum of correlated lognormals

    It is well known that the sum of correlated lognormal ran-dom variables is approximately lognormal (see [39] and refer-ences therein). Of the many approximations for characterizingthis sum, the moment and cumulant matching approaches, suchas [40], [41], yield a poor fit in the lower tail regions ofthe distribution. On the other hand, the moment generatingfunction matching approaches, like [42], are too cumbersomewhen the number of variables |T | is large. Here, we proposethe use of the approach in [43] (reproduced here as algorithm1), which extends the work in [44] to the correlated case.Similar to [37], this algorithm is recursive and therefore alsoshares the same drawback of accumulating error. However, thedrawback is a by-product of the algorithm in [43] and not ofour approach in (13). Any new results on sum of correlatedlognormals can readily be used to resolve this drawback. Tocheck the goodness of fit, the empirical distribution of thelargest element of a sample Gaussian vector, and its p-normapproximation are compared to the distributions obtained usingAlgorithm 1, Clark [38] and second-order moment-matching[40] in Fig. 5. The results show that both Clark as well asAlgorithm 1 give good approximations to the distribution ofthe largest element.

    In summary, the system capacity Cmax(H) can be approx-imated as a Gaussian random variable and its mean andvariance can be computed via Algorithm 1.

    Algorithm 1: Statistics of the largest element of acorrelated Gaussian vector

    Inputs: p, C̄i, κi j forall 1 ≤ i, j ≤ |T | // Defined as in(10)-(13)µw(1) = µs(1) = pC̄1σ2w(1) = σ2s (1) = p2κ11Q(1,∗) = p2κ1∗for i = 2 to |T | doµw(i) = pC̄i − µs(i − 1)σ2w(i) = p2κii + σ2s (i − 1) − 2Q(i − 1, i)µs(i) = µs(i − 1) + G1

    (σw(i), µw(i)

    )σ2s (i) = σ2s (i − 1) − G1

    (σw(i), µw(i)

    )+G2

    (σw(i), µw(i)

    )+2

    [Q(i − 1, i) − σ

    2s (i−1)G3

    (σw (i),µw (i)

    )σ2w (i)

    ]for j = 1 to |T | do

    Q(i, j) = Q(i − 1, j)[1 − G3

    (σw (i),µw (i)

    )σ2w (i)

    ]+

    p2κi jG3(σw (i),µw (i)

    )σ2w (i)

    end forend for// G1(σ, µ),G2(σ, µ),G3(σ, µ) are as defined inAppendix of [43]return µs(|T |)/p // Mean of Cmaxreturn σ2s (|T |)/p2 // Variance of Cmax

    0 1 2 3 4 5 6 70

    0.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4

    0.45

    0.5

    Cmax(H)

    PD

    F

    PDF {maxi{xi}}PDF {log ‖y‖p}

    Est.PDF{Algorithm-1}Est.PDF{Clark}Est.PDF{Moment-match}

    Fig. 5. Distribution of largest element of a Gaussian vector: x-jointly Gaussianvector, y , exp [x] (element-wise); Clark, moment-match refer to solutionsfrom [38] and [40], respectively

    (simulation parameters: |X | = 40, E{Xi } = 1,

    E{XiX j } = 1.5 + δi j , p = 8)

    V. SIMULATION RESULTS

    Using the results derived in the previous sections, we shallnow analyze the system capacity of several practical restrictedprecoded systems. The simulation layout considers a singleuser, cellular downlink channel operating at 2.4 GHz. Boththe transmitter and receiver have a uniform, linear antennaarray with antenna spacings of dtx = 5cm and drx = 2cm,respectively (unless otherwise stated). The transmitter expe-riences a Laplacian power angle spectrum (PAS) with meanangle of arrival (AoA) = π/6 rads and an angle spread (AS)of π/10 rads. The receiver on the other hand experiences a

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 8

    uniform power angle spectrum.10 As a first simple example,

    −2 0 2 4 6 8 10 120

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    0.7

    Capacity (nats/s/Hz)

    PD

    F

    Monte−Carlo sim.Algorithm 1Gauss approx.

    K=2 K=6

    (a) PDF{Cmax(H)}

    0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.52.8

    3

    3.2

    3.4

    3.6

    3.8

    4

    4.2

    dtx (m)

    Capacity (

    nats

    /s/H

    z)

    Monte−Carlo sim.

    Algorithm 1

    Gauss approx.

    (b) Tx antenna spacing

    Fig. 6. Comparison of capacity of an antenna selection system as predictedby Algorithm1 to Monte-Carlo simulations and Gauss-approx (a) Plots PDFof system capacity for K = 2, 6 (b) Plots the mean capacity as a functionof transmit antenna spacing (K = 2)

    (system parameters: N = 2K , M = 2,

    SNR ρ = 10)

    we consider an antenna selection system in Fig. 6. In Fig. 6a,the PDF of capacity as computed by Algorithm 1 is comparedto Monte-Carlo simulations. To quantify the origin of themismatch in distributions, the PDF of the largest element of aGaussian vector with the second-order statistics given by (10)-(11) is also plotted, labeled as Gauss-approx. The gap betweenMonte-Carlo and Gauss-approx quantifies the error due toinaccuracy of Approx. III.1 and III.2. We observe that this gapdoes not increase much with K . On the other hand, the gapbetween Gauss-approx and Algorithm 1 quantifies the errordue to inaccuracy of the approach in [43]. This gap increaseswith K , owing to the error accumulation in the recursive stepsof [43] for large codebooks

    (|T | =

    (NK

    ) ). In Fig. 6b, the impact

    of transmit antenna spacing on ergodic capacity is compared.As seen from the results, though Algorithm 1 overestimates the

    10Tx/Rx correlation matrix is calculated as: [Rx]ab =∫ π−π PAS(θ)e

    2π j(a−b)dx sin θλ dθ

    / ∫ π−π PAS(φ)dφ, where j =

    √−1, λ is

    the wavelength at 2.4GHz and x=tx/rx. All arrays and multipath componentsare in the horizontal plane.

    capacity, it accurately reflects the impact of system parameterslike antenna spacing on capacity.

    For the same simulation layout, the impact of the codebookon ergodic capacity for a limited feedback precoding systemis studied in Fig. 7. The influence of the codebook shape oncapacity is studied in Fig. 7a, where we use skewed code-books Tα generated from a Grassmannian packed codebook T̂as: Tα =

    {(Rtx)αT̂i

    [T̂†i (Rtx)

    2αT̂i]−1/2��T̂i ∈ T̂ }. The skewing

    factor α controls the spacing between the precoding matricesof the codebook. The results suggest that Algorithm 1 givesa good estimate of dependence of capacity on the codebookshape. The impact of codebook size on ergodic capacity isstudied in Fig. 7b. Here, we use Grasmannian codebooks ofdifferent sizes. We observe that Algorithm 1 gives accurateresults for small codebooks but the error increases withcodebook size |T |. This is again due to the error accumulationin the recursive steps of Algorithm 1. The sudden dip at 4bits of feedback is because the codebook is arbitrary and notcustomized to the chosen PAS.

    0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 25

    5.2

    5.4

    5.6

    5.8

    6

    6.2

    6.4

    Skewing Factor α

    Capacity (

    nats

    /s/H

    z)

    Monte−Carlo sim.Algorithm 1Gauss Approx

    (a) Skewing Factor

    2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 73.4

    3.6

    3.8

    4

    4.2

    4.4

    4.6

    Feedback log2 |T | (bits)

    Capacity (

    nats

    /s/H

    z)

    Monte−Carlo sim.

    Algorithm 1

    Gauss Approx

    (b) Codebook Size

    Fig. 7. Impact of the codebook on capacity of a limited feedback precodingsystem, as predicted by Algorithm1, Monte-Carlo simulations and Gauss-approx (a) Studies impact of the codebook skewing-factor

    (N = 8, K =

    M = 2, SNR ρ = 10, | T̂ | = 256, T̂ is a Grassmannian codebook from [45])

    (b) Studies impact of codebook size(N = 4, K = M = 2, SNR ρ = 10, T

    for each codebook size is from [45])

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 9

    VI. APPLICATION TO CODEBOOK DESIGN FOR LIMITEDFEEDBACK SYSTEMS

    It is widely accepted that for an uncorrelated i.i.d. channel,a Grassmannian-packed codebook is near optimal for a limitedfeedback system [9], [11], [46]. However, for a correlatedchannel, no consensus exists on the best codebook design.This is partly because no uniformly tight (over the spaceof all codebooks) bounds or approximations to the systemcapacity are available for a correlated channel. In this section,using Algorithm 1 as the objective function, we shall use anumerical-gradient ascent algorithm to search for the code-book that maximizes the mean capacity. Here, the gradient ofmean capacity (as predicted by Algorithm 1) with respect toa matrix TCB (formed by appending all the precoders in thecodebook T ) is computed numerically. For ease of pictorialrepresentation, we consider a limited feedback beamformingsetting with N = 3, K = M = 1 and other parameterssame as in Sec. V. For a codebook size of |T | = 3, wecompare the beam-patterns formed by the precoding vectorsof the optimized codebook to a Discrete Fourier Transform(DFT) codebook12 in Fig. 8. The estimated PAS11 is alsoplotted for comparison. As the results show, unlike the DFTcodebook that is designed for generic correlated channels, theoptimized codebook here adapts to the user PAS leading to anincrease in capacity from 2.74 to 2.9 nats/s/Hz. Though severalother families of codebooks have been proposed in literature[8], [12]–[14] for a correlated channel, they involve someparameters which need to be chosen. Algorithm 1 is also usefulin such scenarios since it enables picking the mean/outagecapacity maximizing values for these parameters.

    VII. CONCLUSIONS

    This paper analyzes a special class of MIMO systems calledrestricted precoded systems and discusses how many practi-cally relevant systems like antenna selection, beam selectionand limited feedback precoding fall under this class. It isshown that the system capacity of restricted precoded systemscan be expressed as a function of the eigenvalues of a set ofcoupled doubly-correlated Wishart matrices. The eigenvaluesare shown to be jointly Gaussian in the large antenna limit, ifa set of conditions on the transmit correlation and codebookare satisfied. The asymptotic second-order statistics of theeigenvalues are also derived and the results suggest that theirconvergence is very quick. We propose, and verify, that in thefinite antenna regime, a joint-lognormal distribution is a betterfit to the eigenvalue distribution. Using these results, and a fewsimplifying approximations, we show that the system capacityfor a restricted precoded system can be approximated as thelargest element of a correlated Gaussian vector. We propose anew approach for characterizing its distribution and, in theprocess, show that the problem of finding the distributionof the sum of lognormals and the problem of finding thedistribution of the largest element of a Gaussian vector are

    11Est.PAS(θ) = ∑ab [Rtx]abe− 2π j(a−b)dtx sin θλ , where j = √−1 and λ isthe wavelength at 2.4GHz.

    12For K = 1, |T | = N , the columns of a DFT matrix form a Grassmanniancodebook that is well suited for correlated channels [6].

    0.2

    0.4

    0.6

    0.8

    1

    30

    210

    60

    240

    90

    270

    120

    300

    150

    330

    180 0

    PAS estimateBeam pattern T1Beam pattern T2Beam pattern T3

    (a) Optimized codebookE{Cmax(H)} = 2.91nats/s/Hz

    0.4

    0.8

    1

    30

    210

    60

    240

    90

    270

    120

    300

    150

    330

    180 0

    0.2

    0.6

    PAS estimateBeam pattern T1Beam pattern T2Beam pattern T3

    (b) DFT codebookE{Cmax(H)} = 2.74nats/s/Hz

    Fig. 8. Comparison of beam-patterns for optimized codebook and theGrassmannian codebook

    (system parameters: N = 3, K = M = 1, SNR

    ρ = 10)11

    equivalent. Simulations results, for both an antenna selectionsystem and a limited feedback precoded system, suggest thatthe proposed algorithm slightly overestimates the capacity butpredicts the dependence of capacity on system parameters likenumber of antennas, antenna spacing and codebook shape &size accurately. We also observe that a significant portion ofthe mismatch comes from error accumulation in the recursivesteps of the proposed algorithm. Any non-recursive approachto characterize the sum of lognormals can be useful in tacklingthis problem. We also demonstrate, via an example, that the

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 10

    proposed algorithm can be used in the design of near-optimalcodebooks for a limited feedback system.

    APPENDIX A

    (Proof of Theorem III.1). For any value of N , K and ∀i ∈{1, .., |T |}, from (1) we have:

    Qi =HTiT†i H

    Tr{T†i RtxTi}=

    R1/2rx GR1/2tx TiT

    †i R

    1/2tx G

    †R1/2rxTr{T†i RtxTi}

    = R1/2rx

    [K∑k=1

    ĥik[ĥik]†]

    R1/2rx (14)

    where ĥik= GR1/2tx [Ti]c{k }

    /√Tr{T†i RtxTi}. Defining Q̂i =∑K

    k=1 ĥik[ĥi

    k]† and taking expectations we get:

    E{[Q̂i]ab}

    =∑k

    E{[G]r{a}R1/2tx [Ti]c{k }[Ti]†c{k }R

    1/2tx

    ([G]r{b}

    )†}Tr{T†i RtxTi}

    =∑k

    [Ti]†c{k }R1/2tx E{

    ([G]r{b}

    )†[G]r{a}}R1/2tx [Ti]c{k }Tr{T†i RtxTi}

    = δab (15)

    where δab = 1 if a = b and δab = 0 if a , b (the Kroneckerdelta function) and the last step follows from the fact that Ghas i.i.d. CN(0,1) entries.

    E{��[Q̂i]ab ��2} = K∑

    k=1

    K∑̀=1E

    {[ĥik]a[ĥ

    ik]†b[ĥi`]b[ĥ

    i`]†a

    }=∑k ,`

    [E

    {[ĥik]a[ĥ

    ik]†b

    }E

    {[ĥi`]b[ĥ

    i`]†a

    }+E

    {[ĥik]a[ĥ

    i`]†a

    }E

    {[ĥik]

    †b[ĥi`]b

    } ](16a)

    =��E{[Q̂i]ab}��2 +∑

    k ,`

    ���E {[ĥik]†b[ĥi`]b}���2 (16b)=

    ��E{[Q̂i]ab}��2 +∑k ,`

    [Ti†RtxTi]2lk

    Tr{T†i RtxTi}2

    =��E{[Q̂i]ab}��2 + ‖T†i RtxTi ‖2F

    Tr{T†i RtxTi}2(16c)

    where (16a) follows from the result on the expectation ofthe product of four circularly symmetric jointly Gaussianrandom variables [47] and (16b) follows from the fact thatthe vector ĥiβ has i.i.d entries ∀β ∈ {1, ..,K}. Therefore, if

    lims→∞‖T†iRtxTi ‖F

    Tr{T†iRtxTi }= 0, from (14), (16c) we have:

    lims→∞

    Q̂ims= IM ⇒ lim

    s→∞Qi

    ms= Rrx (17)

    where ms= denotes element-wise mean square convergence.From the Hoffman-Weilandt inequality [48], there exists apermutation matrix P such that ‖PΛiP†−Λrx‖F ≤ ‖Qi−Rrx‖F,

    where Λi,Λrx are the eigenvalue matrices of Qi,Rrx, respec-tively. Without loss of generality, assuming Λi is always pickedin this permutation, we have:

    lims→∞Λi

    ms= Λrx (18)

    Let eim be an eigenvector of Qi corresponding to an eigenvalueλim = [Λi]m,m. Now as s→∞:

    Qieim = λimeim (19a)⇒ Rrxeim

    ms= λrxm eim [From (17) and (18)]

    ⇒ eimms= e jφerxm (19b)

    for some angle φ (which may be a function of G), where(19b) follows from the fact that all the eigenvalues of Rrxare distinct. Now since both the eigenvalues and eigenvectorsconverge in mean square sense, following a similar procedureto [20], from (19a) we have:

    [Qi − Rrx + Rrx][eim − e jφerxm + e jφerxm ]= [λim − λrxm + λrxm ][eim − e jφerxm + e jφerxm ]

    ⇒ [Qi − Rrx]e jφerxm + Rrx[eim − e jφerxm ]' [λim − λrxm ]e jφerxm + λrxm [eim − e jφerxm ] (20a)

    ⇒ erxm †[Qi − Rrx]erxm ' λim − λrxm (20b)⇒ λim ' erxm †Qierxm (20c)

    where, as in [20], we use “'" to denote a first-order approxi-mation (i.e., an equality in which the higher order terms thatdo not influence the asymptotic statistics of λim as s→∞ areneglected). Note that (20a) follows by neglecting the higherorder terms and (20b) follows by premultiplying both sides bye−jφ[erxm ]†. This proves the asymptotic first-order expressionfor the eigenvalues (5).

    By taking expectations on both sides of (20c), the asymp-totic mean can be expressed as:

    µim = E{λim} ' erxm †E {Qi} erxm' λrxm [From (17)] (21)

    Similarly, for cross-correlation we have:

    E{λimλjn

    }' E

    {erxm†Qierxmerxn

    †Qjerxn}

    = λrxmλrxn

    K∑k=1

    K∑̀=1E

    {erxm†ĥik[ĥik]

    †erxmerxn†ĥj`[ĥj`]†erxn

    }(22)

    where the last step follows from (14). Notice that erxη†ĥαβ are

    all circularly symmetric jointly Gaussian random variables forall 1 ≤ η ≤ M , 1 ≤ α ≤ |T |, 1 ≤ β ≤ K . From the resulton the expectation of the product of four complex, circularlysymmetric jointly Gaussian random variables [47], we have:

    E{λimλjn

    }' λrxmλrxn

    K∑k=1

    K∑̀=1

    erxm†E

    {ĥik[ĥik]

    †} erxmerxn †E {ĥj`[ĥj`]†} erxn+ λrxmλ

    rxn

    K∑k=1

    K∑̀=1

    erxm†E

    {ĥik[ĥ

    j`]†

    }erxn erxn

    †E{ĥj`[ĥik]

    †} erxm⇒ E

    {λimλjn

    }− µimµjn

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 11

    '∑k ,`

    λrxmλrxn

    ���erxm †E{GR1/2tx [Ti]c{k }[Tj]†c{` }R1/2tx G†}erxn ���2[Tr{T†i RtxTi}Tr{T

    †jRtxTj}]

    =∑k ,`

    λrxmλrxn

    ���[Tj]†c{` }R1/2tx E{G†erxn erxm †G}R1/2tx [Ti]c{k }���2[Tr{T†i RtxTi}Tr{T

    †jRtxTj}]

    =∑k ,`

    δmnλrxmλ

    rxn

    ��� ([Tj]c{` } )†Rtx[Ti]c{k }���2[Tr{T†i RtxTi}Tr{T

    †jRtxTj}]

    =

    δmnλrxmλ

    rxn

    T†jRtxTi

    2FTr{T†i RtxTi}Tr{T

    †jRtxTj}

    (23)

    where, the penultimate step follows from the fact that G hasi.i.d. entries and erxm

    †erxn = δmn. This concludes the proof. �

    APPENDIX B

    (Proof of proposition III.1.1). From Theorem III.1, the re-quired condition is that

    limK→∞

    ‖T†i RtxTi ‖FTr{T†i RtxTi}

    = 0 ∀i ∈ {1, .., |T |}

    Note that:

    ‖T†i RtxTi ‖FTr{T†i RtxTi}

    =‖λ{T†i RtxTi}‖2‖λ{T†i RtxTi}‖1

    =‖λ{TiT†i Rtx}‖2‖λ{TiT†i Rtx}‖1

    (24)

    where λ{A} is the vector of eigenvalues for a square matrixA. We define λ↑{A} and λ↓{A} as sortings of λ{A} inascending and descending orders, respectively. From resultson eigenvalue majorization [49, Eqn 3.20], we have for all1 ≤ L ≤ N:

    L∏̀=1λ↓`{TiT†i }λ

    ↑`{Rtx} ≤

    L∏̀=1λ↓`{TiT†i Rtx}

    ≤L∏̀=1λ↓`{TiT†i }λ

    ↓`{Rtx} (25)

    where λ:`{A} is the `-th element of λ:{A}. By taking the

    logarithm on both sides we get:

    log[λ↓{TiT†i } ◦ λ

    ↑{Rtx}]≺ log

    [λ↓{TiT†i Rtx}

    ]≺ log

    [λ↓{TiT†i } ◦ λ

    ↓{Rtx}]

    (26)

    where ◦ denotes the Hadamard product, the logarithm is takenelement-wise and a ≺ b implies b majorizes a. Now considerthe function fα(x) = ‖ex‖α where α ∈ {1,2, ...} and theexponent ex is taken element wise. The function is clearlypermutation invariant in vector x and for any elements xa, xbof vector x, we have:[

    ∂ fα(x)∂xa

    − ∂ fα(x)∂xb

    ](xa−xb) = (xa−xb)

    eαxa − eαxb‖ex‖α−1α

    ≥ 0 (27)

    Therefore from [49, Th 2.3.14], fα(x) is a Schur-convexfunction. From the definition of a Schur convex function and

    from (26) we have:

    f2(log[λ↓{TiT†i Rtx}]

    )f1(log[λ↓{TiT†i Rtx}]

    ) ≤ f2 ( log[λ↓{TiT†i } ◦ λ↓{Rtx}])f1(log[λ↓{TiT†i } ◦ λ

    ↑{Rtx}])

    (28)

    ⇒‖T†i RtxTi ‖FTr{T†i RtxTi}

    √∑Kk=1 (λtxk )

    2[∑K`=1 λ

    txN+1−`

    ] (29)where in the last step we have used the fact that Ti is semi-unitary with dimension N × K . Alternately, from Hölder’sinequality:

    ‖λ{TiT†i Rtx}‖2‖λ{TiT†i Rtx}‖1

    ≤‖λ{TiT†i Rtx}‖∞‖λ{TiT†i Rtx}‖2

    Now following similar steps to before, we have:

    ‖T†i RtxTi ‖FTr{T†i RtxTi}

    ≤f∞

    (log[λ↓{TiT†i } ◦ λ

    ↓{Rtx}])

    f2(log[λ↓{TiT†i } ◦ λ

    ↑{Rtx}])

    ≤λtx1√∑K

    `=1

    (λtxN+1−`

    )2 (30)Therefore a sufficient condition for Theorem III.1 is that theeigenvalues of Rrx be distinct, and either the right hand sidein (29) or (30) go to zero as s→∞. �

    APPENDIX C

    (Proof of Theorem III.2). Since K = sKo ≥ s, we have:

    (λtx1 )2∑K

    k=1 (λtxN+1−k)2 ≤

    (λtx1 )2∑s

    k=1 (λtxN+1−k)2 (31)

    Therefore using (31) and conditions 1,3 of the theoremstatement, from proposition III.1.1, for any 1 ≤ i ≤ |T |,1 ≤ m ≤ M as s→∞:

    λim = erxm†Qierxm =

    λrxm ĝmR1/2tx TiT

    †i R

    1/2tx ĝ

    †m

    Tr{T†i RtxTi}(32)

    where ĝm = erxm†G is a 1 × N vector with i.i.d. CN(0,1)

    entries. Additionally, the second-order statistics of λim are alsoas given by (6) and (7).

    Now, for any 1 ≤ m ≤ M , consider a set of eigenvaluesv = [λi1m, λi2m, ..., λiLm]. From the Cramer-Wold theorem [50],these eigenvalues are asymptotically jointly Gaussian iff anyweighted sum converges to a Gaussian distribution as s→∞.A weighted sum is given by:

    w†v =L∑̀=1

    w`λi`m

    = λrxm ĝmR1/2tx

    [L∑̀=1

    w`Ti`T

    †i`

    Tr{T†i`RtxTi` }

    ]R1/2tx ĝ

    †m

    Using condition 2 of the theorem statement, we have:

    w†v

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 12

    = λrxm ĝmR1/2tx

    ©«

    L∑̀=1

    w`Ui`U†i`

    Tr{T†i`RtxTi` }︸ ︷︷ ︸W

    ª®®®®®®¬⊗ VV†

    R1/2tx ĝ

    †m

    d=

    N∑n=1

    λrxm��[ĝm]n��2λ↓n{R1/2tx [W ⊗ VV†]R1/2tx } (33)

    where d= represents equality in distribution and λ↓n{A} is the n-th largest eigenvalue of a matrix A. Using Lyapunov’s centrallimit theorem, it is easy to show that (33) is asymptoticallyGaussian if (see [20], [51] for details):

    lims→∞

    ‖λ{R1/2tx [W ⊗ VV†]R1/2tx }‖∞

    ‖λ{R1/2tx [W ⊗ VV†]R1/2tx }‖2

    = 0 (34)

    where, λ{A} represents the vector of eigenvalues of a matrixA. Now following similar steps to those in Appendix B (see(28)), we have:

    ‖λ{R1/2tx [W ⊗ VV†]R1/2tx }‖∞

    ‖λ{R1/2tx [W ⊗ VV†]R1/2tx }‖2

    =‖λ{[W ⊗ Is]Rtx}‖∞‖λ{[W ⊗ Is]Rtx}‖2

    (35)

    ≤ ‖λ↓{W ⊗ Is} ◦ λ↓{Rtx}‖∞‖λ↓{W ⊗ Is} ◦ λ↑{Rtx}‖2

    ≤λ↓1{Rtx}λ

    ↓1{W}√∑s

    `=1

    [λ↑`{Rtx}λ↓1{W}

    ]2=

    λtx1√∑s`=1 (λtxN+1−`)

    2(36)

    where we use the fact that VV† = Is and λ↓{W ⊗ Is} =λ↓{W} ⊗ λ↓{Is}. From (34), (36) and condition 3 of thetheorem, w†v is Gaussian distributed ∀w as s→∞. Thereforethe vector v = [λi1m, λi2m, ..., λiLm] is asymptotically jointlyGaussian distributed. Note that the joint Gaussianity abovetrivially also ensures marginal Gaussianity. Now, the jointGaussianity of λim and λjn for n , m directly follows fromthe marginal Gaussianity and the independence of ĝm and ĝnin (32). �

    APPENDIX D

    In this section we shall enumerate some families of N × Ntransmit correlation matrices, where N = Nos, which satisfy:

    lims→∞(λtx1 )

    2∑sk=1 (λtxN+1−k )

    2 = 0. As discussed in Appendix C, thiscondition also implies Proposition III.1.1.

    1) Exponential Correlation: In this case, the elements ofthe correlation matrix are given by [Rtx]ab = ρ |a−b | for|ρ| < 1. That this matrix satisfies the required constraint canbe verified using the bounds on eigenvalues as derived in [52].

    2) Arbitrary power angle spectrum (PAS): We consider auniform linear antenna array at the transmitter with spacingdtx. Assuming the multipath components to be only in the

    horizontal plane, the elements of the transmit correlationmatrix can be expressed as:

    [Rtx]ab =∫ 1

    0e j2π f (a−b)

    [ ∞∑n=−∞

    ξ( f − n)]

    ︸ ︷︷ ︸ζ ( f )

    df (37)

    where, ξ( f )

    =

    λPAS

    (arcsin( f λdtx )

    )+λPAS

    (π−arcsin( f λdtx )

    )√d2tx−λ2 f 2

    for λ | f |dtx ≤ 1

    0 for λ | f |dtx > 1(38)

    j =√−1, λ is the wavelength and PAS(θ) is the normalized

    PAS computed as PAS(θ) = PAS(θ)/ ∫ π−π PAS(φ)dφ. We define

    Fa as the smallest, possibly non-contiguous, sub-interval of[0,1] such that:

    minf ∈[0,1]−Fa

    {ζ( f )} ≥ maxf ∈Fa{ζ( f )} and

    ∫Fa

    f df = 1/a

    where ζ( f ) is as defined in (37). Since Rtx is a Toeplitz matrix,from Szego’s results on eigenvalues of Toeplitz matrices [53],as s→∞ we have:

    (λtx1 )2∑s

    k=1 (λtxN+1−k)2 =

    max f ∈F1{ζ2( f )

    }N

    ∫f ′∈FNo

    ζ2( f ′)df ′(39)

    It can be easily verified that the right hand side of (39) goesto zero if:• PAS(θ) is continuous, maxθ∈(−π,π] {PAS(θ)} < ∞ and

    PAS( π2 ) = PAS(−π2 ) = 0

    • The constant No is such that∫f ′∈FNo

    ζ2( f ′)df ′ > 0.The former condition is almost always satisfied for a sectoredbase-station. The latter condition is more stringent. However,it can be relaxed if we restrict the codebook T such that∀i lims→∞ rank{T†i RtxTi}/s > Ko − 1.

    REFERENCES

    [1] T. Marzetta, “Noncooperative Cellular Wireless with Unlimited Numbersof Base Station Antennas,” Wireless Communications, IEEE Transac-tions on, vol. 9, pp. 3590–3600, November 2010.

    [2] D. Love, R. Heath, V. Lau, D. Gesbert, B. Rao, and M. Andrews,“An overview of limited feedback in wireless communication systems,”Selected Areas in Communications, IEEE Journal on, vol. 26, pp. 1341–1365, October 2008.

    [3] D. GORE and A. Paulraj, “MIMO antenna subset selection withspace-time coding,” Signal Processing, IEEE Transactions on, vol. 50,pp. 2580–2588, Oct 2002.

    [4] A. Molisch, “MIMO systems with antenna selection - an overview,”in Radio and Wireless Conference, 2003. RAWCON ’03. Proceedings,pp. 167–170, Aug 2003.

    [5] X. Zhang, A. Molisch, and S.-Y. Kung, “Phase-shift-based antenna se-lection for MIMO channels,” in Global Telecommunications Conference,2003. GLOBECOM ’03. IEEE, vol. 2, pp. 1089–1093 Vol.2, Dec 2003.

    [6] A. Molisch and X. Zhang, “Fft-based hybrid antenna selection schemesfor spatially correlated MIMO channels,” Communications Letters,IEEE, vol. 8, pp. 36–38, Jan 2004.

    [7] P. Sudarshan, N. Mehta, A. Molisch, and J. Zhang, “Channel statistics-based RF pre-processing with antenna selection,” Wireless Communica-tions, IEEE Transactions on, vol. 5, pp. 3501–3511, December 2006.

    [8] D. Love and R. Heath, “Grassmannian beamforming on correlatedMIMO channels,” in Global Telecommunications Conference, 2004.GLOBECOM ’04. IEEE, vol. 1, pp. 106–110 Vol.1, Nov 2004.

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 13

    [9] K. Mukkavilli, A. Sabharwal, E. Erkip, and B. Aazhang, “On beamform-ing with finite rate feedback in multiple-antenna systems,” InformationTheory, IEEE Transactions on, vol. 49, pp. 2562–2579, Oct 2003.

    [10] B. Mondal and R. Heath, “Performance analysis of quantized beamform-ing MIMO systems,” Signal Processing, IEEE Transactions on, vol. 54,pp. 4753–4766, Dec 2006.

    [11] D. Love and R. Heath, “Limited feedback unitary precoding for spa-tial multiplexing systems,” Information Theory, IEEE Transactions on,vol. 51, pp. 2967–2976, Aug 2005.

    [12] V. Raghavan, R. Heath, and A. Sayeed, “Systematic codebook designsfor quantized beamforming in correlated MIMO channels,” SelectedAreas in Communications, IEEE Journal on, vol. 25, pp. 1298–1310,September 2007.

    [13] V. Raghavan, A. Sayeed, and V. Veeravalli, “Limited feedback precoderdesign for spatially correlated MIMO channels,” in Information Sciencesand Systems, 2007. CISS ’07. 41st Annual Conference on, pp. 113–118,March 2007.

    [14] V. Raghavan and V. Veeravalli, “Ensemble properties of RVQ-basedlimited-feedback beamforming codebooks,” Information Theory, IEEETransactions on, vol. 59, pp. 8224–8249, Dec 2013.

    [15] A. Molisch, M. Win, Y. seok Choi, and J. Winters, “Capacity of MIMOsystems with antenna selection,” Wireless Communications, IEEE Trans-actions on, vol. 4, pp. 1759–1772, July 2005.

    [16] S. Sanayei and A. Nosratinia, “Capacity of MIMO channels with antennaselection,” Information Theory, IEEE Transactions on, vol. 53, pp. 4356–4362, Nov 2007.

    [17] H. Shen and A. Ghrayeb, “Analysis of the outage probability for spatiallycorrelated MIMO channels with receive antenna selection,” in GlobalTelecommunications Conference, 2005. GLOBECOM ’05. IEEE, vol. 5,pp. 5 pp.–2464, Dec 2005.

    [18] S. Sanayei and A. Nosratinia, “Antenna selection in MIMO systems,”Communications Magazine, IEEE, vol. 42, pp. 68–73, Oct 2004.

    [19] A. M. Tulino and S. Verdu, “Random matrix theory and wirelesscommunications,” Foundations and TrendsÂő in Communications andInformation Theory, vol. 1, no. 1, pp. 1–182, 2004.

    [20] C. Martin and B. Ottersten, “Asymptotic eigenvalue distributions andcapacity for MIMO channels under correlated fading,” Wireless Com-munications, IEEE Transactions on, vol. 3, pp. 1350–1359, July 2004.

    [21] D. Morales-Jimenez, J. Paris, J. Entrambasaguas, and K.-K. Wong, “Onthe diagonal distribution of a complex wishart matrix and its applicationto the analysis of MIMO systems,” Communications, IEEE Transactionson, vol. 59, pp. 3475–3484, December 2011.

    [22] A. Maaref and S. Aissa, “Joint and marginal eigenvalue distributionsof (non)central complex wishart matrices and pdf-based approach forcharacterizing the capacity statistics of MIMO ricean and rayleigh fadingchannels,” Wireless Communications, IEEE Transactions on, vol. 6,pp. 3607–3619, October 2007.

    [23] P. J. Smith and L. M. Garth, “Distribution and characteristic functions forcorrelated complex wishart matrices,” Journal of Multivariate Analysis,vol. 98, no. 4, pp. 661 – 677, 2007.

    [24] P.-H. Kuo, P. Smith, and L. Garth, “Joint density for eigenvalues oftwo correlated complex wishart matrices: Characterization of MIMOsystems,” Wireless Communications, IEEE Transactions on, vol. 6,pp. 3902–3906, November 2007.

    [25] J. Kermoal, L. Schumacher, K. Pedersen, P. Mogensen, and F. Frederik-sen, “A stochastic MIMO radio channel model with experimental vali-dation,” Selected Areas in Communications, IEEE Journal on, vol. 20,pp. 1211–1226, Aug 2002.

    [26] T. M. Cover and J. A. Thomas, Elements of Information Theory(Wiley Series in Telecommunications and Signal Processing). Wiley-Interscience, 2006.

    [27] K. V. Mardia, “Measures of multivariate skewness and kurtosis withapplications,” Biometrika, vol. 57, no. 3, pp. 519–530, 1970.

    [28] P. Smith and M. Shafi, “On a gaussian approximation to the capacity ofwireless MIMO systems,” in Communications, 2002. ICC 2002. IEEEInternational Conference on, vol. 1, pp. 406–410, 2002.

    [29] P. Smith, S. Roy, and M. Shafi, “Capacity of MIMO systems withsemicorrelated flat fading,” Information Theory, IEEE Transactions on,vol. 49, pp. 2781–2788, Oct 2003.

    [30] “Fundamental multidimensional variables,” in Probability DistributionsInvolving Gaussian Random Variables, pp. 17–23, Springer US, 2002.

    [31] R. B. Arellano-Valle and M. G. Genton, “On the exact distribution of lin-ear combinations of order statistics from dependent random variables,”Journal of Multivariate Analysis, vol. 98, no. 10, pp. 1876 – 1894, 2007.

    [32] H. Nagaraja, “Order statistics from independent exponential randomvariables and the sum of the top order statistics,” in Advances inDistribution Theory, Order Statistics, and Inference (N. Balakrishnan,

    J. Sarabia, and E. Castillo, eds.), Statistics for Industry and Technology,pp. 173–185, BirkhÃd’user Boston, 2006.

    [33] A. Ross, “Computing bounds on the expected maximum of correlatednormal variables,” Methodology and Computing in Applied Probability,vol. 12, no. 1, pp. 111–138, 2010.

    [34] D. R. Hoover, “Bounds on expectations of order statistics for dependentsamples,” Statistics and Probability Letters, vol. 8, no. 3, pp. 261 – 265,1989.

    [35] R. A. G. Barry C. Arnold, “Bounds on expectations of linear systematicstatistics based on dependent samples,” The Annals of Statistics, vol. 7,no. 1, pp. 220–223, 1979.

    [36] Y. Tong, “Order statistics of normal variables,” in The MultivariateNormal Distribution, Springer Series in Statistics, pp. 123–149, SpringerNew York, 1990.

    [37] C. E. Clark, “The greatest of a finite set of random variables,” Oper.Res., vol. 9, pp. 145–162, Apr. 1961.

    [38] D. Sinha, H. Zhou, and N. Shenoy, “Advances in computation of themaximum of a set of random variables,” in Quality Electronic Design,2006. ISQED ’06. 7th International Symposium on, pp. 6 pp.–311,March 2006.

    [39] S. Asmussen, J. L. Jensen, and L. Rojas-Nandayapa, “A literature reviewon lognormal sums,”

    [40] L. Fenton, “The sum of log-normal probability distributions in scattertransmission systems,” Communications Systems, IRE Transactions on,vol. 8, pp. 57–67, March 1960.

    [41] A. Abu-Dayya and N. Beaulieu, “Outage probabilities in the presenceof correlated lognormal interferers,” Vehicular Technology, IEEE Trans-actions on, vol. 43, pp. 164–173, Feb 1994.

    [42] N. Mehta, J. Wu, A. Molisch, and J. Zhang, “Approximating a sum ofrandom variables with a lognormal,” Wireless Communications, IEEETransactions on, vol. 6, pp. 2690–2699, July 2007.

    [43] A. Safak, “Statistical analysis of the power sum of multiple correlatedlog-normal components,” Vehicular Technology, IEEE Transactions on,vol. 42, pp. 58–61, Feb 1993.

    [44] S. C. Schwartz and Y. S. Yeh, “On the distribution function and momentsof power sums with log-normal components,” Bell System TechnicalJournal, vol. 61, no. 7, pp. 1441–1462, 1982.

    [45] D. J. Love, “Tables of complex grassmannian packings [online],” 2004.Available: https://engineering.purdue.edu/ djlove/grass.html.

    [46] D. Love, R. Heath, and T. Strohmer, “Grassmannian beamforming formultiple-input multiple-output wireless systems,” Information Theory,IEEE Transactions on, vol. 49, pp. 2735–2747, Oct 2003.

    [47] F. Dittrich, “Useful formula for moment computation of normal randomvariables with nonzero means,” Automatic Control, IEEE Transactionson, vol. 16, pp. 263–265, Jun 1971.

    [48] A. J. Hoffman and H. W. Wielandt, “The variation of the spectrum ofa normal matrix,” Duke Math. J., vol. 20, pp. 37–39, 03 1953.

    [49] R. Bhatia, Matrix Analysis (Graduate Texts in Mathematics). SpringerNew York, 1997.

    [50] H. Cramér and H. Wold, “Some theorems on distribution functions,”Journal of the London Mathematical Society, vol. s1-11, no. 4, pp. 290–294, 1936.

    [51] G. Levin and S. Loyka, “Comments on "Asymptotic eigenvalue dis-tributions and capacity for MIMO channels under correlated fading",”Wireless Communications, IEEE Transactions on, vol. 7, pp. 475–479,February 2008.

    [52] J. Pierce and S. Stein, “Multiple diversity with nonindependent fading,”Proceedings of the IRE, vol. 48, pp. 89–104, Jan 1960.

    [53] U. Grenander and G. Szegö, Toeplitz Forms and Their Applications.AMS Chelsea Publishing Series, University of California Press, 2001.

    Vishnu V. Ratnam (S’10) received his Bachelorof Technology (Hons.) in electronics and electricalcommunication engineering from Indian Instituteof Technology, Kharagpur in 2012. He graduatedas the Salutatorian for the class of 2012. He iscurrently pursuing a Ph.D in Electrical Engineeringat University of Southern California. His researchinterests are in: the design and analysis of lowcomplexity transceivers for large antenna systems(Massive MIMO) and Ultra Wide-Band systems;and resource allocation problems in multi-antenna

    networks such as cooperative/ distributed antenna systems.

  • TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. -, NO. -, 14

    Andreas F. Molisch (S’89–M’95–SM’00–F’05) re-ceived the Dipl. Ing., Ph.D., and habilitation degreesfrom the Technical University of Vienna, Vienna,Austria, in 1990, 1994, and 1999, respectively. Hesubsequently was with AT&T (Bell) LaboratoriesResearch (USA); Lund University, Lund, Sweden,and Mitsubishi Electric Research Labs (USA). Heis now a Professor of Electrical Engineering at theUniversity of Southern California, Los Angeles. Hiscurrent research interests are the measurement andmodeling of mobile radio channels, ultra-wideband

    communications and localization, cooperative communications, multiple-inputâĂŞmultiple-output systems, wireless systems for healthcare, and novelcellular architectures. He has authored, coauthored, or edited four books(among them the textbook Wireless Communications, Wiley-IEEE Press), 16book chapters, some 200 journal papers, 270 conference papers, as well asmore than 80 patents and 70 standards contributions.

    Dr. Molisch has been an Editor of a number of journals and specialissues, General Chair, Tecnical Program Committee Chair, or SymposiumChair of multiple international conferences, as well as Chairman of variousinternational standardization groups. He is a Fellow of the National Academyof Inventors, Fellow of the AAAS, Fellow of the IET, an IEEE DistinguishedLecturer, and a member of the Austrian Academy of Sciences. He has receivednumerous awards, among them the Donald Fink Prize of the IEEE, and theEric Sumner Award of the IEEE.

    Haralabos C. Papadopoulos (S’92–M’98) receivedthe S.B., S.M., and Ph.D. degrees from the Mas-sachusetts Institute of Technology, Cambridge, MA,all in electrical engineering and computer science,in 1990, 1993, and 1998, respectively.

    Since December 2005, he has been with DO-COMO Innovations, Palo Alto, CA, working onphysical-layer algorithms for wireless communica-tion systems and architectures. From 1998 to 2005,he was on the faculty of the Department of Electricaland Computer Engineering, University of Maryland,

    College Park, MD, and held a joint appointment with the Institute of SystemsResearch. During his 1993–1995 summer visits to AT&T Bell Labs, MurrayHill, NJ, he worked on shared time-division duplexing systems and digitalaudio broadcasting. His research interests are in the areas of communicationsand signal processing, with emphasis on resource-efficient algorithms andarchitectures for wireless communication systems.

    Dr. Papadopoulos is the recipient of an NSF CAREER Award (2000), theG. Corcoran Award (2000) given by the University of Maryland, College Park,and the 1994 F. C. Hennie Award (1994) given by the MIT EECS department.He is also a coauthor of the VTC Fall 2009 Best Student Paper Award. He isa member of Eta Kappa Nu and Tau Beta Pi. He is also active in the industryand an inventor on several issued and pending patents.