40
Novel Architectures for Novel Architectures for Novel Architectures for Baseband Processing Novel Architectures for Baseband Processing Viktor Öwall Dept. of Electrical and Information Technology Lund University, Sweden [email protected] 2010-09-23 Lund Circuit Design Workshop 2010

NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Novel Architectures forNovel Architectures forNovel Architectures for Baseband ProcessingNovel Architectures for Baseband Processinggg

Viktor ÖwallDept. of Electrical and Information Technology

Lund University, [email protected]

2010-09-23 Lund Circuit Design Workshop 2010

Page 2: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Outline

• Multibase project and DFE Rx• Reconfigurable computationsg p• Sign bit synchronization• Analog decodingAnalog decoding• SVD-based channel estimation

2010-09-23 Lund Circuit Design Workshop 2010

Page 3: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

FP7 STREP Multi-Base

Multibase: January 2008 – April 2011Multibase: January 2008 – April 2011

Scalable Multi-tasking Baseband for Mobile Communications1. Multi-streaming radio (concurrent execution of multiple standards)2 S l bl bl / fi bl lti t h l2. Scalable programmable/reconfigurable multi-processor technology3. Algorithm/architecture co-design for maximum energy efficiency

2010-09-22 Lund Circuit Design Workshop 2010

LU Focus: Synch and Channel Estimation, algorithms and architectures.

Page 4: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

FP7 STREP Multi-Base

Lund focus in circuits: DFE RxLund focus in circuits: DFE RxContributors:

I. Diaz1, L. Hollevoet2 , T. Olsson3, J. Rodrigues1,

J S 3 L V D P 2J. Svensson3, L. Van Der Perre2, L. Wilhelmsson3, C. Zhang1,

and V Öwall1and V. Öwall1Lund University

2IMEC2IMEC3Ericsson

2010-09-22 Lund Circuit Design Workshop 2010

Page 5: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Motivation

• Design a multi-standard digital front-end receiver including:– Automatic Gain and Resource Activity Controller– Compensation– Filtering and resamplingg p g– Synchronization

• Support for concurrent data streams

• Supported standards: LTE, DVB-H and IEEE 802.11n

2010-09-23 Lund Circuit Design Workshop 2010

Page 6: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

The standards

• All three standards are OFDM based, however with different FFT-size:LTE 128 2048– LTE: 128-2048

– DVB-H: 2048, 4092 (and 8192)– IEEE 802.11n: 64-128

• …and different sample rates:LTE 30 72 Msps LTELTE– LTE: 30.72 Msps

– DVB-H: 9.143 Msps– IEEE 802.11n: 20 or 40 Msps

LTELTE

DVBDVBPilot based

DVBDVB

802.11n802.11nPreamble based

2010-09-23 Lund Circuit Design Workshop 2010

Preamble based

Page 7: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

The multi stream DFE-Rx

Two concurrent data streams in hardware.

More data streamspossible since theydo not transmit all th tithe time.

2010-09-23 Lund Circuit Design Workshop 2010

Page 8: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

AGRAC - Automatic Gain and Resource Activity ControllerResource Activity Controller

• Performs AGC of the analog frontend• Enables/disables other subblocks of the DFE Rx and• Enables/disables other subblocks of the DFE Rx and baseband processing as required.

• Hierarchical wakeup to conserve power.Hierarchical wakeup to conserve power.

2010-09-23 Lund Circuit Design Workshop 2010

Page 9: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Hierarchical wakeup: principle

Activation probability Power consumptionComponentLow High

Baseband

Sync

High Low

Sync

AGRAC High Low

2010-09-23 Lund Circuit Design Workshop 2010

Page 10: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Example: reception of WLAN

DATAChannel

Preamble Noise

Powerconsumption

DATAChannel

p

timeAGRAC (signal detection)

Coarse time syncBaseband processing False trigger, no power wasted

because baseband engine notactivated

2010-09-23 Lund Circuit Design Workshop 2010

Page 11: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

DFE Decimation Chain

IN

OUTbypass for test

Adaptive Farrowvalid in valid outFiltering Resamplervalid_in valid_out

clk control

• Reconfigurable for LTE, 802.11n, and DVB-H• One decimation chain for each DFE stream

2010-09-23 Lund Circuit Design Workshop 2010

Page 12: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Farrow Resampler

• Polynomial order = 3, Length = 8• Transposed FIRs for reduced delay• Optimized fixed point coefficients and maximal coefficient sharing

- Realized through integer linear programming - Realized through integer linear programming

Page 13: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Synchronization

WLAN

Preamble Signal Data

OFDM SymbolSymbolCopy

Allstandards

CP Data CP

Page 14: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Synchronization Architecture

Size of FFT Size of CP or preamble/2

Page 15: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Synchronization Hardware

Mapped onto a reconfiguarble processing array:Mapped onto a reconfiguarble processing array:• flexibility since multi-standard• hardware reuse since sync only part of the time

Page 16: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Multi-standard OFDM time synchronizationsynchronization

• Multiple radio standard support• Dual-standard concurrent support• Scenario-aware adaptive resource allocation• Scenario-aware adaptive resource allocation• Expandable to more streams

arg

2010-09-23 Lund Circuit Design Workshop 2010

Page 17: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Interleaved data storage

• Micro-block function in MC eliminates data alignment operations in PC.alignment operations in PC.

Sign SignInphase Quadrature

011162731 19 3

12 bit > 4 bit

(a)

723

I Q

12 bits -> 4 bits

PC0 -> MC0

(b)Sign Sign

Shift by 0 &

2(I) 2(Q)

1(I) 1(Q)

3(I) 3(Q)(c)

Shift by 0 & mask

Shift by 16& mask

Shift by 4& maskLogic

“or”

4(I) 4(Q)

3(Q) 1(Q)4(Q) 2(Q)3(I) 1(I)4(I) 2(I)

After 4 iterations

(d)Address ‘X’

Shift by 20& mask

2010-09-23 Lund Circuit Design Workshop 2010

Page 18: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Coarse-grained reconfigurable architecture

• Heterogeneous cell array.

• Decoupled processing and memory cells.

• Dedicated local connections for high data throughput

R R

throughput.

• Hierarchical global routing network for communication flexibility. R R

R

communication flexibility.

• Single-Cycle-Per-Hop communication latency.

• Centralized & Distributed cell configuration

R R

gmanagement.

2010-09-23 Lund Circuit Design Workshop 2010

Page 19: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Application mapping with CAL

Mapping of applications and algorithms is a crucial task inalgorithms is a crucial task in order to make them useful.• In cooperation with the

t icomputer science department at LTH.

• CAL, data flow language toCAL, data flow language to specify parallelism for parallel hardware.D t t i li ti• Data centric application mapping.

2010-09-23 Lund Circuit Design Workshop 2010

Page 20: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

The Chip

• Infineon 65nm CMOS process

• Total area 5mm2

• 144 Pads

• Clock frequencies:Main part 80MHz– Main part 80MHz

– Synchronization engine 320MHz through on-chip clock multiplier

2010-09-23 Lund Circuit Design Workshop 2010

Page 21: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

The Chip

Infineon 65nm CMOS

2010-09-23 Lund Circuit Design Workshop 2010

Page 22: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Alternative: Sign-Bit Synch.

Wh t d ith l th i f th IQ t ?What can we do with only the sign of the IQ components?

Presented at ISCAS´10 and VTC´10Presented at ISCAS 10 and VTC 10.

2010-09-23 Lund Circuit Design Workshop 2010

Page 23: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Alternative: Sign-Bit Synch.

Wh t d ith l th i f th IQ t ?What can we do with only the sign of the IQ components?

To be further evaluated OK for coarse synch?To be further evaluated, OK for coarse synch?

2010-09-23 Lund Circuit Design Workshop 2010

Page 24: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Sign-Bit Synchronization:implementation resultsimplementation results

97% area reduction

Memory in correlation

Memory in moving

Comb Logic

Total0.250.3

Logic Process: ST65nm CMOS

correlation in moving sum

Logic

Sign  0.033 0.012 0.0018 0.047

8 Bits 0.25 0.4 0.006 0.668 0 10.150.20.25

[mm²]Memory [mm²]

8 Bits 0.25 0.4 0.006 0.668

00.050.1

8 Bit 1 Bit

The chip is currentlybeing measured.

2010-09-23 Lund Circuit Design Workshop 2010

Page 25: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

UPD: analog decoding

Analog decoder in a digital environment.

System Level Aspects

RF Front -End DecodingDigital Baseband

Sigma Delta A-to-DAntenna

Interface Decoding

System Level Control

gBasebandConverterInterface Decoding

2010-09-22 Lund Circuit Design Workshop 2010

Page 26: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

UPD: analog decodingSimulated spectrum for the output signal from the delta-sigma modulator with SNDR calculated

Why analog decoders:l ti• very low power consumption

• less area on silicon• high throughput due to parallel computations

Analog Decoding

Alternative Decoding Approaches

D-to-ADetection & Synch.

Analog Decoding

pp

Digital DecodingRF Front-End Digital

Baseband

Sigma Delta A-to-DAntenna

Interface Decoding

System Level Control

BasebandConverterInterface

2010-09-22 Lund Circuit Design Workshop 2010

Page 27: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Analog decoder: analog in/digital out

2010-09-23 Lund Circuit Design Workshop 2010

Page 28: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Analog Core Module for BCJR

Analog CoreThe structure of the core

will decide the code, e g Hamming ore.g. Hamming or

tailbitingconvolutional (7,5)

Analog Core Module used for BCJR calculations.

2010-09-23 Lund Circuit Design Workshop 2010

Page 29: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Analog Core Module for BCJR

Forward metric calculationsForward metric calculations in BCJR algorithm.Backward in the same way. Equivalent analog circuit.

2010-09-23 Lund Circuit Design Workshop 2010

y Equivalent analog circuit.

Page 30: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Alternative structure: digital in/digital out

2010-09-23 Lund Circuit Design Workshop 2010

Page 31: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Decoder Output for a Single Bit

1decisionlevel

0

level

Cadence netlist simulations:• Less than 4 µs is needed to decode the transmitted codeword

D i i h i it t t d t t

2010-09-23 Lund Circuit Design Workshop 2010

• Decision: when circuit converges to an steady state

Page 32: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

BER performance

2010-09-23 Lund Circuit Design Workshop 2010

Page 33: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Results: Analog Hamming Decoder

2010-09-23 Lund Circuit Design Workshop 2010

Page 34: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Future work: extend to Convolutional

Expectations: Better performance compared to Hamming decoder

Corresponding tail-biting trellis

Cost: More complexity => More power consumption and area

How much? => Research is ongoing

2010-09-23 Lund Circuit Design Workshop 2010

How much? => Research is ongoing

Page 35: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

SVD-based Channel Estimation

• SVD reduces the complexity of a linear MMSE estimator• SVD reduces the complexity of a linear MMSE estimator– A number of values are close to zero and can be ignored

H1 HH 2121HVUQW 1 HH UDUDUDUW )( 2121

SVD-based Architecture,where U and V are

th l t i d

Conceptual Architecture showing a number of

values ignoredorthonormal matrices and Q−1 is a filtering matrix.

values ignored

Presented at NORCHIP 2009

Page 36: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

SVD-based Channel Estimation

• SVD reduces the complexity of a linear MMSE estimator• SVD reduces the complexity of a linear MMSE estimator– A number of values are close to zero and can be ignored. – HOW MANY?

HH UDUDUDUW )( 2121

Conceptual Architecture showing a number of values ignored

Full MMSE “same”

Where K=8 is the length of the Cyclic Prefix, i.e. CP + some.

Presented at NORCHIP 2009

Page 37: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

SVD-based Channel Estimation

• SVD reduces the complexity of a linear MMSE estimator• SVD reduces the complexity of a linear MMSE estimator– A number of values are close to zero and can be ignored. – HOW MANY?

HH UDUDUDUW )( 2121

Conceptual Architecture showing a number of values ignored

Full MMSE “same” Complexity is reduced by 66% using K=11.

Where K=8 is the length of the Cyclic Prefix, i.e. CP + some.

Presented at NORCHIP 2009

Page 38: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

SVD-based Channel Estimation

• An architecture using four multipliers has been designed for FPGA and ASIC.- FPGA: Functional verification and Resource Usage- ASIC in UMC130nm: Area and Power Simulation

Simulated Power Consumption• Average power consumption of 14.2 mW• Peak power consumption 84.5 mW

Page 39: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

...and now to:Faster-than-Nyquist (FTN) receiverFaster-than-Nyquist (FTN) receiver

FTN iterative receiverFTN iterative receiver

f T∆ = 1

F∆ = 1

t

F∆ < 1

Designed for ST 65nm CMOSMotivation: to increase bandwidth efficiency by compressing in time

d/ f

Designed for ST 65nm CMOSTape-out in November

2010-09-22 Lund Circuit Design Workshop 2010

and/or frequency

Page 40: NovelArchitecturesforNovel Architectures for Baseband ...cdworkshop.eit.lth.se/fileadmin/eit/group/71/2010/Owall-baseband.pdf · JSJ. Svensson3, LV D PL. Van Der Perre2, L. Wilhelmsson3,C

Th k Y !Thank You!

2010-09-23 Lund Circuit Design Workshop 2010