Compiler Design for Digital Signal Processors · LSI Proprietary 3 Storage Networking Systems...

Compiler Design

for Digital Signal Processors

Erik Eckstein

03/25/2010

2LSI Proprietary

LSI Corporation

• LSI is a global leader in semiconductors and software solutions

• Solutions for storage and networking markets

• Founded 1981

• Headquarter: Milpitas, California

• Design centers around the world

– Small design center in Vienna: development of DSP software tools

3LSI Proprietary

StorageNetworking

Systems Semiconductors

Entry RAID Systems

Mid-range RAID

Systems

Host RAID Adapters

Host RAID SW

Data Management

HDD ICs: HDCs, SoCs,

RCs, Pre-Amps

SAS & SAS ROC

Controllers

SAS Expanders, Bridges

SAN Fabric Products

Custom Storage

Products

Custom Products –

Enterprise Datacom

Network Processors

Framers/Mappers

Link Layer Products

Modems, USB,

Firewire

Market Focus

4LSI Proprietary

• Digital Signal Processors– Used in mobile phones, wireless devices, infrastructure, etc.

– Audio, video processing

– High performance for signal processing algorithms

– Low power consumption

• Example: StarPro2716– 16 StarCore SC3400e DSP cores @ 750 MHz

– 8 ARM cores for networking packet processing

– Ethernet I/O support

– 6MB on chip memory

– 96 GMACs (96*109 multiply-accumulate instructions/sec)

– Multichannel speech processing in voice gateways

DSP Overview

5LSI Proprietary

DSP Characteristics

• RISC-like architecture

• Multiple parallel units

• VLIW (very long instruction word)

• Multiple memory buses

• 40-bit registers

• Complex addressing modes

• Hardware loops

• Fixed-point arithmetic

6LSI Proprietary

SC3400e Core

PCUprogram control unit

AGUaddress generation unit

DALUdata ALU

PROGRAM

XPA (32)

XPD (128)

XAA (32)

XAD.in XAD.out (64) (64)

XBA (32)

2 FETCH BUFFERS

16DATA

REGISTERS

27ADDRESS

REGISTERS

7LSI Proprietary

DSP-C Compiler Requirements

• Support for all DSP features– with language extensions (fractional data types)

– with automatic recognition

• Optimizing for speed– time critical code

• Optimizing for code size– control code

8LSI Proprietary

high intermediate representation(general)

Compiler Design

low intermediate representation(processor specific)

component component component...

component

9LSI Proprietary

Code Generator

C Code Parser

Optimizations on Low Level IR

Optimizations on High Level IR

Front End

Back End

Compilation Stages

10LSI Proprietary

addr of addr of

High Level IR Example

assign

addr of

obj "b"

read read

obj "c"

obj "a"

a = b + c;

11LSI Proprietary

Low Level IR Example

move.w #<32,r0

suba r1,r0

asll d7,d1

move.w #<-1,d5 move.l (r6),d4

12LSI Proprietary

Optimizations are Important!

• Compiler contains approx. 50 different optimizations– 70% high level optimizations

– 30% low level optimizations

• Most optimization are invoked several times– up to 17 times!

13LSI Proprietary

Optimization Example: FIR Filter

long fir(short *x, short *y)

long sum = 0;

int i;

for (i = 0; i < 128; i++) {

sum += x[i] * y[i];

return sum;

14LSI Proprietary

Loop Strength Reduction

long sum = 0;

int i;

for (i = 0; i < 128; i++) {

sum += *x * *y;

return sum;

15LSI Proprietary

Hardware Loop Support

long sum = 0;

int i;

loop 128 {

sum += *x * *y;

return sum;

16LSI Proprietary

Instruction Selection

doensh3 #128

clr sum

loopstart3

move.w (x),tmp1

move.w (y),tmp2

imac tmp2,tmp1,sum

adda #4,x

adda #4,y

loopend3

17LSI Proprietary

Register Allocation

doensh3 #128

clr d0

loopstart3

move.w (r0),d1

move.w (r1),d2

imac d2,d1,d0

adda #4,r0

adda #4,r1

loopend3

5 cycles/iteration

18LSI Proprietary

Addressing Mode Optimization

doensh3 #128

sub d0,d0,d0

loopstart3

move.w (r0)+,d1

move.w (r1)+,d2

imac d2,d1,d0

loopend3

3 cycles/iteration

19LSI Proprietary

Instruction Scheduling

doensh3 #128

sub d0,d0,d0

loopstart3

[ move.w (r0)+,d1

move.w (r1)+,d2 ]*)

imac d2,d1,d0

loopend3

2 cycles/iteration*) executed in parallel

20LSI Proprietary

Software Pipelining

1 cycles/iteration

sub d0,d0,d0

doensh3 #127

[ move.w (r0)+,d1 move.w (r1)+,d2 ]

loopstart3

[ move.w(r0)+,d1

move.w(r1)+,d2

imac d2,d1,d0 ]

loopend3

imac d2,d1,d0

21LSI Proprietary

Loop Transformation

0.25 cycles/iteration

loopstart3

[ move.4w (r0)+,d0:d1:d2:d3

move.4w (r1)+,d8:d9:d10:d11

imac d0,d8,d12

imac d1,d9,d13

imac d2,d10,d14

imac d3,d11,d1 ]

loopend3

Questions?

23LSI Proprietary

Compiler Design for Digital Signal Processors · LSI Proprietary 3 Storage Networking Systems...

Documents

RocketRAID 4320 SAS Host Adapter · 2018. 4. 9. · • Web browser-base RAID management software • Disk scrubbing to prevent degraded RAID arrays • Bad sector repair and re-mapping

Lecture 11: Storage Systems Disk, RAID, Dependability Kai Bu kaibu@zju.edu.cn

Xserve RAID - AppleRAID controllers support RAID levels 0, 1, 3, 5, and 0+1. Xserve RAID also supports hybrid RAID levels 10, 30, and 50 when used in conjunction with host-based software

Dell EMC PowerEdge RAID Controller S140 · Windows RAID: Volume, RAID 1, RAID 0, RAID 5, RAID 10 Linux RAID: RAID 1 NOTE: Non-boot virtual disks of any supported RAID level by the

RocketRAID 3530/3540 SATAII Host Adapter User’s Guide · • Battery Backup Unit (BBU) Optional • RoHS compliant Advanced RAID Features • Support RAID 0, 1, 3, 5, 6,10, 50 and

Intel® RAID Controller SASMF8I Host-Bus Adapter ...Product Brief Intel® RAID Controller SASMF8I Intel® RAID Controller SASMF8I Host-Bus Adapter Enhances System Resources to Deliver

SAS-3 Integrated RAID Solution User Guide › docs-and-downloads › host-bus-adapter… · SAS-3 Integrated RAID Solution User Guide November 2012 Chapter 1: Introduction to the

Power Systems: Serial-attached SCSI RAID enablement and - IBM

Power Systems SAS RAID controllers for AIX - IBM - United States

Intel Embedded Server RAID Technology 2 (ESRT2) · 2019-05-30 · 1.1 Intel Embedded Server RAID Technology 2 Features The ESRT2 is a host-based RAID solution that utilizes the chipset,

Non-Volatile CACHE for Host- Based RAID Controllersi.dell.com/.../Documents/NV-Cache-for-Host-Based-RAID-Controllers… · Non-Volatile CACHE for Host-Based RAID ... Non-Volatile

Power Systems SAS RAID controllers for AIX - IBM · Power Systems SAS RAID controllers for AIX. Note Before using this information and the product it supports, read the information

HP ProLiant ML150 Generation 3 servidor...Storage Controller HP Embedded SATA RAID Controller or HP 8 Internal Port SAS Host Bus Adapter with RAID Storage Diskette Drive None ship

RAID Storage, Network File Systems, and DropBoxcseweb.ucsd.edu/~gmporter/classes/wi15/cse124/lectures/lecture12.pdf · RAID Storage, Network File Systems, and DropBox George Porter

VMware vSphere Storage Appliance · install devices for the ESXi 5.0 host. ESXi Physical Hardware SCSI Disk RAID 10 SAS RAID Adapter RAID 10 Array PSA/VMFS/vSCSI Partition Table ESXi

Intel Embedded Server RAID Technology II · Intel® Embedded Server RAID Technology is host-based RAID that utilizes the chipset, processors, and memory of the server board to provide

Advanced UNIX File Systems Berkley Fast File System, Logging File Systems And RAID

Intel® RAID Module RMS3CC080/RMS3CC040 and Intel® … · 2 Prepare the Host Computer CAUTION: Before you install the RAID module, make sure that the host computer is disconnected

ABCD A’B’C’D’ SVC Raid 5 (TB)Raid 1 (TB) Host Raid 5 (TB)Raid 1 (TB) ABasic Virt. Disk100 A'Basic Virt. Disk100 BBasic Virt. Disk 100B'Basic Virt. Disk

Non-Volatile CACHE for Host- Based RAID Controllers...Non-Volatile CACHE for Host-Based RAID Controllers Page ii THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN