9. Image Compression Computer Engineering, Sejong University...

Image Processing

9. Image Compression

Computer Engineering, Sejong University

Dongil Han

The storage and communication of video requirements are immense• storage requirement for uncompressed video

512pixel x 512 pixel x 3bytes/pixel x 30frame/sec = 23.6MBytes/sec

image compression : reducing the amount of data required to represent a digital imageex) 23.6MBytes =>187KBytes

Fundamentals of image compression• reduce or eliminate the redundant data in the image or

Image Compression

application• Digital TV broadcasting• VOD(Video on Demand)• Televideo-conferencing• Medical imaging• Facsimile transmission• or diverse Multi-media environment

=> mainly used for storages and transmissions

Evaluation of compression efficiency• compression ratio

compression ratio = original data/compressed data

• the greater the compression ratio, the smaller the final image will be

Image Compression

Selection criteria for image compression techniques• achievable compression ratio• compression/decompression time• complexity of algorithm• cost of computational resources• availability of computational resources• adoption of standard or not • lossy or loseless compression • the quality of decoded image when lossy compression is

Image Compression

Fundamentals

Two basic types of image compression• Lossless compression : encodes and decodes the data

perfectly. resulting image matches the original image exactly• Lossy compression : allows redundant and nonessential

information to be lost

Data compression• process of reducing the amount of data required to represent

a given quantity of information• key idea : reducing the data redundancy

Three basic data redundancies in digital images• coding redundancy• interpixel redundancy• psychovisual redundancy

Elimination of coding redundancy• assigning fewer bits to the more probable data• assigning longer bits to the less probable data• this process is referred to as variable length coding• reduces the total number of encoded bits

• Image data : assigning fewer bits to the gray levels with higher historam

Fundamentals

coding redundancy• discrete random variable rk in the interval [0,1] represents

the gray levels of an image• each rk occurs with probability pr(rk)

where L is the number of gray levels• If the number of bits used to represent each value of rk is

l(rk), then the average number of bits required to represent each pixel is

• Elimination of coding redundancy => minimize the Lavg

nrp kkr )(

kkrkavg rprlL

Fundamentals

Elimination of coding redundancy

rprlLk

krkavg

)02.0(6)03.0(6)06.0(5)08.0(4

)16.0(3)21.0(2)25.0(2)19.0(2

code 1 code 2

rprlLk

krkavg

)02.0(3)03.0(3)06.0(3)08.0(3

)16.0(3)21.0(3)25.0(3)19.0(3

Fundamentals

Elimination of coding redundancy

Fundamentals

Elimination of interpixel redundancy• use similarities between adjacent pixels• use similarities between adjacent fields• Run-length encoding, DPCM, ADPCM, etc.

• Terminologies for representing interpixel redundancy- spatial redundancy- geometric redundancy- interframe redundancy

Fundamentals

Elimination of interpixel redundancy

Fundamentals

Elimination of interpixel redundancy

Fundamentals

Elimination of psychovisual redundancy• brightness of region depends on factors other than

simply the light reflected by the regions• certain information has less importance than other

information in normal visual processing• this information is said to be psychovisual redundant• this redundancy can be eliminated without significantly

impairing the quality of image perception=> results in a loss of quantitative information

• related to the sampling, quantization

Fundamentals

Elimination of psychovisual redundancy

Fundamentals

Fidelity Criteria• reproducible means of quantifying the nature and extent

of information loss– objective fidelity criteria(객관적 충실도 기준)– subjective fidelity criteria(주관적 충실도 기준)

• objective fidelity criteria offer a simple and convenient mechanism for evaluating information loss

• sometimes, subjective evaluations by a human observer is more appropriate

Fundamentals

Example of objective fidelity criteria• root-mean-square(rms) error between an input and output

image• Let be input and estimate of input image,

then the total error between the two images is

where the images are of size MxN• The root-mean-square erms between the two images is

),(ˆ),,( yxfyxf

)],(),(ˆ[M

yxfyxf

2)],(),(ˆ[1

yrms yxfyxf

Fundamentals

Example of objective fidelity criteria• mean-square signal-to-noise ratio of the estimated

image, denoted SNRms is

)],(),(ˆ[

yxfyxf

Fundamentals

Image Compression Model

Image Compression System• two distinct structural blocks : Encoder and Decoder• source encoder : removes input redundancies• channel encoder : increases the noise immunity• noise free environment, the channel encoder/decoder are

omitted

Source encoder

Channel encoder Channel Channel

decoderSource decoder

Encoder Decoder

),( yxf ),(ˆ yxf

Source encoder• reduces or eliminates any coding, interpixel, psychovisual

redundancies in the input image

Source encoding stages• Mapper : reversible• Quantizer : irreversible• Symbol encoder : reversible

Mapper Quantizer Symbol encoder Channel

Source Encoder

),( yxf

Mapper• transforms the input data into a format designed to reduce

interpixel redundancies in the input image• may or may not reduce the amount of data• example : run-length coding• transformed results : array of coefficients

Quantizer• reduces the accuracy of the mapper’s output• reduces the psychovisual redundancy• must be omitted when error-free compression is desired

Symbol Encoder• reduces the coding redundancy• creates a fixed- or variable-length code

Source Decoder• performs the inverse operations of the source encoder• inverse quantizer block is not included in the general

source decoder

Source Decoding stages• Symbol decoder : inverse operations of the symbol

encoder• Inverse mapper : inverse operations of the mapper

Symbol Decoder

Inverse Mapper

Channel

Source Decoder

),(ˆ yxf

Error-free compression

Error-free compression• means for data reduction• reduces interpixel, coding redundancy

Application• medical or business documents• satellite imagery : cost of collecting the data• digital radiography(방사선 사진) : for diagnostic accuracy• normally provide compression ratios of 2 to 10

Error-free compression techniques• Variable-length coding• Arithmetic coding• Bit-plane coding : Run-length coding• Lossless predictive coding, etc.

Variable-length coding • the simplest approach to error-free compression technique• reduces coding redundancy only • assigns the shortest possible code words to the most

probable gray levels• source symbol could be gray levels or pixel differences of

an image, output of run-length encoding, etc.

Types of variable-length coding • Huffman coding• Truncated Huffman coding• Shift coding• Huffman shift coding, etc.

Huffman coding• In 1952, a paper by David Huffman was published• variable length codes can achieve a higher data density than

fixed length codes• assigns short code for the most frequently occurring data• yields the smallest possible number of code symbols per

source symbol• resulting code is optimal with constraint that the source

symbols be coded one at a time

Huffman coding process1. Ordering the probabilities of symbols2. Combining the lowest probability symbols into a single

symbol. This process is repeated until a reduced source with two symbols is reached

3. To code each reduced source starting with the smallest source4. Working back to the original source

symbol reductions

Code assignment procedure

Huffman coding

전치 특성(Prefix property)• 모든 코드는 다른 코드의 prefix가 될 수 없는 성질

– 예 : 영문자 e의 코드가 “01”일 경우 “010”, “011” 혹은“0100” 등의 코드는 존재하지 않는다

– 즉 e의 코드 “01”이 다른 코드의 prefix가 될 수 없다.– 만약 “010”을 b의 코드로 정하면 “010”을 복호화 할 때다음의 모순 발생

=> “01” 을 e로 복호화 혹은=> “010”을 b로 복호화

Another property• bit operation is required

Huffman coding

Advantages• creates the optimal code for a set of symbols and

probabilities • coding/decoding is accomplished in a simple lookup table• block code : each source symbol is mapped into a fixed

sequence of code symbols• can be decoded without referencing succeeding symbols

Disadvantages• encoding requires two nontrivial passes over the data

(probability calculation, code table creation)• one corrupted bit will wipe out the rest of the data

(disadvantage of variable length coding)• when a large number of symbols is decoded, the

construction of the optimal code is an nontrivial case

Huffman coding

Other near optimal VLCs

Arithmetic coding• nonblock code : a sequence of source symbols is

assigned a single arithmetic code word• the code word defines an interval of real numbers

between 0 and 1• as the number of source symbols increases, the interval

used to represent it becomes smaller• any number within the subinterval can be used to

represent the source symbol• encoding requires two nontrivial passes over the data

– probability calculation– code table creation

Arithmetic coding example

LZW(Lempel-Ziv-Welch) coding• assigns fixed-length code words to variable length

sequences of source symbols• for 8-bit images

– first 256 words of the dictionary are assigned to the gray values 0, 1, …, 255

– image sequences are added to the next code word (39-39 => 256, 39-126=>257, etc.)

Properties of LZW coding• requires no a priori knowledge of the probability• codebook is created while the data are being encoded• repeated source symbols such as “the”, “. “ are

compressed effectively• has been integrated into gif, tiff, pdf file format, etc.

LZW coding example

Bit-plane coding• effective technique for reducing an image’s interpixel

redundancy• process the image’s bit planes individually• compress each binary image via well-known binary

compression methods

Bit-Plane Coding

Bit-plane decomposition• small changes in gray levels could generate many bit

change(e.g. 127(01111111) => 128(10000000))• m-bit gray code => reduces the bit change

– successive code words differ in only one bit position– small changes in gray levels are less likely to affect all

m bit planeDecimal number BCD code gray code

0 0000 0000 1 0001 0001 2 0010 0011 3 0011 0010 4 0100 0110 5 0101 0111 6 0110 0101 7 0111 0100 8 1000 1100 9 1001 1101

original image

original gray coded original gray coded

Bit-Plane Coding

Run-length coding• reduces interpixel redundancy• binary image compression methods• applied each bit-plane of gray level image• describe successive runs of black and white pixels• standard compression approach in FAX coding• compression technique in bmp file• applied for 1, 2, 4-bit gray level image• additional compression can be realized by variable-length

coding the run lengths themselves

– input : AAAABBBBBCCCCCCCCDEEEE– output : 4A5B8C1D4E– compression ratio : 22/10 = 2.2

Run-length coding

Notes for RLC• If compression ratio is less than 1, send original data

– input : MyDogHasFleas– output : 1M1y1D1o1g1H1a1s1F1l1e1a1s– compression ratio : 13/26 = 0.5

• Use special prefix character (for example: +)– input : ABCDDDDDDDDEEEEEEEEE– output : ABC+8D+9E– compression ratio : 19/9 = 2.11

• It make sense to encode only runs of 3 or longer• If the special prefix character is found in the source : use

3 byte notation(run of length 1: + => +1+)

Lossless predictive coding• reduces interpixel redundancy• does not require decomposition of an image into bit planes• extracts and codes only the new information in each pixel• new information : the difference between the actual and

predicted value of that pixel

Lossless predictive coding

Original image

Prediction error image

Histogram

Histogram of

prediction error

Lossy compression

Lossy compression• compromise the accuracy of the reconstructed image in

exchange for increased compression• reduces interpixel, coding redundancy, psychovisual

redundancy

Application• Digital TV : MPEG-2, Video conferencing : H.261• still image coding: JPEG• normally provide compression ratios of 10 to 100

Lossy compression techniques• Lossy predictive coding• Transform coding• Hierarchical coding• Hybrid coding, wavelet coding, etc.

Lossy Predictive Coding

Lossy predictive coding• predictive coding with quantizer• predictions generated by the encoder and decoder are

equivalent• well-known form of lossy predictive coding

- DM(Delta Modulation)- most simple

• other lossy predictive coding techniques- ADM(Adaptive Delta Modulation)- DPCM(Differential Pulse Code Modulation)- ADPCM(Adaptive Differential Pulse Code Modulation)

• optimal predictor : DPCM- minimizes the encoder’s mean-square prediction error

Lossy predictive coding

Lossy predictive coding model

DM(Delta Modulation)• utilize a 1-bit fixed-length code• delta modulation process

- evaluate the input and present encoded level- allocate 1 when input level exceeds present encoded

level- else allocate 0

• implementation- positive/negative constant delta value is assigned to

each code- code 1 : + delta- code 0 : – delta

• relatively smooth region : granular noise appears• rapidly changing region : slope overload appears

DM(Delta Modulation)

Problem of Delta Modulation :• big delta : granular noise is increased• small delta: slope overload is increased

ADM(Adaptive Delta Modulation)• adaptively adjust the delta value• relatively smooth region

- reduce the delta value- encodes the small variation of input signal- granular noise is reduced

• rapidly changing region- increase the delta value- follows rapidly the input signal- slope overload is reduced

DPCM(Differential Pulse Code Modulation)• minimizes the encoder’s mean-square prediction error

• optimal predictor• prediction is constrained to a linear combination of m

previous pixels

}]ˆ{[}{ 22nnn ffEeE

)1,(97.0),(ˆ yxyxf ),1(5.0

)1,(5.0),(ˆ

yxfyxf

)1,1(5.0),1(75.0

)1,(75.0),(ˆ

yxfyxf

otherwiseyxf

vhifyxfyxf

),1(97.0

)1,(97.0),(ˆ

|)1,1()1,(|

|)1,1(),1(|

yxfyxfv

yxfyxfh

Linear Prediction Example

Quantization example

DPCM example

1 bits/pixel 1.25 bits/pixel

2 bits/pixel

2.125 bits/pixel

3 bits/pixel3.125

bits/pixel

1.25 bits/pixel

2.125 bits/pixel

3.125 bits/pixel

1 bits/pixel

2 bits/pixel

3 bits/pixel

DPCM error example

Compression with transform coding• Most information is located in the low frequency region• modifying the transform of an image• map the image into a set of transform coefficients, which

are then quantized and coded• significant number of the coefficients have small magnitude • and can be coarsely quantized with little image distortion

Transform coding techniques• image is divided into subimages of size n x n (8x8, 16x16)• transform each block

- pack as much information as possible into the smallest number of transform coefficients

• more coarsely quantizes the coefficients that carry the least information

• final stage : usually VLC the quantized coefficient is used

Transform Coding

Typical transform coding system

Transform Coding

Transform coding• transform kernel(basis function) constructs each

transform coding techniques

• Types of transform coding- KLT((Karhunen-Loeve Transform)- DFT(Discrete Fourier Transform)- DCT(Discrete Cosine Transform)- WHT(Walsh-Hadamard Transform)

),,,(),(1

),,,(),(),(

vuyxhvuFN

vuyxgyxfvuF

Transform Coding

Types of transform coding• Discrete Fourier Transform(DFT)

• Walsh-Hadamard Transform(WHT)

• Discrete Cosine Transform(DCT)

NvyuxjNvyuxj evuyxheN

vuyxg /)(2/)(22

),,,(,1

)()()()(

),,,(),,,(

iiiii vpybupxb

Nvuyxhvuyxg

)12(cos[]

)12(cos[)()(),,,(),,,(

uxvuvuyxhvuyxg

1,...,2,1,2

NvuforN

vuforN

Transform Coding

Transform kernel example

Walsh-Hadamard basis function for N=4

Discrete-cosine basis function for N=4

Transform Coding

Approximations using

(a) Fourier

(b) Hadamard

(c) Cosine

difference

8x8 block

Comp. ratio : 2

rms error:

(a):1.28

(b):0.86

(c):0.68

RMS error comparision• Reconstruction error versus subimage size comparison

Transform Coding

Transform coding quantization• quantization of transform coefficients

),(),(ˆ

vuFroundvuF

Typical normalization matrix : Z(u,v)

Transform Coding

128136132140150157161154

132140140161154157168161

140154157150154161161164

132154161143154157161161

132147157157161154171164

154143161164164147168171

164150171157150161154171

154164168154150161161168

24232011

43311322

214743895

1591521512

66101544108

105339846

6153513112534

6111020349214

DC component

AC component

Original image

Transform coding quantization• DCT example

Transform Coding

24232011

43311322

214743895

1591521512

66101544108

105339846

6153513112534

6111020349214

00000000

00000001

00000011

00000100

00001123

000010413

Quantization

9910310011298959272

10112012110387786449

921131048164553524

771031096856372218

6280875129221714

5669574024161314

5560582619141212

6151402416101116

Quantization Matrix

Quantization Example

Video Compression Technique

Typical compression technique with DPCM, DCT and VLC

Zig-Zag scan Alternate scan

00000000

00000001

00000011

00000100

00001123

000010413

13 4 3 0 -2 0 1 1 0 1 -1 -1 1 1 0 0 0 … 0

Zig-Zag scan

Scan• AC values are strung together in a sequence• this irregular ordering keeps low frequency coefficient

together

[ 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 3 1 0 0 0 0 2 ]

(5, 1) (11, 3) (0, 1) (4, 2)

0.50.30.2

Symbol Probability Code

Run-Length Coding• zig-zag 시퀀스를 (앞에 있는 ‘0’의 개수, ‘0’이 아닌 상수)의형태로 표현

Huffman Coding• 출현 빈도가 높은 값에 적은 비트 할당

• 무손실 부호화

160162161159

163160155150

156153151144

153149144139

361723

5121154

Original image ordifference between original & predicted images

00000001001112010

DCTQuantization

Zig-Zag scan

Run-Length CodeHuffman Code

10011011100011...

Video Compression Process

9. Image Compression Computer Engineering, Sejong University...

Documents

First Hop Redundancy Protocols Configuration Guide · First Hop Redundancy Protocols Configuration Guide . IP redundancy name is "hsrp-Fa0/0.100-101" (default) IP redundancy name

Insertion of Impairments in Test Video Sequences for Quality Assessment Based on Psychovisual Characteristics

No Slide Titleagotchev/DIPII/presentation3.pdf · Fundamentals: Interpixel Fundamentals: Redundancy Interpixel Redundancy. Example of a non-visual mapping: •1024x343 grey-scale

Interpixel Capacitive Coupling - Rochester Institute of ...ridl.cfd.rit.edu/products/theses and senior... · Interpixel Capacitive Coupling . Linpeng Cheng . A thesis submitted in

02 psychovisual perception DIP

t I @,sdjT - UNT Digital Library/67531/metadc666534/...shaped distributions, there still was some interpixel redundancy, evident when viewing both the in- phase and quadrature image

Redundancy guide - StepChange Debt Charity...4 StepChange Debt Charity Redundancy Guide Facing redundancy If you’re facing redundancy, it’s vital that you understand your rights

CoE4TN3 Image Processing - McMaster Universityshirani/ip05/chapter8.pdf · Interpixel redundancy • How to reduce interpixel redundancy? • The image (which is a 2-D array) is transformed

On-chip-redundancy On-chip-redundancy according to

Chapter 8 Image Compression - 清華大學電機系-NTHUEE · • Interpixel redundancy Figures 8.2(e) and (f) show the respective autocorrelation coefficients as γ(Δn)=A(Δn)/A(0)

Surviving redundancy

Software Redundancy

Reducing Redundancy

Understanding Redundancy - · PDF fileUnderstanding Redundancy ... • 3+1 redundancy: In this mode, the power capacity of one ... • 2+2 redundancy: In this mode,

Redundancy Management

Kinematic Redundancy

Interpixel redundancy

DHCPv6 Redundancy Considerations Redundancy Proposals in RFC 6853

Image Compression - Burapha Universitykrisana/... · Image Compression • Introduction - The goal of image compression is the reduction of the amount of ... - Interpixel redundancy

Redundancy Handling