22
CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS MPEG Pauli Jutila Cristina Petre

CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

  • Upload
    eros

  • View
    79

  • Download
    0

Embed Size (px)

DESCRIPTION

CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS. MPEG Pauli Jutila Cristina Petre. Contents. 1. Introduction to MPEG 2. MPEG Family 3. Intra Frame Coding Techniques 4. Non-Intra Frame Coding Techniques - PowerPoint PPT Presentation

Citation preview

Page 1: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

CT20A6100MACHINE VISION AND DIGITAL

IMAGE ANALYSIS

MPEG

Pauli Jutila

Cristina Petre

Page 2: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Contents

1. Introduction to MPEG 2. MPEG Family3. Intra Frame Coding Techniques4. Non-Intra Frame Coding Techniques5.Summary

Page 3: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Why do we need video compression?

Example video compression- 1920x1080 pixels 30 frames/sec 8 bits/color => 1.5Gb/sec

- 6MHz channel bandwidth => 19.2Mb/sec

=> compression 83:1

Page 4: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

The MPEG Family

Page 5: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-1

● Basic form using motion compensated block-based transform coding techniques

● Optimized to work for 352x240 at 30frames/sec

or 352x288 at 25frames/sec ,possible to go up

to 4095x4095

● Defined for progressive frames only

● No direct provision for interlaced video applications

Page 6: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-2

● Designed for digital television broadcasting

● Potentially high quality video (4-9Mb/sec)

● Profiles and levels :

profile - bitstream scalability, colorspace resolution

level - image resolution, maximum bit-rate/profile

● MP@ML : 720x480 at 30 frames/sec

● MP@HL : 1920x1080 at 30 frame/sec

Page 7: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-4

● Interactive content

● Object-oriented composite files

● Image blocks are arbitrary shaped

● Multiplexes and synchronizes data

● Divided into a number of parts-compatibility issue

Page 8: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Coding Techniques

Page 9: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Intra Frame Coding

Video Filter DCT Quantizer Run-Lenght VLC

Bit-StreamBuffer

Bit-Rate Control

Page 10: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Video Filter

• The human eye is less sensitive to chrominance than luminance.

• RGB YCbCr• Choice of sub-sampling ratios:

• 4:4:4 (Professional, Post-Production)• 4:2:2 (same as above)• 4:1:1 (DV Camcorder) • 4:2:0 (Web, DVD)

Page 11: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Discrete Cosine Transform

• Nearly optimal in energy consentration and decorrelating

• The human eye is less sensitive to changes in higher frequencies

• Calculated in blocks, usually 8x8

Page 12: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

DCT Coefficient Quantization

• 12-bit coefficient is divided by a corresponding value from intra quantization matrix

• Each value in the resultant matrix is pre-scaled by multiplying by a single value, known as the quantizer scale code (1-112)

• The goal is to force as many of the DCT coefficients to zero or near zero

Page 13: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Variable Length Coding

• Zigzag-scanning long runs of consecutive zeros

• Alternate pattern in MPEG-2 (for interlaced video)• Huffman coding

Page 14: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Coding Techniques

Page 15: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

I,P,B Frames

I-pictures: intra-coded no motion-compensated prediction used as reference picture

P-picture: inter-coded forward predicted (from I or P frame)

B-picture: inter-coded bi-directional prediction high coding efficiency

Page 16: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

I,P,B Frames

Page 17: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Motion Estimation

Page 18: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Matching

Page 19: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Prediction Error Compensation

Page 20: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Non-Intra Decoding

Page 21: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Summary

● Video compression is needed!

● MPEG enables a wide range of applications

● Different standards but same principles

● Spatial and temporal processing

Page 22: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

References

● Barry G. Haskell, Atul Puri, Arun N. Netravali, Digital Video: An Introduction to MPEG-2, Chapman and Hall, 1997.

● Joan L. Mitchell, William B. Pennebaker, Chad E. Fogg, Didier J. LeGall, MPEG Video Compression Standard, Chapman and Hall, 1997.

● John Wiseman, An Introduction to MPEG Video Compression, 1998

● Marius Preda, Les standards MPEG dans l'espace multimedia, 2005

● E. G. Richardson, Video Codec Design, Wiley, 2002● Wikipedia