102
Audio Coding S Audio Coding S tandards tandards MPEG MPEG 及及及及及及及及及及及 及及及及 及及及及及及及及及及及 及及及及 包包 包包 MPEG-1 Layer I, II & III MPEG-1 Layer I, II & III MPEG-2 AAC MPEG-2 AAC 及及 及及 及及及及及 及及及及及 Dolby AC-3 Dolby AC-3 MPEG-4 MPEG-4 Natural Audio Natural Audio 及及 及及 MPEG MPEG 及及及及及及及及及及及及 及及 AVS (Audio Video Standard) AVS (Audio Video Standard) 及及及及 及及及及 §2.3 包包包 包包包包包包 包包包 包包包包包包 包包 包包 及及及及 及及及及

Audio Coding Standards

Embed Size (px)

DESCRIPTION

Audio Coding Standards. §2.3. 主要介绍. MPEG 及其相关的音频编码标准、算法原理 包括 : MPEG-1 Layer I, II & III , MPEG-2 AAC 以及与之相关的 Dolby AC-3 , MPEG-4 的 Natural Audio ,未来 MPEG 发展方向等。 国家 AVS (Audio Video Standard) 发展计划. 第二章 音频信息处理 ▪ 标准. Contents. 2.3.1 Overview of Audio Coding Standards - PowerPoint PPT Presentation

Citation preview

  • Audio Coding StandardsMPEGMPEG-1 Layer I, II & IIIMPEG-2 AACDolby AC-3MPEG-4Natural AudioMPEGAVS (Audio Video Standard)2.3

  • Contents 2.3.1 Overview of Audio Coding Standards 2.3.2 ITU-T Audio Coding Recommendations 2.3.3 Perceptual Audio Coding Basic 2.3.4 MPEG Audio Coding Systems 2.3.5 Dolby Audio Coding 2.3.6 China Audio Coding Initiatives 2.3.7 Next Step of Audio CodingReferences

  • ReferencesBasics about MPEG Perceptual Audio Codinghttp://www.iis.fraunhofer.de/amm/techinf/basics.html

    MPEG _ Chinahttp://www.mpegchina.com.cn/index.htm

    MPEG-4 Industry Forumhttp://www.m4if.org/

    MPEG.ORGhttp://www.mpeg.org/MPEG/index.html

  • 1. An Overview of Audio & Sound Coding Standards From Source Coding to Perceptual Coding Audio / Sound Coding Technologies Multimedia Communication Multimedia Framework

  • Audio Coding OverviewFrom source coding to perceptual codingPsychoacoustic ModelLow data rate, Hi-FiFrom signal to contentStructured Audio & Audio RetrievingFrom local application to global accessMPEG 21 perspectiveMultimedia FrameworkFrom stereo to surrounding multi-channelDolby AC-3 5.1 system & more

  • International ORG. / COM.International Telecommunication Unionhttp://www.itu.int/home/CCITTthe International Consultative Committee on Telephony and TelegraphyInternational Organization for Standardizationhttp://www.iso.org/International Electrotechnical Commissionhttp://www.iec.org/MPEGMoving Picture Experts Group http://mpeg.telecomitalialab.com/

    Dolby Laboratories, Inc.http://www.dolby.com/ http://www.dolby.com.cn/

  • 2. ITU Recommendations G.711-PCM G.721-32kbit/s G.722-64kbit/s(7KHz) CELP-16kbit/s ITU

  • Chronicle1972G.711 64kb/s A PCM 1984G.721 32kb/s ADPCM G.722 64kb/s ADPCM G.723.1 5.3kb/s6.3kb/s LSF G.726 16kb/s1990G.727 16-40kb/s ADPCM1992G.728 / G.729 16kb/s LD-CELP1988RPE-LTP 13kb/sGSM1989VSELP 6.7kb/s

  • ITU Recommendations

    PCM64kb/sG.711ISDN4.04.5(A)(A)APCMDPCMADPCM32kb/sG.72 1SB-ADPCM64kb/sG.722G.726G.727LPC2.4kb/s2.53.5

  • ITU RecommendationsG.711PCM8000Hz8AA 13PCMA14PCM8G.721198664kbit/sAPCM32kbit/s ADPCMPCM

  • ITU RecommendationsADPCM/ APCMPCMPCM415(15)04 PCMAPCM(synchronous coding adjustment)()

  • 3. Preliminary for Perceptual Audio Coding(Psychoacoustic Model)(Perceptual Sub-band Coding)Dolby AC-3/MPEG Audio Coding

  • Some Conceptions( dyn/cm2 )( W/cm2 )(dB)10-16 W/cm2 = 0 (dB)( phon ) ( sone ) = 0 () ( Hz ) Mel ()Mel = 1000 Log2(1+)

  • Perceptual Audio Coding (1)HzdB 1 kHz120dB()1202 kHz4 kHz

  • Perceptual Audio Coding (2) 20 Hz18000 Hz40 dB

  • Perceptual Audio Coding (3)(masking tone)(masked tone)

    (Frequency Domain Masking)(Simultaneous Masking)

    (Time Domain Masking)

  • Perceptual Audio Coding (3 cont.)

  • Perceptual Audio Coding (3 cont.)250 Hz1 kHz4 kHz8 kHz = 0.25, 1, 4 kHz

  • Perceptual Audio Coding (3 cont.)(Critical Band)20 Hz16 kHz24

    Bark ()1 Bark = () < 500 Hz, 1 Bark /100 () > 500 Hz, 1Bark 9 + 4log( /1000)

  • Perceptual Audio Coding (3 cont.)

    (pre-masking)(post-masking)

    520 ms50200 ms

  • Perceptual Audio Coding (3 cont.)Audio Masking

  • 4. MPEG Audio MPEG-1 Audio : Layer I, II &III MPEG-2 Audio : Back Compatible (BC)AAC : Advanced Audio Coding (Non BC) MPEG-4 Audio : Overview MPEG 21 : The Next Step of MPEGMoving Picture Experts Group Audio Coding Standards

  • About MPEGISOIEC WG111986MPEGMPEGMPEG MPEG-11992MPEG-2MPEG-319927(High-Definition TVHDTV)MPEG-4(1999)MPEG-5MPEG-6MPEG-7()

  • Chronicle of MPEG StandardsMPEG Audio Coding Standards199308 MPEG-1 ISO/IEC 11172199408 MPEG-2 ISO/IEC 13818199901 MPEG-4 ISO/IEC 14496 V1.0199912 MPEG-4 ISO/IEC 14496 V2.0199810 MPEG-7 200107 200109 200003 MPEG-21

  • Some ExplanationsMPEG-1 ISO/IEC 11172 MPEG-2 ISO/IEC 13818MPEG-4 ISO/IEC 14496 V1.0MPEG-4 ISO/IEC 14496 V2.0(video object)MPEG-7

  • Some Explanations (cont.)MPEG Audio(2 kHz5 kHz)()

  • Prices Aspects MPEG-LAMPEGMPEG-LAMPEGMPEG-LAWhyPrice ?MPEG 4

  • 4. MPEG Audio MPEG-1 Audio : Layer I, II &III MPEG-2 Audio : Back Compatible MPEG-2 AAC : Advanced Audio Coding MPEG-4 Audio : OverviewMoving Picture Experts Group Audio Coding Standards

  • 4. MPEG Audio MPEG-1 Audio : Layer I, II &III MPEG-2 Audio : Back CompatibleAAC : Advanced Audio Coding MPEG-4 Audio : OverviewMoving Picture Experts Group Audio Coding Standards

  • 4. MPEG Audio MPEG-1 Audio : Layer I, II &III MPEG-2 Audio : Back CompatibleMPEG-2 AAC : Advanced Audio Coding MPEG-4 Audio : OverviewMoving Picture Experts Group Audio Coding Standards

  • 4. MPEG Audio MPEG-1 Audio : Layer I, II &III MPEG-2 Audio :Back CompatibleAAC : Advanced Audio Coding MPEG-4 Audio : OverviewMoving Picture Experts Group Audio Coding Standards

  • MPEG-1 AudioAudio Coding Algorithms(sub-band codingSBC)SBCMPEGMPEG48 kHz16256 kb/s61 (40 kHz44.1 kHz)

  • MPEG-1 Audio (cont.)MPEG Audio ISO/IEC 11172-3

  • MPEG-1 Audio (cont.)/

    Layer I, II & III

  • MPEG-1 Audio (cont.)CD*MUSICAM ( Masking pattern adapted Universal Sub-band Integrated Coding And Multiplexing ) **ASPEC ( Adaptive Spectral Perceptual Entropy Coding of high quality musical signal ) ()

  • MPEG-1 Audio (cont.)MPEG /

    CRC MPEG

  • MPEG-1 Audio (cont.)(Layer III)

  • MPEG-1 Audio (cont.)-()(masking threshold)

    (signal-to-mask ratioSMR)

    (frame)

  • MPEG-1 Audio (cont.)32

  • MPEG-1 Audio (cont.)MPEG31231SBCSBCSBC (frame)Layer I 384 3212Layer II Layer III 1152

  • MPEG-1 Audio (cont.)

  • MPEG-1 Audio (cont.)Layer I1-DCT (discrete cosine transform)SMR 6(scale factor)SMR(bit allocation)

  • MPEG-1 Audio (cont.)Layer I(12) MUX

    32

    CRC16

    4

    6

  • MPEG-1 Audio (cont.)Layer II13111521 23 1(12)

  • MPEG-1 Audio (cont.)Layer III (Huffman)ASPEC (Audio Spectral Perceptual Entropy Encoding)OCF (Optimal Coding In The Frequency domain)12123(modified discrete cosine transformMDCT)12MDCT3

  • Layer IIIISO/MPEG Audio Layer III Coder / Decoder

  • MPEG-1 Audio (cont.)MPEG3

  • MPEG-2 Audio OverviewMPEG

    MPEG-2 AudioMPEG-2 MultichannelMPEG-1 AudioMPEG-2 BC (Backward Compatible)MPEG-2 AAC (Advanced Audio Coding)MPEG-1MPEG-2 NBC (Non-Backward Compatible)

    MPEG-2 Audio : BC

  • MPEG-2 BCISO/IEC 13818-3MPEG-2 BCMPEG-1 Audio (ISO/IEC 1117-3)-1, -2-316 kHz, 22.05 kHz24 kHz32384 kb/s8640 kb/s5.17.1Linear PCM(PCM)Dolby AC-3(Audio Code Number 3)

  • MPEG-2 BCMPEG-2 BC

  • MPEG-2 BC Multichannel5.13/2-LFE.1LFE3()2LFE (low frequency effects3Hz~120Hz) 7.15.1 5.1

  • MPEG-2 BC ISO/IEC 13818-3

  • MPEG AAC MPEG-2 AAC MPEG-2 AAC MPEG-2 AAC

  • MPEG-2 AACMPEG-2 AACMPEG-2MPEG-2 AACAAC8 kHz96 kHzAACAAC4816LFE (low frequency effects)16(overdub channel)(multilingual channel)16MPEG-2 AAC11:1(44.116 )/11=64 kb/s5320 kb/sMPEG2MPEG-2 AAC1MPEG370

  • MPEG-2 AAC MPEG-2 AACMPEG AudioAACAAC(advanced audio coding tools)(modular)(tool)AAC

  • (Main Profile)(Gain Control)AACAAC

  • (Low Complexity Profile)(temporal noise shapingTNS)(Scalable Sampling Rate Profile)TNS

  • MPEG-2 AAC (Gain control)PQF (polyphase quadrature filter)(gain detector)(gain modifier)4PQF(Filter Bank)MPEG-2 AACMDCTTDAC(time domain aliasing cancellation)

  • MDCTKBD (Kaiser-Bessel derived)(sine)MDCT

    MDCT

    n N i =

  • TNSTNSTNS(joint stereo coding)MPEG-2 AACM/S(Mid/Side encoding)/(Intensity /Coupling)M/SM/S(matrixed stereo coding)M/SM(middle)S(side)M/S-(sum-difference coding)/(intensity stereo coding)(channel coupling coding)(irrelevance)

  • (Prediction)(stationary)(Quantizer)(Noiseless coding)

  • MPEG-2 AAC

  • MPEG-4 AudioMPEG-4 Audio(parametric coding)(code excited linear predictiveCELP)/T / F (time / frequency)SA (structured audio)-TTS (text-to-speech)

  • MPEG-42 kb/s64 kb/s(natural audio)MPEG-48 kHz(speech)24 kb/s8 kHz16 kHz(audio)416 kb/s CELPCELP(code excited linear predictive)624 kb/s8 kHz16 kHzT/F-(time-to-frequencyT/F)(vector quantizationVQ)16 kb/s8 kHz

  • MPEG-4 Audio* UMTS (universal mobile telecommunication system)

  • MPEG-4TTSMIDI-Text-to-Speech

  • 5. Dolby Audio CodingOverview of Dolby Audio Coding SystemAC-1 : Dolbys first digital coding systemAC-2 : 2-channel stereo systemDolby AC-3 : Multi-channel Digital Audio Compression System

  • Brief HistoryAC-1 (1987)Dolbys first digital coding systemSimple delta modulation based coding techniques4-2-4 multi-channel system, 2-1 bit-rate reductionAC-2 (1989)TDAC (Time Domain Aliasing Cancellation) Filter Bank based on MDCT/MDST2-channel stereo systemBit allocation based on Psychoacoustic ModelAC-2a : pre-echo control by block size adaptation

  • Brief HistoryAC-3 (1991~)TDAC Filter Bank based on MDCT5.1 multi-channel (320 kb/s) digital audioUSA HDTV Digital Audio Coding StandardFirst cinema demonstration : Star Trek VIChannel coupling techniques is applied to reduce bit-rate at high frequencies

  • AC-3 IntroductionInput Audio : 1 ~ 5.1 channels of source0.1 channel : low frequency (Subwoofer) signalsampling rate : 32 kHz, 44.1 kHz, 48kHzwindowing : 50% overlap/add Fielder windowBit Rate : 32 kb/s ~ 640 kb/sbandwidth reduction factor : 13.5uncompressed PCM sample : 6 channel * 48 kHz * 18 bits = 5.184 Mb/sstandard bit rate : 384 kb/s

  • AC-3 FeaturesAC-3 Encoder

  • AC-3 FeaturesAC-3 Decoder

  • AC-3 FeaturesBit-stream Syntax1 Frame represents 1536 PCM samples for all channels

    1 Block represents 256 PCM for each channelSI=Sync. Info BSI=Bit-stream Info CRC for error correction Aux Data for private control

  • AC-3 FeaturesThe AC-3 Multi-channel CoderThe Conception of Multi-channel

  • 6. Audio Coding Initiatives in China An Overview of Chinese AVS Project Audio Coding Quality Assessment Methods

  • China AVS ProjectAVS : audio video coding standard ()2002 621 Official Homepage : http://www.avs.org.cn

  • 7. Next Step of Audio Coding Standardization MPEG 21:(Multimedia Framework)

  • Next StepMPEG-7MPEG-21MPEG62MPEG28JPEG2002102125

  • & MIDI MIDI FM, Wavetable MIDI MIDISMF/XMF/GM2.4

  • MIDIMusical Instrument Digital Interface (MIDI) (music synthesizers)(musical instruments) MIDI()MIDI MIDIMIDIMIDI

  • MIDI(cont.) () WAVEWAVEMIDI

  • WAVEMIDI WAVEMIDI

    MIDIWAVE MIDI MIDIMic CD 5KB3.6MB 5242

  • MIDI(cont.)MIDI frequency modulation (FM) (Wavetable) MIDI

  • FMFM,, ()

  • FM(cont.)(Yamaha OPL-III)

    FM1314ROMROMFM

  • FM 44.1 kHz16CD-DAROM

  • (cont.) ADSRFM

  • MIDIMIDIMIDIMIDIMIDIMIDI(local control)

  • MIDI(cont.)MIDI(bit stream)31.25 kbps10(181)(MIDI controller)(MIDI sequencer) MIDI MIDI3MIDIIN, OUTTHRU()MIDIMIDI OUTMIDI(MIDI sound generator)(MIDI sound module)IN(MIDI messages)

  • MIDI(cont.)Yamaha MIDI InstrumentsSimple for LaptopWith TX81Z Synthesizer Module

  • MIDI(cont.)MIDIMIDI(MIDI channel)16

  • MIDI(cont.)MIDI

  • MIDI(cont.)PCMIDIMIDIPCMIDIMIDIIN

    MPC(Multimedia PC)(muti-timbral)(polyphonic)voicespatches()()(note)MPC(Base-level synthesizer)(Extended synthesizer)

  • MIDI(cont.) 33()63998

  • MIDIMIDI82(All MIDI status byte and data byte values are in hexadecimal )21n 0~F

    Status ByteData BytesMessageBn 78 00 All Sound Off

  • MIDI(cont.) MIDI

  • MIDI MIDI MIDIMIDI note on MIDI note off MIDIMIDInote on MIDIMIDIMIDI(time-stamping)

  • MIDI SMF (Standard MIDI File) / XMF (eXtensible Music Format) MIDI(International MIDI Association)MIDI(Standard MIDI Files)MIDIMIDI MIDI(Standard MIDI File)3MIDIMIDIMIDI(tracks)MIDI0 (Format 0)MIDI(MIDI sequence data)MIDI1 (Format 1)MIDI2 (Format 2)

  • MIDIMIDI(International MIDI Association) MIDI(General MIDI Specification) MIDI(General MIDI Instruments)MIDI(General MIDI Sound Set)(patch map)MIDI(General MIDI Percussion Set)MIDI(General MIDI Performance)MIDI MIDIMIDI19111610

  • http://www.midi.org/http://crystal.apana.org.au/~ghansper/midi_introduction/

    2MIDIMIDI