3 Multimedia for Linux IVI

  • Upload
    clux

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

  • 7/30/2019 3 Multimedia for Linux IVI

    1/7

    Executive Summary

    Intelligent in-vehicle systems increasingly offer end users rich

    multimedia experiences as part of their core functionalit y.

    To compete in a dynamic marketplace, automotive original

    equipment manufacturers (OEMs) and integrators must provide

    digital audio and video, animated 2D and 3D maps and s treet

    views, and speech-based interfaces in their next-generation

    in-vehicle systems.

    Linux, open source sof tware (OSS), and proprietary/commercial

    software hos ted on Linux offer automotive OEMs and suppliers

    a wide range of multimedia technologies, toolkits, media

    players, and plug-ins. This paper provides an overview of

    available multimedia software resources for automotive

    applications that include audio, video, speech, and digital

    rights management (DRM). When possible, it also supplies

    heuristics for making design and deployment decisions based

    on project maturit y and licensing information, for example.

    Core Automotive Multimedia Requirements

    To support real-world automotive applications, Linux-hosted

    multimedia technology or automotive applications needs to

    deliver certain key capabilities:

    Supportforpopular,industry-standardmediaformats

    and fle types

    Ready-to-useaudio/videocodecsandplayers

    ScreenresolutionsfromVGA(videographicsarray)

    (640x480) upward through high defnition (HD) resolutions

    Performanceandloadcharacteris ticsmatchedtoavailable

    automotive embedded hardware

    CPUsupportforautomotiveprocessorarchitec tures:ARM,

    Hitachi SH, and Intel

    Speechoutputandrecognition

    Challenges to Meeting Multimedia Requirements

    with Open Source Software

    Untilrecently,manyOEMsfounditextremelydifficultto

    integrate a useul and comprehensive set o multimedia

    components or use in devices based on embedded Linux.

    Despite broad embedded and desktop deployment and a

    worldwide community o Linux developers and end users,

    eorts to accommodate the gamut o media types, codecs,

    ormats, DRM regimes, intellectual property restrictions, and

    licenses stymied the deployment o ull-eatured Linux-based

    media devices. These challenges include the ollowing:

    Highlyproprietary,nonstandardnatureofmanyaudioand

    video codecs

    VariationinmaturityofOSSprojectsimplementing

    multimedia components

    Conformancetotheparticularsofcopyrightlegislation(in

    particulartheDigitalMillenniumCopyrightAct,orDMCA)

    and patent law in choosing and integrating OSScomponents or multimedia

    This paper, in surveying available multimedia technologies

    or embedded Linux, reers to relevant inormation regarding

    maturity, licensing, and intellectual property status o the

    sotware in question.

    Table of Contents

    Executive Summary ............................................................ 1

    Core Automotive Multimedia Requirements ...................... 1

    Challenges to Meeting Multimedia Requirements

    with Open Source Software .............................................1

    Multimedia Infrastructure: Frameworks, Engines, Etc. ......2

    Open Source Media Formats .............................................4

    Media Players ..................................................................... 4

    Speech Technology ............................................................. 5

    Voice Recognition ...... ...... ..... ...... ...... ...... ..... ...... ...... ...... ..... 6

    Digital Rights Management ................................................ 7

    Conclusion .......................................................................... 7

    Multimedia Resources in Linux for Automotive

    Use Cases and Applications

    The primary use cases or multimedia technology in

    Linux-based automotive applications include the

    ollowing:

    Vehicledashboardandseatbackdisplayswith

    medium to high resolution Navigationsystemswith2Dand3Dmapdisplays,

    audio output, text-to-speech, etc.

    Entertainmentsystemsforlocalmediaplaybackand

    streaming

    Operationalfeedback(e.g.,vehiclereversevideo

    camera)

  • 7/30/2019 3 Multimedia for Linux IVI

    2/7

    2 | Multimedia Resources in Linux for Automotive

    Multimedia Infrastructure: Frameworks, Engines, Etc.

    AswithmanyotherLinux-basedtechnologies,thereexist

    multiple, sometimes competing OSS projects to implement

    multimedia unctionality as well as versions o cross-platorm

    commercial technology targeted and optimized or Linux-

    hosted execution.

    ALSA

    ALSA,theAdvancedLinuxSoundArchitecture,provides

    standard audio and MIDI services to the Linux operating

    systemandapplicationsrunningonit.ALSAsupportsa

    rangeofaudiointerfaces,fromPCsoundcardsto

    professional-qualitymultichannelaudio.ALSAoffers

    eatures advantageous to in-vehicle inotainment use:

    Dynamicallyloadable,modularaudiodevicedrivers

    Nativesupportforsymmetricmultiprocessing(SMP)

    and thread saety

    alsa-liblibrary,withuser-spaceAPIstosimplify

    application programming SupportforlegacyopensoundsystemAPIs,withbinary

    compatibility or most Open Sound System programs

    ALSAispartoftheLinuxkernelandiscoveredbyGPLv2and

    theLinuxCOPYINGfile.alsa-libisdistributedundertheGNU

    LesserGeneralPublicLicense(LGPL),easingOEMintegration.

    Learn more at http://www.alsa-project.org.

    GStreamer

    GStreamerisamultimediaframeworkservingarangeof

    applications, including media players, streaming media

    broadcasters,andvideoeditors.GStreameriswidely

    deployed in both desktop and embedded settings and has

    been adopted by mobile and embedded Linux platorms,

    including Maemo, Moblin, and LiMo.

    GStreamerconnectsmediaprocessingelementsintoa

    pipeline, where each element is provided by a plug-in.

    Elements communicate by means o pads (like solder points

    onsilicon).Asourcepadononeelementcanbeconnected

    to a sink pad on another. When the pipeline is in the playing

    state, data buers low rom the source pad to the sink pad.

    Padsnegotiatethekindofdatathatwillbesentusing

    abstractions or deining capabilities.

    GStreamerletsdevelopersintegratecodecsandfilters

    through plug-ins with generic interaces. By decoupling

    codecs rom the ramework architecture, multimedia

    applications can take advantage o advances in codec and

    filtertechnologyasynchronouslyfromGStreamer

    development.

    GStreamerisreleasedundertheLGPL,facilitatingintegrationand

    commercial deployment with minimal reciprocal disclosure. Learn

    moreaboutGStreamerathttp://gstreamer.freedesktop.org/.

    MP3Codec

    DIVXCodec

    OGGCodec

    DVDCodec

    AudioEffects

    AudioPlayer

    VideoPlayer

    RippingTool

    VideoEditor

    StreamingServer

    Figure 1: GStreamer plug-in and application architecture

  • 7/30/2019 3 Multimedia for Linux IVI

    3/7

    3 | Multimedia Resources in Linux for Automotive

    Helix DNA Client

    HelixDNAClientisauniversalopensourcedigitalmedia

    playback engine, supporting playback of any format and

    codecacrossarangeofdevicetypes.HelixDNAprovides

    underlying multimedia functionality to a range of applications

    requiring local playback and streaming of multiple formats in

    disparate environments.

    HelixDNAClientunderliesthedigitalmediaplatformfor

    RealNetworksRealPlayeronWindows,Apple,andLinuxas

    well as the Rhapsody player.

    HelixDNAandcodecsourcecodeisavailableforreference

    and research and development under Open Source Initiative

    (OSI)-approvedlicenses(GPLv2andRealNetworksPublic

    Source License) or via commercial licenses or deployment

    indevices.LearnmoreaboutHelixDNAClientat

    https://helix-client.helixcommunity.org/.

    Adobe Flash Lite

    AdobeFlashLitesoftwareprovidesarun-timeenginefor

    mobile and consumer electronics devices with substantial

    interoperability beneits or device manuacturers and

    contentdevelopers.AutomotiveOEMsandintegrators

    canbuildonthepopularityandubiquityofAdobeFlash

    content to provide end users with web browsing, video,

    andinteractivecontentcomparabletousersdesktop

    experiences, in the car, on the go.

    AdobeFlashLiteoffersthefollowingtoLinux-based

    automotive systems developers:

    FLVletypesupport,includingH.264,On2VP6,and

    Sorenson video codecs

    AccesstocontentandvideocreatedwithAdobeFlash

    Integratedauthoringenvironment Cross/multiplatformsupport

    FlashLitefeaturesanopenSDKbutislicensedunderstrictly

    commercialtermsbyAdobefordevice-baseddeployment.

    To learn more, visit http://www.adobe.com/products/flashlite/

    for general information and ht tp://www.adobe.com/products/

    flashlite/version/foracomparisonofAdobeFlashproducts,

    capabilities, and platform support.

    Xine

    Xine is a multimedia playback engine built around the shared

    library xine-lib. Xine oers a variety o ront-end player

    applications,includingCodeine,gzine,Kaffeine,KMPlayer,

    and Totem.

    Xine can open and execute container ormats that include

    3gp,AVI,ASF,FLV,Matroska,MOV(QuickTime),MP4,NUT,

    Ogg,OGM,andRealMedia.Xinecandecode/playarange

    o audio and video ormats:

    Audio:AAC,AC3,ALAC,AMR,FLAC,MP3,RealAudio,

    Shorten,Speex,Vorbis,WMA

    Video: Cinepak,DV,H.263,H.264/MPEG-4AVC,HuffYUV,

    Indeo,MJPEG,MPEG-1,MPEG-2,MPEG-4ASP,RealVideo,

    Sorenson,Theora,WMV(partial,includingWMV1,WMV2,

    andWMV3,viaFFmpeg)

    XineisreleasedundertheGNUGeneralPublicLicenseand

    may not be suitable or all commercial applications. Learn

    more about Xine at http://xinehq.de/.

    Figure 2: Helix DNA Client system architecture

    Application (GUI, Control Flow)

    Playback Controls

    Sync and Timing

    Audio/Video Mixing

    Streaming Protocols

    File Formats

    Codecs

    Capture and Encoding

    Transcoding

    Metadata Editing

    Download Manager

    Media Services

    DRM

    Core Run-Time Libraries

    Linux Kernel Device Drivers

  • 7/30/2019 3 Multimedia for Linux IVI

    4/7

    4 | Multimedia Resources in Linux for Automotive

    Open Source Media Formats

    CurrentmediaformatsonWindowsdesktopmachinesand

    ubiquitousmediadevices(MPEG,WMV,etc.)areprobably

    amiliar. But the range o reely licensed media ormats

    supported by Linux-based rameworks and players is

    probably less amiliar. These ormats provide additional

    options or OEMs to oer high-quality multimedia

    experiences without the obligation o patents, proprietarylicenses, and so on. However, they are not as widely used as

    theirproprietaryequivalents.Notethatwhiletheseandother

    open ormats are native to open source multimedia sot ware,

    OSS also supports amiliar commercial ormats through a mix

    o open source and proprietary plug-ins and codecs.

    Ogg Vorbis

    OggVorbisisanopen,patent-free,audio-encodingand

    compression ormat that also oer s streaming technology. It

    is comparable to other ormats used to store and play digital

    music,suchasMP3,VQF,AAC,andsoon.OggVorbisdiffers

    rom other ormats by being completely ree, open, and

    unpatented. Ogg is the native audio ile ormat or many

    open source media tools and suppor ted by an increasing

    number o commercially available media players and devices

    as well.

    OggVorbiscomprisestwocomponents:

    Ogg: Containerformatforaudio,video,andmetadata

    rom Xiph.org

    Vorbis: The audio compression scheme contained in Ogg;

    otherformatscanbeembeddedinOggsuchasFLAC

    and Speex

    Learn more at ht tp://www.vorbis.com.

    TheoraTheora is a free and op en video compression format. Theora

    scales from postage stamp to high-definition resolution and is

    competitive with commercial formats, especially at low bit

    rates. Theora is supported by most open source engines and

    players,includingMPlayer,Xine,Helix,andVideoLAN.Theora

    iscomparabletoMPEG-4andDiVX,withalargeandactive

    developer community engaged in incremental enhancement.

    Theora can be used to distribute ilm and video online

    and on disc without licensing or royalty ees or vendor

    lock-in associated with other ormats. Learn more at

    http://www.theora.org and ht tp://www/xiph.org.

    Media Players

    Gnash

    GnashisafreesoftwareFlashmovieplayerandisoneof

    theGNUfamilyprojectshostedandsupportedbytheFree

    SoftwareFoundation.GnashisbasedonGameSWFand

    supportsmostSWFv7featuresandsomeSWFv8andv9

    eatures.

    Gnashfeaturesincludethefollowing:

    Standalone and plug-in operation:Gnashcanrunin

    either mode.

    Browser-based execution: Gnashcanrunasaplug-in

    fromwithinMozilla-derivedbrowsers,suchasFirefox,as

    wellasKonqueror.

    SWF v7+ compliant: GnashcanrunSWF-compatibleles.

    Streaming video support:

    Gnashsupportsviewingof

    streaming video rom popular sharing sites such as Lulu.tv

    orYouTube.

    XML message server: GnashsupportsXML-based

    messagingfortheFlashformat.

    High-quality output: GnashusesOpenGLforrendering

    graphicsonthedesktop,andAntigrainGeometry(AGG)

    or embedded rame bueronly devices.

    Extensible: GnashsupportsextendingActionScriptwith

    custom scripts or wrappers or third-party libraries.

    AspartofthemainGNUprojecttree,Gnashislicensed

    underGPLv3.LearnmoreaboutGnashathttp://www.gnu.org/

    software/gnash/.

    Helix Player

    HelixPlayerisanopensourcemediaplayerwidelydeployedon

    both desktop Linux and embedded Linux, Solaris, and Symbian

    OS.HelixPlayerisbasedontheHelixDNAClientmediaengine.

    HelixPlayerisdual-licensedunderopensource(RealNetworks

    PublicSourceLicense)andcommercialcommunitysource

    (RealNetworksCommunitySourceLicense)licenses.HelixPlayer

    contains only open source components and plays open source

    formats,suchasOggVorbisandTheora.

    RealPlayerforLinuxbuildsontopoftheHelixPlayerand

    supportsRealAudio,RealVideo,WindowsMedia,MP3,Flash,

    MPEG-4(fee-based),andothernonopensourcecomponents.

    LearnmoreaboutbothHelixPlayerandRealPlayerat

    https://player.helixcommunity.org/.

    Kaffeine

    Kaffeineisanopensource,full-featuredmediaplayerfor

    KDE(KDesktopEnvironment)andaccompaniesKDEin

    deployment in enterprise workstation and home desktop

    settings. It supports a broad range o local and network

    mediaanddigitalvideobroadcasting(DVB).Kaffeine(like

    Totembelow)canusexine-liborGStreamerasabackend.

    KaffeineislicensedunderGPLv2butasanapplicationimposes

    minimal reciprocal disclosure requirements upon OEMs. Learn

    moreabouttheKaffeineplayerathttp://kaffeine.kde.org/.

    Totem

    TotemisthestandardmovieplayerfortheGNOMEdesktop

    environment and widely available with desktop Linux

    distributions.TotemistightlyintegratedwithGNOMEand

    itsfilemanager,Nautilus,featuringthegenerationofvideo

    ile thumbnails and a video plug-in or Mozilla browsers

    (Firefox,etal.).

  • 7/30/2019 3 Multimedia for Linux IVI

    5/7

    5 | Multimedia Resources in Linux for Automotive

    Totemexistsintwoversions:onebasedonGStreamerthat

    eatures greater extensibility and supports a larger variety o

    media ormat, the other based on Xine that oers bet ter

    encryptedDVDplaybacksupportandslightlybroaderfile

    formatsupportthantheGStreamerversion.

    BuildingonGStreamer,Totemcanplayallmainstreammedia

    ormats, both open and proprietary. It also understands

    numerousplaylistformats,includingSHOUTcast,M3U,XML

    ShareablePlaylistFormat(XSPF),SMIL,WindowsMedia

    Player,andRealAudio.Totemsupportsfull-screenvideo

    playback including multihead conigurations.

    TotemislicensedunderGPLv2butasanapplicationimposes

    minimal reciprocal disclosure requirements upon OEMs.

    Learn more at ht tp://www.gnome.org/projects/totem/.

    LinDVD

    LinDVDfromCorelisasoftwareDVDplayerfromthesame

    teamthatcreatedtheubiquitousWinDVDfortheWindows

    desktop.LinDVDispackagedasacustomproduct/services

    offeringtargetedatdeviceOEMsandisnotaCOTS(commercial o-the-shel) product per se.

    MPlayer

    MPlayerisamovieplayerstructuredasacommand-line

    applicationwithGUIsforeachsupportedoperatingsystem.

    ForLinux,MPlayeroffersGNOME-basedgmplayer.

    MPlayersupportsawiderangeofformatsandencoding

    standards,includingMPEG/VOB,AVI,Ogg/OGM,VIVO,

    ASF/WMA/WMV,QT/MOV/MP4,RealMedia,Matroska,NUT,

    NuppelVideo,FLI,YUV4MPEG,FILM,RoQ,andPVAfiles.

    MPlayerletsendusersplayVideoCD,SVCD,DVD,3ivx,

    DivX3/4/5,WMV,andH.264movies.

    MPlayersupportsagamutofoutputdrivers,includingX11,

    Xv,DGA,OpenGL,SVGAlib,fbdev,AAlib,DirectFB,GGI,

    SDL,VESA,andvariouslow-levelcard-specificdrivers(e.g.,

    Matrox,3Dfx,andATI).Inmanycases,MPlayeralsosupports

    sotware and/or hardware scaling.

    MostvideoandaudioformatsaresupportedinMPlayer

    nativelythroughthelibavcodeclibraryoftheFFmpeg

    project.Forformatslackingopensourcecodecs,MPlayer

    relies on binary codecs. It can use Windows DLLs directly

    with the help o a DLL loader.

    MPlayerislicensedunderGPLv2and,asanapplication,

    imposes minimal reciprocal disclosure requirements uponOEMs. However, the inormal use and availability o (reverse -

    engineered)MPlayercodecsforotherwiseproprietarymedia

    ormats may expose OEMs or suppliers to intellectual

    property claims against use o sotware patents in open

    source sotware. Learn more at ht tp://www.mplayerhq.hu/.

    VLC and VideoLAN

    VLCis afreesof twaremediaplayerdevelopedunderthe

    VideoLANproject .Monol ithicinsize,itencompassesapor table

    multimedia player, encoder, and streamer in a single code base

    (thefront-endandplug-inapproachesofGStreamer,Xine,etc.).

    VLCcanst reamcontentovernetworksandtranscodemedia

    into various different formats, with sup port for many audio and

    videocodecsandfileformatsaswellasDVDs,VCDs,andstreaming protocols.

    VLCincludesalargenumberoffreecodeclib rar ies ,reducing

    the need or integration and tuning o proprietary plug-ins.

    ManyVLCcodecsshipwithlibavcodecfromtheFFmpeg

    project.VLCisalsothefirstplayertosupportplaybackof

    encryptedDVDsonLinuxwiththelibdvdcssDVDdecryption

    library.

    VLCi slicensedunderGPLv2butasanapplicationimposes

    minimal reciprocal disclosure requirements upon OEMs.

    Learn more at ht tp://www.videolan.org/.

    Speech Technology

    The commercial marketplace, academia, and the open source

    bazaar are lush with speech tec hnologies. While many are

    quite mature and stable, ew have the quality and

    performancetomakethecutasautomotivegrade.

    Most o the projects and products available or Linux or

    automotive ocus individually on speech output (speech

    synthesis and text-to-speech) or input (recognition).

    Speech output or in-car systems can be a key dierentiator

    in a crowded marketplace. Requirements include smooth,

    continuous speech generation, choice o voices, and o-

    the-shel internationalization capability.

    Festival

    Festivaloffersaframeworkforbuildingspeechsynthesis

    systems and oers ull text-to-speech (TTS) through multiple

    APIsandchannels:shell/CLI,Schemecommandinterpreter,

    C++andJavaclasslibraries,andEmacsinterface.Festival

    supportsEnglish(BritishandAmerican)andSpanish,with

    English being the most mature. Tools and documentation

    fornewvoicesareavailablethroughCarnegieMellon

    UniversitysFestvoxproject(http://festvox.org).

    FestivalandthespeechtoolsaredistributedunderanX11-type

    license, allowing unrestricted commercial and noncommercial use

    alike. Learn more at http://www.cstr.ed.ac.uk/projects/festival/.

    MBROLA

    TheMBROLAprojectcreatesspeechsynthesizersforagrowing

    list of languages and is provide d free for noncommercial

    applications. The project goal is to foster research into speech

  • 7/30/2019 3 Multimedia for Linux IVI

    6/7

    6 | Multimedia Resources in Linux for Automotive

    synthesis,specificallyonprosody.Assuch,MBROLAisnota

    TTS synthesizer; rather it accepts phoneme representations as

    input, together with prosodic information, and produces

    speech output based on concatenation of diphones.

    MBROLAisdistributedunderitsownlicensingterms

    (essentially a limited ree copyr ight license). Learn more

    at http://tcts.pms.ac.be/synthesis/mbrola.html.

    Nuance RealSpeak Solo

    RealSpeak Solo is a TTS solution, optimized to enhance

    automotive and other embedded conversational applications.

    Highly scalable, it provides high speech quality output across a

    range of footprints, t ypically from 8MB to 20MB for automotive

    use.Recently,Nuanceintroducedtheoptionforvoicecapture

    of preferred speakers for greater OEM differentiation.

    Nuanceoffersseveraladditionalproductsforin-vehicle

    speechgeneration,includingPromptSculptorand

    SpeechPAKAutomotive.LearnmoreaboutNuanceproducts

    at http://www.nuance.com/automotive/.

    Speex

    Speex is an open source, patent-ree audio compression

    ormat designed or speech. The Speex project aims to lower

    the barrier o entry or voice applications by providing a ree

    alternative to expensive proprietary speech codecs.

    Moreover,SpeexisbasedonCELP(CodeExcitedLinear

    Prediction)andcancompressvoiceatbitratesfrom2kbpsto

    44kbps.SomeofSpeexsfeaturesincludethefollowing:

    Narrowband(8kHz),wideband(16kHz),andultra-wideband

    (32kHz) compression in the same bit stream

    Intensitystereoencoding

    Packetlossconcealment

    Variablebitrateoperation(VBR) Voiceactivitydetection(VAD)

    Discontinuoustransmission(DTX)

    Acousticechocancellationandnoisesuppression

    SpeexispartoftheGNUProjectandisavailableunderthe

    revised Berkeley Sotware Distribution (BSD) license. Learn

    more at http://www.speex.org/.

    NeoSpeech VoiceText

    VoiceText generatessynthes izedspeechfrominputtex tand

    targetscustom,standaloneapplications.VoiceTextishighly

    scalable,withfootprintsassmallas16MB.VoiceTextsupports

    text-to-speechforU.S.English,Latin-AmericanSpanish,

    Korean,Japanese,andMandarinChinese,withacollection

    of11nativevoices.

    NeoSpeechsupportsdesktopLinuxoff-the-shelfand

    embedded Linux in custom conigurations. Learn more

    at http://www.neospeech.com.

    Voice Recognition

    Vehicle-basedspeechrecognitionisoneofthemore

    exacting uses o this input technology: viable recognition

    systems require high-quality noise rejection and

    compensation to oset high levels o ambient noise (wind,

    traffic,streetsounds).Voicerecognitionsystemsmustbe

    speaker-independent and oer low recognition latency and

    highaccuracy.Assuch,onlyafewofthemanyspeechinputtechnologies available as open source and/or Linux-based

    commercial sotware can pass muster as automotive grade.

    Nuance VoCon

    VoCon3200is designedfor add ingspeechrecognit ionto

    automotiveandotherembeddedapplications.TheVoCon

    3200 speech recognition engine delivers speaker-independent

    and continuous speech-recognition capabilities and is suitable

    for navigation systems, telematics applications, and voice

    control of in-vehicle devices.

    Modularandscalable,VoCon3200meetstherequirementsof

    different applications and markets with modules for speakeradaptation, natural language understanding, spelling, and

    onlinegrammarcompiler.VoCon3200featuresincludethe

    following:

    Speaker-independent,continuous-digit,andcommand

    recognition

    Voicedestinationentryforlargecountriesandmarkets

    Automotiveacousticalmodelsandautomaticnoise

    cancellation

    TheVoCon3200packageincludessupportforU.S.English,

    U.K.English,andGermanwiththeautomotiveacoustical

    models. Other languages can be suppor ted easily via data

    filesavailablefromNuance.

    LearnmoreaboutVoCon3200athttp://www.nuance.com/

    vocon/3200/.

    Sensory FluentSoft

    SensorysFluentSofttechnologyiscapableofrecognizing

    thousands o words. Designed to create a natural human-

    machineinterface(HMI),FluentSoftemployswordandphrase

    spotting technologies to avoid the limitations o disc rete

    word command and control.

    To aid in speech application development, Sensory oe rs a

    speaker-independentsoftwaredevelopmentkit(SDK)witha

    large vocabulary or integrating high-quality speech

    recognition technology into automotive applications, with

    modestmemoryandCPUresourceuse.TheFluentSoft

    phonetic recognition engine can build new vocabularies

    based on text input, even on the device.

  • 7/30/2019 3 Multimedia for Linux IVI

    7/7

    Sphinx

    Sphinx is an open source speech recognition system implemented

    inJava.SphinxcomesfromacollaborationamongCarnegie

    MellonUniversity,SunMicrosystemsLaboratories,Mitsubishi

    ElectricResearchLabs(MERL),andHewlett-Packard(HP),with

    contributionsfromtheUniversityofCaliforniaatSantaCruz

    (UCSC)andtheMassachusettsInstituteofTechnology(MIT).

    Sphinx eatures live and batch mode speech recognizers

    capable o recognizing discrete and continuous speech. It

    also oers a range o plug-in interaces or dierent ront

    ends, language models, and acoustic models. Sphinx also

    boasts standalone tools or displaying waveorms and

    spectrograms and generating eatures rom audio.

    Sphinx is distributed under a BSD license. Learn more at

    http://cmusphinx.sourceorge.net/.

    Digital Rights Management

    DRM sotware and related technologies that enorce

    licensing do not sit easily with open source sotware and

    developer communities that create and support OSS. The

    latestversionoftheGNUGeneralPublicLicense(GPLv3)

    evenfeaturesexplicitprohibitionagainstplacingGPLv3

    sotware itsel under the control o a rights management

    regime. However, deployment and use o DRM, while

    incompatiblewithmanagingGPLsoftwareitself,isviablefor

    othersoftwareandformultimediacontent.Fortheirpart,

    OEMs, content owners, and network operators express real

    requirements or DRM, and a number o solutions are

    available or Linux-hosted automotive applications.

    Helix DRM

    Helix DRM for Devices enables chipset and device manufacturersto include DRM in intelligent device software. Helix DRM for

    Devices allows content rights holders to determine device

    delivery policy while also serving the needs of device

    manufacturers and end users. Helix DRM for Devices gives

    content rights holders and device manufacturers flexibility

    over their business models through two modes of operation.

    Primary Device Support

    Aprimarydeviceisanetwork-connectedsystem,aswithPCs,

    set-topboxes,homemediaservers,ormobilephones.A

    primary device can connect directly to the Internet and

    eatures local storage. With Helix DRM primary device

    support,boththeHelixDRMsoftwareandtheHelixDNAclient run natively on the managed device. To access and play

    managed content, an end user or program must acquire the

    secured content rom a media server. When the attempt is

    made to use the encrypted ile on the primar y device, Helix

    DRM requires a license key to decrypt it. In response, the

    Helix DRM client on the primary device requests a license,

    and the Helix DRM license server generates a license key per

    the associated business rules and policy deined by the

    content owner.

    Secondary Device Support

    Asecondarydevicerequiresaconnectiontoaprimarydevice

    for the transfer of data and rules for that data. On secondary

    devices, Helix DRM lets OEMs enable standards-based secure

    subscription, online purchase, and other business models on a

    wide range of portable devices and c onnected home appliances.

    Learn more at https://devicedrm.helixcommunity.org/.

    OpenIPMP

    OpenIPMPisanopensourceDRMimplementationforMPEG-4

    andMPEG-2content.TheprojectadherestoISO/MPEGIPMP

    (intellectual property management and protection) open

    standards(MPEGIPMPhooksandIPMP-X),ISMAcryp,andOpenMobileAlliance(OMA)DRM2specifications.OpenIPMP

    includesCA,PKI,DOImanagement,ISMAstreaming,license

    server, encoding/encryption, player, and plug-in architecture.

    TheOpenIPMPprojectcodeisdistributedundertheMozilla

    PublicLicense(MPL1.1)andsoeasilyintegratesinto

    commercial/proprietary device sotware stacks. Learn more

    at http://sourceorge.net/projects/openipmp.

    Conclusion

    Thanks to substantial investment by the automotive and

    consumer electronics industries, academic researchers, and

    open source community work and through the efforts ofcommercialindependentsoftwarevendors(ISVs),embedded

    Linux today hosts a rich and varied toolbox of multimedia,

    speech, and DRM technologies. This document is not intended

    to be exhaustive but rather to provide an overview of the

    options in these areas that are appropriate for automotive

    development and deployment. In many cases, the software

    technologies and commercial offerings highlighted here are

    already integrated by Wind River or deployed by global

    automotive OEMs.

    Notes

    1.GNUGeneralPublicLicenseversion3,Section3,http://www.s.org/licensing/licenses/gpl.html.

    Wind River is the global leader in Device Sotware Optimization (DSO). We enable companies to develop,

    run, and manage device sotware aster, better, at lower cost, and more reliably. www.windriver.com

    2008WindRiverSystems,Inc.TheWindRiverlogoisatrademarkofWindRiverSystems,Inc.,andWindRiverandVxWorksareregisteredtrademarksofWindRiverSystems,Inc.

    Othermarksusedhereinarethepropertyoftheirrespectiveowners.Formoreinformation,seewww.windriver.com/company/terms/trademark.html.Rev.10/2008