52
Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Embed Size (px)

Citation preview

Page 1: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Multimedia Grand Challenge 2010

Mei-Chen Yeh03/30/2010

Page 2: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

• Research-oriented– Paper reading and

discussion– Hand-on assignments– Teamwork– Brainstorm activities

• Student-driven– Bid for paper

presentations! ( 論文競標 )

– Play with useful tools!

– Find your partners!– Think without limits!

天馬行空15% grade

Page 3: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Why should I spend time on this?

• I want to pass this course.• Writing a report / doing a project always take

time. Why not turn the report/project into something beneficial? $$ 要花在刀口上

Page 4: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Here comes the opportunity!

• $1500 (~NT47688) if we earn the first prize• Make your resume stand out

Page 5: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

………………………….………………………….

………………………….

………………………….

………………………….

………………………….

Education Educationmaster, NTNU………………………….………………………….

master, NXU………………………….………………………….

Experiences

……………………………………………

……………………………………………

……………………………………………

……………………………………………

Experiences……………………………………………………………………………………………………………………………………………………………………………………

Publicationxxx, “An fast approach for automatic photo organization”, ACM Multimedia, 2010.

Page 6: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

10 Challenges this year

• Identified by leading companies!– two from Google– two from Yahoo!– two from Radvision– one from Nokia, HP, 3DLife, and CeWe

Page 7: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Nokia Challenge 2010: Where was this Photo Taken, and How?

Page 8: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Can you tell where are these photos taken?

Page 9: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Nokia Challenge• Goal– Try to derive exact camera poses (location and

orientation) of given photos that are lacking location annotation

• You can assume the availability of nearby photos/video with known location that can be used to derive unknown camera poses; other ideas that do not require existing content will be welcome.

• Applications– http://betalabs.nokia.com/apps/nokia-image-space

Page 10: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

<geolocation alt="0" lat="43.731634200204" lng="7.421395112594" /> <orientation pitch="0" roll="0" yaw="100" />

<geolocation alt="0" lat="43.731639899896" lng="7.421372230007" /> <orientation pitch="0" roll="0" yaw="104" />

<geolocation alt="0" lat="43.731658423895" lng="7.421072325625" /> <orientation pitch="7.5" roll="0" yaw="240" />

<geolocation alt="0" lat="43.731594553817" lng="7.421487732589" /> <orientation pitch="0" roll="0" yaw="276" />

緯度 , 經度

Page 11: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Nokia Challenge

• Matlab tools and datasets are available!– photos captured with the Nokia 6210 Navigator– each image has an associated GPS/orientation

measurement

Page 12: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Google Challenge 2010: Robust, As-Accurate-As-Human Genre

Classification for Video

Page 13: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Current video search engine

language ( 語言 )length ( 影片長度 )date ( 上傳日期 )

Page 14: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Google Challenge

• Goal– take user generated videos (along with their

sparse and noisy metadata) and automatically classify them into genres

【法】文藝作品之類型

Page 15: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

A genre hierarchy

Page 16: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Google Challenge 2010: Indexing and Fast Interactive Searching in Personal

Diaries ( 個人日記 )

Page 17: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Google Challenge

• Diaries can be any combination of audio, video, geographic location, photos, phone logs, and whatever other multimedia data the user generates or accesses.

• Goal– develop good schema, algorithms, UI, etc., that

will be useful for diaries from audio-only through full-featured multimedia.

Page 18: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

slide from 彎彎’ s blog

Page 19: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Google Challenge

• Some thoughts: – What would be a good design for blogs?– We now have photos, comments, game records,

… on facebook. Could we design new tools for users to create their own “e-diary”?

Page 20: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge 2010: Robust Automatic Segmentation of Video

According to Narrative Themes故事形式的

Page 21: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

An episode of Friends

One example:

Page 22: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge

• Goal– develop methods, techniques, and algorithms to

automatically generate narrative themes for a given video, as well as present the content in an easy-to-consume manner to end-users in a search engine experience

• Applications– allow users consume pieces of a video that would be

of interest to them– let users kill time during lunch breaks in creative ways

Page 23: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Another example:

2010/03/30 中天新聞

政治.財經 社會.地方 影視.體育 生活.醫療

1. Train a 馬英九總統 detector, use it to identify video segments, and tag them with “ 政治”

2. Identify the sound of the anchor man ( 主播 ), from there identify keywords such as “ 娛樂” , “ 社會” , etc.

3. …..

Page 24: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

One more example:

2010/03/28 La New vs. 兄弟 @ 新莊

被三振

安打

失誤

廣告

1. Background music is different, tag the video segment “ 廣告”2. ….

Page 25: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

• My suggestions– select a certain type of videos– Examples:• sitcom or some popular tv show such as 康熙來了• your favorite movie• a wedding video of your family ( 進場 , 致詞 , 敬酒 ,

玩遊戲… )• sport videos• educational videos• …

• You need to specify not only the themes, but also how your are going to segment the input video into themes automatically

Page 26: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge 2010: Novel Image Understanding

Page 27: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge

• Move beyond simple image classification• Goal– develop novel and useful ways to organize and

structure image content

Page 28: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge

• Example– Sort celebrity pictures by their subject’s age

1999 2000 2001 2002…

or hair style

Page 29: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge

• Example– discover how a logo/advertisement/product has

evolved over time

Page 30: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Yahoo! Challenge

• There are many ways to organize photos! – What are the ways that are not obvious? – What can we do better than we can do today?

Page 31: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

HP Challenge 2010: High Impact Visual Communication

Page 32: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

HP Challenge

• Goal– find a solution which can create a collage ( 美術

拼貼 ) and generate a textual description that tells the story of a set of photos

• How do we create a high impact picture that can convey information across cultural boundaries and find a thousand words that best describe such a picture?

Page 33: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

HP Challenge

• Input– a digital photo collection, such as photos taken

during a vacation

• Outputs– the most appealing collage picture (1600x1200

pixels) that best represents the original collection– a description (<100 words) of the collage picture

Page 34: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

my favorite cartoon: Sponge bob and square pants….

A show at xxx pub. Hang out with my friends….

Page 35: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

HP Challenge

• 6 datasets are available!– each with 20 photos

Page 36: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

CeWe Challenge 2010: Automatic Theme Identification of Photo Sets for

Digital Print Products

Page 37: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

CeWe Challenge

• About CeWe– the Number One services partner for first-class

trade brands on the European photographic market

– supplies both stores and Internet retailers with photographic products

posters, calendars or photo books

Page 38: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

CeWe Challenge

• Goal– simplify the “style selection” step

• In the CEWE PHOTOBOOK software about 100 styles are available to suit different user tastes and different types of photo books.– events (party, holiday, BBQ…)– seasons (Christmas, summer, Easter, new year…) – design styles (classical, funky, cute, …)

Page 39: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

CeWe Challenge

• Based on the photo contents– not necessarily to automatically determine the

one and only perfect style, – but rather to provide the user with a reasonable

selection of styles he or she can choose from.

Page 40: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

CeWe Challenge

• Example– What would be good background colors for the

set of photos?

Page 41: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

3DLife Challenge 2010: Sports Activity Analysis in Camera Networks

Page 42: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

3DLife Challenge

• Goal– Explore the limits of what is possible in terms of

2D and 3D data extraction from a low-cost camera network for sports

– Tennis is chosen as a case study

http://www.hawkeyeinnovations.co.uk/

Page 43: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

3DLife Challenge• The capture infrastructure– 720 x 680, MPEG-4 25Hz cameras – not calibrated or synchronized– share only limited overlapping fields of view

• Subjects of interests– Player localization and tracking– Event-based analysis and human behavior modeling – 3D reconstruction of the playing arena and/or the players

or their actions– Player activity and motion over an entire training session– Novel visualization and feedback mechanisms of any

analysis results

Page 44: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Radvision Challenge 2010: Real-time Data Collaboration Adaptation for Multi-Device Video Conferencing

Page 45: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Radvision Challenge

• Goal– adapt, in real-time, the data collaboration channel

to different receiving devices, in a way that would be regarded as optimal perceptually by users

• Example video– http://comminfo.rutgers.edu/conferences/mmch

allenge/2010/02/10/radvision-challenge-adaptation/

Page 46: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Radvision Challenge 2010: Video Conferencing To Surpass “In-Person”

Meeting Experience

Page 47: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Radvision Challenge

• Goal– developing new technologies and ideas to surpass

( 超越 , 改善 ) the “in-person” meeting experience

Page 48: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Radvision Challenge

• Example:– Come up with ways to maintain a long-term

relationship

Page 49: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

For more information…

http://comminfo.rutgers.edu/conferences/mmchallenge/

Page 50: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Report requirements

• File format– Subject: MM-Challenge#X-Team#X– File name: MM-Challenge#X-Team#X.doc

1 2 3 4 5

6 7 8 9 10

Page 51: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Report requirements

• 2 pages• Write-up as a formal paper– title, names– abstract, keywords– introduction– system description (or prototype)– references

template is available on the course website

Page 52: Multimedia Grand Challenge 2010 Mei-Chen Yeh 03/30/2010

Report requirements

• Language– English is preferred; Chinese is fine.

• Due date– 04/20 11:59 pm