2
Demo Abstract: FOCUS: Clustering Crowdsourced Videos by Line-of-Sight Puneet Jain Duke University [email protected] Justin Manweiler IBM T. J. Watson [email protected] Arup Achary IBM T. J. Watson [email protected] Kirk Beaty IBM T. J. Watson [email protected] ABSTRACT We present a demonstration of FOCUS [1], a system to appear in the SenSys 2013 main conference. FOCUS is a video-clustering service for live user video streams, indexed automatically and in realtime by shared content. FOCUS uniquely leverages visual, 3D model reconstruction and mul- timodal sensing to decipher and continuously track a video’s line-of-sight. Through spatial reasoning on the relative ge- ometry of multiple video streams, FOCUS recognizes shared content even when viewed from diverse angles and distances. We believe FOCUS can enable a new family of applications, such as instant replay, augmented reality, citizen journalism, security breach detection, and disaster assessment. In the demonstration, we will show 325 video clips taken at Duke University Wallace Wade Stadium being processed in real-time via FOCUS pipeline. The recorded video clips con- tain one of three spots in the stadium: East Stand, Score- board, and West Stand. The demo shall be shown in the form of a web interface, first showing randomly clustered video clips. Later, on a button click, FOCUS shall process displayed videos in real-time, outputting clusters of videos, containing the common shared subject in each of them. For each successfully processed video clip in a cluster, we will further show similar clips from near, medium, and wide an- gle. To display performance and accuracy of FOCUS in indoor environments, a similar demonstration will be shown for an office space. FOCUS shall run on multi-node Hadoop cluster built on top of IBM SmartCloud platform. General Terms Algorithms, Design, Experimentation, Performance Copyright is held by the owner/author(s). SenSys ’13, Nov 11-15 2013, Roma, Italy ACM 978-1-4503-2027-6/13/11 . Categories and Subject Descriptors H.3.4 [Information Storage and Retrieval]: Systems and Software Keywords Video Analytics, Localization, Structure from Motion 1. DEMONSTRATION FOCUS is implemented in two parts: (1) a mobile app (pro- totyped for Android) (2) a distributed service, deployed on infrastructure-as-a-service cloud using Hadoop. The app lets users to capture and upload live video streams, annotated with time-synchronized sensor data, from GPS, compass, and gyroscope. The cloud service receives video streams in parallel, analyzing each, leveraging Computer Vision and sensing to continuously model and track the video’s line- of-sight. FOCUS performs a series of steps on cloud before inferring a video’s line-of-sight: (1) Frame decoding, (2) Fea- ture extraction, (3) Features to model alignment. The above steps estimate relative camera pose of any given frame, with respect to the provided model; We define line-of-sight as a ray emerging from camera position, extending toward the infinity in the outward facing direction of camera. The es- timated line-of-sight is used to quantify the shared physical space between any two videos and an spatio-temporal ma- trix of similarity is constructed. Finally, clustering using modularity maximization is performed to construct groups of similar clips. The similar clips in a group are further di- vided into subgroup by looking at 3D-angular separation in their line-of-sight. 2. SETUP We shall demo FOCUS on a web browser. The participants will be asked to select a set of clips from a pre-captured pool of videos. The selected video clips shall be processed via FOCUS pipeline and results will be displayed in three column format from different angle of views. A tentative screenshot of our demo is shown in Figure 1. FOCUS es- timates video’s line-of-sight with the help of a 3D model. To display the significance of the model, along with each video, we shall also show the corresponding images used in the model construction.

Demo Abstract: FOCUS: Clustering Crowdsourced Videos by ...puneet/papers/focus-sensys-demo.pdf · The demo shall be shown in the form of a web interface, rst showing randomly clustered

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Demo Abstract: FOCUS: Clustering Crowdsourced Videos by ...puneet/papers/focus-sensys-demo.pdf · The demo shall be shown in the form of a web interface, rst showing randomly clustered

Demo Abstract: FOCUS: Clustering Crowdsourced Videosby Line-of-Sight

Puneet JainDuke University

[email protected]

Justin ManweilerIBM T. J. Watson

[email protected]

Arup AcharyIBM T. J. Watson

[email protected]

Kirk BeatyIBM T. J. Watson

[email protected]

ABSTRACTWe present a demonstration of FOCUS [1], a system toappear in the SenSys 2013 main conference. FOCUS is avideo-clustering service for live user video streams, indexedautomatically and in realtime by shared content. FOCUSuniquely leverages visual, 3D model reconstruction and mul-timodal sensing to decipher and continuously track a video’sline-of-sight. Through spatial reasoning on the relative ge-ometry of multiple video streams, FOCUS recognizes sharedcontent even when viewed from diverse angles and distances.We believe FOCUS can enable a new family of applications,such as instant replay, augmented reality, citizen journalism,security breach detection, and disaster assessment.

In the demonstration, we will show 325 video clips taken atDuke University Wallace Wade Stadium being processed inreal-time via FOCUS pipeline. The recorded video clips con-tain one of three spots in the stadium: East Stand, Score-board, and West Stand. The demo shall be shown in theform of a web interface, first showing randomly clusteredvideo clips. Later, on a button click, FOCUS shall processdisplayed videos in real-time, outputting clusters of videos,containing the common shared subject in each of them. Foreach successfully processed video clip in a cluster, we willfurther show similar clips from near, medium, and wide an-gle. To display performance and accuracy of FOCUS inindoor environments, a similar demonstration will be shownfor an office space. FOCUS shall run on multi-node Hadoopcluster built on top of IBM SmartCloud platform.

General TermsAlgorithms, Design, Experimentation, Performance

Copyright is held by the owner/author(s).SenSys ’13, Nov 11-15 2013, Roma, ItalyACM 978-1-4503-2027-6/13/11 .

Categories and Subject DescriptorsH.3.4 [Information Storage and Retrieval]: Systemsand Software

KeywordsVideo Analytics, Localization, Structure from Motion

1. DEMONSTRATIONFOCUS is implemented in two parts: (1) a mobile app (pro-totyped for Android) (2) a distributed service, deployed oninfrastructure-as-a-service cloud using Hadoop. The app letsusers to capture and upload live video streams, annotatedwith time-synchronized sensor data, from GPS, compass,and gyroscope. The cloud service receives video streams inparallel, analyzing each, leveraging Computer Vision andsensing to continuously model and track the video’s line-of-sight. FOCUS performs a series of steps on cloud beforeinferring a video’s line-of-sight: (1) Frame decoding, (2) Fea-ture extraction, (3) Features to model alignment. The abovesteps estimate relative camera pose of any given frame, withrespect to the provided model; We define line-of-sight as aray emerging from camera position, extending toward theinfinity in the outward facing direction of camera. The es-timated line-of-sight is used to quantify the shared physicalspace between any two videos and an spatio-temporal ma-trix of similarity is constructed. Finally, clustering usingmodularity maximization is performed to construct groupsof similar clips. The similar clips in a group are further di-vided into subgroup by looking at 3D-angular separation intheir line-of-sight.

2. SETUPWe shall demo FOCUS on a web browser. The participantswill be asked to select a set of clips from a pre-capturedpool of videos. The selected video clips shall be processedvia FOCUS pipeline and results will be displayed in threecolumn format from different angle of views. A tentativescreenshot of our demo is shown in Figure 1. FOCUS es-timates video’s line-of-sight with the help of a 3D model.To display the significance of the model, along with eachvideo, we shall also show the corresponding images used inthe model construction.

Page 2: Demo Abstract: FOCUS: Clustering Crowdsourced Videos by ...puneet/papers/focus-sensys-demo.pdf · The demo shall be shown in the form of a web interface, rst showing randomly clustered

Figure 1: Random unclustered videos (left): show-ing video previews, Videos clustered by line-of-sight(right): columns show similar videos from variousangular separation.

3. REFERENCES[1] P. Jain, J. Manweiler, A. Acharya, and K. Beaty.

Focus: Clustering crowdsourced videos by line-of-sight.In Proceedings of the 11th ACM Conference onEmbedded Networked Sensor Systems. ACM, 2013.