11
Yueting Zhuang Shiqiang Yang Yong Rui Qinming He (Eds.) Advances in Multimedia Information Processing - PCM 2006 7th Pacific Rim Conference on Multimedia Hangzhou, China, November 2-4, 2006 Proceedings Springer

Advances in Multimedia Information Processing - PCM 2006

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Yueting Zhuang Shiqiang Yang Yong Rui Qinming He (Eds.)

Advances in Multimedia Information Processing -PCM 2006

7th Pacific Rim Conference on Multimedia Hangzhou, China, November 2-4, 2006 Proceedings

Springer

Table of Contents

Expressive Speech Recognition and Synthesis as Enabling Technologies for Affective Robot-Child Communication 1

Selma Yümazyildiz, Wesley Mattheyses, Yorgos Patsis, Werner Verhelst

Embodied Conversational Agents: Computing and Rendering Realistic Gaze Patterns 9

Gerard Bailly, Frederic Elisei, Stephan Raidt, Alix Casari, Antoine Picot

DBN Based Models for Audio-Visual Speech Analysis and Recognition 19

Ilse Ravyse, Dongmei Jiang, Xiaoyue Jiang, Guoyun Lv, Yunshu Hou, Hichem Sahli, Rongchun Zhao

An Extensive Method to Detect the Image Digital Watermarking Based on the Known Template 31

Yang Feng, Senlin Luo, Linmin Pan

Fast Mode Decision Algorithm in H.263+/H.264 Intra Transcoder 41 Min Li, Guiming He

Binary Erasure Codes for Packet Transmission Subject to Correlated Erasures 48

Frederik Vanhaverbeke, Frederik Simoens, Marc Moeneclaey, Danny De Vleeschauwer

Image Desynchronization for Secure Collusion-Resilient Fingerprint in Compression Domain 56

Zhongxuan Liu, Shiguo Lian, Zhen Ren

A Format-Compliant Encryption Framework for JPEG2000 Image Code-Streams in Broadcasting Applications 64

Jinyong Fang, Jun Sun

Euclidean Distance Transform of Digital Images in Arbitrary Dimensions 72

Dong Xu, Hua Li

JPEG2000 Steganography Possibly Secure Against Histogram-Based Attack 80

Hideki Noda, Yohsuke Tsukamizu, Michiharu Niimi

XIV Table of Contents

Perceptual Depth Estimation from a Single 2D Image Based on Visual Perception Theory 88

Bing Li, De Xu, Songhe Feng, Aimin Wu, Xu Yang

A System for Generating Personalized Virtual News 96 Jian-Jun Xu, Jun Wen, Dan-Wen Chen, Yu-Xiang Xie, Ling-Da Wu

Image Fingerprinting Scheme for Print-and-Capture Model 106 Won-gyum Kim, Seon Hwa Lee, Yong-seok Seo

16x16 Integer Cosine Transform for HD Video Coding 114 Jie Dong, King Ngi Ngan

Heegard-Berger Video Coding Using LMMSE Estimator 122 Xiaopeng Fan, Oscar Au, Yan Chen, Jiantao Zhou

Real-Time BSD-Driven Adaptation Along the Temporal Axis of H.264/AVC Bitstreams 131

Wesley De Neue, Davy De Schrijver, Davy Van Deursen, Peter Lambert, Rik Van de Walle

Optimal Image Watermark Decoding 141 Wenming Lu, Wanqing Li, Rei Safavi-Naini, Philip Ogunbona

Diagonal Discrete Cosine Transforms for Image Coding 150 Jingjing Fu, Bing Zeng

Synthesizing Variational Direction and Scale Texture on Planar Region 159

Yan-Wen Guo, Xiao-Dong Xu, Xi Chen, Jin Wang, Qun-Sheng Peng

Fast Content-Based Image Retrieval Based on Equal-Average K-Nearest-Neighbor Search Schemes 167

Zhe-Ming Lu, Hans Burkhardt, Sebastian Boehmer

Characterizing User Behavior to Improve Quality of Streaming Service over P2P Networks 175

Yun Tang, Lifeng Sun, Jianguang Luo, Yuzhuo Zhong

Interacting Activity Recognition Using Hierarchical Durational-State Dynamic Bayesian Network 185

Youtian Du, Feng Chen, Wenli Xu, Weidong Zhang

Table of Contents XV

Improving the Image Retrieval Results Via Topic Coverage Graph 193 Kai Song, Yonghong Tian, Tiejun Huang

Relevance Feedback for Sketch Retrieval Based on Linear Programming Classification 201

Bin Li, Zhengxing Sun, Shuang Liang, Yaoye Zhang, Bo Yuan

Hierarchical Motion-Compensated Frame Interpolation Based on the Pyramid Structure 211

Gun-Ill Lee, Rae-Hong Park

Varying Microphone Patterns for Meeting Speech Segmentation Using Spatial Audio Cues 221

Eva Cheng, Ian Burnett, Christian Ritz

Region-Based Sub-pixel Motion Estimation from Noisy, Blurred, and Down-Sampled Sequences 229

Osama A. Oraer, Toshihisa Tanaka

Differential Operation Based Palmprint Authentication for Multimedia Security 237

Xiangqian Wu, Kuanquan Wang, David Zhang

A Broadcast Model for Web Image Annotation 245 Jia Li, Ting Liu, Weiqiang Wang, Wen Gao

An Approach to the Compression of Residual Data with GPCA in Video Coding 252

Lei Yao, Jian Liu, Jiangqin Wu

A Robust Approach for Object Recognition 262 Yuanning Li, Weiqiang Wang, Wen Gao

A Novel Method for Spoken Text Feature Extraction in Semantic Video Retrieval 270

Juan Cao, Jintao Li, Yongdong Zhang, Sheng Tang

A Semantic Image Category for Structuring TV Broadcast Video Streams 279

Jinqiao Wang, Lingyu Duan, Hanqing Lu, Jesse S. Jin

Markov Chain Monte Carlo Super-Resolution Image Reconstruction with Simultaneous Adaptation of the Prior Image Model 287

Jing Tian, Kai-Kuang Ma

XVI Table of Contents

Text Detection in Images Using Texture Feature from Strokes 295 Caifeng Zhu, Weiqiang Wang, Qianhui Ning

Robust Mandarin Speech Recognition for Car Navigation Interface 302 Pei Ding, Lei He, Xiang Yan, Rui Zhao, Jie Hao

GKDA: A Group-Based Key Distribution Algorithni for WiMAX MBS Security 310

Huijie Li, Guangbin Fan, Jigang Qiu, Xiaokang Lin

A Watermarking Algorithm for JPEG File 319 Hongmei Liu, Huiying Fu, Jiwu Huang

SNR Scalability in H.264/AVC Using Data Partitioning 329 Stefaan Mys, Peter Lambert, Wesley De Neve, Piet Verhoeve, Rik Van de Walle

A Real-Time XML-Based Adaptation System for Scalable Video Formats 339

Davy Van Deursen, Davy De Schrijver, Wesley De Neve, Rik Van de Walle

Generic, Scalable Multimedia Streaming and Delivery with Example Application for H.264/AVC 349

Joseph Thomas-Kerr, Ian Burnett, Christian Ritz

Shape-Based Image Retrieval in Botanical Collections 357 Itheri Yahiaoui, Nicolas Herve, Nozha Boujemaa

Macroblock Mode Decision Scheme for Fast Encoding in H.264/AVC . . . . 365 Donghyung Kim, Joohyun Lee, Kicheol Jeon, Jechang Jeong

A Mathematical Model for Interaction Analysis Between Multiview Video System and User 375

You Yang, Gangyi Jiang, Mei Yu, Zhu Peng

Motion Composition of 3D Video 385 Jianfeng Xu, Toshihiko Yamasaki, Kiyoharu Aizawa

EKM: An Efficient Key Management Scheme for Large-Scale Peer-to-Peer Media Streaming 395

Feng Qiu, Chuang Lin, Hao Yin

Using Earth Mover's Distance for Audio Clip Retrieval Yuxin Peng, Cuihua Fang, Xiaoou Chen

405

Table of Contents XVII

Streaming-Mode MB-Based Integral Image Techniques for Fast Multi-view Video Illumination Compensation 414

Jiangbo Lu, Gauthier Lafruit, Francky Catthoor

A Motion Vector Predictor Architecture for AVS and MPEG-2 HDTV Decoder 424

Junhao Zheng, Di Wu, Lei Deng, Don Xie, Wen Gao

Inter-camera Coding of Multi-view Video Using Layered Depth Image Representation 432

Seung-Uk Yoon, Eun-Kyung Lee, Sung-Yeol Kim, Yo-Sung Ho, Kugjin Yun, Sukhee Cho, Namho Hur

Optimal Priority Packetization with Multi-layer UEP for Video Streaming over Wireless Network 442

Huanying Zou, Chuang Lin, Hao Yin, Zhen Chen, Feng Qiu, Xuening Liu

A Multi-channel MAC Protocol with Dynamic Channel Allocation in CDMA Ad Hoc Networks 450

Jigang Qiu, Guangbin Fan, Huijie Li, Xiaokang Lin

Fuzzy Particle Swarm Optimization Clustering and Its Application to Image Clustering 459

Wensheng Yi, Min Yao, Zhiwei Jiang

A New Fast Motion Estimation for H.264 Based on Motion Continuity Hypothesis 468

Juhua Pu, Zhang Xiong, Lionel M. Ni

Statistical Robustness in Multiplicative Watermark Detection 477 Xingliang Huang, Bo Zhang

Adaptive Visual Regions Categorization with Sets of Points of Interest. . . 485 Hichem Houissa, Nozha Boujemaa, Hichem Frigui

A Publishing Framework for Digitally Augmented Paper Documents: Towards Cross-Media Information Integration 494

Xiaoqing Lu, Zhiwu Lu

Web-Based Semantic Analysis of Chinese News Video 502 Huamin Feng, Zongqiang Fang, Kun Qiu, Guosen Song

A Quality-Controllable Encryption for H.264/AVC Video Coding 510 Guang-Ming Hong, Chun Yuan, Yi Wang, Yu-Zhuo Zhong

XVIII Table of Contents

Texture Synthesis Based on Minimum Energy Cut and Its Applications 51g

Shuchang Xu, Xiuzi Ye, Yin Zhang, Sanyuan Zhang

Unifying Keywords and Visual Features Within One-Step Search for Web Image Retrieval 527

Ruhan He, Hai Jin, Wenbing Tao, Aobing Sun

Dynamic Bandwidth AUocation for Stored Video Under Renegotiation Frequency Constraint 537

Myeong-jin Lee, Kook-yeol Yoo, Dong-jun Lee

Online Selection of Discriminative Features Using Bayes Error Rate for Visual Tracking 547

Dawei Liang, Qingming Huang, Wen Gao, Hongxun Yao

Interactive Knowledge Integration in 3D Cloth Animation with Intelligent Learning System 556

Yujun Chen, Jiaxin Wang, Zehong Yang, Yixu Song

Multi -view Video Coding with Flexible View-Temporal Prediction Structure for Fast Random Access 564

Yanwei Liu, Qingming Huang, Xianggang Ji, Debin Zhao, Wen Gao

Squeezing the Auditory Space: A New Approach to Multi-channel Audio Coding 572

Bin Cheng, Christian Ritz, Ian Burnett

Video Coding by Texture Analysis and Synthesis Using Graph Cut 582 Yongbing Zhang, Xianggang Ji, Debin Zhao, Wen Gao

Multiple Description Coding Using Adaptive Error Recovery for Real-Time Video Transmission 590

Zhi Yang, Jiajun Bu, Chun Chen, Linjian Mo, Kangmiao Liu

An Improved Motion Vector Prediction Scheme for Video Coding 598 Da Liu, Debin Zhao, Qiang Wang, Wen Gao

Classifying Motion Time Series Using Neural Networks 606 Lidan Shou, Ge Gao, Gang Chen, Jinxiang Dong

Estimating Intervals of Interest During TV Viewing for Automatic Personal Preference Acquisition 615

Makoto Yamamoto, Naoko Nitta, Noboru Babaguchi

Table of Contents XIX

Image Annotations Based on Semi-supervised Clustering with Semantic Soft Constraints 624

Xiaoguang Rui, Pingbo Yuan, Nenghai Yu

Photo Retrieval from Personal Memories Using Generic Concepts 633 Rui M. Jesus, Arnaldo J. Abrantes, Nuno Correia

PanoWalk: A Remote Image-Based Rendering System for Mobile Devices 641

Zhongding Jiang, Yandong Mao, Qi Jia, Nan Jiang, Junyi Tao, Xiaochun Fang, Hujun Bao

A High Quality Robust Watermarking Scheme 650 Yu-Ting Pai, Shanq-Jang Ruan, Jürgen Götze

An Association Rule Mining Approach for Satellite Cloud Images and Rainfall 658

Xu Lai, Guo-hui Li, Ya-li Gan, Ze-gang Ye

AVAS: An Audio-Visual Attendance System 667 Dongdong Li, Yingchun Yang, Zhenyu Shan, Gang Pan, Zhaohui Wu

Improved POCS-Based Deblocking Technique Using Wavelet Transform in Block Coded Image 676

Goo-Rak Kwon, Hyo-Kak Kim, Chun-Soo Park, Yoon Kim, Sung-Jea Ko

Sketch Case Based Spatial Topological Data Retrieval 686 Zhen-ming Yuan, Liang Zhang, Hong Pan

Providing Consistent Service for Structured P2P Streaming System 695 Zhen Yang, Huadong Ma

Adaptive Search Range Scaling for B Pictures Coding 704 Zhigang Yang, Wen Gao, Yan Liu, Debin Zhao

Video QoS Monitoring and Control Framework over Mobile and IP Networks 714

Bingjun Zhang, Lifeng Sun, Xiaoyu Cheng

Extracting Moving / Static Objects of Interest in Video Sojung Park, Minhwan Kim

722

XX Table of Contents

Building a Personalized Music Emotion Prediction System 730 Chan-Chang Yeh, Shian-Shyong Tseng, Pei-Chin Tsai, Jui-Feng Weng

Video Segmentation Using Joint Space-Time-Range Adaptive Mean Shift 740

Irene Y.H. Gu, Vasile Gui, Zhifei Xu

EagleRank: A Novel Ranking Model for Web Image Search Engine 749 Kangmiao Liu, Wei Chen, Chun Chen, Jiajun Bu, Can Wang, Peng Huang

Color Image Enhancement Using the Laplacian Pyramid 760 Yeul-Min Baek, Hyoung-Joon Kim, Jin-Aeon Lee, Sang-Guen Oh, Whoi- Yul Kim

3D Mesh Construction from Depth Images with Occlusion 770 Jeung-Chul Park, Seung-Man Kim, Kwan-Heng Lee

An Eigenbackground Subtraction Method Using Recursive Error Compensation 779

Zhifei Xu, Pengfei Shi, Irene Yu-Hua Gu

Attention Information Based Spatial Adaptation Framework for Browsing Videos Via Mobile Devices 788

Yi Wang, Houqiang Li, Zhengkai Liu, Chang Wen Chen

Style Strokes Extraction Based on Color and Shape Information 798 Jianming Liu, Dongming Lu, Xiqun Lu, Xifan Shi

Requantization Transcoding of H.264/AVC Bitstreams for Intra 4x4 Prediction Modes 808

Stijn Notebaert, Jan De Cock, Koen De Wolf, Rik Van de Walle

Prediction Algorithms in Large Scale VOD Services on Grid Infrastructure 818

Bo Li, Depei Qian

A Hierarchical Framework for Fast Macroblock Prediction Mode Decision in H.264 827

Cheng-dong Shen, Si-kun Li

Compact Representation for Large-Scale Clustering and Similarity Search 835

Bin Wang, Yuanhao Chen, Zhiwei Li, Mingjing Li

Table of Contents XXI

Robust Recognition of Noisy and Partially Occluded Faces Using Iteratively Reweighted Fitting of Eigenfaces 844

Wangmeng Zuo, Kuanquan Wang, David Zhang

Pitching Shot Detection Based on Multiple Feature Analysis and Fuzzy Classification 852

Wen-Nung Lie, Guo-Shiang hin, Sheng-Lung Cheng

Color Changing and Fading Simulation for Frescoes Based on Empirical Knowledge from Artists 861

Xifan Shi, Dongming Lu, Jianming Liu

A Novel Spatial-Temporal Position Prediction Motion-Compensated Interpolation for Frame Rate Up-Conversion 870

Jianning Zhang, Lifeng Sun, Yuzhuo Zhong

Web Image Clustering with Reduced Keywords and Weighted Bipartite Spectral Graph Partitioning 880

Su Ming Koh, Liang-Tien Chia

An Architecture to Connect Disjoint Multimedia Networks Based on Node's Capacity 890

Jaime Lloret, Juan R. Diaz, Jose M. Jimenez, Fernando Boronat

Quantitative Measure of Inlier Distributions and Contour Matching for Omnidirectional Camera Calibration 900

Yongho Hwang, Hyunki Hong

High-Speed All-in-Focus Image Reconstruction by Merging Multiple Differently Focused Images 909

Kazuya Kodama, Hiroshi Mo, Akira Kubota

A Real-Time Video Deinterlacing Scheme for MPEG-2 to AVS Transcoding 919

Qian Huang, Wen Gao, Debin Zhao, Cliff Reader

Persian Text Watermarking 927 Ali Asghar Khodami, Khashayar Yaghmaie

Three Dimensional Reconstruction of Structured Scenes Based on Vanishing Points 935

Guanghui Wang, Shewei Wang, Xiang Gao, Yubing Li

Parallel Processing for Reducing the Bottleneck in Realtime Graphics Rendering 943

Mee Young Sung, Suk-Min Whang, Yonghee Yoo, Nam-Joong Kim, Jong-Seung Park, Wonik Choi

XXII Table of Contents

Distributed Data Visualization Tools for Multidisciplinary Design Optimization of Aero-crafts 953

Chunsheng Liu, Tianxu Zhang

An Efficient Clustering and Indexing Approach over Large Video Sequences 961

Yu Yang, Qing Li

An Initial Study on Progressive Filtering Based on Dynamic Programming for Query-by-Singing/Humming 971

Jyh-Shing Roger Jang, Hong-Ru Lee

Measuring Multi-modality Similarities Via Subspace Learning for Cross-Media Retrieval 979

Hong Zhang, Jianguang Weng

SNR-Based Bit Allocation in Video Quality Smoothing 989 Xiangui Kang, Junqiang Lan, Li Liu, Xinhua Zhuang

Shadow Removal in Sole Outdoor Image 999 Zhenlong Du, Xueying Qin, Wei Hua, Hujun Bao

3D Head Model Classification Using KCDA 1008 Bo Ma, Hui-yang Qu, Hau-san Wong, Yao Lu

Framework for Pervasive Web Content Delivery 1018 Henry N. Palit, Chi-Hung Chi, Lin Liu

Region-Based Semantic Similarity Propagation for Image Retrieval 1027 Weiming Lu, Hong Pan, Jiangqin Wu

Author Index 1037