9
Guozhu Dong Xuemin Lin Wei Wang Yun Yang Jeffrey Xu Yu (Eds.) Advances in Data and Web Management Joint 9th Asia-Pacific Web Conference, APWeb 2007 and 8th International Conference on Web-Age Information Management, WAIM 2007 Huang Shan, China, June 16-18, 2007 Proceedings Sprin ger

Advances in Data and Web Management - GBV

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Advances in Data and Web Management - GBV

Guozhu Dong Xuemin Lin Wei Wang Yun Yang Jeffrey Xu Yu (Eds.)

Advances in Data and Web Management

Joint 9th Asia-Pacific Web Conference, APWeb 2007 and 8th International Conference on Web-Age Information Management, WAIM 2007 Huang Shan, China, June 16-18, 2007 Proceedings

Sprin ger

Page 2: Advances in Data and Web Management - GBV

Table of Contents

Keynote

Data Mining Using Fractals and Power Laws 1 Christos Faloutsos

Exploring the Power of Links in Data Mining 2 Jiawei Han

Community Systems: The World Online 3 Raghu Ramakrishnan

A New DBMS Architecture for DB-IR Integration 4 Kyu-Young Whang

Invited Paper

Study on Efficiency and Effectiveness of KSORD 6 Shan Wang, Jun Zhang, Zhaohui Peng, Jiang Zhan, and Qiuyue Wang

Discovering Web Services Based on Probabilistic Latent Factor Model 18

Yanchun Zhang and Jiangang Ma

SCORE: Symbiotic Context Oriented Information Retrieval 30 Prasan Roy and Mukesh Mohania

Process Aware Information Systems: A Human Centered Perspective . . . 39 Clarence A. Ellis and Kwanghoon Kim

Data Mining and Knowledge Discovery I

IMCS: Incremental Mining of Closed Sequential Patterns 50 Lei Chang, Dongqing Yang, Tengjiao Wang, and Shiwei Tang

Mining Time-Shifting Co-regulation Patterns from Gene Expression Data 62

Ying Yin, Yuhai Zhao, Bin Zhang, and Guoren Wang

Tight Correlated Item Sets and Their Efficient Discovery 74 Lizheng Jiang, Dongqing Yang, Shiwei Tang, Xiuli Ma, and Dehui Zhang

Page 3: Advances in Data and Web Management - GBV

XVI Table of Contents

Information Retrieval I

Improved Prediction of Protein Secondary Structures Using Adaptively Weighted Profiles 83

Gouchol Pok, Keun Ho Ryu, and Yong J. Chung

Framework for Building a High-Quality Web Page Collection Considering Page Group Structure 95

Yuxin Wang and Keizo Oyama

Multi-document Summarization Using Weighted Similarity Between Topic and Clustering-Based Non-negative Semantic Feature 108

Sun Park, Ju-Hong Lee, Deok-Hwan Kim, and Chan-Min Ahn

P2P Systems

A Fair Load Balancing Algorithm for Hypercube-Based DHT Networks 116

Guowei Huang, Gongyi Wu, and Zhi Chen

LINP: Supporting Similarity Search in Unstructured Peer-to-Peer Networks 127

Bin Cui, Weining Qian, Linhao Xu, and Aoying Zhou

Generation and Matching of Ontology Data for the Semantic Web in a Peer-to-Peer Framework 136

Chao Wang, Jie Lu, and Guangquan Zhang

Sensor Networks

Energy-Efficient Skyline Queries over Sensor Network Using Mapped Skyline Filters 144

Junchang Xin, Guoren Wang, and Xiaoyi Zhang

An Adaptive Dynamic Cluster-Based Protocol for Target Tracking in Wireless Sensor Networks 157

WenCheng Yang, Zhen Fu, JungHwan Kim, and Myong-Soon Park

Distributed, Hierarchical Clustering and Summarization in Sensor Networks 168

Xiuli Ma, Shuangfeng Li, Qiong Luo, Dongqing Yang, and Shiwei Tang

Spatial and Temporal Databases I

A New Similarity Measure for Near Duplicate Video Clip Detection Xiangmin Zhou, Xiaofang Zhou, and Heng Tao Shen

176

Page 4: Advances in Data and Web Management - GBV

Table of Contents XVII

Efficient Algorithms for Historical Continuous fcNN Query Processing over Moving Object Trajectories 188

Yunjun Gao, Chun Li, Gencai Chen, Qing Li, and Chun Chen

Effective Density Queries for Moving Objects in Road Networks 200 Caifeng Lai, Ling Wang, Jidong Chen, Xiaofeng Meng, and Karine Zeitouni

An Efficient Spatial Search Method Based on SG-Tree 212 Yintian Hu, Changjie Tang, Lei Duan, Tao Zeng, and Chuan Li

Getting Qualified Answers for Aggregate Queries in Spatio-temporal Databases 220

Cheqing Jin, Weibin Guo, and Futong Zhao

Web Mining

Dynamic Adaptation Strategies for Long-Term and Short-Term User Profile to Personalize Search 228

Lin Li, Zhenglu Yang, Botao Wang, and Masaru Kitsuregawa

Using Structured Tokens to Identify Webpages for Data Extraction 241 Ling Lin, Lizhu Zhou, Qi Guo, and Gang Li

Honto? Search: Estimating Trustworthiness of Web Information by Search Results Aggregation and Temporal Analysis 253

Yusuke Yamamoto, Taro Tezuka, Adam Jatowt, and Katsumi Tanaka

A Probabilistic Reasoning Approach for Discovering Web Crawler Sessions 265

Athena Stassopoulou and Marios D. Dikaiakos

An Exhaustive and Edge-Removal Algorithm to Find Cores in Implicit Communities 273

Nan Yang, Songxiang Lin, and Qiang Gao

XML and Semi-structured Da ta I

Active Rules Termination Analysis Through Conditional Formula Containing Updatable Variable 281

Zhongmin Xiong, Wei Wang, and Jian Pei

Computing Repairs for Inconsistent XML Document Using Chase 293 Zijing Tan, Zijun Zhang, Wei Wang, and Baue Shi

An XML Publish/Subscribe Algorithm Implemented by Relational Operators 305

Jiakui Zhao, Dongqing Yang, Jun Gao, and Tengjiao Wang

Page 5: Advances in Data and Web Management - GBV

XVIII Table of Contents

Retrieving Arbitrary XML Fragments from Structured Peer-to-Peer Networks 317

Toshiyuki Amagasa, Chunhui Wu, and Hiroyuki Kitagawa

Data Mining and Knowledge Discovery II

Combining Smooth Graphs with Semi-supervised Learning 329 Liang Liu, Weijun Chen, and Jianmin Wang

Extracting Trend of Time Series Based on Improved Empirical Mode Decomposition Method 341

Hui-ting Liu, Zhi-wei Ni, and Jian-yang Li

Spectral Edit Distance Method for Image Clustering 350 Nian Wang, Jun Tang, Jiang Zhang, Yi-Zheng Fan, and Dong Liang

Mining Invisible Tasks from Event Logs 358 Lijie Wen, Jianmin Wang, and Jiaguang Sun

The Selection of Tunable DBMS Resources Using the Incremental/Decremental Relationship 366

Jeong Seok Oh, Hyun Woong Shin, and Sang Ho Lee

Hyperclique Pattern Based Off-Topic Detection 374 Tianming Hu, Qingui Xu, Huaqiang Yuan, Jiali Hou, and Chao Qu

Sensor Networks and Grids

An Energy Efficient Connected Coverage Protocol in Wireless Sensor Networks 382

Yingchi Mao, Zhuoming Xu, and Yi Liang

A Clustered Routing Protocol with Distributed Intrusion Detection for Wireless Sensor Networks 395

Lan Yao, Na An, Fuxiang Gao, and Ge Yu

Continuous Approximate Window Queries in Wireless Sensor Networks 407

Bin Wang, Xiaochun Yang, Guoren Wang, and Ge Yu

A Survey of Job Scheduling in Grids 419 Congfeng Jiang, Cheng Wang, Xiaohu Liu, and Yinghui Zhao

Query Processing and Optimization

Relational Nested Optional Join for EfHcient Semantic Web Query Processing 428

Artem Chebotko, Mustafa Atay, Shiyong Lu, and Farshad Fotouhi

Page 6: Advances in Data and Web Management - GBV

Table of Contents XIX

Efficient Processing of Relational Queries with Sum Constraints 440 Svetlozar Nestorov, Chuang Liu, and Ian Foster

A Theoretical Framework of Natural Computing - M Good Lattice Points (GLP) Method 452

Jia-xing Cheng, Ling Zhang, and Bo Zhang

Building Data Synopses Within a Known Maximum Error Bound 463 Chaoyi Fang, Qing Zhang, David Hansen, and Anthony Maeder

Exploiting the Structure of Update Fragments for Efficient XML Index Maintenance 471

Katharina Grün and Michael Schrefl

Information Retrieval II

Improvements of HITS Algorithms for Spam Links 479 Yasuhito Asano, Yu Tezuka, and Takao Nishizeki

Efficient Keyword Search over Data-Centric XML Documents 491 Guoliang Li, Jianhua Feng, Na Ta, and Lizhu Zhou

Promotional Ranking of Search Engine Results: Giving New Web Pages a Chance to Prove Their Values 503

Yizhen Zhu, Mingda Wu, Yan Zhang, and Xiaoming Li

Data Stream

Adaptive Scheduling Strategy for Data Stream Management Sys tem. . . . 511 Guangzhong Sun, Yipeng Zhou, Yu Huang, and Yinghua Zhou

A QoS-Guaranteeing Scheduling Algorithm for Continuous Queries over Streams 522

Shanshan Wu, Yanfei Lv, Ge Yu, Yu Gu, and Xiaojing Li

A Simple But Effective Event-Driven Model for Data Stream Queries . . . 534 Yu Gu, Ge Yu, Shanshan Wu, Xiaojing Li, Yanfei Lv, and Dejun Yue

Spatial and Temporal Databases II

Efficient Difference NN Queries for Moving Objects 542 Bin Wang, Xiaochun Yang, Guoren Wang, and Ge Yu

APCAS: An Approximate Approach to Adaptively Segment Time Series Stream 554

Li Junkui and Wang Yuanzhen

Page 7: Advances in Data and Web Management - GBV

XX Table of Contents

Continuous k-Nearest Neighbor Search Under Mobile Environment 566 Jun Feng, Linyan Wu, Yuelong Zhu, Naoto Mukai, and Toyohide Watanabe

Data Integration and Collaborative Systems

Record Extraction Based on User Feedback and Document Selection . . . 574 Jianwei Zhang, Yoshiharu Ishikawa, and Hiroyuki Kitagawa

Density Analysis of Winnowing on Non-uniform Distributions 586 Xiaoming Yu, Yue Liu, and Hongbo Xu

Error-Based Collaborative Filtering Algorithm for Top-N Recommendation 594

Heung-Nam Kim, Ae-Ttie Ji, Hyun-Jun Kim, and Geun-Sik Jo

A PLSA-Based Approach for Building User Profile and Implementing Personalized Recommendation 606

Dongling Chen, Daling Wang, Ge Yu, and Fang Yu

CoXML: A Cooperative XML Query Answering System 614 Shaorong Liu and Wesley W. Chu

Concept-Based Query Transformation Based on Semantic Centrality in Semantic Peer-to-Peer Environment 622

Jason J. Jung, Antoine Zimmerman, and Jeröme Euzenat

Data Mining and E-Learning

Mining Infrequently-Accessed File Correlations in Distributed File System 630

Lihua Yu, Gang Chen, and Jinxiang Dong

Learning-Based Trust Model for Optimization of Selecting Web Services 642

Janarbek Matai and Dong Soo Han

SeCED-FS: A New Approach for the Classification and Discovery of Significant Regions in Medical Images 650

Hui Li, Hanhu Wang, Mei Chen, Teng Wang, and Xuejian Wang

Context-Aware Search Inside e-Learning Materials Using Textbook Ontologies 658

Nimit Pattanasri, Adam Jatowt, and Katsumi Tanaka

Activate Interaction Relationships Between Students Acceptance Behavior and E-Learning 670

Fong-Ling Fu, Hung-Gi Chou, and Sheng-Chin Yu

Page 8: Advances in Data and Web Management - GBV

Table of Contents XXI

Semantic-Based Grouping of Search Engine Results Using WordNet . . . . 678 Reza Hemayati, Weiyi Meng, and Clement Yu

XML and Semi-structured Da ta II

Static Verification of Access Control Model for AXML Documents 687 Il-Gon Kim

SAM: An EfRcient Algorithm for F&B-Index Construction 697 Xianmin Liu, Jianzhong Li, and Hongzhi Wang

BUXMiner: An EfRcient Bottom-Up Approach to Mining XML Query Patterns 709

Yijun Bei, Gang Chen, and Jinxiang Dong

A Web Service Architecture for Bidirectional XML Updating 721 Yasushi Hayashi, Dongxi Liu, Kento Emoto, Kazutaka Matsuda, Zhenjiang Hu, and Masato Takeichi

Data Mining, Privacy, and Security

(a, fc)-anonymity Based Privacy Preservation by Lossy Join 733 Raymond Chi-Wing Wong, Yubao Liu, Jian Yin, Zhilan Huang, Ada Wai-Chee Fu, and Jian Pei

Achieving &-Anonymity Via a Density-Based Clustering Method 745 Hua Zhu and Xiaojun Ye

fc-Anonymization Without Q-S Associations 753 Weijia Yang and Shangteng Huang

Protecting and Recovering Database Systems Continuously 765 Yanlong Wang, Zhanhuai Li, and Juan Xu

Towards Web Services Composition Based on the Mining and Reasoning of Their Causal Relationships 777

Kun Yue, Weiyi Liu, and Weihua Li

Potpourr i

A Dynamically Adjustable Rule Engine for Agile Business Computing Environments 785

Yonghwan Lee, Junaid Ahsenali Chaudhry, Dugki Min, Sunyoung Han, and Seungkyu Park

A Formal Design of Web Community Interactivity 797 Chima Adiele

Page 9: Advances in Data and Web Management - GBV

XXII Table of Contents

Towards a Type-2 Fuzzy Description Logic for Semantic Search Engine 805

Ruixuan Li, Xiaolin Sun, Zhengding Lu, Kunmei Wen, and Yuhua Li

A Type-Based Analysis for Verifying Web Application 813 Woosung Jung, Eunjoo Lee, Kapsu Kim, and Chisu Wu

Homomorphism Resolving of XPath Trees Based on Automata 821 Ming Fu and Yu Zhang

An Efficient Overlay Multicast Routing Algorithm for Real-Time Multimedia Applications 829

Shan Jin, Yanyan Zhuang, Linfeng Liu, and Jiagao Wu

Novel NonGaussianity Measure Based BSS Algorithm for Dependent Signals 837

Fasong Wang, Hongwei Li, and Rui Li

Data Mining and Data Streams

HiBO: Mining Web's Favorites 845 Sofia Stamou, Lefteris Kozanidis, Paraskevi Tzekou, Nikos Zotos, and Dimitris Cristodoulakis

Frequent Variable Sets Based Clustering for Artificial Neural Networks Particle Classification 857

Xin Jin and Rongfang Bie

Attributes Reduction Based on GA-CFS Method 868 Zhiwei Ni, Fenggang Li, Shanling Yang, Xiao Liu, Weili Zhang, and Qin Luo

Towards High Performance and High Availability Clusters of Archived Stream 876

Kai Du, Huaimin Wang, Shuqiang Yang, and Bo Deng

Continuously Matching Episode Rules for Predicting Future Events over Event Streams

Chung-Wen Cho, Ying Zheng, and Arbee L.P. Chen

Author Index 893