Home

Organization
 
Organizing Committee
Steering Committee
Program Committee
External Reviewers
Sponsors

Online Proceedings
 
PAKDD2006 (LNAI)
BioDM2006 (LNBI)
KDLL2006 (LNBI)
WISI2006 (LNCS)
KDXD2006 (LNCS)

Papers and Proposals
 
Call for Papers
Call for Workshop Proposals
Call for Tutorial Proposals
Important Dates
Paper Submission (Closed)
Paper Review (PC Only)

Program
 
Keynote Speakers &
Invited Talk
Conference Program
Accepted Papers
Tutorials
Paper Awards
Student Travel Awards
Photo Gallery

Registration
 
Conference Registration
Student Travel Support

PAKDD Workshops
 
BioDM2006
KDLL2006
WISI2006
KDXD2006

Other Events
 
PAKDD School
Data Mining Competition

Information
 
Conference Venue
Accommodation
About Singapore
Tourist Information
Flight Information
VISA Application
Transportation

Others
 
PAKDD2007
DASFAA2006
Past PAKDDs


PAKDD2006 Conference Program

Sunday, 9 April 2006

6:00 – 8:00

Reception and early registration

Monday, 10 April 2006

8:00 – 8:45

Registration

8:45 – 9:00

Opening Address

9:00 – 10:00

Keynote Address by Prabhakar Raghavan: “The Changing Face
of Web Search

Chair: Jaideep Srivastava

10:00 – 10:30

Break

10:30 – 12:00

Session 1A:
Support Vector Machines

Session 1B
Classification I

Session 1C
Privacy Preservation

Session 1D
Tutorial

12:00 – 1:30

Lunch

1:30 – 3:05

Session 2A
Association Rule Mining I

Session 2B
Bio-data
Mining

Session 2C
Clustering I

Session 2D
Tutorial

3:05 – 3:30

Break

3:30 – 5:10

Session 3A
Web Mining

Session 3B
Outlier
Detection

Session 3C
Industrial I

Session 3D
(3:30-6:30)
Tutorial

Tuesday, 11 April 2006

8:30 – 9:00

Registration

9:00 – 10:00

Keynote Address by David Hand: “Protection or Privacy?
Data Mining and Personal Data

Chair: Hiroshi Motoda

10:00 – 10:30

Break

10:30 – 12:00

Session 4A
Panel Discussion

Session 4B
Novel
Algorithms I

Session 4C
Invited Talk

Session 4D
Multimedia
Mining

12:00 – 1:30

Lunch

1:30 – 3:05

Session 5A
Association Rule Mining II

Session 5B
Classification
II

Session 5C
Clustering II

Session 5D
Tutorial

3:05 – 3:30

Break

3:30 – 5:05

Session 6A
Temporal
Data Mining

Session 6B
Stream Data Mining

Session 6C
Industrial II

Session6D
Tutorial

5:45

Banquet

Wednesday, 12 April 2006

8:30 – 10:00

Session 7A
Novel Algorithms II

Session 7B
Classification III

Session 7C
Clustering III

 

10:00 – 10:30

Break

10:30 – 12:00

Session 8A
Association
Rule Mining III

Session 8B
Graph and Network Mining

Session 8C
Industrial III

 

12:00

Close of conference

 

Monday, 10 April 2006

Session 1A:
Support Vector Machines

Session chair:
Yasutoshi Yajima

Time: 10:30-12:00
Room: Ballroom 1A

Regular papers:

Parallel Randomized Support Vector Machine
Yumao Lu, Vwani Roychowdhury

One-Class Support Vector Machines for Recommendation Tasks
Yasutoshi Yajima

e-tube based Pattern Selection for Support Vector Machines
Dongil Kim, Sungzoon Cho

Short paper:

Self-Adaptive Two-Phase Support Vector Clustering for Multi-Relational Data Mining
Ping Ling, Yan Wang, Chun-Guang Zhou

Session 1B:
Classification I

Session chair:
Tie-Yan Liu

Time: 10:30-12:00
Room: Vista 1

Regular papers:

Dynamic Category Profiling for Text Filtering and Classification
Rey-Long Liu

Regularized Semi-supervised Classification on Manifold
Lianwei Zhao, Siwei Luo, Yanchang Zhao, Lingzhi Liao, Zhihai Wang

Heterogeneous Information Integration in Hierarchical Text Classification
Huai-Yuan Yang, Tie-Yan Liu, Li Gao, Wei-Ying Ma

Short paper:

FISA: Feature-based Instance Selection for Imbalanced Text Classification
Aixin Sun, Ee-Peng Lim, Boualem Benatallah, Mahbub Hassan

Session 1C:
Privacy Preservation

Session chair:
Chengqi Zhang

Time: 10:30-12:00
Room: Ballroom 1B

Regular papers:

Achieving Private Recommendations Using Randomized Response Techniques
Huseyin Polat, Wenliang Du

On Robust and Effective K-Anonymity in Large Databases
Wen Jin, Rong Ge, Weining Qian

Privacy-Preserving SVM Classification on Vertically Partitioned Data
Hwanjo Yu, Jaideep Vaidya, Xiaoqian Jiang

Short paper:

Bias-free Hypothesis Evaluation in Multirelational Domains
Christine Koerner, Stefan Wrobel

Session 1D:
Tutorial (first half)

Session chair:
Sourav Bhowmick

Time: 10:30-12:00
Room: Vista 2

Outlier Detection, Principles, Techniques and Applications
Sanjay Chawla (University of Sydney)


Monday, 10 April 2006

Session 2A:
Association Rule Mining I

Session chair:
N. Balakrishnan

Time: 1:30-3:05
Room: Ballroom 1A

Regular papers:

Mining Temporal Indirect Associations
Ling Chen, Sourav S Bhowmick, Jinyan Li

SGPM: Static Group Pattern Mining using Apriori-like Sliding Window
John Goh, David Taniar, Ee-Peng Lim

Short papers:

Mining Top-K Frequent Closed Itemsets is Not in APX
Chienwen Wu

Is Frequency Enough for Decision Makers to Make Decisions?
Shichao Zhang, Jeffrey Xu Yu, Jingli Lu, Chengqi Zhang

Ramp: High Performance Frequent Itemset Mining with Efficient Bit-vector Projection Technique
Shariq Bashir, Abdul Rauf Baig

Session 2B:
Bio-data Mining

Session chair:
Sharma Chakravarthy

Time: 1:30-3:05
Room: Vista 1

Regular papers:

Scoring Method for Tumor Prediction from Microarray Data Using an Evolutionary Fuzzy Classifier
Shinn-Ying Ho, Chih-Hung Hsieh, Kuan-Wei Chen, Hui-Ling Huang, Hung-Ming Chen, Shinn-Jang Ho

Efficient Discovery of Structural Motifs from Protein Sequences with Combination of Flexible Intra- and Inter-block Gap Constraints
Chen-Ming Hsu, Chien-Yu Chen, Ching-Chi Hsu, Baw-Jhiune Liu

Short papers:

Finding Consensus Patterns in Very Scarce Biosequence Samples from Their Minimal Multiple Generalizations
Yen Kaow Ng, Takeshi Shinohara

Kernels on Lists and Sets over Relational Algebra: An Application to Classification of Protein Fingerprints
Adam Woznica, Alexandros Kalousis, Melanie Hilario

Mining Quantitative Maximal Hyperclique Patterns: A Summary of Results
Yaochun Huang, Hui Xiong, Weili Wu, Sam Y. Sung

Session 2C:
Clustering II

Session chair:
Richi Nayak

Time: 1:30-3:05
Room: Ballroom 1B

Regular papers:

DeLiClu: Boosting Robustness, Completeness, Usability, and Efficiency of Hierarchical Clustering by a Closest Pair Ranking
Elke Achtert, Christian Boehm, Peer Kroeger

XCLS: A Fast and Effective Clustering Algorithm for Heterogenous XML Documents
Richi Nayak, Sumei Xu 

Neighborhood Density Method for Selecting Initial Cluster Centers in K-means Clustering
Yunming Ye, Joshua Zhexue Huang, Xiaojun Chen, Shuigeng Zhou, Graham Williams, Xiaofei Xu

Short paper:

Uncertain Data Mining: An Example in Clustering Location Data
Michael Chau, Reynold Cheng, Ben Kao, Jackey Ng

Session 2D:
Tutorial (second half)

Session chair:
Sourav Bhowmick

Time: 1:30-3:05
Room: Vista 2

Outlier Detection, Principles, Techniques and Applications
Sanjay Chawla (University of Sydney)


Monday, 10 April 2006

Session 3A:
Web Mining

Session chair:
David Taniar

Time: 3:30-5:10
Room: Ballroom 1A

Regular papers:

Level-biased Statistics in the Hierarchical Structure of the Web
Guang Feng, Tie-Yan Liu, Xu-Dong Zhang, Wei-Ying Ma

iWED: An Integrated Multigraph Cut-based Approach for Detecting Events from A Website
Qiankun Zhao, Sourav S Bhowmick, Aixin Sun

CLEOPATRA: Evolutionary Pattern-based Clustering of Web Usage Data
Qiankun Zhao, Sourav S Bhowmick, Le Gruenwald

Enhancing Duplicate Collection Detection through Replica Boundary Discovery
Zhigang Zhang, Weijia Jia, Xiaoming Li

Session 3B:
Outlier Detection

Session chair:
Osmar Zaiane

Time: 3:30-5:10
Room: Vista 1

Regular papers:

A Nonparametric Outlier Detection Technique for Effectively Discovering Top-N Outliers from Engineering Data
Hongqin Fan, Osmar Zaïane, Andrew Foss, Junfeng Wu

A Fast Greedy Algorithm for Outlier Mining
Zengyou He, Shengchun Deng, Xiaofei Xu, Joshua Zhexue Huang

Ranking Outliers Using Symmetric Neighborhood Relationship
Wen Jin, Anthony K. H. Tung, Jiawei Han, Wei Wang

Construction of Finite Automata for Intrusion Detection from System Call Sequences Using Genetic Algorithms
Kyubum Wee, Sinjae Kim

Session 3C:
Industrial I

Session chair:
Lim Soon Wong

Time: 3:30-5:10
Room: Ballroom 1B

Regular papers:

Extracting and Summarizing Hot Item Features across Different Auction Web Sites
Tak-Lam Wong, Wai Lam, Shing-Kit Chan

A Systematic Study of Parameter Correlations in Large Scale Duplicate Document Detection
Shaozhi Ye, Ji-Rong Wen, Wei-Ying Ma

Patterns of Influence in a Recommendation Network
Jurij Leskovec, Ajit Singh, Jon Kleinberg

Mining Unexpected Associations for Signalling Potential Adverse Drug Reactions from Administrative Health Databases
Huidong Jin, Jie Chen, Chris Kelman, Hongxing He, Damien McAullay, Christine M. O’Keefe

Session 3D:
Tutorial

Session chair:
Sourav Bhowmick

Time: 3:30-6:30
Room: Vista 2

Text Clustering: Algorithms, Semantics and Systems
Joshua Huang (University of Hong Kong, China)
Michael Ng (Hong Kong Baptist University, China)


Tuesday, 11 April 2006

Session 4A:
Panel Discussion

Moderator:
Wynne Hsu

Time: 10:30-12:00
Room: Ballroom 1A

The Bright and Dark Side of Data Mining Research
Panelists: Prabhakar Raghavan, Bhavani Thuraisingham, Limsoon Wong, Ke Wang

Session 4B:
Novel Algorithms I

Session chair:
Kyu-Young Whang

Time: 10:30-12:00
Room: Vista 1

Regular papers:

Neighbor Line-based Locally Linear Embedding
De-Chuan Zhan, Zhi-Hua Zhou

Evaluation of Attribute-aware Recommender System Algorithms on Data with Varying Characteristics
Karen H. L. Tso, Lars Schmidt-Thieme

Constructive Meta-Level Feature Selection Method Based on Method Repositories
Hidenao Abe, Takahira Yamaguchi

Short paper:

Predicting Rare Extreme Values
Luis Torgo, Rita Ribeiro

Session 4C:
Invited Talk

Session chair:
Ee Peng Lim

Time: 10:30-12:00
Room: Ballroom 1B

Data Mining for Surveillance Applications
Bhavani Thuraisingham (University of Texas at Dallas)

Session 4D:
Multimedia Mining

Session chair:
David Cheung

Time: 10:30-12:00
Room: Vista 2

Regular papers:

A Machine Learning Application for Human Resource Data Mining Problem
Zhen Xu, Binheng Song

Multimedia Semantics Integration Using Linguistic Model
Bo Yang, Ali R. Hurson

A Novel Indexing Approach for Efficient and Fast Similarity Search of Captured Motions
Chuanjun Li, B. Prabhakaran

Short paper:

Mining Frequent Spatial Patterns in Image Databases
Wei-Ta Chen, Yi-Ling Chen, Ming-Syan Chen


Tuesday, 11 April 2006

Session 5A:
Association Rule Mining II

Session chair:
Hiroshi Motoda

Time: 1:30-3:05
Room: Ballroom 1A

Regular papers:

Quality-Aware Association Rule Mining
Laure Berti-Équille

IMB3-Miner: Mining Induced/Embedded Subtrees by Constraining the Level of Embedding
Henry Tan, Tharam S. Dillon, Fedja Hadzic, Elizabeth Chang, Ling Feng

Short papers:

Maintaining Frequent Itemsets over High-Speed Data Streams
James Cheng, Yiping Ke, Wilfred Ng

TRIPPER: Rule Learning Using Taxonomies
Flavian Vasile, Adrian Silvescu, Dae-Ki Kang, Vasant Honavar

Generalized Disjunction-Free Representation of Frequents Patterns with At Most k Negations
Marzena Kryszkiewicz

Session 5B:
Classification II

Session chair:
Zhi-Hua Zhou

Time: 1:30-3:05
Room: Vista 1

Regular papers:

RNBL-MN: Recursive Naive Bayes Learner for Sequence Classification
Dae-Ki Kang, Adrian Silvescu, Vasant Honavar

Using Weighted Nearest Neighbor to Benefit from Unlabeled Data
Kurt Driessens, Peter Reutemann, Bernhard Pfahringer, Claire Leschi

Short papers:

Further Improving Emerging Pattern Based Classifiers via Bagging
Hongjian Fan, Ming Fan, Kotagiri Ramamohanarao, Mengxu Liu

Comparison of Documents Classification Techniques to Classify Medical Reports
F. H. Saad, B. de la Iglesia, G. D. Bell

Similarity-based Sparse Feature Extraction Using Local Manifold Learning
Cheong Hee Park

Session 5C:
Clustering II

Session chair:
Longbing Cao

Time: 1:30-3:05
Room: Ballroom 1B

Regular papers:

Iterative Clustering Analysis for Grouping Missing Data in Gene Expression Profiles
Dae-Won Kim, Bo-Yeong Kang

Parallel Density-Based Clustering of Complex Objects
Stefan Brecheisen, Hans-Peter Kriegel, Martin Pfeifle

Clustering Large Collection of Biomedical Literature based on Ontology-enriched Bipartite Graph Representation and Mutual Refinement Strategy
Illhoi Yoo, Xiaohua Hu

Short papers:

Clustering Web Sessions by Levels of Page Similarity
Caren Moraes Nichelle, Karin Becker

Session 5D:
Tutorial (first half)

Session chair:
Sourav Bhowmick

Time: 1:30-3:05
Room: Vista 2

Database Mining: Bringing Algorithms to Data
Sharma Chakravarthy (University of Texas at Arlington, USA)


Tuesday, 11 April 2006

Session 6A:
Temporal Data Mining

Session chair:
Ho Tu Bao

Time: 3:30-5:05
Room: Ballroom 1A

Regular papers:

A Multi-Hierarchical Representation for Similarity Measurement of Time Series
Xinqiang Zuo, Xiaoming Jin

Multistep-Ahead Time Series Prediction
Haibin Cheng, Pang-Ning Tan, Jing Gao, Jerry Scripps

Short papers:

Sequential Pattern Mining with Time Interval
Yu Hirate, Hayato Yamana

A Wavelet Analysis Based Data Processing for Time Series of Data Mining Predicting
Weimin Tong, Yijun Li, Qiang Ye

Variable Support Mining of Frequent Itemsets over Data Streams Using Synopsis Vectors
Ming-Yen Lin, Sue-Chen Hsueh, Sheng-Kun Hwang

Session 6B:
Stream Data Mining

Session chair:
Ramamohanarao Kotagiri

Time: 3:30-5:05
Room: Vista 1

Regular papers:

Hardware Enhanced Mining for Association Rules
Wei-Chuan Liu, Ken-Hao Liu, Ming-Syan Chen

A Single Index Approach for Time-Series Subsequence Matching that Supports Moving Average Transform of Arbitrary Order
Yang-Sae Moon, Jinho Kim

Short papers:

Efficient Mining of Emerging Events in a Dynamic Spatiotemporal Environment
Yu Meng, Margaret H. Dunham 

Distributed Pattern Discovery in Multiple Streams
Jimeng Sun, Spiros Papadimitriou, Christos Faloutsos

COMET: Event-Driven Clustering over Multiple Evolving Streams
Mi-Yen Yeh, Bi-Ru Dai, Ming-Syan Chen

Session 6C:
Industrial II

Session chair:
Lim Soon Wong

Time: 3:30-5:05
Room: Ballroom 1B

Regular papers:

Weighted Intra-Transactional Rule Mining for Database Intrusion Detection
Abhinav Srivastava, Shamik Sural, A. K. Majumdar

Image Classification via LZ78 based String Kernel: A Comparative Study
Ming Li, Yanong Zhu

Domain-Driven Actionable Knowledge Discovery in the Real World
Longbing Cao, Chengqi Zhang

Short papers:

Network Data Mining: Discovering Patterns of Interaction between Attributes
John Galloway, Simeon J. Simoff

Session 6D:
Tutorial (second half)

Session chair:
Sourav Bhowmick

Time: 3:30-5:05
Room: Vista 2

Database Mining: Bringing Algorithms to Data
Sharma Chakravarthy (University of Texas at Arlington, USA)


Wednesday, 12 April 2006

Session 7A:
Novel Algorithms II

Session chair:
Takashi Washio

Time: 8:30-10:00
Room: Ballroom 1A

Regular papers:

Improving on Bagging with Input Smearing
Eibe Frank, Bernhard Pfahringer

Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles
Yang Liu, Aijun An, Xiangji Huang

Intelligent Particle Swarm Optimization in Multi-objective Problems
Shinn-Jang Ho, Wen-Yuan Ku, Jun-Wun Jou, Ming-Hao Hung, Shinn-Ying Ho

Short paper:

Hidden Space Principal Component Analysis
Weida Zhou, Li Zhang, Licheng Jiao

Session 7B:
Classification III

Session chair:
Shamik Sural

Time: 8:30-10:00
Room: Vista 1

Regular papers:

Detecting Citation Types Using Finite State Machines
Minh Hoang Le, Tu Bao Ho, Yoshiteru Nakamori

Variable Randomness in Decision Tree Ensembles
Fei Tony Liu, Kai Ming Ting

Generalized Conditional Entropy and a Metric Splitting Criterion for Decision Trees
Dan A. Simovici, Szymon Jaroszewicz

Short paper:

A Multiclass Classification Method Based on the Design of Output Codes
Qi Qiang, Qinming He

Session 7C:
Clustering III

Session chair:
Le Thi Hoai An

Time: 8:30-10:00
Room: Ballroom 1B

Regular papers:

An EM-Approach for Clustering Multi-Instance Objects
Hans-Peter Kriegel, Alexey Pryakhin, Matthias Schubert

Mining Maximal Correlated Member Clusters in High Dimensional Database
Lizheng Jiang, Dongqing Yang, Shiwei Tang, Xiuli Ma, Dehui Zhang

Hierarchical Clustering Based on Mathematical Optimization
Le Hoai Minh, Le Thi Hoai An, Pham Dinh Tao

Short paper:

Clustering Multi-Represented Objects Using Combination Trees
Elke Achtert, Hans-Peter Kriegel, Alexey Pryakhin, Matthias Schubert


Wednesday, 12 April 2006

Session 8A:
Association Rule Mining III

Session chair:
Ho Tu Bao

Time: 10:00-12:00
Room: Ballroom 1A

Regular papers:

Mining Interesting Imperfectly Sporadic Rules
Yun Sing Koh, Nathan Rountree, Richard O’Keefe

Evaluating a Rule Evaluation Support Method Based on Objective Rule Evaluation Indices
Hidenao Abe, Shusaku Tsumoto, Miho Ohsaki, Takahira Yamaguchi

Improved Negative-Border Online Mining Approaches
Ching-Yao Wang, Shian-Shyong Tseng, Tzung-Pei Hong

Short paper:

Association-Based Dissimilarity Measures for Categorical Data: Limitation and Improvement
Si Quang Le, Tu Bao Ho, Le Sy Vinh

Session 8B:
Graph and Network Mining

Session chair:
Manoranjan Dash

Time: 10:00-12:00
Room: Vista 1

Regular papers:

Summarization and Visualization of Communication Patterns in a Large-Scale Social Network
Preetha Appan, Hari Sundaram, Belle Tseng

Constructing Decision Trees for Graph-Structured Data by Chunkingless Graph-Based Induction
Phu Chien Nguyen, Kouzou Ohara, Akira Mogi, Hiroshi Motoda, Takashi Washio

Combination of Smooth Graphs and Semi-Supervised Classification
Xueyuan Zhou, Chunping Li

Short paper:

Enhanced DB-Subdue: Supporting Subtle Aspects of Graph Mining Using a Relational approach
Ramanathan Balachandran, Srihari Padmanabhan, Sharma Chakravarthy

Session 8C:
Industrial III

Session chair:
Lim Soon Wong

Time: 10:00-12:00
Room: Ballroom 1B

Regular papers:

Data Mining Using Relational Database Management Systems
Beibei Zou, Xuesong Ma, Bettina Kemme, Glen Newton, Doina Precup

Towards Automated Design of Large-scale Circuits by Combining Evolutionary Design with Data Mining
Shuguang Zhao, Mingying Zhao, Jun Zhao, Licheng Jiao

An Adaptive Intrusion Detection Algorithm based on Clustering and Kernel-Method
Hansung Lee, Yongwha Chung, Daihee Park

Short paper:

An Intelligent System Based on Kernel Methods for Crop Yield Prediction
A. Majid Awan, Mohd. Noor Md. Sap