Long, regular and short paper presentations are denoted by [pink], [violet] and [green] respectively. Each long paper presentation is allocated with 30 minutes, with 25 minutes for presentation and 5 minutes for questions; each regular paper presentation is allocated with 20 minutes, with 17 minutes for presentation and 3 minutes for questions; and each short paper presentation is allocated with 15 minutes, with 12 minutes for presentation and 3 minutes for questions. 
         
May 21
8:30 -   9:00 Opening      
9:00 - 10:00 Keynote Speech      
  Chair: Einoshin Suzuki      
  Christos Faloutsos; Graph Mining: Laws, Generators and Tools    
  (Room A,B,C)      
10:00 - 10:20 Coffee Break      
10:20 - 11:50 Session 1A: Privacy Preserving Data Mining Session 1B: Web Mining Session 1C: Clustering 1 Session 1D: Network Mining
  (Room A) (Room B) (Room C) (Room D)
  Chair: Mei Kobayashi Chair: Vincent S. Tseng Chair: Saso Dzeroski Chair: Yifeng Zeng
  Protecting Privacy in Incremental Maintenance for Distributed Association Rule Mining SEM: Mining Spatial Events from the Web A Clustering-Oriented Star Coordinate Translation Method for Reliable Clustering Parameterization Mining Bulletin Board Systems Using Community Generation
  Wai Kit Wong, David Wai Lok Cheung, Edward Hung, and Huan Liu Kaifeng Xu, Rui Li, Shenghua Bao, Dingyi Han, and Yong Yu Chieh-Yuan Tsai and Chuang-Cheng Chiu Ming Li, Zhongfei (Mark) Zhang, and Zhi-Hua Zhou
  On Addressing Accuracy Concerns in Privacy Preserving Association Rule Mining Person Name Disambiguation in Web Pages using Social Network; Compound Words and Latent Topics Constrained Clustering for Gene Expression Data Mining Mining Changes in Patent Trends for Competitive Intelligence
  Ling Guo, Songtao Guo, and Xintao Wu Shingo Ono, Issei Sato, Minoru Yoshida, and Hiroshi Nakagawa Vincent S. Tseng, Lien-Chin Chen, and Ching-Pin Kao Meng-Jung Shih, Duen-Ren Liu, and Ming-Li Hsu
  On Privacy in Time Series Data Mining Fighting WebSpam: Detecting spam on the Graph via Content and Link Features G-TREACLE: A New Grid-based and Tree-alike Pattern Clustering Technique for Large Databases Structure-based hierarchical transformations for interactive visual exploration of social networks
  Ye Zhu, Yongjian Fu, and Huirong Fu Yu-Jiu Yang, Shuang-Hong Yang, and Bao-Gang Hu Cheng-Fa Tsai and Chia-Chen Yen Lisa Singh, Mitchell Beard, Brian Gopalan, and Gregory Nelson
    Using Ontology-Based User Preferences to Aggregate Rank Lists in Web Search R-map: Mapping Categorical Data for Clustering and Visualization based On Reference Sets Analyzing the Propagation of Influence and Concept Evolution in Enterprise Social Networks Through Centrality and Latent Semantic Analysis
    Lin Li, Zhenglu Yang, and Masaru Kitsuregawa Zhi-Yong Shen, Ming Li, Yi-Dong Shen, and Jun Sun Weizhong Zhu, Chaomei Chen, and Robert B. Allen
      Efficient Joint Clustering Algorithms in Optimization and Geography Domains A Framework for Discovering Spatio-Temporal Cohesive Networks
      Chia-Hao Lo and Wen-Chih Peng Jin Soung Yoo and Joengmin Hwang
11:50 - 13:20 Lunch      
13:20 - 14:20 Invited Talk      
  Chair: David Cheung      
  Hiroki Arimura; Efficient Algorithms for Mining Frequent and Closed Patterns from Semi-structured Data  
  (Room A,B,C)      
14:20 - 14:40 Coffee Break      
14:40 - 16:10 Session 2A: Feature Selection and Construction Session 2B: Clustering 2 Session 2C: Frequent Itemset 1 Session 2D: Sequence Data Mining 1
  (Room A) (Room B) (Room C) (Room D)
  Chair: Michel Verleysen Chair: Xintao Wu Chair: Chi Ming Kao Chair: Takeaki Uno
  Feature Selection by Nonparametric Bayes Error Minimization Large-scale k-means Clustering with User-Centric Privacy Preservation LCM over ZBDDs: Fast Generation of Very Large-Scale Frequent Itemsets Using a Compact Graph-Based Representation A Framework for Modeling Positive Class Expansion with Single Snapshot
  Shuang-Hong Yang and Bao-Gang Hu Jun Sakuma and Shigenobu Kobayashi Shin-ichi Minato, Takeaki Uno, and Hiroki Arimura Yang Yu and Zhi-Hua Zhou
  Feature construction based on closedness properties is not that simple Scaling Record Linkage to Non-Uniform Distributed Class Sizes Efficient Mining of High Utility Itemsets from Large Datasets Concept Lattice Based Mutation Control for Reactive Motifs Discovery
  Dominique Gay, Nazha Selmaoui, and Jean-Francois Boulicaut Steffen Rendle and Lars Schmidt-Thieme Alva Erwin, Raj P. Gopalan, and Narasimaha Achuthan Kitsana Waiyamai, Peera Liewlom, Thanapat Kangkachit, and Thanawin Rakthanmanon
  Generation of Globally Relevant Continuous Features for Classification Clustering Transaction Datasets Using Seeds FIsViz: A Frequent Itemset Visualizer A Simple Characterization on Serially Constructible Episodes
  Sylvain Letourneau, Stan Matwin, and A. Fazel Famili Yun Sing Koh and Russel Pears Carson Kai-Sang Leung, Pourang P. Irani, and Christopher L. Carmichael Takashi Katoh and Kouichi Hirata
    I/O Scalable Bregman Co-clustering A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features
    Kuo-Wei Hsu, Arindam Banerjee, and Jaideep Srivastava Carson Kai-Sang Leung, Mark Anthony F. Mateo, and Dale A. Brajczuk Vincent S. Tseng, Ja-Hwung Su, Jhih-Hong Huang, and Chih-Jen Chen
16:10 - 16:30 Coffee Break      
16:30 - 18:00 Session 3A: Frequent Itemset 2 Session 3B: Subspace Clustering Session 3C: Decision Tree and Session 3D: Relational and Network Mining
  (Room A) (Room B) (Room C)    Class Imbalance Problem (Room D)
  Chair: Carson K. Leung Chair: Peer Kröger Chair: Zhi-Hua Zhou Chair: Akihiro Yamamoto
  Ambiguous Frequent Itemset Mining and Polynomial Delay Enumeration Mining Quality-Aware Subspace Clusters BOAI: Fast Alternating Decision Tree Induction based on Bottom-up Evaluation Tracking Topic Evolution in On-line Postings: 2006 IBM Innovation Jam data
  Takeaki Uno and Hiroki Arimura Ying-Ju Chen, Yi-Hong Chu, and Ming-Syan Chen Bishan Yang, Tengjiao Wang, Dongqing Yang, and Lei Chang Mei Kobayashi and Raylene Yung
  A Decremental Approach for Mining Frequent Itemsets from Uncertain Data SubClass: Classification of Multidimensional Noisy Data Using Subspace Clusters A comparison of different off-centered entropies to deal with class imbalance for decision trees Entity Network Prediction using Multitype Topic Models
  Chun-Kit Chui and Ben Kao Ira Assent, Ralph Krieger, Petra Welter, Jorg Herbers, and Thomas Seidl Philippe Lenca, Stephane Lallich, Thanh-Nghi Do, and Nguyen-Khang Pham Hitohiro Shiozaki, Koji Eguchi, and Takenao Ohkawa
  A Cluster-Based Genetic-Fuzzy Mining Approach for Items with Multiple Minimum Supports A Creditable Subspace Labeling Method based on D-S Evidence Theory Analyzing PETs on Imbalanced Datasets when Training and Testing Class Distributions Differ Relational pattern mining based on equivalent classes of properties extracted from samples
  Chun-Hao Chen, Tzung-Pei Hong, and Vincent S. Tseng Yu Zong, Xianchao Zhang, He Jiang, and Mingchu Li David Cieslak and Nitesh Chawla Nobuhiro Inuzuka, Jun-ichi Motoyama, Shinpei Urazawa, and Tomofumi Nakano
  CP-tree: A Tree Structure for Single-Pass Frequent Pattern Mining   A New Credit Scoring Method Based on Rough Sets and Decision Tree Exploiting Propositionalization based on Random Relational Rules for Semi-Supervised Learning
  Syed Khairuzzaman Tanbeer, Chowdhury Farhan Ahmed, Byeong-Soo Jeong, and Young-Koo Lee   XiYue Zhou, DeFu Zhang, and Yi Jiang Grant Anderson and Bernhard Pfahringer
         
May 22
9:00 - 10:00 Invited Talk      
  Chair: Graham Williams      
  Michael R. Berthold; Supporting Creativity: Towards Associative Discovery of New Insights  
  (Room A,B,C)      
10:00 - 10:20 Coffee Break      
10:20 - 11:55 Session 4A: Outlier Detection Session 4B: SVM and Regression Session 4C: Rule Discovery Session 4D: Feature and Instance Selection
  (Room A) (Room B) (Room C) (Room D)
  Chair: Arthur Zimek Chair: Masashi Sugiyama Chair: Bernhard Pfahringer Chair: Md Rafiul Hassan
  Unusual Pattern Detection in High Dimensions Extreme Support Vector Machine Minimum Variance Associations --- Discovering Relationships in Numerical Data Automatic Training Example Selection for Unsupervised Record Linkage
  Minh Nguyen, Leo Mark, and Edward Omiecinski  Qiuge Liu, Qing He, and Zhongzhi Shi Szymon Jaroszewicz Peter Christen
  Unsupervised Change Analysis using Supervised Learning A Minimal Description Length Scheme for Polynomial Regression Mining a Complete Set of both Positive and Negative Association Rules from Large Databases Sparse Kernel-based Feature Weighting
  Shohei Hido, Tsuyoshi Ide, Hisashi Kashima, Harunobu Kubo, and Hirofumi Matsuzawa Aleksandar Pekov, Saso Dzeroski, and Ljuptuo Todorovski Hao Wang, Xing Zhang, and Guoqing Chen Shuang-Hong Yang, Yu-Jiu Yang Yang, and Bao-Gang Hu
  Improving the Robustness to Outliers of Mixtures of Probabilistic PCAs Bootstrap based Pattern Selection for Support Vector Regression Combined Association Rule Mining A More Topologically Stable Locally Linear Embedding Algorithm Based on R*-Tree
  Nicolas Delannay, Cedric Archambeau, and Michel Verleysen Dongil Kim and Sungzoon Cho Huaifeng Zhang, Yanchang Zhao, Longbing Cao, and Chengqi Zhang Tian Xia, Jintao Li, Yongdong Zhang, and Sheng Tang
  Cell-based Outlier Detection Algorithm: A Fast Outlier Detection Algorithm for Large Datasets Customer Churn Time Prediction in Mobile Telecommunication Industry using Ordinal Regression Mining Non-Coincidental Rules Without A User Defined Support Threshold Locally Linear Online Mapping for Mining Low-Dimensional Data Manifolds
  You Wan and Fuling Bian Rupesh Gopal and Saroj Meher Yun Sing Koh Huicheng Zheng, Wei Shen, Qionghai Dai, and Sanqing Hu
      Rule Extraction with Rough-Fuzzy Hybridization Method A Selective Classifier for Incomplete Data
      Nan-Chen Hsieh Jingnian Chen, Houkuan Huang, Fengzhan Tian, and Shengfeng Tian
11:55 - 13:00 Lunch      
13:00 - 18:00 Excursion      
         
18:30 - 22:00 Banquet      
         
May 23
9:00 - 10:00 Invited Talk      
  Chair: Huan Liu      
  Genshiro Kitagawa; Prospective Scientific Methodology in Knowledge Society    
  (Room A,B,C)      
10:00 - 10:20 Coffee Break      
10:20 - 11:50 Session 5A: Spatial and Image Data Mining Session 5B: Sequence Data Mining 2 Session 5C: Semi-Supervised Learning Session 5D: Application 1
  (Room A) (Room B) (Room C) (Room D)
  Chair: Jin Soung Yoo Chair: Shin-ichi Minato Chair: Nitesh V. Chawla Chair: Masayuki Numao
  Towards Region Discovery in Spatial Datasets An Efficient Algorithm for Finding Similar Short Substrings from Large Scale String Data Semi-Supervised Local Fisher Discriminant Analysis for Dimensionality Reduction Data-Aware Clustering Hierarchy for Wireless Sensor Networks
  Wei Ding, Rachsuda Jiamthapthaksin, Rachana Parmar, Dan Jiang, Tomasz Stepinski, and Christoph Eick Takeaki Uno Masashi Sugiyama, Tsuyoshi Ide, Shinichi Nakajima, and Jun Sese Xiaochen Wu, Peng Wang, Wei Wang, and Baile Shi
  ANEMI: An Adaptive Neighborhood Expectation-Maximization Algorithm with Spatial Augmented Initialization Accurate and Efficient Retrieval of Multimedia Time Series Data under Uniform Scaling and Time Warping Using Supervised and Unsupervised Techniques to Determine Groups of Patients with Different Continuity of Care Learning User Purchase Intent From User-Centric Data
  Tianming Hu, Hui Xiong, Xueqing Gong, and Sam Yuan Sung Waiyawuth Euachongprasit and Chotirat Ann Ratanamahatana Eu-Gene Siew, Leonid Churilov, Kate A. Smith-Miles, and Joachim P. Sturmberg Rajan Lukose, Jiye Li, Jing Zhou, and Satyanarayana Raju Penmetsa
  A New Model for Image Annotation Characteristic-based Descriptors for Motion Sequence Recognition Forward Semi-Supervised Feature Selection Exploratory Hot Spot Profile Analysis using an Interactive Visual Drill-Down Self-Organizing Maps
  Sanparith Marukatat Liang Wang, Xiaozhe Wang, Christopher Leckie, and Ramamohanarao Kotagiri Jiangtao Ren, Zhengyuan Qiu, Wei Fan, Hong Cheng, and Philip S. Yu Denny, Graham Williams and Peter Christen
  Jumping Emerging Patterns with Occurrence Count in Image Classification   Active Learning with Misclassification Sampling Using Diverse Ensembles Enhanced by Unlabeled Instances Discovering New Orders of the Chemical Elements through Genetic Algorithms
  Lukasz Kobylinski and Krzysztof Walczak   Jun Long, Jianping Yin, En Zhu, and Wentao Zhao Alexandre Blansche and Shuichi Iwata
        Combining Context and Existing Knowledge When Recognizing Biological Entities -- Early results
        Mika Timonen and Antti Pesonen
11:50 - 13:20 Lunch      
13:20 - 14:20 Invited Talk      
  Chair: Kai Ming Ting      
  Robert C. Holte; Cost-sensitive Classifier Evaluation using Cost Curves    
  (Room A,B,C)      
14:20 - 14:40 Coffee Break      
14:40 - 16:10 Session 6A: Classification 1 Session 6B: Graph and Network Mining Session 6C: Statistical Methods Session 6D: Text Mining 1
  (Room A) (Room B) (Room C) (Room D)
  Chair: Ulf Johansson Chair: Michael Berthold Chair: Szymon Jaroszewicz Chair: Manabu Okumura
  Multi-Class Named Entity Recognition via Bootstrapping with Dependency Tree-based Patterns A Mixture Model for Expert Finding A Decomposition Algorithm for Learning Bayesian Network Structures from Data Applying Latent Semantic Indexing in Frequent Itemset Mining for Document Relation Discovery
  Van Dang and Akiko Aizawa Jing Zhang, Jie Tang, Liu Liu, and Juanzi Li Yifeng Zeng and Jorge Cordero Hernandez Thanaruk Theeramunkong, Kritsada Sriphaew, and Manabu Okumura
  An efficient unordered tree kernel and its application to glycan classification Mining Correlated Subgraphs in Graph Databases Tradeoff Analysis of Different Markov Blanket Local Learning Approaches Enriching WordNet with Folksonomies
  Tetsuji Kuboyama, Kouichi Hirata, and Kiyoko F. Aoki-Kinoshita Tomonobu Ozaki and Takenao Ohkawa Shunkai Fu and Michel C. Desmarais Hao Zheng, Xian Wu, and Yong Yu
  Learning Rules for Multiple Target Classification Efficient Mining of Minimal Distinguishing Subgraph Patterns from Graph Databases Query expansion for the language modelling framework using the naive Bayes assumption Automatic Extraction of Basis Expressions that Indicate Economic Trends
  Bernard Zenko and Saso Dzeroski Zhiping Zeng, Jianyong Wang, and Lizhu Zhou Laurence Park and Kotagiri Ramamohanarao Hiroki Sakaji, Hiroyuki Sakai, and Shigeru Masuyama
    What is Frequent in a Single Graph On Discrete Data Modeling Seeing several stars: a rating inference task for a document containing several evaluation criteria
    Bjoern Bringmann and Siegfried Nijssen Nizar Bouguila and Walid Elguebaly Kazutaka Shimada and Tsutomu Endo
16:10 - 16:30 Coffee Break      
16:30 - 18:00 Session 7A: Classification 2 Session 7B: Stream Mining Session 7C: Application 2 Session 7D: Text Mining 2
  (Room A) (Room B) (Room C) (Room D)
  Chair: Robert C. Holte Chair: Hiroki Arimura Chair: Takashi Okada Chair: Dirk E. Van den Poel
  Privacy-Preserving Linear Fisher Discriminant Analysis Handling Numeric Attributes in Hoeffding Trees Designing a system for a process parameter determined through modified PSO and fuzzy neural network Term Committee Based Event Identification Within News Topics
  Shuguo Han and Wee Keong Ng Bernhard Pfahringer, Geoff Holmes, and Richard Kirkby Jui-Tsung Wong, Kuei-Hsien Chen, and Chwen-Tzeng Su Kuo Zhang, JuanZi Li, Gang Wu, and KeHong Wang
  Evaluating Standard Techniques for Implicit Diversity Maintaining Optimal Multi-way Splits for Numerical Attributes in Data Streams Forecasting Urban Air Pollution Using HMM-fuzzy Model A New Framework for Taxonomy Discovery from Text
  Ulf Johansson, Tuve Lofstrom, and Lars Niklasson Tapio Elomaa and Petri Lehtinen M. Maruf Hossain, Md. Rafiul Hassan, and Michael Kirley Ahmad El Sayed, Hakim Hacid, and Djamel Zighed
  Local Projection in Jumping Emerging Patterns Discovery in Transaction Databases Connectivity Based Stream Clustering Using Localised Density Exemplars PAID: Packet Analysis for Anomaly Intrusion Detection Detecting Near-Duplicates in Large-Scale Short Text Databases
  Pawel Terlecki and Krzysztof Walczak Sebastian Luhr and Mihai Lazarescu Kuo-Chen Lee, Jason Chang, and Ming-Syan Chen Caichun Gong, Yulan Huang, Xueqi Cheng, and Shuo Bai
  Fast k Most Similar Neighbor Classifier for Mixed Data based on an Approximation and Elimination algorithm Fast on-line estimation of the joint probability distribution Unmixed Spectrum Clustering for Template Composition in Lung Sound Classification Text Categorization of Multilingual Web Pages on Specific Domain
  Selene Hernandez Rodriguez, J. Ariel Carrasco-Ochoa, and J. Fco. Martinez-Trinidad Jan Peter Patist Tomonari Masada, Senya Kiyasu, and Sueharu Miyahara Jicheng Liu and Chunyan Liang
      The Application of Echo State Network in Stock Data Mining  
      Xiaowei Lin, Zehong Yang, and Yixu Song