Conference Program
| Program Summary
| Program Details
|
Main Conference Paper Presentation
Regular paper presentations are indicated in the program by (Regular). Each regular paper presentation is allocated with 25 minutes, with 20 minutes for presentation and 5 minutes for questions; and each short paper presentation is allocated with 18 minutes, with 15 minutes for presentation and 3 minutes for questions. Please be at the presentation room 10 minutes before the session and contact your session chair. A staff will help you to connect your laptop with the LCD projector. To recognize session chairs, find the sign attached to the name badge.
Program Summary
Click on the workshop name to see the detailed program.
April 27, 2009 | |||
Room | Time | Workshop | Details |
Queen’s Park 4 (2nd floor) | 08:30 - 17:30 | PAISI'09 | http://www.fbe.hku.hk/~mchau/paisi/program.htm |
Queen’s Park 5 (2nd floor) | 08:30 -12:30 | ICEC'09 | http://www.nd.edu/~dial/Workshop2009/ workshop2009.html |
13:30 - 17:30 | QIMIE'09 | http://conferences.telecom-bretagne.eu/ qimie2009/program/ |
|
Queen’s Park 6 (2nd floor) | 08:30 - 12:00 | AIBDM'09 |
http://www.cs.usyd.edu.au/~aibdm09/ aibdm09_schedule.html |
13:00 - 18:30 | OSDM'09 | http://osdm09.togaware.com/ | |
Sakura (37th floor) |
19:00 - 22:00 | Workshop Reception |
April 28, 2009 | ||||
Room A | Room B | Room C | Minutes | |
08:30 - 09:00 | Opening | |||
09:00 - 10:00 | Keynote Speech | |||
10:00 - 10:20 | Coffee Break | |||
10:20 - 12:00 | Session 1A Classification 1 |
Session 1B Privacy Preserving Data Mining |
Session 1C Text Mining 1 |
100 |
12:00 - 13:00 | Lunch | |||
13:00 - 14:40 | Session 2A Semi-Supervised Learning and SVM |
Session 2B Clustering 1 |
Session 2C Sequence Data Mining |
100 |
14:40 - 15:00 | Coffee Break | |||
15:00 -17:00 | Session 3A Statistical Methods and Ensemble |
Session 3B Rule Discovery |
Session 3C Pattern Mining |
120 |
April 29, 2009 | ||||
Room A | Room B | Room C | ||
08:30 - 10:00 | Session 4A Clustering 2 |
Session 4B Web Mining 1 |
Session 4C Text Mining 2 |
90 |
10:00 - 10:20 | Coffee Break | |||
10:20 - 12:00 | Session 5A Outlier Detection |
Session 5B Statistical Methods |
Session 5C Recommendation Systems |
100 |
12:00 - 13:00 | Lunch | |||
13:00 - 14:00 | Invited Speech | |||
14:00 - 15:40 | Session 6A Outlier Detection and Spatial Data Mining |
Session 6B Ensemble Methods |
Session 6C Link Analysis |
100 |
15:40 - 16:00 | Coffee Break | |||
16:00 - 18:00 | Session 7A Feature Selection and Construction |
Session 7B Stream and Time-series Data Mining |
Session 7C Support Vector Machines |
120 |
18:30 - 20:30 | Banquet | |||
April 30, 2009 | ||||
Room A | Room B | Room C | ||
09:00 - 10:00 | Keynote Speech | |||
10:00 - 10:20 | Coffee Break | |||
10:20 - 12:00 | Session 8A Classification and Link Analysis |
Session 8B Web Mining 2 |
Session 8C Text Mining 3 |
100 |
12:00 - 13:00 | Lunch | |||
13:00 - 16:00 | Excursion |
Program Details
April 28, 2009 | ||||
Room A | Room B | Room C | Room D | |
08:30 - 09:00 |
Opening | |||
09:00 - 10:00 |
Keynote
Speech: KDD for BSN – towards the future of pervasive sensing Guang-Zhong Yang, PhD, Imperial College London
|
|||
10:00 - 10:20 |
Coffee Break | |||
10:20 - 12:00 |
Session 1A Classification 1 |
Session 1B Privacy Preserving Data Mining |
Session 1C Text Mining 1 |
Tutorial A |
DTU: A Decision Tree for Uncertain Data Biao Qin, Yuni Xia, and Fang Li |
Efficient Privacy-Preserving Link Discovery Xiaoyun He, Jaideep Vaidya, Basit Shafiq, Nabil Adam, Evimaria Terzi, and
Tyrone Grandison |
Sentence-Level Novelty Detection in English and
Malay
Agus Trisnajaya Kwee, Flora S Tsai, and Wenyin Tang |
Domain-Driven Data Mining: Empowering Actionable Knowledge Delivery Longbing Cao
| |
Safe-Level-SMOTE: Safe-Level-Synthetic Minority
Over-sampling TEchnique for handling the class imbalanced problem
Chumphol Bunkhumpornpat, Krung Sinapiromsaran, and Chidchanok Lursinsap |
On Link Privacy in Randomizing Social Networks
Xintao Wu, and Xiaowei Ying |
Information Extraction from Thai Text with Unknown
Phrase Boundaries
Peerasak Intarapaiboon, Ekawit Nantajeewarawat, and Thanaruk Theeramunkong |
||
Using Highly Expressive Contrast Patterns for
Classification - Is It Worthwhile?
Elsa Loekito, and James Bailey |
Accurate Synthetic Generation of Realistic Personal
Information
Peter Christen, and Agus Pudjijono |
A Corpus-based Approach for Automatic Thai Unknown
Word Recognition using Ensemble Learning Techniques
Jakkrit TeCho, Cholwich Nattee, and Thanaruk Theeramunkong
|
||
Arif Index for Predicting the Classification
Accuracy of Features and its Application in Heart Beat Classification Problem
Muhammad Arif, Fayyaz Afsar, Muhammad Usman Akram, and Adnan Fida
|
An Efficient Approximate Protocol for
Privacy-Preserving Association Rule Mining
Jaideep Vaidya, Murat Kantarcioglu, and Robert Nix |
A Hybrid Approach to Improve Bilingual Multiword
Expression Extraction
Jianyong Duan, Mei Zhang, Lijing Tong, and Feng Guo |
||
UCI++: Improved Support for Algorithm Selection
Using Datasetoids
Carlos Soares |
Addressing the Variability of Natural Language
Expression in Sentence Similarity with Semantic Structure of the
Sentences
Palakorn Achananuparp, Xiaohua Hu, and Christopher C. Yang |
|||
12:00 - 13:00 |
Lunch | |||
13:00 - 14:40 |
Session 2A Semi-supervised Learning and SVM |
Session 2B Clustering 1 |
Session 2C Sequence Data Mining |
Tutorial B |
Robust Graph Hyperparameter Learning for Graph Based
Semi-Supervised Classification
Krikamol Muandet, Sanparith Marukatat, and Cholwich Nattee |
Regularized Local Reconstruction for Clustering
Jun Sun, Zhiyong Shen, Bai Su, and Yi-Dong Shen |
A Polynomial-Delay Polynomial-Space Algorithm for
Extracting Frequent Diamond Episodes from Event Sequences
Takashi Katoh, Hiroki Arimura, and Kouichi Hirata |
Mining Evolution of Complex Structured Data Sourav S Bhowmick |
|
Budget Semi-supervised Learning
Zhi-Hua Zhou, Michael Ng, Qiao-Qiao She, and Yuan Jiang
|
Clustering with lower bound on Similarity
Mohammad Al Hasan, Saeed Salem, Benjarath Pupacdi, and Mohammed Zaki |
Computing Substitution Matrices for Genomic
Comparative Analysis
Minh Duc Cao, Trevor Dix, and Lloyd Allison |
||
When does Co-Training Work in Real Data?
Charles X. Ling, Jun Du, and Zhi-Hua Zhou |
Approximate Spectral Clustering
Liang Wang, Christopher Leckie, Rao Kotagiri, and James Bezdek |
Mining Both Positive and Negative Impact-Oriented
Sequential Rules From Transactional Data: a Case Study in Social Security
Yanchang Zhao, Huaifeng Zhang, Longbing Cao, Chengqi Zhang, and Hans
Bohlscheid |
||
Classification of Audio Signals Using a
Bhattacharyya Kernel-based Centroid Neural Network
Dong-Chul Park, Yunsik Lee, and Dong-Min Woo |
Pairwise Constrained Clustering for Sparse and High
Dimensional Feature Spaces
Su Yan, Hai Wang, Dongwon Lee, and C. Lee Giles |
Aggregated Subset Mining
Albrecht Zimmermann, and Björn Bringmann |
||
Sparse Kernel Learning and the Relevance Units
Machine
Junbin Gao, and Jun Zhang |
Hot Item Detection in Uncertain Data
Matthias Renz, Andreas Zuefle, Hans-Peter Kriegel, and Thomas Bernecker |
|||
14:40 - 15:00 |
Coffee Break | |||
15:00 - 17:00 |
Session 3A Statistical Methods and Ensemble |
Session 3B Rule Discovery |
Session 3C Pattern Mining |
Tutorial B (cont.) |
A Statistical Approach for Binary Vectors Modeling
and Clustering
Nizar Bouguila, and Khalid Daoudi |
Interval Data Classification under Partial
Information: A Chance-Constraint Approach
Sahely Bhadra, Saketha Nath Jagarlapudi, Aharon Ben-Tal, and Chiranjib
Bhattacharyya |
The Studies of Mining Frequent Patterns Based on
Frequent Pattern Tree
Show-Jane Yen, Yue-Shi Lee, Chiu-Kuang Wang, and Jung-Wei Wu |
Mining Evolution of Complex Structured Data Sourav S Bhowmick |
|
Multi-Resolution Boosting for Classification and
Regression
Chandan K Reddy, and Jin-Hyeong Park |
Negative Encoding Length as a Subjective
Interestingness Measure for Groups of Rules
Einoshin Suzuki |
Discovering Periodic-Frequent Patterns in
Transactional Databases
Syed Khairuzzaman Tanbeer, Chowdhury Ahmed, Byeong-Soo Jeong, and Young-Koo
Lee |
||
Spanning Tree Based Attribute Clustering
Yifeng Zeng, and Jorge Cordero H. |
On optimal rules discovery: a framework and a
necessary and sufficient condition for optimality
Yannick Le Bras, Philippe Lenca, and Stéphane Lallich |
Trace Mining from Distributed Assembly Databases for
Causal Analysis
Shohei Hido, Hirofumi Matsuzawa, Fumihiko Kitayama, and Masayuki Numao |
||
The effect of parameter tuning and focusing on bus
travel time prediction
João Moreira, Carlos Soares, Alipio Jorge, and Jorge Freire de Sousa |
Discovering Action Rules that are Highly Achievable
from Massive Data
Einoshin Suzuki
| Let's Tango - Finding the Right Couple for
Feature-Opinion Association in Sentiment Analysis
Kam Tong Chan, and Irwin King |
||
Transfer Learning Action Models by Measuring the
Similarity of Different Domains
Hankui Zhuo, and Qiang Yang |
Extracting Fuzzy Rules for Detecting Ventricular
Arrhythmias Based on NEWFM
Dong-Kun Shin, Sang-Hong Lee, and Joon S. Lim |
An Efficient Candidate Pruning Technique for High
Utility Pattern Mining
Chowdhury Farhan Ahmed, Syed Tanbeer, Byeong-Soo Jeong, and Young-Koo Lee |
||
April 29, 2009 | ||||
Room A | Room B | Room C | Room D | |
8:30 - 10:00 |
Session 4A Clustering 2 |
Session 4B Web Mining 1 |
Session 4C Text Mining 2 |
Tutorial C |
An Integration of Fuzzy Association Rules and
WordNetfor Document Clustering
Chun-Ling Chen, Frank S.C. Tseng, Tyne Liang, and Tyne Liang
| Quantifying Asymmetric Semantic Relations from Query
Logs by Resource Allocation
Zhiyuan Liu, Yabin Zheng, and Maosong Sun |
Text Categorization using Fuzzy Proximal SVM and
Distributional Clustering of words
Arunkumar Mani, and Madan Gopal |
Issues of Mining for Heterogeneous Social Networks Shou-de Lin, Hung-Yi Lo, and Cheng-Te Li
|
|
Nonlinear Data Analysis Using A New Hybrid Data
Clustering Algorithm
Ureerat Wattanachon |
Acquiring Semantic Relations using the Web as
Background Knowledge
Wilson Wong, Wei Liu, and Mohammed Bennamoun |
Cool Blog Classification from Positive and Unlabeled
Examples
Kritsada Sriphaew, Hiroya Takamura, and Manabu Okumura |
||
Clustering Documents using a Wikipedia-based Concept
Representation
Anna Huang, David Milne, Eibe Frank, and Ian H. Witten |
Grouped ECOC Conditional Random Fields for
Prediction of Web User Behavior
Yong Zhen Guo, Kotagiri Ramamohanarao, and Laurence A.F. Park |
Newistic: a distributed news gathering and analysis
platform
Horatiu Mocian, and Ovidiu Dan |
||
An Instantiation of Hierarchical Distance-based
Conceptual Clustering for Propositional Learning
Maria Jose Ramirez-Quintana, Ana Funes, Jose Hernandez-Orallo, and Cesar
Ferri |
CLHQS: Hierarchical Query Suggestion by Mining
Clickthrough Log
Depin Chen, Jun Yan, Zhijun Yin, and Yan Xiong |
Building a Text Classifier by a Keyword and
Unlabeled Documents
Qiu Qiang, Yang Zhang, and Junping Zhu |
||
10:00 - 10:20 |
Coffee Break | |||
10:20 - 12:00 |
Session 5A Outlier Detection |
Session 5B Statistical Methods |
Session 5C Recommendation and Rating Systems |
Tutorial C (cont.) |
Detecting abnormal events via Hierarchical Dirichlet
Processes
Xian-Xing Zhang, Hua Liu, Yang Gao, and Derek Hao Hu |
Active Learning for Causal Bayesian Network
Structure with Non-symmetrical Entropy
Guoliang Li, and Tze Yun Leong |
Long-Term Relevance Feedback for Content-Based Image
Suggestion
Sabri Boutemedjet, and Djemel Ziou |
Issues of Mining for Heterogeneous Social Networks Shou-de Lin, Hung-Yi Lo, and Cheng-Te Li |
|
A New Local Distance-based Outlier Detection
Approach for Scattered Real-World Data
Ke Zhang, Marcus Hutter, and Huidong Jin |
A Comparative Study of Bandwidth Choice in Kernel
Density Estimation for Naive Bayesian Classification
Bin Liu, Geoff Webb, Ying Yang, and Janice Boughton |
COMUS: Ontological and Rule-based Reasoning for
Music Recommendation System
Seungmin Rho |
||
Mining Outliers with Faster Cutoff Update and Space
Utilization
Chi Cheong Szeto, and Edward Hung |
Analysis of Variational Bayesian Matrix
Factorization
Shinichi Nakajima, and Masashi Sugiyama |
Spatial Weighting for Bag-of-Visual-Words
Representation and Its Application in Content-Based Image Retrieval
Xin Chen, Xiaohua Hu, and Xiajiong Shen |
||
Outlier Detection in Axis-Parallel Subspaces of High
Dimensional Data
Hans-Peter Kriegel, Peer Kröger, Erich Schubert, and Arthur Zimek |
An Effective Boosting Mehod for Naïve Bayesian
Classifiers by Local Accuracy Estimation
Zhipeng Xie |
Item Preference Parameters from Grouped Ranking
Observations
Hideitsu Hino, Yu Fujimoto, and Noboru Murata |
||
k-Dominant Skyline Computation by using
Sort-Filtering Method
Md. Anisuzzaman Siddique, and Yasuhiko Morimoto |
Cross-Channel Query Recommendation on Commercial
Mobile Search Engine: Why, How and Empirical Evaluation
Shunkai Fu, Bingfeng Pi, Ying Zhou, Micheal Desmarais, Weilei Wang, Song
Han, and Xunrong Rao |
|||
12:00 - 13:00 |
Lunch | |||
13:00 - 14:00 |
Invited Speech: Finding Hidden Structures in Relational Databases
Yu Xu, Jeffrey, BE, ME, PhD
|
|||
14:00 - 15:40 |
Session 6A Outlier Detection and Spatial Data Mining |
Session 6B Ensemble Methods |
Session 6C Link Analysis |
Tutorial D |
Detecting Link Hijacking by Web Spammers
YOUNGJOO CHUNG, Masashi Toyoda, and Masaru Kitsuregawa |
A Data Driven Ensemble Classifier for Credit Scoring
Analysis
Nan-Chen Hsieh, Lun-Ping Hung, and Chia-Ling Ho |
Exploiting the Block Structure of Link Graph for
Efficient Similarity Computation
Pei Li, Yuanzhe Cai, Hongyan Liu, Jun He, and Xiaoyong Du |
Outlier Detection Techniques Hans-Peter Kriegel, Peer Kröger, Arthur Zimek
|
|
Data Mining for Intrusion Detection: from Outliers
to True Intrusions
Florent MASSEGLIA, Goverdhan Singh, Celine Fiot, Alice Marascu, and Pascal
Poncelet |
A Multi-Partition Multi-Chunk Ensemble Technique to
Classify Concept-Drifting Data Streams
Mohammad Mehedy Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani
Thuraisingham |
Growth Analysis of Neighbor Network for Evaluation
of Damage Progress
Ken-ichi Fukui, Kazuhisa Sato, Junichiro Mizusaki, Kazumi Saito, Masahiro
Kimura, and Masayuki Numao |
||
A Multi-Resolution Approach for Atypical Behaviour
Mining
Florent MASSEGLIA, and Alice Marascu |
Parameter Estimation in Semi-Random Decision Tree
Ensembling on Streaming Data
PeiPei Li, Qianhui Liang, Xindong Wu, and Xuejang Hu |
Link Structure Ranking Algorithm for Trading
Networks
Andri Mirzal |
||
Change Analysis in Spatial Data by Combining
Contouring Algorithms with Supervised Density Functions
Chun Sheng Chen, Vadeerat Rinsurongkawong, Christoph Eick, and Michael Twa |
Diversity in Combinations of Heterogeneous
Classifiers
Kuo-Wei Hsu, and Jaideep Srivastava |
A Parallel Algorithm for Finding Related Pages in
the Web by using Segmented Link Structures
Shen Xiaoyan, Chen Junliang, Meng Xiangwu, Zhang Yujie, and Liu Chuanchag |
||
Centroid Neural Network with Spatial Constraints
Dong-Chul Park |
Boosting Biomedical Information Retrieval
Performance through Citation Graph: An Empirical Study
Xiaoshi Yin, Xiangji Huang, Qinmin Hu, and Zhoujun Li |
|||
15:40 - 16:00 |
Coffee Break | |||
16:00 - 18:00 |
Session 7A Feature Selection and Construction |
Session 7B Stream and Time-series Data Mining |
Session 7C Support Vector Machines |
Tutorial D (cont.) |
Online Feature Selection Algorithm with Bayesian l1
Regularization
Yunpeng Cai, Yijun Sun, and Steve Goodison |
Speeding up Similarity Search on Large Time Series
Dataset Under Time Warping Distance
Pongsakorn Ruengronghirunya, Vit Niennattrakul, and Chotirat Ann
Ratanamahatana
|
Ranking Vector Machine: An Efficient Method for
Learning Ranking SVM
Hwanjo Yu, Youngdae Kim, and Seungwon Hwang |
Outlier Detection Techniques Hans-Peter Kriegel, Peer Kröger, Arthur Zimek
|
|
On Joint Feature Selection and Local Learning Based
Clustering
Yiu-ming Cheung, and Hong Zeng |
A Novel Fractal Representation for Dimensionality
Reduction of Large Time Series Data
Poat Sajjipanon, and Chotirat Ann Ratanamahatana |
A kernel framework for protein residue annotation
Huzefa Rangwala, Christopher Kauffman, and George Karypis |
||
Similarity-based Feature Selection for Learning From
Examples with Continuous Values
Yun Li, Sun-Jun Hu, Wen-Jie Yang, Guo-Zi Sun, Fang-Wu Yao, and Geng Yang |
Clustering Data Streams in Optimization and
Geography Domains
Wen-Chih Peng |
On Pairwise Kernels: An Efficient Alternative and
Generalization Analysis
Hisashi Kashima, Satoshi Oyama, Yoshihiro Yamanishi, and Koji Tsuda |
||
Application-independent feature construction from
noisy samples
Dominique Gay, Nazha Selmaoui, and Jean-FranÃÅòois Boulicaut |
CBDT: A Concept Based Approach to Data Stream Mining
Stefan Hoeglinger, Russel Pears, and Yun Sing Koh |
A Family-based Evolutional Approach for Kernel Tree
Selectionin SVMs
Ithipan Methasate, and Thanaruk Theeramunkong |
||
Estimating Optimal Feature Subsets Using Mutual
Information Feature Selector and Rough Sets
Sombut Foitong, Pornthep Rojanavasu, Boonwat Attachoo, and Ouen Pinngern |
Meaningful Subsequence Matching under
Distance for Data Stream
Vit Niennattrakul, and Chotirat Ann Ratanamahatana |
An Online Incremental Learning Vector Quantization
Ye Xu, Furao Shen, Osamu Hasegawa, and Jinxi Zhao |
||
An Aggregate Ensemble for Mining Data Streams with
both Concept Drifting and Noise
Peng Zhang, Xingquan Zhu, Yong Shi, and Xindong Wu |
||||
18:30 - 20:30 |
Banquet | |||
April 30, 2009 | ||||
Room A | Room B | Room C | Room D | |
9:00 - 10:00 |
Keynote Speech: The future of search: an online content perspective Andrew Tomkins, PhD (Yahoo! Research)
|
|||
10:00 - 10:20 |
Coffee Break | |||
10:20 - 12:00 |
Session 8A Classification and Link Analysis |
Session 8B Web Mining 2 |
Session 8C Text Mining 3 |
|
Dynamic Exponential Family Matrix Factorization
Kohei Hayashi, Jun-ichiro Hirayama, and Shin Ishii |
X-tracking the Changes of Web Navigation Patterns
Long Wang, and Christoph Meinel |
Thai Word Segmentation with Hidden Markov Model and
Decision Tree
Poramin Bheganan, Richi Nayak, and Yue Xu |
||
A Nonparametric Bayesian Learning Model: Application
to Text and Image Categorization
Nizar Bouguila, and Djemel Ziou |
Website Classification using Extended Hidden Markov
Models
Majid Yazdani, Milad Eftekhar, and Hassan Abolhassani |
An efficient method for generating, storing and
matching features for text mining
Chan Shing Kit, and Wai Lam |
||
On Mining Rating Dependencies in Online
Collaborative Rating Networks
Hady W. Lauw, Ee-Peng Lim, and Ke Wang |
Emotion Recognition of Pop Music Based on Maximum
Entropy with Priors
Hui He, Bo Chen, and Jun Guo |
A Discriminative Approach to Topic-based Citation
Suggestion
Jie Tang, and Jing Zhang |
||
Learning to Extract Relations for Relational
Classification
Steffen Rendle, Christine Preisach, and Lars Schmidt-Thieme |
Simultaneously Finding Fundamental Articles and New
Topics Using a Community Tracking Method
Tieyun Qian, Jaideep Srivastava, Zhiyong Peng, and Phillip Sheu |
Romanization of Thai Proper Names Based On
Popularity Of Usages
Akegapon Tangverapong |
||
Towards a Novel Association Measure via Web Search
Results Mining
Xiaojun Wan |
||||
12:00 - 13:00 |
Lunch | |||
13:00 - 16:00 |
Excursion |