headerphoto

The 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining

21-24 June, 2010 - Hyderabad, India

Conference Program

22nd June 2010
SESSION 1A
CLUSTERING
Session Chair: Prabin Panigrahi, Indian Institute of Management
95 mins
1.
A Set Correlation Model for Partitional Clustering
Xuan Vinh Nguyen and Michael E. Houle
Regular
2.
iVAT and aVAT: Enhanced Visual Analysis for Cluster Tendency Assessment
Liang Wang, Uyen Nguyen, James Bezdek, Christopher Leckie and Rao Kotagiri
Regular
3.
A Robust Seedless Algorithm for Correlation Clustering
Mohammad Aziz and Chandan Reddy
Short
4.
Integrative Parameter-free Clustering of Data with Mixed Type Attributes
Christian Bohm, Sebastian Goebl, Annahita Oswald, Claudia Plant, Michael Plavinski and Bianca Wackersreuther
Short
5.
Data Transformation for Sum Squared Residue
Hyuk Cho
Short
 
SESSION 1B
SOCIAL NETWORKS
Session Chair: Kamalakar Karlapalem, International Institute of Information Technology, Hyderabad
95 mins
1.
A Better Strategy of Discovering Link-Pattern based Communities by Classical Clustering Methods
Chen-Yi Lin, Jia-Ling Koh and Arbee L. P. Chen.
Regular
2.
Mining Antagonistic Communities from Social Networks
Kuan Zhang, David Lo and Ee-Peng Lim
Regular
3.
As Time Goes By: Discovering Eras in Evolving Social Networks
Michele Berlingerio, Michele Coscia, Fosca Giannotti, Anna Monreale and Dino Pedreschi
Short
4.
Online Sampling of High Centrality Individuals in Social Networks
Arun Maiya and Tanya Berger-Wolf
Short
5.
Estimate on Expectation for Influence Maximization in Social Networks
Yao Zhang, Qing Gu, Jun Zheng and Daoxu Chen
Short
 
SESSION 2A
PRIVACY
Session Chair: Nitesh Chawla, University of Notre Dame
100 mins
1.
Hiding Emerging Patterns with Local Recoding Generalization
Michael Cheng, William Kwok Wai Cheung and Byron Koon Kau Choi
Regular
2.
Anonymizing Transaction Data by Integrating Suppression and Generalization
Junqiang Liu and Ke Wang
Short
3.
Satisfying Privacy Requirements: One Step Before Anonymization
Xiaoxun Sun, Hua Wang, and Jiuyong Li
Short
4.
Computation of Ratios of Secure Summations in Multi-Party Privacy-Preserving Latent Dirichlet Allocation
Bin Yang and Hiroshi Nakagawa
Short
5.
Privacy-Preserving Network Aggregation
Troy Raeder, Marina Blanton, Nitesh Chawla and Keith Frikken
Short
6.
Multivariate Equi-width Data Swapping For Private Data Publication
Yidong Li and Hong Shen
Short
 
SESSION 2B
NOVEL APPLICATIONS
Session Chair: Ashish V. Tendulkar, IIT Madras
85 mins
1.
Ontology-based Mining of Brainwaves: A Sequence Similarity Technique for Mapping Alternative Features in Event-Related Potentials (ERP) Data
Haishan Liu, Gwen Frishkoff, Robert Frank and Dejing Dou
Regular
2.
Combining Support Vector Machines and the t-Statistic for Gene Selection in DNA Microarray Data Analysis
Tao Yang, Vojislav Kecman, Longbing Cao and Chengqi Zhang
Short
3.
Satrap: Data and network heterogeneity aware P2P data-mining
Hock Hee Ang, Vivekanand Gopalkrishnan, Anwitaman Datta, Wee Keong Ng and Steven C.H. Hoi
Short
4.
Player Performance Prediction in Massively Multiplayer Online Role-Playing Games (MMORPGs)
Kyong Jin Shim, Richa Sharan and Jaideep Srivastava
Short
5.
Relevant Gene Selection Using Normalized Cut Clustering With Maximal Compression Similarity Measure
Rajni Bala, Ramesh Agarwal and Manju Sardana
Short
 
SESSION 2C
CLASSIFICATION I
Session Chair: Chandan Reddy, Wayne State University
95 mins
1.
A Novel Scalable Multi-class ROC for Effective Visualization and Computation
Md Rafiul Hassan, James Bailey, Kotagiri Ramamohanarao and M Maruf Hossain
Regular
2.
Efficiently finding the best parameter for the emerging pattern-based classifier PCL
Thanh-Son Ngo, Mengling Feng, Guimei Liu and Limsoon Wong
Regular
3.
Rough Margin based Core Vector Machines
Gang Niu, Bo Dai, Lin Shang and Yangsheng Ji
Short
4.
BoostML: An Adaptive Metric Learning for Nearest Neighbor Classification
Nayyar Abbas Zaidi, David McG Squire and David Suter
Short
5.
A New Emerging Pattern Mining Algorithm and its Application in Supervised Classification
Milton Garcia-Borroto, Jose Francisco Martinez-Trinidad and Jesus Ariel Carrasco-Ochoa
Short
 
SESSION 3A
PATTERN MINING
Session Chair:Thanaruk Theeramunkong, Sirindhorn International Institute of Technology
120 mins
1.
An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns
zhigang zheng, Yanchang Zhao, Ziye Zuo and Longbing Cao
Regular
2.
Valency based Weighted Association Rule Mining
Yun Sing Koh, Russel Pears and Wai Yeap
Regular
3.
Ranking Sequential Patterns with Respect to Significance
Robert Gwadera and Fabio Crestani
Regular
4.
Mining Association Rules in Long Sequences
Boris Cule and Bart Goethals
Short
5.
Mining Closed Episodes from Event Sequences Efficiently
Wenzhi Zhou, Hongyan Liu and Hong Cheng
Short
5.
Most Significant Substring Mining Based On Chi-square Measure
Sourav Dutta and Arnab Bhattacharya
Short
 
SESSION 3B
RECOMMENDATIONS/ANSWERS
Session Chair:Christine Preisach, University of Hildesheim
115 mins
1.
Probabilistic User Modeling in the Presence of Drifting Concepts
Vikas Bhardwaj and Ramaswamy Devarajan
Regular
2.
Using Association Rules to Solve the Cold-Start Problem in Recommender Systems
Gavin Shaw, Yue Xu and Shlomo Geva
Short
3.
Semi-Supervised Tag Recommendation - Using Untagged Resources to Mitigate Coldstart Problems
Christine Preisach, Leandro Balby Marinho and Lars Schmidt-Thieme
Short
4.
Cost-sensitive Listwise Ranking Approach
Min Lu, MaoQiang Xie, Yang Wang, Jie Liu and YaLou Huang
Short
5.
Mining Wikipedia and Yahoo! Answers for Question Expansion in Opinion QA
Yajie Miao and Chunping Li
Short
6.
Answer Diversification for Complex Question Answering on the Web
Palakorn Achananuparp, Xiaohua Hu, Tingting He, Christopher C. Yang, Yuan An and Lifan Guo
Short
7.
Vocabulary Filtering for Term Weighting in Archived Question Search
Zhao-Yan Ming, Kai Wang, and Tat-Seng Chua
Short
 
SESSION 3C
TOPIC MODELING/INFO EXTRACTION
Session Chair: B. Ravindran, IIT Madras
115 mins
1.
On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations
Arun R, Suresh V, Veni Madhavan C E and Narasimha Murty M
Regular
2.
Supervising Latent Topic Model for Maximum-Margin Text Classification and Regression
Wanhong Xu
Regular
3.
Topic Decomposition and Summarization
Wei Chen, Can Wang, Chun Chen, Lijun Zhang and Jiajun Bu
Short
4.
Resource-bounded Information Extraction: Acquiring Missing Feature Values On Demand
Pallika Kanani, Andrew McCallum and Shaohan Hu
Regular
5.
Efficient Deep Web Crawling Using Reinforcement Learning
Lu Jiang, Zhaohui Wu, Qian Feng, Jun Liu and Qinghua Zheng
Regular
23nd June 2010
 
SESSION 4A
SKYLINES/UNCERTAINTY
Session Chair:Arbee L.P. Chen, National Chengchi University
85 mins
1.
SkyDist: Data Mining on Skyline Objects
Christian Bohm Annahita Oswald, Claudia Plant, Michael Plavinski and Bianca Wackersreuther
Short
2.
Multi-Source Skyline Queries Processing in Multi-Dimensional Space
Cuiping Li, Wenlin He and Hong Chen
Short
3.
UNN: A Neural Network for uncertain data classification
Jiaqi Ge, Yuni Xia and Chandima Nadungodage
Regular
4.
Efficient Pattern Mining from Uncertain Data with Sampling
Toon Calders, Calin Garboni and Bart Goethals
Short
5.
Classifier Ensemble for Uncertain Data Stream Classification
Shirui Pan, Kuan Wu, Yang Zhang and Xue Li
Short
 
SESSION 4B
DIMENSIONALITY-REDUCTION/PARALLELISM
Session Chair: Arnab Bhattacharya, Indian Institute of Technology, Kanpur
80 mins
1.
Subclass-oriented Dimension Reduction with Constraint Transformation and Manifold Regularization
Bin Tong and Einoshin Suzuki
Regular
2.
Distributed Knowledge Discovery with Non Linear Dimensionality Reduction
Panagis Magdalinos, Michalis Vazirgiannis and Dialecti Valsamou
Regular
3.
DPSP: Distributed Progressive Sequential Pattern Mining on the Cloud
Jen-Wei Huang, Su-Chen Lin and Ming-Syan Chen
Short
4.
An Approach for Fast Hierarchical Agglomerative Clustering using Graphics Processors with CUDA
Arul Shalom S.A., Manoranjan Dash and Minh Tue
Short
 
SESSION 5A
SPATIO-TEMPORAL MINING
Session Chair: R. Rajesh, Bharathiar University
85 mins
1.
Correspondence Clustering: An Approach to Cluster Multiple Related Spatial Datasets
Vadeerat Rinsurongkawong and Christoph F. Eick
Regular
2.
Mining Trajectory Corridors Using Frechet Distance and Meshing Grids
Haohan Zhu, Jun Luo, Hang Yin, Xiaotao Zhou, Joshua Zhexue Huang, Benjamin Zhan
Short
3.
Subseries Join: A Similarity-Based Time Series Match Approach
Yi Lin and Michael D. McCool
Short
4.
TWave: High-Order Analysis of Spatiotemporal Data
Michael Barnathan, Vasileios Megalooikonomou, Christos Faloutsos, Feroze Mohamed and Scott Faro
Short
5.
Spatial Clustering with Obstacles Constraints Using Dynamic Piecewise Linear Chaotic Map and Dynamic Nonlinear PSO
XuePing ZHANG, Haohua Du, and Jiayao Wang
Short
 
SESSION 5B
FEATURE-SELECTION/VISUALIZATION
Session Chair: Raju Bapi, University of Hyderabad
90 mins
1.
A Novel Prototype Reduction Method for the K-Nearest Neighbor Algorithm with K>=1
Tao Yang, Longbing Cao and Chengqi Zhang
Regular
2.
Generalized Two-Dimensional FLD Method for Face Feature Extraction: An Application to Face Recognition
Shiladitya Chowdhury, Jamuna Kanta Sing, Dipak Kumar Basu and Mita Nasipuri
Regular
3.
Learning Gradients with Gaussian Processes
Xinwei Jiang, Junbin Gao, Tianjiang Wang and Paul W. Kwan
Regular
4.
Analyzing the Role Of Dimension Arrangement For Data Visualization in Radviz
Luigi Di Caro, Vanessa Frias Martinez and Enrique Frias Martinez
Short
 
SESSION 6A
GRAPH MINING
Session Chair: Liu Guimei, National University of Singapore
95 mins
1.
Subgraph Mining on Directed and Weighted Graphs
Stephan Gunnemann and Thomas Seidl
Regular
2.
Finding Itemset-Sharing Patterns in a Large Itemset-Associated Graph
Mutsumi Fukuzaki, Mio Seki, Hisashi Kashima and Jun Sese
Regular
3.
A Framework for SQL-based Mining of Large Graphs on Relational Databases
Sriganesh Srihari, Shruti Chandrashekar and Srinivasan Parthasarathy
Short
4.
Fast discovery of reliable k-terminal subgraphs
Melissa Kasari, Hannu Toivonen and Petteri Hintsanen
Short
5.
GTRACE2: Improving Performance Using Labeled Union Graphs
Akihiro Inokuchi and Takashi Washio
Short
 
SESSION 6B
CLUSTERING
Session Chair: Latifur Khan, University of Texas at Dallas
95 mins
1.
Orthogonal Nonnegative Matrix Tri-factorization for Semi-supervised Document Co-clustering
Huifang Ma, Weizhong Zhao, Qing Tan and Zhongzhi Shi
Regular
2.
Fast Orthogonal Nonnegative Matrix Tri-Factorization for Simultaneous Clustering
Zhao Li, Xindong Wu and Zhenyu Lu
Short
3.
Hierarchical Clustering of Webpages via Cross-Page and In-Page Link Structures
Cindy Xide Lin, Yintao Yu, Jiawei Han and Bing Liu
Short
4.
Mining Numbers in Text Using Suffix Arrays and Clustering Based on Dirichlet Process Mixture Models
Minoru Yoshida, Issei Sato, Hiroshi Nakagawa and Akira Terada
Short
5.
Rule Synthesizing from Multiple Related Databases
Dan He, Xindong Wu and Xingquan Zhu
Regular
 
SESSION 7A
OPINION/SENTIMENT MINING
Session Chair: Longbin Cao, University of Technology Sydney
110 mins
1.
Opinion-Based Imprecise Query Answering
Muhammad Abulaish, Tanvir Ahmad, Jahiruddin and Mohammad Najmud Doja
Regular
2.
Blog Opinion Retrieval based on Topic-opinion Mixture Model
Peng Jiang, Chunxia Zhang, Qing Yang and Zhendong Niu
Regular
3.
Feature Subsumption for Sentiment Classification in Multiple Languages
Zhongwu Zhai, Hua Xu, Jun Li and Peifa Jia
Short
4.
Decentralisation of ScoreFinder: A Framework for Credibility Management on User-Generated Contents
Yang Liao, Aaron Harwood and Ramamohanarao Kotagiri
Short
5.
Classification and Pattern Discovery of Mood in Weblogs
Thin Nguyen, Dinh Phung, Brett Adams, Truyen Tran and Svetha Venkatesh
Short
6.
Capture of Evidence for Summarization: An application of enhanced Subjective Logic
Sukanya Manna, B. Sumudu U. Mendis and Tom Gedeon
Short
 
SESSION 7B
STREAM MINING
Session Chair: Dr. M. Saravanan, Ericsson India R&D.
110 mins
1.
Fast Perceptron Decision Tree Learning from Evolving Data Streams
Albert Bifet, Geoff Holmes, Bernhard Pfahringer and Eibe Frank
Regular
2.
Classification and Novel Class Detection in Data Streams with Active Mining
Mohammad M. Masud, Jing Gao, Latifur Khan, Jiawei Han and Bhavani Thuraisingham
Regular
3.
Bulk loading Hierarchical Mixture Models for Efficient Stream Classification
Philipp Kranen, Ralph Krieger, Stefan Denker and Thomas Seidl
Short
4.
Summarizing Multidimensional Data Stream: A Hierarchy-Graph-Based Approach
Yoann Pitarch, Anne Laurent and Pascal Poncelet
Short
5.
Efficient trade-off between speed processing and accuracy in summarizing data stream
Nesrine Gabsi, Fabrice Clerot and Georges Hebrail
Short
6.
Subsequence Matching of StreamSynopses under the Time Warping Distance
Su-Chen Lin, Mi-Yen Yeh and Ming-Syan Chen
Short
24nd June 2010
 
SESSION 8A
SIMILARITY & KERNELS
Session Chair: Vikram Pudi, International Institute of Information Technology Hyderabad
100 mins
1.
Normalized kernels and similarity indices
Julien Ah-Pine
Regular
2.
Adaptive Matching Based Kernels for Labelled Graphs
Adam Woznica, Alexandros Kalousis and Melanie Hilario
Regular
3.
A New Framework for Dissimilarity and Similarity Learning
Adam Woznica, Alexandros Kalousis and Melanie Hilario
Regular
4.
Semantic-Distance Based Clustering for XML Keyword Search
Weidong Yang and Hao Zhu
Regular
 
SESSION 8B
GRAPH ANALYSIS
Session Chair: Sanjay Chawla, University of Sydney
100 mins
1.
OddBall: Spotting Anomalies in Weighted Graphs
Leman Akoglu, Mary McGlohon and Christos Faloutsos
Regular
2.
Robust Outlier Detection Using Commute Time with Eigenspace Embedding
Nguyen Lu Dang Khoa and Sanjay Chawla
Regular
3.
EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs
B. Aditya Prakash, Ashwin Sridharan, Mukund Seshadri, Sridhar Machiraju and Christos Faloutsos
Regular
4.
Basset: Scalable Gateway Finder in Large Graphs
Hanghang Tong, Spiros Papadimitriou, Christos Faloutsos, Philip Yu and Tina Eliassi-Rad
Regular
 
SESSION 8C
CLASSIFICATION II
Session Chair: Jaideep Srivastava, Universty of Minnesota
105 mins
1.
Ensemble Learning based on Multi-Task Class Labels
Qing Wang and Liang Zhang
Regular
2.
Supervised Learning with Minimal Effort
Eileen A. Ni and Charles X. Ling
Regular
3.
Generating Diverse Ensembles to Counter the Problem of Class Imbalance
T. Ryan Hoens and Nitesh V. Chawla
Regular
4.
Relationship Between Diversity and Correlation in Multi-Classifier Systems
Kuo-Wei Hsu and Jaideep Srivastava
Short
5.
Compact Margin Machine
Bo Dai and Gang Niu
Short