List Of Accepted Papers

This year we received 327 submissions, a 37% increase over PAKDD-04, which is the highest number of submissions since the first PAKDD in 1997. The submitted papers went through a rigorous reviewing process. The Program Committee members were deeply involved in a highly engaging selection process with discussions among reviewers. As a result, the PAKDD-05 Program Committee accepted for publication and oral presentation 49 regular papers and 49 short papers, representing a 29.8% acceptance rate.


REGULAR PAPERS
  • 104, A Framework for Incorporating Class Priors into Discriminative Classification, by Rong Jin and Yi Liu
  • 109, Subspace Clustering of Text Documents with Feature Weighting K-Means Algorithm, by Liping Jing, Michael K. Ng and Jun Xu
  • 128, Threshold Tuning for Improved Classification Association Rule Mining, by Frans Coenen, Paul Leng and Lu Zhang
  • 131, Increasing Classification Accuracy by Combining Adaptive Sampling and Convex Pseudo-Data, by Chia Huey Ooi and Madhu Chetty
  • 145, WLPMiner: Weighted Frequent Pattern Mining with Length-decreasing support constraints, by Unil Yun and John Leggett
  • 154, Kernels over relational algebra structures, by Adam Woznica, Alexandros Kalousis and Melanie Hilario
  • 156, Bayesian Sequence Learning For Predicting Protein Cleavage Points, by Michael Mayo
  • 162, Rule Extraction from Trained Support Vector Machines, by Ying Zhang, Hongye Su and Jian Chu
  • 171, Gene expression microarray : data mining via weighted prefix trees, by Tran Trang, Nguyen Cam Chi and Hoang Ngoc Minh
  • 200, Unifying Signature-based and Anomaly-based Intrusion Detection, by Zhuowei Li and Amitabha Das
  • 215, A MPAA-Based Iterative Clustering Algorithm augmented by Nearest Neighbors Search for Time-Series Data Streams, by Jian-Wei Liu
  • 243, Where are the motifs in time-series data, by Zheng Liu, Jeffrey Xu Yu, Xuemin Lin, Hongjun Lu and Wei Wang
  • 282, A Kennel Function Method in Clustering, by Ling Zhang, Tao Wu and Yanping Zhang
  • 284, An Efficient Framework for Mining Flexible Constraints, by Arnaud Soulet and Bruno Cremilleux
  • 285, SETRED: Self-Training with Editing, by Ming Li and Zhi-Hua Zhou
  • 291, Adjusting Mixture Weights of Gaussian Mixture Model via Regularized Probabilistic Latent Semantic Analysis, by Luo Si and Rong Jin
  • 299, Improvements of IncSpan: Incremental Mining of Sequential Patterns in Large Database, by Son Nguyen, Xingzhi Sun and Maria Orlowska
  • 312, Pruning Derivative Partial Rules During Impact Rule Discovery, by Shiying Huang and Geoffrey Webb
  • 315, Cl-GBI: A Novel Approach for Extracting Typical Patterns from Graph-Structured Data, by Phu Chien Nguyen, Kouzou Ohara, Hiroshi Motoda and Takashi Washio
  • 318, NBC: A Neighborhood-Based Clustering Algorithm, by Shuigeng Zhou, Yue Zhao, Jihong Guan and Joshua Z. Huang
  • 321, Using Consensus Susceptibility and Consistency Measures for Inconsistent Knowledge Management, by Ngoc Thanh Nguyen and Michal Malowiecki
  • 325, Automatic Occupation Coding with Combination of Machine Learning and Hand-Crafted Rules, by Kazuko Takahashi, Hiroya Takamura and Manabu Okumura
  • 327, A DNA Index Structure Using Frequency and Position Information, by Woo-Cheol Kim, Sanghyun Park, Jung-Im Won, Sang-Wook Kim and Jee-Hee Yoon
  • 332, A new informative generic base of association rules, by Ghada Gami, Sadok Ben Yahia, Engelbert Mephu Nguifo and Yahya Slimani
  • 343, Support Oriented Discovery of Generalized Disjunction-Free Representation of Frequent Patterns with Negation, by Marzena Kryszkiewicz and Katarzyna Cichon
  • 347, Mining Frequent Trees with Node-Inclusion Constraints, by Atsuyoshi Nakamura
  • 351, Retrieval Based on Language Model with Relative Entropy and Feedback, by Hua Huo
  • 367, Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information, by Raju Shrestha and Ya Ping Lin
  • 369, Finding Sporadic Rules Using Apriori-Inverse, by Yun Sing Koh and Nathan Rountree
  • 372, A Novel Indexing Method for Efficient Sequence Matching in Large DNA Database Environment, by Junglm Won, Jee-Hee Yoon, Sanghyun Park and Sang-Wook Kim
  • 412, Efficient Sampling: Application to Image Data, by Surong Wang, Manoranjan Dash and Liang-Tien Chia
  • 432, Extraction of Frequent Few-Overlapped Monotone DNF Formulas with Depth-First Pruning, by Yoshikazu Shima, Kouichi Hirata and Masateru Harao
  • 438, Text Classification for DAG-Structured Categories, by Cao Nguyen, Tran Dung and Tru Cao
  • 448, ADenTS: An Adaptive Density-based Tree Structure for Approximating Aggregate Queries over Real Attributes, by Tianyi Wu, Jian Xu, Chen Wang, Wei Wang and Baile Shi
  • 461, Sentiment Classification using Word Sub-Sequences and Dependency Sub-Trees, by Shotaro Matsumoto, Hiroya Takamura and Manabu Okumura
  • 466, Improving Rough Classifiers Using Concept Ontology, by Sinh Hoa Nguyen and Hung Son Nguyen
  • 472, Pushing Tougher Constraints in Frequent Pattern Mining, by Francesco Bonchi and Claudio Lucchese
  • 473, An Efficient Compression Technique for Frequent Itemset Generation in Association Rule Mining, by Mafruz Ashrafi, David Taniar and Kate Smith
  • 480, Approximated Clustering of Distributed High-Dimensional Data, by Peter Kunath, Hans-Peter Kriegel, Martin Pfeifle and Matthias Renz
  • 483, Speeding-up hierarchical agglomerative clustering in presence of expensive metrics, by Mirco Nanni
  • 489, Dynamic Cluster Formation using Level Set Methods, by Andy Yip, Chris Ding and Tony Chan
  • 493, Improving Mining Quality by Exploiting Data Dependency, by Fang Chu, Yizhou Wang, Carlo Zaniolo and D.Stott Parker
  • 494, A vector field visualization technique for Self-Organizing Maps, by Georg Polzlbauer, Andreas Rauber and Michael Dittenbach
  • 508, PatZip: Pattern-Preserved Spatial Data Compression, by Yu Qian, Kang Zhang and D.T.Huynh
  • 513, Visualization of Cluster Changes by Comparing Self-Organizing Maps, by Denny and David Squire
  • 516, Cluster-based Rough Set Construction, by Qiang Li
  • 519, Progressive Sampling for Association Rules based on Sampling Error Estimation, by Ming-Syan Chen, Kun-Ta Chuang and Wen-Chieh Yang
  • 529, Computing Cyslic Pattern Kernel for Predictive Graph Mining, by Tamas Horvath
  • 533, QED : An Efficient Framework for Temporal Region Query Processing , by Yi-Hong Chu, Kun-Ta Chuang and Ming-Syan Chen
SHORT PAPERS
  • 110, Using Term Clustering and Supervised Term Affinity Construction to Boost Text Classification, by Chong Wang and Wenyuan Wang
  • 111, Feature Selection for High Dimensional Face Image Using Self-Organizing Maps, by Xiaoyang Tan, Songcan Chen, Zhi-hua Zhou and Fuyan Zhang
  • 113, A likelihood ratio distance measure for the similarity between the fourier transform of time series, by Anthony Bagnall, Gareth Janacek and Michael Powell
  • 130, CLeVer: A Feature Subset Selection Technique for Multivariate Time Series, by Kiyoung Yang, Hyunjin Yoon and Cyrus Shahabi
  • 146, The TIMERS II Algorithm for the Discovery of Causality, by Kamran Karimi and Howard Hamilton
  • 173, Using Rough Set in Feature Selection and Reduction in Face Recognition Problem, by Bac Le Hoai and Tuan Nguyen Anh
  • 180, Analysis of company growth data using genetic algorithms on binary trees, by Gerrit K. Janssens, Kenneth Sorensen, Arthur Limere and Koen Vanhoof
  • 193, Adaptive Nonlinear Auto-associate Modeling through Manifold Learning with Applications for Character and Digit Recognition, by Junping Zhang and Stan Z. Li
  • 216, Dynamic Mining Hierarchical Topic from Web News Stream Data using Divisive-Agglomerative Clustering Method, by Jian-Wei Liu
  • 223, Considering Re-occurring Features in Associative Classifiers, by Rafal Rak, Wojciech Stach, Osmar Zaiane and Maria-Luiza Antonie
  • 230, Collecting Topic-related Web Pages for Link Structure Analysis by Using a Potential Hub and Authority First Approach, by Leuo-hong Wang and Tong-wen Lee
  • 237, A New Evolutionary Neural Network Classifier, by Arit Thammano and Asavin Meengen
  • 246, A Privacy-Preserving Classification Mining Algorithm, by Weiping Ge, Wei Wang, Xiaorong Li and Baile Shi
  • 259, Maximizing Tree Diversity by Building Complete-Random Decision Trees, by Fei Tony Liu, Kai Ming Ting and Wei Fan
  • 260, Conditional Random Fields for Transmembrane Helix Prediction, by Lior Lukov, Sanjay Chawla and W. Bret Church
  • 278, A Recent-Biased Technique for Dimension Reduction, by Yanchang Zhao, Chengqi Zhang and Shichao Zhang
  • 289, Stochastic local clustering for massive graphs, by Satu Elisa Schaeffer
  • 292, Combining Classifiers with Multi-Representation of Context in Word Sense Disambiguation, by Cuong Anh Le, Van Nam Huynh and Akira Shimazu
  • 294, Learning Bayesian Networks Structures from Incomplete Data: An Efficient Approach Based on Extended Evolutionary Programming, by Li Xiaolin
  • 295, Performance Measurements for Privacy Preserving Data Mining, by Nan Zhang, Wei Zhao and Jianer Chen
  • 304, Training Support Vector Machines Using Greedy Stagewise Algorithm, by Liefeng Bo, Ling Wang and Licheng Jiao
  • 317, Covariance and PCA for Categorical Variables, by Hirotaka Niitsuma and Takashi Okada
  • 326, Technology Trends Analysis from the Internet Resources, by Shin-ichi Kobayashi, Yasuyuki Shirai, Kazuo Hiyane, Fumihiro Kumeno, Hiroshi Inujima and Noriyoshi Yamauchi
  • 330, Graph Partition Model for Robust Temporal Data Segmentation, by Yuan Jinhui , Zhang Bo and Lin Fuzong
  • 333, A Divide and Conquer approach for deriving partially ordered sub-structures, by Sadok Ben Yahia, Yahya Slimani and Jihen Rezgui
  • 334, Accurate Symbolization of Time Series, by Xinqiang Zuo and Xiaoming Jin
  • 341, A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering, by Chotirat Ann Ratanamahatana, Eamonn Keogh, Anthony Bagnall and Stefano Lonardi
  • 342, Mining Mobile Group Patterns: A Trajectory-based Approach, by San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu and Ee-Peng Lim
  • 345, An Automatic Unsupervised Querying Algorithm for Efficient Information Extraction in Biomedical Domain, by Min Song, Il-Yeol Song, Xiaohua Hu and Robert Allen
  • 373, Voting fuzzy k-NN to predict protein subcellular localization from normalized amino acid pair compositions, by Quang Tung Thai, Doheon Lee, Dae-Won Kim and Jong-Tae Lim
  • 376, Feature Selection Algorithm for Data with Both Nominal and Continuous Features, by Wenyin Tang and Kezhi Mao
  • 379, A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets, by Ying Liu, Wei-keng Liao and Alok Choudhary
  • 380, Automatic View Selection: An Application to Image Mining, by Manoranjan Dash and Deepak Kolipakkam
  • 396, Improved Self-Splitting Competitive Learning Algorithm, by Jun Liu and Rao Kotagiri
  • 442, Finding temporal features of event-oriented patterns, by Xingzhi Sun, Maria Orlowska and Xue Li
  • 451, Comparison of Tree based methods on mammography data, by Richard De Veaux and Thu Hoang
  • 460, A Top-down Algorithm for Web Log Sequential Pattern Mining, by Guo Jian-kui
  • 481, Visual Interactive Evolutionary Algorithm for High Dimensional Data Clustering and Outlier Detection, by Boudjeloud Lydia and Poulet Francois
  • 485, On Multiple Query Optimization in Data Mining, by Maciej Zakrzewicz and Marek Wojciechowski
  • 488, Mining Time-Profiled Associations, by Jin Soung Yoo, Pusheng Zhang and Shashi Shekhar
  • 490, Frequent Itemset Mining with Parallel RDBMS, by Xuequn Shang
  • 497, Online Algorithms for Mining Inter-Stream Associations From Large Sensor Networks, by Kin Kong Loo, Ivy Tong, Ben Kao
  • 505, Can We Apply Projection Based Frequent Pattern Mining Paradigm to Spatial Co-location Mining?, by Yan Huang, Liqin Zhang and Ping Yu
  • 506, Mining Frequent Ordered Patterns, by Zhi-Hong Deng, Cong-Rui Ji, Ming Zhang and Shi-Wei Tang
  • 507, Automatic Extraction of Low Frequency Bilingual Word Paris from Parallel Corpora with Various Languages, by Hiroshi Echizen-ya
  • 510, An Anomaly Detection Method for Spacecraft using Relevance Vector Learning, by Ryohei Fujimaki, Takehisa Yairi and Kazuo Machida
  • 514, An Incremental Data Stream Clustering Algorithm Based on Dense Units Detection, by Jing Gao, Jianzhong Li, Zhaogong Zhang and Pang-Ning Tan
  • 524, Kernel principal component analysis for content based image retrieval, by Guang-Ho Cha
  • 526, Dynamic Fuzzy Clustering for Recommender Systems, by Sung-Hwan Min


[ HOME ]