PAKDD-05 - Accepted Papers

List Of Accepted Papers

This year we received 327 submissions, a 37% increase over PAKDD-04, which is the highest number of submissions since the first PAKDD in 1997. The submitted papers went through a rigorous reviewing process. The Program Committee members were deeply involved in a highly engaging selection process with discussions among reviewers. As a result, the PAKDD-05 Program Committee accepted for publication and oral presentation 49 regular papers and 49 short papers, representing a 29.8% acceptance rate.

REGULAR PAPERS

104, A Framework for Incorporating Class Priors into Discriminative Classification, by Rong Jin and Yi Liu
109, Subspace Clustering of Text Documents with Feature Weighting K-Means Algorithm, by Liping Jing, Michael K. Ng and Jun Xu
128, Threshold Tuning for Improved Classification Association Rule Mining, by Frans Coenen, Paul Leng and Lu Zhang
131, Increasing Classification Accuracy by Combining Adaptive Sampling and Convex Pseudo-Data, by Chia Huey Ooi and Madhu Chetty
145, WLPMiner: Weighted Frequent Pattern Mining with Length-decreasing support constraints, by Unil Yun and John Leggett
154, Kernels over relational algebra structures, by Adam Woznica, Alexandros Kalousis and Melanie Hilario
156, Bayesian Sequence Learning For Predicting Protein Cleavage Points, by Michael Mayo
162, Rule Extraction from Trained Support Vector Machines, by Ying Zhang, Hongye Su and Jian Chu
171, Gene expression microarray : data mining via weighted prefix trees, by Tran Trang, Nguyen Cam Chi and Hoang Ngoc Minh
200, Unifying Signature-based and Anomaly-based Intrusion Detection, by Zhuowei Li and Amitabha Das
215, A MPAA-Based Iterative Clustering Algorithm augmented by Nearest Neighbors Search for Time-Series Data Streams, by Jian-Wei Liu
243, Where are the motifs in time-series data, by Zheng Liu, Jeffrey Xu Yu, Xuemin Lin, Hongjun Lu and Wei Wang
282, A Kennel Function Method in Clustering, by Ling Zhang, Tao Wu and Yanping Zhang
284, An Efficient Framework for Mining Flexible Constraints, by Arnaud Soulet and Bruno Cremilleux
285, SETRED: Self-Training with Editing, by Ming Li and Zhi-Hua Zhou
291, Adjusting Mixture Weights of Gaussian Mixture Model via Regularized Probabilistic Latent Semantic Analysis, by Luo Si and Rong Jin
299, Improvements of IncSpan: Incremental Mining of Sequential Patterns in Large Database, by Son Nguyen, Xingzhi Sun and Maria Orlowska
312, Pruning Derivative Partial Rules During Impact Rule Discovery, by Shiying Huang and Geoffrey Webb
315, Cl-GBI: A Novel Approach for Extracting Typical Patterns from Graph-Structured Data, by Phu Chien Nguyen, Kouzou Ohara, Hiroshi Motoda and Takashi Washio
318, NBC: A Neighborhood-Based Clustering Algorithm, by Shuigeng Zhou, Yue Zhao, Jihong Guan and Joshua Z. Huang
321, Using Consensus Susceptibility and Consistency Measures for Inconsistent Knowledge Management, by Ngoc Thanh Nguyen and Michal Malowiecki
325, Automatic Occupation Coding with Combination of Machine Learning and Hand-Crafted Rules, by Kazuko Takahashi, Hiroya Takamura and Manabu Okumura
327, A DNA Index Structure Using Frequency and Position Information, by Woo-Cheol Kim, Sanghyun Park, Jung-Im Won, Sang-Wook Kim and Jee-Hee Yoon
332, A new informative generic base of association rules, by Ghada Gami, Sadok Ben Yahia, Engelbert Mephu Nguifo and Yahya Slimani
343, Support Oriented Discovery of Generalized Disjunction-Free Representation of Frequent Patterns with Negation, by Marzena Kryszkiewicz and Katarzyna Cichon
347, Mining Frequent Trees with Node-Inclusion Constraints, by Atsuyoshi Nakamura
351, Retrieval Based on Language Model with Relative Entropy and Feedback, by Hua Huo
367, Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information, by Raju Shrestha and Ya Ping Lin
369, Finding Sporadic Rules Using Apriori-Inverse, by Yun Sing Koh and Nathan Rountree
372, A Novel Indexing Method for Efficient Sequence Matching in Large DNA Database Environment, by Junglm Won, Jee-Hee Yoon, Sanghyun Park and Sang-Wook Kim
412, Efficient Sampling: Application to Image Data, by Surong Wang, Manoranjan Dash and Liang-Tien Chia
432, Extraction of Frequent Few-Overlapped Monotone DNF Formulas with Depth-First Pruning, by Yoshikazu Shima, Kouichi Hirata and Masateru Harao
438, Text Classification for DAG-Structured Categories, by Cao Nguyen, Tran Dung and Tru Cao
448, ADenTS: An Adaptive Density-based Tree Structure for Approximating Aggregate Queries over Real Attributes, by Tianyi Wu, Jian Xu, Chen Wang, Wei Wang and Baile Shi
461, Sentiment Classification using Word Sub-Sequences and Dependency Sub-Trees, by Shotaro Matsumoto, Hiroya Takamura and Manabu Okumura
466, Improving Rough Classifiers Using Concept Ontology, by Sinh Hoa Nguyen and Hung Son Nguyen
472, Pushing Tougher Constraints in Frequent Pattern Mining, by Francesco Bonchi and Claudio Lucchese
473, An Efficient Compression Technique for Frequent Itemset Generation in Association Rule Mining, by Mafruz Ashrafi, David Taniar and Kate Smith
480, Approximated Clustering of Distributed High-Dimensional Data, by Peter Kunath, Hans-Peter Kriegel, Martin Pfeifle and Matthias Renz
483, Speeding-up hierarchical agglomerative clustering in presence of expensive metrics, by Mirco Nanni
489, Dynamic Cluster Formation using Level Set Methods, by Andy Yip, Chris Ding and Tony Chan
493, Improving Mining Quality by Exploiting Data Dependency, by Fang Chu, Yizhou Wang, Carlo Zaniolo and D.Stott Parker
494, A vector field visualization technique for Self-Organizing Maps, by Georg Polzlbauer, Andreas Rauber and Michael Dittenbach
508, PatZip: Pattern-Preserved Spatial Data Compression, by Yu Qian, Kang Zhang and D.T.Huynh
513, Visualization of Cluster Changes by Comparing Self-Organizing Maps, by Denny and David Squire
516, Cluster-based Rough Set Construction, by Qiang Li
519, Progressive Sampling for Association Rules based on Sampling Error Estimation, by Ming-Syan Chen, Kun-Ta Chuang and Wen-Chieh Yang
529, Computing Cyslic Pattern Kernel for Predictive Graph Mining, by Tamas Horvath
533, QED : An Efficient Framework for Temporal Region Query Processing , by Yi-Hong Chu, Kun-Ta Chuang and Ming-Syan Chen

SHORT PAPERS

110, Using Term Clustering and Supervised Term Affinity Construction to Boost Text Classification, by Chong Wang and Wenyuan Wang
111, Feature Selection for High Dimensional Face Image Using Self-Organizing Maps, by Xiaoyang Tan, Songcan Chen, Zhi-hua Zhou and Fuyan Zhang
113, A likelihood ratio distance measure for the similarity between the fourier transform of time series, by Anthony Bagnall, Gareth Janacek and Michael Powell
130, CLeVer: A Feature Subset Selection Technique for Multivariate Time Series, by Kiyoung Yang, Hyunjin Yoon and Cyrus Shahabi
146, The TIMERS II Algorithm for the Discovery of Causality, by Kamran Karimi and Howard Hamilton
173, Using Rough Set in Feature Selection and Reduction in Face Recognition Problem, by Bac Le Hoai and Tuan Nguyen Anh
180, Analysis of company growth data using genetic algorithms on binary trees, by Gerrit K. Janssens, Kenneth Sorensen, Arthur Limere and Koen Vanhoof
193, Adaptive Nonlinear Auto-associate Modeling through Manifold Learning with Applications for Character and Digit Recognition, by Junping Zhang and Stan Z. Li
216, Dynamic Mining Hierarchical Topic from Web News Stream Data using Divisive-Agglomerative Clustering Method, by Jian-Wei Liu
223, Considering Re-occurring Features in Associative Classifiers, by Rafal Rak, Wojciech Stach, Osmar Zaiane and Maria-Luiza Antonie
230, Collecting Topic-related Web Pages for Link Structure Analysis by Using a Potential Hub and Authority First Approach, by Leuo-hong Wang and Tong-wen Lee
237, A New Evolutionary Neural Network Classifier, by Arit Thammano and Asavin Meengen
246, A Privacy-Preserving Classification Mining Algorithm, by Weiping Ge, Wei Wang, Xiaorong Li and Baile Shi
259, Maximizing Tree Diversity by Building Complete-Random Decision Trees, by Fei Tony Liu, Kai Ming Ting and Wei Fan
260, Conditional Random Fields for Transmembrane Helix Prediction, by Lior Lukov, Sanjay Chawla and W. Bret Church
278, A Recent-Biased Technique for Dimension Reduction, by Yanchang Zhao, Chengqi Zhang and Shichao Zhang
289, Stochastic local clustering for massive graphs, by Satu Elisa Schaeffer
292, Combining Classifiers with Multi-Representation of Context in Word Sense Disambiguation, by Cuong Anh Le, Van Nam Huynh and Akira Shimazu
294, Learning Bayesian Networks Structures from Incomplete Data: An Efficient Approach Based on Extended Evolutionary Programming, by Li Xiaolin
295, Performance Measurements for Privacy Preserving Data Mining, by Nan Zhang, Wei Zhao and Jianer Chen
304, Training Support Vector Machines Using Greedy Stagewise Algorithm, by Liefeng Bo, Ling Wang and Licheng Jiao
317, Covariance and PCA for Categorical Variables, by Hirotaka Niitsuma and Takashi Okada
326, Technology Trends Analysis from the Internet Resources, by Shin-ichi Kobayashi, Yasuyuki Shirai, Kazuo Hiyane, Fumihiro Kumeno, Hiroshi Inujima and Noriyoshi Yamauchi
330, Graph Partition Model for Robust Temporal Data Segmentation, by Yuan Jinhui , Zhang Bo and Lin Fuzong
333, A Divide and Conquer approach for deriving partially ordered sub-structures, by Sadok Ben Yahia, Yahya Slimani and Jihen Rezgui
334, Accurate Symbolization of Time Series, by Xinqiang Zuo and Xiaoming Jin
341, A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering, by Chotirat Ann Ratanamahatana, Eamonn Keogh, Anthony Bagnall and Stefano Lonardi
342, Mining Mobile Group Patterns: A Trajectory-based Approach, by San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu and Ee-Peng Lim
345, An Automatic Unsupervised Querying Algorithm for Efficient Information Extraction in Biomedical Domain, by Min Song, Il-Yeol Song, Xiaohua Hu and Robert Allen
373, Voting fuzzy k-NN to predict protein subcellular localization from normalized amino acid pair compositions, by Quang Tung Thai, Doheon Lee, Dae-Won Kim and Jong-Tae Lim
376, Feature Selection Algorithm for Data with Both Nominal and Continuous Features, by Wenyin Tang and Kezhi Mao
379, A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets, by Ying Liu, Wei-keng Liao and Alok Choudhary
380, Automatic View Selection: An Application to Image Mining, by Manoranjan Dash and Deepak Kolipakkam
396, Improved Self-Splitting Competitive Learning Algorithm, by Jun Liu and Rao Kotagiri
442, Finding temporal features of event-oriented patterns, by Xingzhi Sun, Maria Orlowska and Xue Li
451, Comparison of Tree based methods on mammography data, by Richard De Veaux and Thu Hoang
460, A Top-down Algorithm for Web Log Sequential Pattern Mining, by Guo Jian-kui
481, Visual Interactive Evolutionary Algorithm for High Dimensional Data Clustering and Outlier Detection, by Boudjeloud Lydia and Poulet Francois
485, On Multiple Query Optimization in Data Mining, by Maciej Zakrzewicz and Marek Wojciechowski
488, Mining Time-Profiled Associations, by Jin Soung Yoo, Pusheng Zhang and Shashi Shekhar
490, Frequent Itemset Mining with Parallel RDBMS, by Xuequn Shang
497, Online Algorithms for Mining Inter-Stream Associations From Large Sensor Networks, by Kin Kong Loo, Ivy Tong, Ben Kao
505, Can We Apply Projection Based Frequent Pattern Mining Paradigm to Spatial Co-location Mining?, by Yan Huang, Liqin Zhang and Ping Yu
506, Mining Frequent Ordered Patterns, by Zhi-Hong Deng, Cong-Rui Ji, Ming Zhang and Shi-Wei Tang
507, Automatic Extraction of Low Frequency Bilingual Word Paris from Parallel Corpora with Various Languages, by Hiroshi Echizen-ya
510, An Anomaly Detection Method for Spacecraft using Relevance Vector Learning, by Ryohei Fujimaki, Takehisa Yairi and Kazuo Machida
514, An Incremental Data Stream Clustering Algorithm Based on Dense Units Detection, by Jing Gao, Jianzhong Li, Zhaogong Zhang and Pang-Ning Tan
524, Kernel principal component analysis for content based image retrieval, by Guang-Ho Cha
526, Dynamic Fuzzy Clustering for Recommender Systems, by Sung-Hwan Min

[ HOME ]