Session Details
Announced on April 27, 2015
Workshop Day
May 19, 2015 (Tuesday) |
|
08:30 - 09:30 |
|
BigPMA: The 2nd Workshop on Pattern Mining and Application of Big Data Location: Sunflower Ballroom A |
|
Session 1 Chair: Shih-Hao Chang |
|
08:30 08:50 |
From Cluster-based Outlier Detection to Time Series Discord Discovery Nguyen Huy Kha, Duong Tuan Anh (Ho Chi Minh City University of Technology) |
08:50 09:10 |
ProbitUCB: A Novel Method for Review Ranking Wanying Ding, Yue Shang (Drexel University), Dae Hoon Park (University of Illinois at Urbana-Champaign), Lifan Guo (TCL Research America), Xiaohua Hu (Drexel University) |
09:10 09:30 |
Web site audience segmentation using hybrid alignment techniques Vinh-Trung Luu, Germain Forestier, Fr ed eric Fondement, Pierre-Alain Muller (Universite de Haute Alsace) |
VLSP: The 3rd Workshop on Vietnamese Language and Speech Processing Location: Sunflower Ballroom B |
|
08:30 09:00 |
Author Profiling for Vietnamese Forum Posts |
09:00 09:30 |
Syntax-Based Statistical machine translation approach for solving the diacritization problem |
PAISI: Pacific Asia Workshop on Intelligence and Security Informatics Location: Daisy Room |
|
08:30 09:30 |
Opening and Keynote Speech TBC |
QIMIE: Workshop on Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models Location: Lotus Room |
|
08:30 08:35 |
Opening |
08:35 09:30 |
Keynote: Classifier Evaluation Under Changing Scenarios Nitesh Chawla |
10:00 - 12:00 |
|
BigPMA: The 2nd Workshop on Pattern Mining and Application of Big Data Location: Sunflower Ballroom A |
|
Session 1 (cont.) Chair: Shih-Hao Chang |
|
10:00 10:20 |
Mining Massive-Scale Spatiotemporal Trajectories in Parallel: A Survey |
10:20 10:40 |
Manifold Regularized Symmetric Joint Link Model for Overlapping Community Detection |
10:40 11:00 |
High Dimensional Explicit Feature Biased Matrix Factorization Recommendation |
11:00 11:20 |
A Simhash-based Generalized Framework for Citation Matching in MapReduce |
11:20 11:40 |
A Dynamic Feature Selection Based LDA Approach to Baseball Pitch Prediction |
VLSP: The 3rd Workshop on Vietnamese Language and Speech Processing Location: Sunflower Ballroom B |
|
10:00 10:30 |
Utilizing Vietnamese Sentiment Analysis for Online Reputation Management Platform |
10:30 11:00 |
An improved method for Vietnamese speech segmentation based on method of waveform image representation |
11:00 11:30 |
Modeling Vietnamese speech prosody: A step-by-step approach towards an expressive speech synthesis system |
11:30 12:00 |
Hybrid Deep Neural Network-Hidden Markov Model for Vietnamese Large Vocabulary Continuous Speech Recognition System |
PAISI: Pacific Asia Workshop on Intelligence and Security Informatics Location: Daisy Room |
|
Session 1: Social Media Intelligence Chair: TBC |
|
10:00 10:30 |
Media REVEALr: A Social Multimedia Monitoring and Intelligence System for Web Multimedia Verification Katerina Andreadou, Symeon Papadopoulos, Lazaros Apostolidis, Anastasia Krithara, and Yiannis Kompatsiaris |
10:30 11:00 |
Geotagging Social Media Content with a Refined Language Modelling Approach Giorgos Kordopatis-Zilos, Symeon Papadopoulos, and Yiannis Kompatsiaris |
11:00 11:30 |
Predicting Vehicle Recalls with User-Generated Contents: A Text Mining Approach Xuan Zhang, Shuo Niu, Da Zhang, G. Alan Wang, and Weiguo Fan |
11:30 12:00 |
GCM: A Greedy-based Cross-Matching Algorithm for Identifying Users across Multiple Online Social Networks Wenxin Liang, Bo Meng, Xiaosong He, and Xianchao Zhang |
QIMIE: Workshop on Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models Location: Tulip Room |
|
Session 1: Network and Community Analysis Chair: TBC |
|
10:00 10:30 |
Analyzing User Behaviors Based on Temporal Patterns of Sequential Pattern Evaluation Indices on Twitter Hidenao Abe |
10:30 11:00 |
Evaluation of Community Mining Algorithms in the Presence of Attributes Reihaneh Rabbany and Osmar Zaïane |
Session 2: Clustering Chair: TBC |
|
11:00 11:30 |
Internal Clustering Evaluation of Data Streams Marwan Hassani and Thomas Seidl |
11:30 12:00 |
Feature maximization based clustering quality evaluation: a promising approach Jean-Charles Lamirel, Shadi Al Shehabi |
13:30 - 15:00 |
|
BigPMA: The 2nd Workshop on Pattern Mining and Application of Big Data Location: Sunflower Ballroom A |
|
Session 2 Chair: Yi-Cheng Chen |
|
13:30 13:50 |
A Cloud Based Diabetes Lifestyle Management System |
13:50 14:10 |
Big Data Generation:Application of Mobile Healthcare |
14:10 14:30 |
Construction of a prediction model for nephropathy among obese patients using genetic and clinical features |
VLSP: The 3rd Workshop on Vietnamese Language and Speech Processing Location: Sunflower Ballroom B |
|
13:30 14:00 |
Fast Dependency Parsing using Distributed Word Representations |
14:00 14:30 |
Building a LTAG syntax - semantic interface for Vietnamese |
PAISI: Pacific Asia Workshop on Intelligence and Security Informatics Location: Daisy Room |
|
Session 2: Fraud Detection Chair: TBC |
|
13:30 14:00 |
P2P Lending Fraud Detection: A Big Data Approach Jennifer Xu, Yong Lu, and Michael Chau |
14:00 14:30 |
Drug Anti-forgery and Tracing System Based on Lightweight Asymmetric Identities Shenghui Su, Na Li, and Shuwang Lu |
QIMIE: Workshop on Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models Location: Tulip Room |
|
Session 3: Interestingness Measures Chair: TBC |
|
13:30 14:00 |
A Study of Interestingness Measures for Associative Classification on Imbalanced Data Guangfei Yang and Xuejiao Cui |
14:00 14:30 |
Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction Guangfei Yang and Jian Huang |
14:30 15:00 |
Leveraging the Common Cause of Errors for Constraint-based Data Cleansing Ayako Hoshino, Hiroki Nakayama, Chihiro Ito, Kyota Kanno and Kenshi Nishimura |
15:00 - 18:10 |
|
VLSP: The 3rd Workshop on Vietnamese Language and Speech Processing Location: Sunflower Ballroom B |
|
15:00 15:30 |
Semantic Role Labeling in Bilingual English-Vietnamese Corpus |
15:30 16:00 |
Named Entity Alignment in an English-Vietnamese Bilingual Corpus |
16:00 17:00 |
General Discussion - Closing |
PAISI: Pacific Asia Workshop on Intelligence and Security Informatics Location: Daisy Room |
|
Session 3: Text Mining Chair: TBC |
|
15:00 15:30 |
Chinese Word POS Tagging with Markov Logic Zhihua Liao |
15:30 16:00 |
In Search of Plagiarism Behaviors: An Empirical Study of Online Reviews Zhuolan Bao and Michael Chau |
DAEBH: Workshop on Data Analytics for Evidence-Based Healthcare Location: Tulip Room |
|
15:00 15:10 |
Workshop Opening Dr. Xujuan Zhou |
15:10 16:10 |
Keynote I: Evidence Mining Systems Dr. Guy Tsafnat ( Chair: Dr. Wei Liu ) |
16:10 17:10 |
Keynote II: From Big Data Analytics in Healthcare to New Generic Data Mining Approaches Prof. Osmar Zaiane ( Chair: Dr. Xujuan Zhou ) |
|
Presentation Session ( Chair: Dr. Wei Liu ) |
17:10 17:30 |
Learning Entry Profiles of Children with ASD from Multivariate Treatment Information using Restricted Boltzmann Machines Pratibha Vellanki, Dinh Phung, Thi Duong and Svetha Venkatesh |
17:30 17:50 |
Citation Enrichment Improves Deduplication of Primary Evidence Miew Keen Choong, Sarah Thorning and Guy Tsafnat |
17:50 18:10 |
Integrating Content Centric Networking and Web Content Mining: A Future Efficient Internet Architecture for Healthcare Rabia Bashir and Sajjad Akbar |
Main Conference
May 20, 2015 (Wednesday) |
||
09:00 - 10:00 |
||
KEYNOTE I: Online and Batch Learning with Interventions Thorsten Joachims |
||
10:20 - 12:30 |
||
10:20 12:30 |
TUTORIAL I: Crowdsourcing for Big Data Analytics Presenters: Hisashi Kashima, Satoshi Oyama, and Yukino Baba Location: Tulip Room |
|
10:20 12:30 |
TUTORIAL II: Differential Privacy and Its Applications Presenters: Gang Li, Tianqing Zhu, and Wanlei Zhou Location:Daisy Room |
|
SESSION 1A: Social Networks and Social Media |
||
10:20 10:45 |
Maximizing Friend-Making Likelihood for Social Activity Organization (L) Chih-Ya Shen, De-Nian Yang, Wang-Chien Lee, and Ming-Syan Chen |
|
10:45 11:10 |
What Is New in Our City? A Framework for Event Extraction Using Social Media Posts (L) Chaolun Xia, Jun Hu, Yan Zhu, and Mor Naaman |
|
11:10 11:35 |
Link Prediction in Aligned Heterogeneous Networks (L) Fangbing Liu and Shu-Tao Xia |
|
11:35 11:55 |
Scale-Adaptive Group Optimization for Social Activity Planning (R) Hong-Han Shuai, De-Nian Yang, Philip S. Yu, and Ming-Syan Chen |
|
11:55 12:15 |
Influence Maximization across Partially Aligned Heterogenous Social Networks (R) Qianyi Zhan, Jiawei Zhang, Senzhang Wang, Philip S. Yu,and Junyuan Xie |
|
SESSION 1B: Classification |
||
10:20 10:40 |
Double Ramp Loss Based Reject Option Classifier (R) Naresh Manwani, Kalpit Desai, Sanand Sasidharan, and Ramasubramanian Sundararajan |
|
10:40 11:00 |
Efficient Methods for Multi-label Classification (R) Chonglin Sun, Chunting Zhou, Bo Jin, and Francis C.M. Lau |
|
11:00 11:20 |
A Coupled k-Nearest Neighbor Algorithm for Multi-label Classification (R) Chunming Liu and Longbing Cao |
|
11:20 11:40 |
Learning Topic-Oriented Word Embedding for Query Classification (R) Hebin Yang, Qinmin Hu, and Liang He |
|
11:40 12:00 |
Reliable Early Classification on Multivariate Time Series with Numerical and Categorical Attributes (R) Yu-Feng Lin, Hsuan-Hsu Chen, Vincent S. Tseng, and Jian Pei |
|
12:00 12:20 |
Document Classification based on Distributed Document Representation: A Supervised Deep Learning Framework (R) Rumeng Li and Hiroyuki Shindo |
|
SESSION 1C: Machine Learning |
||
10:20 10:45 |
Collaborating Differently on Different Topics: A Multi-Relational Approach to Multi-Task Learning (L) Sunil Kumar Gupta, Santu Rana, Dinh Phung, and Svetha Venkatesh |
|
10:45 11:05 |
A Bayesian Nonparametric Approach to Multilevel Regression (R) Vu Nguyen, Dinh Phung, Svetha Venkatesh, and Hung H. Bui |
|
11:05 11:25 |
Learning Conditional Latent Structures from Multiple Data Sources (R) Viet Huynh, Dinh Phung, Long Nguyen, Svetha Venkatesh,and Hung H. Bui |
|
11:25 11:45 |
Collaborative Multi-view Learning with Active Discriminative Prior for Recommendation (R) Qing Zhang and Houfeng Wang |
|
11:45 12:05 |
Online and Stochastic Universal Gradient Methods for Minimizing Regularized Holder Continuous Finite Sums in Machine Learning (R) Ziqiang Shi and Rujie Liu |
|
12:05 12:25 |
Multi-Task Metric Learning on Network Data (R) Chen Fang and Daniel N. Rockmore |
|
SESSION 1D: Applications |
||
10:20 10:45 |
On Damage Identification in Civil Structures Using Tensor Analysis (L) Nguyen Lu Dang Khoa, Bang Zhang, Yang Wang, Wei Liu,Fang Chen, Samir Mustapha, and Peter Runcie |
|
10:45 11:10 |
Predicting Smartphone Adoption in Social Networks (L) Le Wu, Yin Zhu, Nicholas Jing Yuan, Enhong Chen, Xing Xie, and Yong Rui |
|
11:10 11:30 |
Discovering the Impact of Urban Traffic Interventions Using Contrast Mining on Vehicle Trajectory Data (R) Xiaoting Wang, Christopher Leckie, Hairuo Xie, and Tharshan Vaithianathan |
|
11:30 11:50 |
Locating Self-collection Points for Last-mile Logistics using Public Transport Data (R) Huayu Wu, Dongxu Shao, and Wee Siong Ng |
|
11:50 12:10 |
A Stochastic Framework for Short-term Solar Irradiance Forecasting (R) Jin Xu, Shinjae Yoo, Dantong Yu, Hao Huang, Dong Huang, John Heiser, and Paul Kalb |
|
12:10 12:30 |
Online Prediction of Chess Match Result (R) Mohammad M. Masud, Ameera Al-Shehhi, Eiman Al-Shamsi,Shamma Al-Hassani, Asmaa Al-Hamoudi, and Latifur Khan |
|
14:00 - 16:10 |
||
14:00 16:10 |
TUTORIAL III: Behavior Computing: Deep Behavior Analytics and Active Behavior Management Presenter: Longbing Cao Location: Tulip Room |
|
CONTEST: Gender Prediction Based on E-commerce Data Chairs: Hung Son Nguyen, Nitesh Chawla, and Nguyen Duc Dung Location: Daisy Room |
||
14:00 |
Welcome and Introduction Hung Son Nguyen, Poland |
|
14:00 14:20 |
FRDC's approach at PAKDD’15 Data Mining Competition Miao Qingliang, Fujitsu Research & Development Center Co.,LTD., China |
|
14:20 14:40 |
The combination of supervised and unsupervised approach Xia Yingju, Fujitsu Research & Development Center Co.,LTD., China |
|
14:40 15:00 |
Random forrest based classification on heterogeneous generated features. Jan Kralj, Jozef Stefan Institute, Slovenia |
|
15:00 15:20 |
TTI's Gender Prediction System using Bootstrapping and Identical-Hierarchy Mohammad Golam Sohrab, Toyota Technological Institute, Japan |
|
15:20 15:40 |
Factor Models for Gender Prediction Based on E-commerce Data Immanuel Bayer, University of Konstanz, Germany |
|
15:40 16:00 |
A Granular Classifier for PAKDD 2015 Data Mining Competition. Wojtek Swieboda, University of Warsaw, Poland |
|
16:00 16:20 |
Gender Prediction based on counting with weight method Pham Ngoc An, FPT University, Vietnam |
|
16:20 16:25 |
Summary and closing remarks
|
|
SESSION 2A: Opinion Mining and Sentiment Analysis |
||
14:00 14:20 |
Emotion Cause Detection for Chinese Micro-blogs based on ECOCC Model (R) Kai Gao, Hua Xu, and Jiushuo Wang |
|
14:20 14:40 |
Parallel Recursive Deep Model for Sentiment Analysis (R) Changliang Li, Bo Xu, Gaowei Wu, Saike He, Guanhua Tian,and Yujun Zhou |
|
14:40 15:00 |
Sentiment Analysis in Transcribed Utterances (R) Nir Ofek, Gilad Katz, Bracha Shapira, and Yedidya Bar-Zev |
|
15:00 15:20 |
HierRating: A Hierarchical Generative Bayesian Model for Entity Latent Aspect Rating Analysis (R) Xun Wang, Katsuhito Sudoh, and Masaaki Nagata |
|
15:20 15:40 |
Sentiment Analysis on Microblogging by Integrating Text and Image Features (R) Yaowen Zhang, Lin Shang, and Xiuyi Jia |
|
15:40 16:00 |
TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets during a Disaster for Reaction (R) Minh-Tien Nguyen, Asanobu Kitamoto, and Tri-Thanh Nguyen |
|
SESSION 2B: Clustering |
||
14:00 14:25 |
Evolving Chinese Restaurant Process for Modeling Evolutionary Trace in Temporal Data (L) Peng Wang, Chuan Zhou, Peng Zhang, Weiwei Feng, Li Guo, and Binxing Fang |
|
14:25 14:50 |
Small-Variance Asymptotics for Bayesian Nonparametric Models with Constraints (L) Cheng Li, Santu Rana, Dinh Phung, and Svetha Venkatesh |
|
14:50 15:10 |
Spectral Clustering for Large-Scale Social Networks via a Pre-Coarsening Sampling based NystrÖm Method (R) Ying Kang, Bo Yu, Weiping Wang, and Dan Meng |
|
15:10 15:30 |
pcStream: A Stream Clustering Algorithm for Dynamically Detecting and Managing Temporal Contexts (R) Yisroel Mirsky, Bracha Shapira, Lior Rokach, and Yuval Elovici |
|
15:30 15:50 |
Clustering over Data Streams based on Growing Neural Gas (R) Mohammed Ghesmoune, Mustapha Lebbah, and Hanene Azzag |
|
15:50 16:10 |
ClustCube Cubes: A Novel OLAP-based Mining Structure for Clustering Complex Database Objects (R) Alfredo Cuzzocrea |
|
SESSION 2C: Novel Methods and Algorithms |
||
14:00 14:25 |
Principal Sensitivity Analysis (L) Sotetsu Koyamada, Masanori Koyama, Ken Nakae, and Shin Ishii |
|
14:25 14:50 |
SocNL: Bayesian Label Propagation with Confidence (L) Yuto Yamaguchi, Christos Faloutsos, and Hiroyuki Kitagawa |
|
14:50 15:15 |
An Incremental Local Distribution Network for Unsupervised Learning (L) Youlu Xing, Tongyi Cao, Ke Zhou, Furao Shen, and Jinxi Zhao |
|
15:15 15:35 |
Trend-based Citation Count Prediction for Research Articles (R) Cheng-Te Li, Yu-Jen Lin, Rui Yan, and Mi-Yen Yeh |
|
15:35 15:55 |
Mining text enriched heterogeneous citation networks (R) Jan Kralj, Anita Valmarska, Marko Robnik-Šikonja, and Nada Lavrac |
|
SESSION 2D: Outlier and Anomaly Detection Session Chair: Chih-Ya Shen |
||
14:00 14:25 |
Contextual Anomaly Detection Using Log-linear Tensor Factorization (L) Alpa Jayesh Shah, Christian Desrosiers, and Robert Sabourin |
|
14:25 14:50 |
A Semi-supervised Framework for Social Spammer Detection (L) Zhaoxing Li, Xianchao Zhang, Hua Shen, Wenxin Liang, and Zengyou He |
|
14:50 15:10 |
Fast One-Class Support Vector Machine for Novelty Detection (R) Trung Le, Dinh Phung, Khanh Nguyen, and Svetha Venkatesh |
|
15:10 15:30 |
ND-SYNC: Detecting Synchronized Fraud Activities (R) Maria Giatsoglou, Despoina Chatzakou, Neil Shah, Alex Beutel,Christos Faloutsos, and Athena Vakali |
|
15:30 15:50 |
An Embedding Scheme for Detecting Anomalous Block Structured Graphs (R) Lida Rashidi, Sutharshan Rajasegarar, and Christopher Leckie |
|
15:50 16:10 |
A Core-attach Based Method for Identifying Protein Complexes in Dynamic PPI Networks (R) Jiawei Luo, Chengchen Liu, and Hoang Tu Nguyen |
|
May 21, 2015 (Thursday) |
||
09:00 - 10:00 |
||
KEYNOTE II: Topic Modeling with More Confidence - A Theory and Some Algorithms Xuan Long Nguyen |
||
10:20 - 12:30 |
||
SESSION 3A: Social Networks and Social Media |
||
10:20 10:45 |
Multiple Factors-Aware Diffusion in Social Networks (L) Chung-Kuang Chou and Ming-Syan Chen |
|
10:45 11:05 |
Understanding Community Effects on Information Diffusion (R) Shuyang Lin, Qingbo Hu, Guan Wang, and Philip S. Yu |
|
11:05 11:25 |
On Burst Detection and Prediction in Retweeting Sequence (R) Zhilin Luo, Yue Wang, Xintao Wu, Wandong Cai, and Ting Chen |
|
11:25 11:45 |
Few Things About Idioms: Understanding Idioms and its users in the Twitter Online Social Network (R) Koustav Rudra, Abhijnan Chakraborty, Manav Sethi, Shreyasi Das, Niloy Ganguly, and Saptarshi Ghosh |
|
11:45 12:05 |
Retweeting activity on Twitter: Signs of Deception (R) Maria Giatsoglou, Despoina Chatzakou, Neil Shah, Christos Faloutsos, and Athena Vakali |
|
12:05 12:25 |
Resampling-based Gap Analysis for Detecting Nodes with High Centrality on Large Social Network (R) Kouzou Ohara, Kazumi Saito, Masahiro Kimura, and Hiroshi Motoda |
|
SESSION 3B: Classification |
||
10:20 10:40 |
Prediciton of Emergency Events: A Multi-task Multi-label learning Approach (R) Budhaditya Saha, Sunil Kumar Gupta, and Svetha Venkatesh |
|
10:40 11:00 |
Nearest Neighbor Method Based on Local Distribution for Classification (R) Chengsheng Mao, Bin Hu, Philip Moore, Yun Su, and Manman Wang |
|
11:00 11:20 |
Immune Centroids Over-Sampling Method for Multi-Class Classification (R) Xusheng Ai, Jian Wu, Victor S. Sheng, Pengpeng Zhao, Yufeng Yao, and Zhiming Cui |
|
11:20 11:40 |
Optimizing Classifiers for Hypothetical Scenarios(R) Reid A. Johnson, Troy Raeder, and Nitesh V. Chawla |
|
11:40 12:00 |
Repulsive-SVDD Classification (R) Phuoc Nguyen and Dat Tran |
|
12:00 12:20 |
Centroid-Means-Embedding: an Approach to Infusing Word Embeddings into Features for Text Classification (R) Mohammad Golam Sohrab, Makoto Miwa, and Yutaka Sasaki |
|
SESSION 3C: Machine Learning |
||
10:20 10:45 |
Context-aware Detection of Sneaky Vandalism on Wikipedia across Multiple Languages (L) Khoi-Nguyen Tran, Peter Christen, Scott Sanner, and Lexing Xie |
|
10:45 11:05 |
Uncover the Latent Structures of Crowd Labeling (R) Tian Tian and Jun Zhu |
|
11:05 11:25 |
Use Correlation Coefficients in Gaussian Process to Train Stable ELM Models (R) Yulin He, Joshua Zhexue Huang, Xizhao Wang, and Rana Aamir Raza |
|
11:25 11:45 |
Local Adaptive and Incremental Gaussian Mixture for Online Density Estimation (R) Tianyu Qiu, Furao Shen, and Jinxi Zhao |
|
11:45 12:05 |
Latent Space Tracking from Heterogeneous Data with an Application for Anomaly Detection (R) Jiaji Huang and Xia Ning |
|
12:05 12:25 |
A Learning-rate Schedule for Stochastic Gradient Methods to Matrix Factorization (R) Wei-Sheng Chin, Yong Zhuang, Yu-Chin Juan, and Chih-Jen Lin |
|
SESSION 3D: Applications |
||
10:20 10:45 |
Learning of Performance Measures from Crowd-sourced Data with Application to Ranking of Investments (L) Greg Harris, Anand Panangadan, and Viktor K. Prasanna |
|
10:45 11:10 |
Hierarchical Dirichlet Process for Tracking Complex Topical Structure Evolution and Its Application to Autism Research Literature (L) Adham Beykikhoshk, Ognjen Arandjelovic ´, Svetha Venkatesh,and Dinh Phung |
|
11:10 11:30 |
Automated Detection for Probable Homologous Foodborne Disease Outbreaks (R) Xiao Xiao, Yong Ge, Yunchang Guo, Danhuai Guo, Yi Shen,Yuanchun Zhou, and Jianhui Li |
|
11:30 11:50 |
Identifying Hesitant and Interested Customers for Targeted Social Marketing (R) Guowei Ma, Qi Liu, Le Wu, and Enhong Chen |
|
11:50 12:10 |
Activity-Partner Recommendation (R) Wenting Tu, David W. Cheung, Nikos Mamoulis, Min Yang,and Ziyu Lu |
|
12:10 12:30 |
Iterative Use of Weighted Voronoi Diagrams to Improve Scalability in Recommender Systems (R) Joydeep Das, Subhashis Majumder, Debarshi Dutta, and Prosenjit Gupta |
|
14:00 - 15:00 |
||
KEYNOTE III: Direct Change Detection without Identification Masashi Sugiyama |
||
15:20 - 17:30 |
||
SESSION 4A: Mining Uncertain and Imprecise Data |
||
15:20 15:45 |
Mining Uncertain Sequential Patterns in iterative MapReduce (L) Jiaqi Ge, Yuni Xia, and Jian Wang |
|
15:45 16:10 |
Quality Control for Crowdsourced POI Collection (L) Shunsuke Kajimura, Yukino Baba, Hiroshi Kajino, and Hisashi Kashima |
|
16:10 16:30 |
Towards Efficient Sequential Pattern Mining in Temporal Uncertain Databases (R) Jiaqi Ge, Yuni Xia, and Jian Wang |
|
16:30 16:50 |
Preference-based top-k representative skyline queries on uncertain databases (R) Ha Thanh Huynh Nguyen and Jinli Cao |
|
16:50 17:10 |
Cluster Sequence Mining: Causal Inference with Time and Space Proximity under Uncertainty (R) Yoshiyuki Okada, Ken-ichi Fukui, Koichi Moriyama,and Masayuki Numao |
|
17:10 17:30 |
Achieving Accuracy Guarantee for Answering Batch Queries with Differential Privacy (R) Dong Huang, Shuguo Han, and Xiaoli Li |
|
SESSION 4B: Mining Temporal and Spatial Data |
||
15:20 15:45 |
Automated Classification of Passing in Football (L) Michael Horton, Joachim Gudmundsson, Sanjay Chawla, and Joël Estephan |
|
15:45 16:10 |
Stabilizing Sparse Cox Model using Statistic and Semantic Structures in Electronic Medical Records (L) Shivapratap Gopakumar, Tu Dinh Nguyen, Truyen Tran, Dinh Phung, and Svetha Venkatesh |
|
16:10 16:30 |
Semi Supervised Adaptive Framework for Classifying Evolving Data Stream (R) Ahsanul Haque, Latifur Khan, and Michael Baron |
|
16:30 16:50 |
Predicting Next Locations with Object Clustering and Trajectory Clustering (R) Meng Chen, Yang Liu, and Xiaohui Yu |
|
16:50 17:10 |
A Plane Moving Average Algorithm for Short-Term Traffic Flow Prediction (R) Lei Lv, Meng Chen, Yang Liu, and Xiaohui Yu |
|
17:10 17:30 |
Recommending Profitable Taxi Travel Routes based on Big Taxi Trajectories Data (R) Wenxin Yang, Xin Wang, Seyyed Mohammadreza Rahimi,and Jun Luo |
|
SESSION 4C: Novel Methods and Algorithms |
||
15:20 15:45 |
Boosting via Approaching Optimal Margin Distribution (L) Chuan Liu and Shizhong Liao |
|
15:45 16:05 |
o-HETM: An Online Hierarchical Entity Topic Model for News Streams (R) Linmei Hu, Juanzi Li, Jing Zhang, and Chao Shao |
|
16:05 16:25 |
Modeling User Interest and Community Interest in Microbloggings: An Integrated Approach (R) Tuan-Anh Hoang |
|
16:25 16:45 |
Minimal Jumping Emerging Patterns: Computation and Practical Assessment (R) Bamba Kane, Bertrand Cuissart, and Bruno Crémilleux |
|
16:45 17:05 |
Rank matrix factorisation (R) Thanh Le Van, Matthijs van Leeuwen, Siegfried Nijssen,and Luc De Raedt |
|
17:05 17:25 |
An Empirical Study of Personal Factors and Social Effects on Rating Prediction (R) Zhijin Wang, Yan Yang, Qinmin Hu, and Liang He |
|
SESSION 4D: Feature Extraction and Selection |
||
15:20 15:45 |
Cost-sensitive Feature Selection on Heterogeneous Data (L) Wenbin Qian, Wenhao Shu, Jun Yang, and Yinglong Wang |
|
15:45 16:05 |
A Feature Extraction Method for Multivariate Time Series Classification Using Temporal Patterns (R) Pei-Yuan Zhou and Keith C.C. Chan |
|
16:05 16:25 |
Scalable Outlying-Inlying Aspects Discovery via Features Ranking (R) Nguyen Xuan Vinh, Jeffrey Chan, James Bailey, Christopher Leckie, Kotagiri Ramamohanarao, and Jian Pei |
|
16:25 16:45 |
A DC Programming Approach for Sparse Optimal Scoring (R) Hoai An Le Thi and Duy Nhat Phan |
|
16:45 17:05 |
Graph Based Relational Features for Collective Classification (R) Immanuel Bayer, Uwe Nagel, and Steffen Rendle |
|
17:05 17:25 |
A New Feature Sampling Method in Random Forests for Prediction High Dimensional Data (R) Thanh-Tung Nguyen, He Zhao, Joshua Zhexue Huang, Thuy Thi Nguyen, and Mark Junjie Li |
|
May 22, 2015 (Friday) |
||
09:00 - 11:05 |
||
SESSION 5A: Mining Heterogeneous, High Dimensional, and Sequential Data |
||
09:00 09:20 |
Seamlessly Integrating Effective Links with Attributes for Networked Data Classification (R) Yangyang Zhao, Zhengya Sun, Changsheng Xu, and Hongwei Hao |
|
09:20 09:40 |
Clustering on Multi-source Incomplete Data via Tensor Modeling and Factorization (R) Weixiang Shao, Lifang He, and Philip S. Yu |
|
09:40 10:00 |
Locally Optimized Hashing for Nearest Neighbor Search (R) Seiya Tokui, Issei Sato, and Hiroshi Nakagawa |
|
10:00 10:20 |
Do-Rank: DCG Optimization for Learning-to-Rank in Tag-based Item Recommendation Systems (R) Noor Ifada and Richi Nayak |
|
10:20 10:40 |
Efficient Discovery of Recurrent Routine Behaviours in Smart Meter Time Series by Growing Subsequences (R) Jin Wang, Rachel Cardell-Oliver, and Wei Liu |
|
10:40 11:00 |
Convolutional Nonlinear Neighbourhood Components Analysis for Time Series Classification (R) Yi Zheng, Qi Liu, Enhong Chen, J. Leon Zhao, Liang He, and Guangyi Lv |
|
SESSION 5B: Entity Resolution and Topic Modelling |
||
09:00 09:20 |
Clustering-based Scalable Indexing for Multi-party Privacy-preserving Record Linkage (L) Thilina Ranbaduge, Dinusha Vatsalan, and Peter Christen |
|
09:20 09:40 |
Efficient Interactive Training Selection for Large-scale Entity Resolution (R) Qing Wang, Dinusha Vatsalan, and Peter Christen |
|
09:40 10:00 |
Unsupervised Blocking Key Selection for Real-Time Entity Resolution (R) Banda Ramadan and Peter Christen |
|
10:00 10:20 |
Incorporating Probabilistic Knowledge into Topic Models (R) Liang Yao, Yin Zhang, Baogang Wei, Hongze Qian, and Yibing Wang |
|
10:20 10:40 |
Learning Focused Hierarchical Topic Models with Semi-Supervision in Microblogs (R) Anton Slutsky, Xiaohua Hu, and Yuan An |
|
10:40 11:00 |
Predicting Future Links Between Disjoint Research Areas Using Heterogeneous Bibliographic Information Network (R) Yakub Sebastian, Eu-Gene Siew, and Sylvester Olubolu Orimaye |
|
SESSION 5C: Itemset and High Performance Data Mining |
||
09:00 09:20 |
CPT+: Decreasing the time/space complexity of the Compact Prediction Tree (L) Ted Gueniche, Philippe Fournier-Viger, Rajeev Raman,and Vincent S. Tseng |
|
09:20 09:40 |
Mining Association Rules in Graphs based on Frequent Cohesive Itemsets (R) Tayena Hendrickx, Boris Cule, Pieter Meysman, Stefan Naulaerts, Kris Laukens, and Bart Goethals |
|
09:40 10:00 |
Mining High Utility Itemsets in Big Data (R) Ying Chun Lin, Cheng-Wei Wu, and Vincent S. Tseng |
|
10:00 10:20 |
Decomposition Based SAT Encodings for Itemset Mining Problems (R) Said Jabbour, Lakhdar Sais, and Yakoub Salhi |
|
10:20 10:40 |
A Comparative Study on Parallel LDA Algorithms in MapReduce Framework (R) Yang Gao, Zhenlong Sun, Yi Wang, Xiaosheng Liu, Jianfeng Yan, and Jia Zeng |
|
10:40 11:00 |
Distributed Newton Methods for Regularized Logistic Regression (R) Yong Zhuang, Wei-Sheng Chin, Yu-Chin Juan, and Chih-Jen Lin |
|
SESSION 5D: Recommendation |
||
09:00 09:25 |
Coupled Matrix Factorization within Non-IID Context (L) Fangfang Li, Guandong Xu, and Longbing Cao |
|
09:25 09:50 |
Complementary usage of Tips and Reviews for Location Recommendation in Yelp (L) Saurabh Gupta, Sayan Pathak, and Bivas Mitra |
|
09:50 10:15 |
Coupling Multiple Views of Relations for Recommendation (L) Bin Fu, Guandong Xu, Longbing Cao, Zhihai Wang, and Zhiang Wu |
|
10:15 10:35 |
Pairwise one class recommendation algorithm (R) Huimin Qiu, Chunhong Zhang, and Jiansong Miao |
|
10:35 10:55 |
RIT: Enhancing Recommendation with Inferred Trust (R) Guo Yan, Yuan Yao, Feng Xu, and Jian Lu |