Automated Dimension Determination for NMF-based Incremental Collaborative Filtering

Xiwei Wang; Jun Zhang; Ruxin Dai

Research Article

Automated Dimension Determination for NMF-based Incremental Collaborative Filtering

Download951 downloads

Cite: BibTeX Plain Text

@ARTICLE{10.4108/eai.17-12-2015.150804,
    author={Xiwei Wang and Jun Zhang and Ruxin Dai},
    title={Automated Dimension Determination for NMF-based Incremental Collaborative Filtering},
    journal={EAI Endorsed Transactions on Collaborative Computing},
    volume={1},
    number={5},
    publisher={EAI},
    journal_a={CC},
    year={2015},
    month={12},
    keywords={auxiliary information, incremental clustering, data growth, collaborative Filtering, NMF},
    doi={10.4108/eai.17-12-2015.150804}
}

Xiwei Wang
Jun Zhang
Ruxin Dai
Year: 2015
Automated Dimension Determination for NMF-based Incremental Collaborative Filtering
CC
EAI
DOI: 10.4108/eai.17-12-2015.150804

Xiwei Wang¹^,*, Jun Zhang², Ruxin Dai³

1: Department of Computer Science, Northeastern Illinois University, Chicago, Illinois 60625, USA
2: Department of Computer Science, University of Kentucky, Lexington, Kentucky 40506-0633, USA
3: Department of Computer Science and Information Systems, University of Wisconsin River Falls, River Falls,Wisconsin 54022, USA

*Contact email: xwang9@neiu.edu

Abstract

The nonnegative matrix factorization (NMF) based collaborative filtering t e chniques h a ve a c hieved great success in product recommendations. It is well known that in NMF, the dimensions of the factor matrices have to be determined in advance. Moreover, data is growing fast; thus in some cases, the dimensions need to be changed to reduce the approximation error. The recommender systems should be capable of updating new data in a timely manner without sacrificing the prediction accuracy. In this paper, we propose an NMF based data update approach with automated dimension determination for collaborative filtering purposes. The approach can determine the dimensions of the factor matrices and update them automatically. It exploits the nearest neighborhood based clustering algorithm to cluster users and items according to their auxiliary information, and uses the clusters as the constraints in NMF. The dimensions of the factor matrices are associated with the cluster quantities. When new data becomes available, the incremental clustering algorithm determines whether to increase the number of clusters or merge the existing clusters. Experiments on three different datasets (MovieLens, Sushi, and LibimSeTi) were conducted to examine the proposed approach. The results show that our approach can update the data quickly and provide encouraging prediction accuracy.

Keywords: auxiliary information, incremental clustering, data growth, collaborative Filtering, NMF

Received: 2014-12-27
Accepted: 2015-06-30
Published: 2015-12-17
Publisher: EAI

: http://dx.doi.org/10.4108/eai.17-12-2015.150804

Copyright © 2015 Xiwei Wang et al., licensed to EAI. This is an open access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/), which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited. doi:10.4108/eai.17-12-2015.150804