A NOVEL FUZZY-BASED SIMILARITY MEASURE FOR COLLABORATIVE FILTERING TO ALLEVIATE THE SPARSITY PROBLEM

Document Type: Research Paper

Authors

School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran

Abstract

Memory-based collaborative filtering is the most popular approach to build recommender systems. Despite its success in many applications, it still suffers from several major limitations, including data sparsity. Sparse data affect the quality of the user similarity measurement and consequently the quality of the recommender system. In this paper, we propose a novel user similarity measure based on fuzzy set theory along with default voting technique aimed to provide a valid similarity measurement between users wherever the available ratings are relatively rare. The main idea of this research is to model the rating behaviour of each user by a fuzzy set, and use this model to determine the user's degree of interest on items. Experimental results on the MovieLens and Netflix datasets show the effectiveness of the proposed algorithm in handling data sparsity problem. It also outperforms some state-of-the-art collaborative filtering algorithms in terms of prediction quality.

Keywords


[1] G. Adomavicius and A. Tuzhilin, Toward the next generation of recommender systems: A survey of
the state-of-the-art and possible extensions, Knowledge and Data Engineering, IEEE Transactions,
17(6) (2005), 734{749.
[2] C. C. Aggarwal, Recommender Systems: The Textbook, Springer, 2016.
[3] D. Anand and K. K. Bharadwaj, Utilizing various sparsity measures for enhancing accuracy of
collaborative recommender systems based on local and global similarities, Expert Systems with
Applications, 38(5) (2011), 5101{5109.
[4] J. Bobadilla, F. Ortega, A. Hernando and A. Gutirrez, Recommender systems survey, Knowledge-
Based Systems, 46 (2013), 109{132.
[5] J. S. Breese, D. Heckerman and C. Kadie, Empirical analysis of predictive algorithms for
collaborative ltering, In Proceedings of the 14th Conference on Uncertainty in Arti cial
Intelligence, (1998), 43{52.
[6] L. Chen, G. Chen and F. Wang, Recommender systems based on user reviews: the state of the art,
User Modeling and User-Adapted Interaction, 25(2) (2015), 99{154.
[7] C. Cornelis, X. Guo, J. Lu and G. Zhang, A Fuzzy Relational Approach to Event Recommendation,
In Proceedings of the 2nd Indian International Conference on Arti cial Intelligence, 5 (2005), 2231{
2242.
[8] C. Desrosiers and G. Karypis, A novel approach to compute similarities and its application to item
recommendation, In Paci c Rim International Conference on Arti cial Intelligence, (2010), 39{51.
[9] C. Desrosiers and G. Karypis, A comprehensive survey of neighborhood-based recommendation
methods, In Recommender Systems Handbook, (2011), 107{144.
[10] M. D. Ekstrand, J. T. Riedl and J. A. Konstan, Collaborative ltering recommender systems,
Foundations and Trends in Human-Computer Interaction, 4(2) (2011), 81{173.
[11] F. Fouss, A. Pirotte, J. M. Renders and M. Saerens, Random-walk computation of similarities
between nodes of a graph with application to collaborative recommendation, IEEE Transactions on
Knowledge and Data Engineering, 19(3) (2007), 355{369.
[12] M. A. Ghazanfar and A. Prugel-Bennett, Leveraging clustering approaches to solve the gray-sheep
users problem in recommender systems, Expert Systems with Applications, 41(7) (2014), 3261{
3275.
[13] K. Goldberg, T. Roeder, D. Gupta and C. Perkins, Eigentaste: A constant time collaborative
ltering algorithm, Information Retrieval, 4(2) (2001), 133{151.
[14] J. L. Herlocker, J. A. Konstan, A. Borchers and J. Riedl, An algorithmic framework for performing
collaborative ltering, In Proceedings of the 22nd Annual International ACM SIGIR Conference
on Research and Development in Information Retrieval, (1999), 230{237.
[15] J. L. Herlocker, J. A. Konstan, L. G. Terveen and J. T. Riedl, Evaluating collaborative ltering
recommender systems, ACM Transactions on Information Systems (TOIS), 22(1) (2004), 5{53.
[16] R. J. Hyndman and A. B. Koehler, Another look at measures of forecast accuracy, International
Journal of Forecasting, 22(4) (2006), 679-688.
[17] M. Jamali and M. Ester, Trustwalker: a random walk model for combining trust-based and item-
based recommendation, In Proceedings of the 15th ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, (2009), 397{406.
[18] D. Jannach, M. Zanker, A. Felfernig and G. Friedrich, Recommender systems: an introduction,
Cambridge University Press, 2010.
[19] G. Karypis, Evaluation of item-based top-n recommendation algorithms, In Proceedings of the 10th
International Conference on Information and Knowledge Management, (2001), 247{254.

[20] J. A. Konstan and J. Riedl, Recommender systems: from algorithms to user experience, User
Modeling and User-Adapted Interaction, 22(1-2) (2012), 101{123.
[21] G. Koutrika, B. Bercovitz and H. Garcia-Molina, FlexRecs: expressing and combining
exible
recommendations, In Proceedings of the 2009 ACM SIGMOD International Conference on
Management of Data, (2009), 745{758.
[22] A. S. Lampropoulos and G. A. Tsihrintzis, Machine Learning Paradigms, Springer, 2015.
[23] C. W. Leung, S. C. Chan and F. Chung, A collaborative ltering framework based on fuzzy
association rules and multiple-level similarity, Knowledge and Information Systems, 10(3) (2006),
357{381.
[24] H. Luo, C. Niu, R. Shen and C. Ullrich, A collaborative ltering framework based on both local
user similarity and global user similarity, Machine Learning, 72(3) (2008), 231{245.
[25] H. Ma, I. King and M. R. Lyu, E ective missing data prediction for collaborative ltering,
In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval, (2007), 39{46.
[26] P. Massa and P. Avesani, Trust metrics in recommender systems, In Computing with Social Trust,
(2009), 259{285.
[27] B. K. Patra, R. Launonen, V. Ollikainen and S. Nandi, A new similarity measure using
Bhattacharyya coecient for collaborative ltering in sparse data, Knowledge-Based Systems, 82
(2015), 163{177.
[28] P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom and J. Riedl, Grouplens: an open architecture
for collaborative ltering of netnews, In Proceedings of the 1994 ACM Conference on Computer
Supported Cooperative Work, (1994), 175-186.
[29] F. Ricci, L. Rokach and B. Shapira, Introduction to recommender systems handbook, Springer,
(2011), 1{35.
[30] B. Sarwar, G. Karypis, J. Konstan and J. Riedl, Analysis of recommendation algorithms for e-
commerce, In Proceedings of the 2nd ACM Conference on Electronic Commerce, (2000), 158{167.
[31] B. Sarwar, G. Karypis, J. Konstan and J. Riedl, Item-based collaborative ltering recommendation
algorithms, In Proceedings of the 10th International Conference on World Wide Web, (2001),
285-295.
[32] J. B. Schafer, J. Konstan and J. Riedl, Recommender systems in e-commerce, In Proceedings of
the 1st ACM Conference on Electronic commerce, (1999), 158{166.
[33] G. Shani and A. Gunawardana, Evaluating recommendation systems, In Recommender Systems
Handbook, Springer US, (2011), 257{297.
[34] U. Shardanand and P. Maes, Social information ltering: Algorithms for automating word of
mouth, In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, (1995),
210-217.
[35] Y. Shi, M. Larson and A. Hanjalic, Collaborative ltering beyond the user-item matrix: A survey
of the state of the art and future challenges, ACM Computing Surveys (CSUR), 47(1) (2014), 3.
[36] X. Su and T. M. Khoshgoftaar, A survey of collaborative ltering techniques, Advances in Arti cial
Intelligence, (2009), 2{19.
[37] R. Yera, J. Castro and L. Martnez, A fuzzy model for managing natural noise in recommender
systems, Applied Soft Computing, 40 (2016), 187{198.
[38] L. A. Zadeh, Fuzzy sets, Information and Control, 8(3) (1965), 338{353.
[39] Z. Zhang, X. Tang and D. Chen, Applying user-favorite-item-based similarity into slope one scheme
for collaborative ltering, Computing and Communication Technologies (WCCCT), 2014 World
Congress on, (2014), 5{7.