TOTAL PRIVACY PRESERVATION AND SEARCH QUALITY IMPROVEMENT IN PERSONALIZED WEB SEARCH
Keywords:
Personalized search, Topic model, Relevance ranking, Vector Space Model, Homomorphic encryptionAbstract
Personalized Web Search (PWS) has dramatically improved the quality and effectiveness of web search nowadays. Its effectiveness totally depends on the user personnel data collection. But collecting personnel data possess a major threat of privacy risk. Even the search engine side can misuse this personnel information. Evidences show that users are more concerned about their privacy than search quality. We here propose a new system which offers improved search quality and complete privacy protection. A pattern based user interested topic presentation enhances the personalized search. When the user submits a query, our system creates a vector space model with a weight of query related frequent pattern words and then sent to the server side. To avoid the vulnerability of eavesdropping of this vector space model, a modified version of fully homomorphic encryption scheme is used. Experimental results of performance analysis show that our system improves the effectiveness of PWS and execution time.
Downloads
References
Lidan Shou, He Bai, Ke Chen, and Gang Chen, ”Supporting Privacy Protection in Personalized Web Search”, In Proc. IEEE Trans.on Knowledge and Data Eng., vol.26, issue 2, February 2014.
Pitokow James, Hinrich Schütze, Todd Cass, Rob Cooley, Don Turnbull, Andy Edmonds, Eytan Adar, and Thomas Breuel (2002), "Personalized search", Communications of the ACM (CACM) 45 (9): 50–55.
Jaime Teevan, Susan T. Dumais and Eric Horvitz (2005). "Personalizing search via automated analysis of interests and activities", In Proc. 28th Annual Int‟l ACM SIGIR Conf. on Research and Development in Information Retrieval, August 2005..
Zhicheng Dou, Ruihua Song and Ji-Rong Wen(2007). "A large-scale evaluation and analysis of personalized search strategies", In Proc. 16th Int‟l Conf. World Wide Web, ACM Press.
Paul Alexandru Chirita, Claudiu S. Firan, Wolfgang Nejdl (2006). "Summarizing local context to personalize global Web search", In Proc.15th ACM Int‟l Conf. on Information and knowledge management.
Harry, David. "Search Personalization and the User Experience". Retrieved 29 April 2014.
K. Sugiyama, K. Hatano, and M. Yoshikawa, “Adaptive Web Search Based on User Profile Constructed without any Effort from Users”, In Proc. 13th Int‟l Conf. World Wide Web (WWW), 2004.
Xuehua Shen, Bin Tan, and ChengXiang Zhai, “Privacy Protection in Personalized Search”, ACM SIGIR Forum, vol. 41, no. 1, pp. 4-17, June 2007.
J.Castelli-Roca, A. Viejo, and J. Herrera-Joancomarti´, “Preserving User‟s Privacy in Web Search Engines”, Computer Comm., vol. 32, no. 13/14, pp. 1541-1551, 2009. 10. Spetka and Scott, "The WWW Robot: Beyond Browsing", NCSA. Archived from the original on 3 September 2004. Retrieved 21 November 2010. 11. Castillo and Carlos, “Effective Web Crawling (Ph.D. thesis)”, University of Chile, Retrieved 2010-08-03.
C. D. Manning, P. Raghavan and H. Schutze, (2008). "Scoring, term weighting, and the vector space model", Introduction to Information Retrieval . p. 100.
Marten van Dijk, Craig Gentry, Shai Halevi and Vinod Vaikuntanathan. "Fully Homomorphic Encryption over the Integers", June 8, 2010.
David M. Blei, Andrew Y. Ng, and Michael I. Jordan, “Latent dirichlet allocation”, Journal of Machine Learning Research 3 (2003) 993-1022.
X. Wang, A. McCallum, and X. Wei, “Topical n-grams: Phrase and topic discovery, with an application to information retrieval”, In Proc. 7th IEEE Int. Conf. Data Min., 2007, pp. 697–702.
Singhal and Amit, "Modern Information Retrieval: A Brief Overview",Bulletin of the IEEE Computer Society Technical Committee on Data Engineering 24 (4): 35–43.
D.E. Rose, “Why Is Web Search So Hard... to Evaluate?”, Rinton Press,Vol.3, No.3&4 December, (2004) 171-181.
J.C.C. Pun and F.H. Lochovsky,” Ranking Search Results by Web Quality Dimensions”, Rinton Press., Vol.3 No.3&4 December, (2004) 216-235.