WEB PAGE PREDICTION ENHANCED WITH CONFIDENCE MECHANISM
Keywords:
Web page prediction, prefetching, Markov chains, Hidden Markov Models, graph algorithms, browser extensionAbstract
In this work we comparatively present and evaluate different prediction techniques used to anticipate and prefetch web pages and files accessed via browsers. The goal is to reduce the delays necessary to load the web pages and files visited by the users. We have included into our analysis Markov chains, Hidden Markov Models and graph algorithms. We have enhanced all these predictors with confidence mechanism which classifies dynamically web pages as predictable or unpredictable. A prediction is generated only if the confidence counter attached to the current web page is in a predictable state, improving thus the accuracy. Based on the results we have also developed a hybrid predictor consisting in a Hidden Markov Model and a graph-based predictor. The experiments show that this hybrid predictor provides the best prediction accuracies, an average of 85.45% on the “Ocean Group Research” dataset from the University of Boston and 87.28% on the dataset collected from the educational web server of our university, being thus the most appropriate to efficiently predict and prefetch web pages.
Downloads
References
Baum, L.E. An Inequality and Associated Maximization Technique in Statistical Estimation for
Probabilistic Functions of Markov Processes. Inequalities, Vol. 3, 1-8, 1972.
Cambazoglu, B.B. Algorithmic Techniques for Reducing Energy Consumption of Commercial
Web Search engines. Yahoo! Research Barcelona, Spain, Training School, University of Balearic
Islands, Palma de Mallorca, 2012.
Canali, C., Colajanni, M., Lancellotti, R. Adaptive algorithms for efficient content management in
social network services. 10th International Conference on Computer and Information Technology,
-75, 2010.
Ciobanu, D., Dinuca, C.E. Predicting the next page that will be visited by a web surfer using Page
Rank algorithm. International Journal of Computers and Communications, Issue 1, Vol. 6, 60-67,
Cormen, T., Leiserson, C., Rivest, R., Stein, C. Introduction to Algorithms. MIT Press, Third
Edition, 2009.
Cunha, C. A., Bestavros, A., Crovella, M. E. Characteristics of WWW Client Traces. Boston
University Department of Computer Science, Technical Report TR-95-010, 1995.
Deshpande, M., Karypis, G. Selective Markov Models for Predicting Web-Page Accesses. ACM
Transactions on Internet Technology, Vol. 4, Issue 2, 163-184, 2004.
Domènech, J., Pont, A., Sahuquillo, J., Gil, J. A. An experimental framework for testing web
prefetching techniques. The 30th EUROMICRO Conference, 214-221, 2004.
Domènech, J., Sahuquillo, J., Pont, A., Gil, J. A. How current web generation affects prediction
algorithms performance. Proceedings of SoftCOM International Conference on Software,
Telecommunications and Computer Networks, 2005.
Dubey, S., Mishra, N. Web Page Prediction using Hybrid Model. International Journal on
Computer Science & Engineering, Vol. 3 Issue 5, 2170-2176, 2011.
Gellert, A., Vintan, L. Person Movement Prediction Using Hidden Markov Models. Studies in
Informatics and Control, Vol. 15, No. 1, National Institute for Research and Development in
Informatics, Bucharest, 17-30, 2006.
Guo, Y. Z., Ramamohanarao, K., Park, L. A. F. Web Page Prediction Based on Conditional
Random Fields. The 18th European Conference on Artificial Intelligence, 251-255, 2008.
Hasan, M. A., Chaoji, V., Salem, S., Zaki, M. Link Prediction using Supervised Learning.
Proceedings of SDM 06 Workshop on Link Analysis, Counterterrorism and Security, 2006.
Huang, Q, Yang, Q., Huang, J. Z., Ng, M. K. Mining of Web-Page Visiting Patterns with
Continuous-Time Markov Models. Springer-Verlag Berlin Heidelberg, 549-558, 2004.
Huang, Z. Link Prediction Based on Graph Topology: The Predictive Value of Generalized
Clustering Coefficient. Proceedings of the Workshop on Link Analysis: Dynamics and Static of
Large Networks, 2006.
Jin, X., Xu, H. An Approach to Intelligent Web Pre-fetching Based on Hidden Markov Model.
Proceedings of the 42nd Conference on Decision and Control, Maui, Hawaii, USA, 2954-2958,
Vol. 3, 2003.
Kaushal, P. Hybrid Markov model for better prediction of web page. International Journal of
Scientific and Research Publications, Vol. 2, Issue 8, 2012.
Khalil, F., Li, J., Wang, H. Integrating Recommendation Models for Improved Web Page
Prediction Accuracy. Proceedings of the 31st Australasian Conference on Computer Science, Vol.
, 91-100, 2008.
Khalil, F., Li, J., Wang, H. An Integrated Model for Next Page Access Prediction. Internation
Journal of Knowledge and Web Intelligence, Vol. 1, No. 1/2, 48-80, 2009.
Khanchana, R., Punithavalli, M. Web Page Prediction for Web Personalization: A Review. Global
Journal of Computer Science and Technology, Vol. 11, Issue 7, 39-44, 2011.
Liben-Nowell, D., Kleinberg, J. The Link-Prediction Problem for Social Networks, Journal of the
American Society for Information Science and Technology, Vol. 58, Issue 7, pages 1019-1031,
Murata, T., Moriyasu, S. Link Prediction of Social Networks Based on Weighted Proximity
Measures. IEEE/WIC/ACM International Conference on Web Intelligence, 85-88, 2007.
Petzold, J. Augsburg Indoor Location Tracking Benchmarks. Technical Report 2004-9, Institute
of Computer Science, University of Augsburg, Germany, 2004.
Rabiner, L.R. A Tutorial on Hidden Markov Models and Selected Applications in Speech
Recognition. Proceedings of the IEEE, Vol 77, No. 2, 257-286, 1989.
Schmidt, E. Every 2 days we create as much information as we did up to 2003. Lake Tahoe, CA.
URL http://techcrunch.com/2010/08/04/schmidt-data/, 2010.
Singhai, N., Nigam R.K. A Novel Technique to Predict Oftenly Used Web Pages from Usage
Patterns. International Journal of Emerging Trends & Technology in Computer Science, Vol. 1,
Issue 4, 49-55. (2012)
Stamp, M. A Revealing Introduction to Hidden Markov Models.
http://www.cs.sjsu.edu/faculty/stamp/RUA/HMM.pdf, 2004.
Su, Z., Yang, Q., Zhang, H. J. A Prediction System for Multimedia Pre-fetching in Internet.
Proceedings of the eighth ACM international conference on Multimedia, 3-11, 2000.
Temgire, S., Gupta. P. Review on Web Prefetching Techniques. International Journal of
Technology Enhancements and Emerging Engineering Research, Vol. 1, Issue 4, 100-105, 2013.
Vintan, L., Gellert, A., Petzold, J., Ungerer, T. Person Movement Prediction Using Neural
Networks. Proceedings of the KI2004 International Workshop on Modeling and Retrieval of
Context (MRC 2004), Vol-114, Ulm, Germany, 2004.
Vintan, L., Florea, A., Gellert, A. Random Degrees of Unbiased Branches. Proceedings of the
Romanian Academy, Series A, No. 3, 259-268, 2008.
Wan, M., Jönsson, A., Wang, C., Li, L., Yang, Y Web user clustering and Web prefetching using
Random Indexing with weight functions. Knowledge and Information Systems, Vol. 33, Issue 1,
-115, 2012.
Zhu, J., Hong, J., Hughes, J. G. Using Markov Chains for Link Prediction in Adaptive Web Sites.
Springer-Verlag Berlin Heidelberg, 60-73, 2002.