WEB PAGE PREDICTION ENHANCED WITH CONFIDENCE MECHANISM

Authors

  • ARPAD GELLERT Computer Science and Electrical Engineering Department, Lucian Blaga University of Sibiu, Romania
  • ADRIAN FLOREA Computer Science and Electrical Engineering Department, Lucian Blaga University of Sibiu, Romania

Keywords:

Web page prediction, prefetching, Markov chains, Hidden Markov Models, graph algorithms, browser extension

Abstract

In this work we comparatively present and evaluate different prediction techniques used to anticipate and prefetch web pages and files accessed via browsers. The goal is to reduce the delays necessary to load the web pages and files visited by the users. We have included into our analysis Markov chains, Hidden Markov Models and graph algorithms. We have enhanced all these predictors with confidence mechanism which classifies dynamically web pages as predictable or unpredictable. A prediction is generated only if the confidence counter attached to the current web page is in a predictable state, improving thus the accuracy. Based on the results we have also developed a hybrid predictor consisting in a Hidden Markov Model and a graph-based predictor. The experiments show that this hybrid predictor provides the best prediction accuracies, an average of 85.45% on the “Ocean Group Research” dataset from the University of Boston and 87.28% on the dataset collected from the educational web server of our university, being thus the most appropriate to efficiently predict and prefetch web pages.

 

Downloads

Download data is not yet available.

References

Baum, L.E. An Inequality and Associated Maximization Technique in Statistical Estimation for

Probabilistic Functions of Markov Processes. Inequalities, Vol. 3, 1-8, 1972.

Cambazoglu, B.B. Algorithmic Techniques for Reducing Energy Consumption of Commercial

Web Search engines. Yahoo! Research Barcelona, Spain, Training School, University of Balearic

Islands, Palma de Mallorca, 2012.

Canali, C., Colajanni, M., Lancellotti, R. Adaptive algorithms for efficient content management in

social network services. 10th International Conference on Computer and Information Technology,

-75, 2010.

Ciobanu, D., Dinuca, C.E. Predicting the next page that will be visited by a web surfer using Page

Rank algorithm. International Journal of Computers and Communications, Issue 1, Vol. 6, 60-67,

Cormen, T., Leiserson, C., Rivest, R., Stein, C. Introduction to Algorithms. MIT Press, Third

Edition, 2009.

Cunha, C. A., Bestavros, A., Crovella, M. E. Characteristics of WWW Client Traces. Boston

University Department of Computer Science, Technical Report TR-95-010, 1995.

Deshpande, M., Karypis, G. Selective Markov Models for Predicting Web-Page Accesses. ACM

Transactions on Internet Technology, Vol. 4, Issue 2, 163-184, 2004.

Domènech, J., Pont, A., Sahuquillo, J., Gil, J. A. An experimental framework for testing web

prefetching techniques. The 30th EUROMICRO Conference, 214-221, 2004.

Domènech, J., Sahuquillo, J., Pont, A., Gil, J. A. How current web generation affects prediction

algorithms performance. Proceedings of SoftCOM International Conference on Software,

Telecommunications and Computer Networks, 2005.

Dubey, S., Mishra, N. Web Page Prediction using Hybrid Model. International Journal on

Computer Science & Engineering, Vol. 3 Issue 5, 2170-2176, 2011.

Gellert, A., Vintan, L. Person Movement Prediction Using Hidden Markov Models. Studies in

Informatics and Control, Vol. 15, No. 1, National Institute for Research and Development in

Informatics, Bucharest, 17-30, 2006.

Guo, Y. Z., Ramamohanarao, K., Park, L. A. F. Web Page Prediction Based on Conditional

Random Fields. The 18th European Conference on Artificial Intelligence, 251-255, 2008.

Hasan, M. A., Chaoji, V., Salem, S., Zaki, M. Link Prediction using Supervised Learning.

Proceedings of SDM 06 Workshop on Link Analysis, Counterterrorism and Security, 2006.

Huang, Q, Yang, Q., Huang, J. Z., Ng, M. K. Mining of Web-Page Visiting Patterns with

Continuous-Time Markov Models. Springer-Verlag Berlin Heidelberg, 549-558, 2004.

Huang, Z. Link Prediction Based on Graph Topology: The Predictive Value of Generalized

Clustering Coefficient. Proceedings of the Workshop on Link Analysis: Dynamics and Static of

Large Networks, 2006.

Jin, X., Xu, H. An Approach to Intelligent Web Pre-fetching Based on Hidden Markov Model.

Proceedings of the 42nd Conference on Decision and Control, Maui, Hawaii, USA, 2954-2958,

Vol. 3, 2003.

Kaushal, P. Hybrid Markov model for better prediction of web page. International Journal of

Scientific and Research Publications, Vol. 2, Issue 8, 2012.

Khalil, F., Li, J., Wang, H. Integrating Recommendation Models for Improved Web Page

Prediction Accuracy. Proceedings of the 31st Australasian Conference on Computer Science, Vol.

, 91-100, 2008.

Khalil, F., Li, J., Wang, H. An Integrated Model for Next Page Access Prediction. Internation

Journal of Knowledge and Web Intelligence, Vol. 1, No. 1/2, 48-80, 2009.

Khanchana, R., Punithavalli, M. Web Page Prediction for Web Personalization: A Review. Global

Journal of Computer Science and Technology, Vol. 11, Issue 7, 39-44, 2011.

Liben-Nowell, D., Kleinberg, J. The Link-Prediction Problem for Social Networks, Journal of the

American Society for Information Science and Technology, Vol. 58, Issue 7, pages 1019-1031,

Murata, T., Moriyasu, S. Link Prediction of Social Networks Based on Weighted Proximity

Measures. IEEE/WIC/ACM International Conference on Web Intelligence, 85-88, 2007.

Petzold, J. Augsburg Indoor Location Tracking Benchmarks. Technical Report 2004-9, Institute

of Computer Science, University of Augsburg, Germany, 2004.

Rabiner, L.R. A Tutorial on Hidden Markov Models and Selected Applications in Speech

Recognition. Proceedings of the IEEE, Vol 77, No. 2, 257-286, 1989.

Schmidt, E. Every 2 days we create as much information as we did up to 2003. Lake Tahoe, CA.

URL http://techcrunch.com/2010/08/04/schmidt-data/, 2010.

Singhai, N., Nigam R.K. A Novel Technique to Predict Oftenly Used Web Pages from Usage

Patterns. International Journal of Emerging Trends & Technology in Computer Science, Vol. 1,

Issue 4, 49-55. (2012)

Stamp, M. A Revealing Introduction to Hidden Markov Models.

http://www.cs.sjsu.edu/faculty/stamp/RUA/HMM.pdf, 2004.

Su, Z., Yang, Q., Zhang, H. J. A Prediction System for Multimedia Pre-fetching in Internet.

Proceedings of the eighth ACM international conference on Multimedia, 3-11, 2000.

Temgire, S., Gupta. P. Review on Web Prefetching Techniques. International Journal of

Technology Enhancements and Emerging Engineering Research, Vol. 1, Issue 4, 100-105, 2013.

Vintan, L., Gellert, A., Petzold, J., Ungerer, T. Person Movement Prediction Using Neural

Networks. Proceedings of the KI2004 International Workshop on Modeling and Retrieval of

Context (MRC 2004), Vol-114, Ulm, Germany, 2004.

Vintan, L., Florea, A., Gellert, A. Random Degrees of Unbiased Branches. Proceedings of the

Romanian Academy, Series A, No. 3, 259-268, 2008.

Wan, M., Jönsson, A., Wang, C., Li, L., Yang, Y Web user clustering and Web prefetching using

Random Indexing with weight functions. Knowledge and Information Systems, Vol. 33, Issue 1,

-115, 2012.

Zhu, J., Hong, J., Hughes, J. G. Using Markov Chains for Link Prediction in Adaptive Web Sites.

Springer-Verlag Berlin Heidelberg, 60-73, 2002.

Downloads

Published

2014-06-27

How to Cite

GELLERT, A. ., & FLOREA, A. (2014). WEB PAGE PREDICTION ENHANCED WITH CONFIDENCE MECHANISM. Journal of Web Engineering, 13(5-6), 507–524. Retrieved from https://journals.riverpublishers.com/index.php/JWE/article/view/3917

Issue

Section

Articles