Web Usage Mining by Neural Hybrid Prediction with Markov Chain Components





Web prefetching, Webpage prediction, Markov chains, neural networks, browser extension


This paper presents and evaluates a two-level web usage prediction technique, consisting of a neural network in the first level and contextual component predictors in the second level. We used Markov chains of different orders as contextual predictors to anticipate the next web access based on specific web access history. The role of the neural network is to decide, based on previous behaviour, whose predictor’s output to use. The predicted web resources are then prefetched into the cache of the browser. In this way, we considerably increase the hit rate of the web browser, which shortens the load times. We have determined the optimal configuration of the proposed hybrid predictor on a real dataset and compared it with other existing web prefetching techniques in terms of prediction accuracy. The best configuration of the proposed neural hybrid method provides an average web access prediction accuracy of 86.95%.


Download data is not yet available.

Author Biography

Arpad Gellert, Computer Science and Electrical Engineering Department, Lucian Blaga University of Sibiu, Romania

Arpad Gellert obtained his MSc (2003) and the PhD (2008) in Computer Science at Lucian Blaga University of Sibiu. He is currently working as an associate professor in the Computer Science and Electrical Engineering Department of the same university. He also worked as visiting researcher in Barcelona and Milano. Previously he was a Java developer at Multimedia Capital Romania. His research interests include computer architecture, smart buildings & factories, web mining and image processing. He published 5 books and over 50 scientific papers in some prestigious journals and international top conferences and acquired more than 300 citations. He was member of several research grants. He developed as a project manager a research grant supported by the Romanian National Council of Academic Research and three internal LBUS grants. Currently he is member of an ongoing Hasso Plattner Excellence Research Grant. He received in 2010 the “Ad Augusta Per Angusta” prize awarded by Lucian Blaga University of Sibiu for excellence in scientific research. His webpage can be found at http://webspace.ulbsibiu.ro/arpad.gellert.


T.T. Adelaja, S.O. Akinola, ‘An Enhanced Web Page Recommendation System Using Hidden Markov Model and Page Rank Technique’, Journal of Science and Logics in ICT Research, Vol. 1, 57–63, June 2017.

M. Brala, M. Dhanda, ‘An Improved Markov Model Approach to Predict Web Page Caching’, International Journal of Computer Science and Communication Networks, Vol. 2, 393–399, 2012.

L. Chun-jie, L. Hong-bo, L. Hua, Z. Li-ming, F. Chun-yan, G. Ji-ping, ‘Research on Polynomial Regression Prefetching Model’, IOP Conference Series: Earth and Environmental Science, Volume 332, Issue 2, 2019.

C.A. Cunha, A. Bestavros, M.E. Crovella, ‘Characteristics of WWW Client Traces’, Boston University Department of Computer Science, Technical Report TR-95-010, 1995.

N.T. Da, T. Hanh, P.H. Duy, ‘Improving Webpage Access Predictions Based on Sequence Prediction and PageRank Algorithm’, Interdisciplinary Journal of Information, Knowledge, and Management, Vol. 14, 27–44, 2019.

S. Dubey, N. Mishra, ‘Web Page Prediction using Hybrid Model’, International Journal on Computer Science & Engineering, Vol. 3 Issue 5, 2170–2176, 2011.

W. Ertel, ‘Introduction to Artificial Intelligence’, Second Edition, Springer, 2017.

W. Feng, T.H. Kazi, G. Hu, ‘Web Prefetching by ART1 Neural Network’, Software and Network Engineering. Studies in Computational Intelligence, Vol. 413, 29–40, 2012.

A. Gellert, A. Florea, ‘Investigating a New Design Pattern for Efficient Implementation of Prediction Algorithms’, Journal of Digital Information Management, Vol. 11, Issue 5, 366–377, 2013.

A. Gellert, A. Florea, ‘Web Page Prediction Enhanced with Confidence Mechanism’, Journal of Web Engineering, Vol. 13, Issue 5–6, 507–524, 2014.

A. Gellert, A. Florea, ‘Web Prefetching through Efficient Prediction by Partial Matching’, World Wide Web: Internet and Web Information Systems, Vol. 19, Issue 5, 921–932, September 2016.

A. Gellert, ‘Web Access Mining through Dynamic Decision Trees with Markovian Features’, Journal of Web Engineering, Vol. 16, Issue 5–6, 524–536, 2017.

S. Hong, K. Kim, T. Kim, ‘A Prefetching Scheme for Improving the Web Page Loading Time with NVRAM’, Journal of Semiconductor Technology and Science, Vol. 18, No. 1, 20–28, February 2018.

T. Ibrahim, C.-Z. Xu, ‘Neural Nets Based Predictive Pre-fetching to Tolerate WWW Latency’, 20th IEEE International Conference on Distributed Computing Systems, 636–643, Taipe, Taiwan, April 2000.

M. Joo, Y. An, H. Roh, W. Lee, ‘Predictive Prefetching Based on User Interaction for Web Applications’, IEEE Communication Letters, Vol. 25, Issue 3, 821–824, March 2021.

A.S. Kakar, M.S. Rohie, ‘A Review of Probabilistic Techniques Used for Web Browsers’ Caching’, European Journal of Engineering research and Science, Vol. 5, Issue 7, 773–780, July 2020.

P. Kaushal, ‘Hybrid Markov model for better prediction of web page’, International Journal of Scientific and Research Publications, Vol. 2, Issue 8, 2012.

K.-C. Kim, ‘Framework for Efficient Web Page Prediction using Deep Learning’, Journal of the Korea Society of Computer and Information, Vol. 25, Issue 12, 165–172, 2020.

M. Kubat, ‘An Introduction to Machine Learning’, Second Edition, Springer, 2017.

O.P. Mandal, H.K. Azad, ‘Web Access Prediction Model using Clustering and Artificial Neural Network’, International Journal of Engineering Research & Technology, Vol. 3, Issue 9, 195–199, 2014.

M. Narvekar, S.S. Banu, ‘Predicting User’s Web Navigation Behavior Using Hybrid Approach’, International Conference on Advanced Computing Technologies and Applications, Vol. 45, 3–12, Mumbai, India, March 2015.

T. Pamutha, S. Chimphlee, C. Kimpan, P. Sanguansat, ‘Web Page Access Prediction on Server Side’, Journal of Convergence Information Technology, Vol. 9, No. 5, September 2014.

K. Radinsky, K.M. Svore, S.T. Dumais, M. Shokouhi, J. Teevan, A. Bocharov, E. Horvitz, ‘Behavioral Dynamics on the Web: Learning, Modeling, and Prediction’, ACM Transactions on Information Systems, Vol. 31, Issue 3, July 2013.

S. Setia, Jyoti, N. Duhan, ‘Neural Network Based Prefetching Control Mechanism’, International Journal of Engineering and Advanced Technology, Vol. 9, Issue 2, 2019.

K. Shyamala, S. Kalaivani, ‘Application of Monte Carlo Search for Performance Improvement of Web Page Prediction’, International Journal of Engineering & Technology, Vol. 7, No. 3.4, 133–137, 2018.

K. Shyamala, S. Kalaivani, ‘Enhanced Webpage Prediction Using Rank Based Feedback Process’, New Trends in Computational Vision and Bio-inspired Computing, Springer, 567–576, September 2020.

N. Singhai, R.K. Nigam, ‘A Novel Technique to Predict Oftenly Used Web Pages from Usage Patterns’, International Journal of Emerging Trends & Technology in Computer Science, Vol. 1, Issue 4, 49–55, 2012.

D.S. Sisodia, V. Khandal, R. Singhal, ‘Fast Prediction of Web User BrowsingBehaviours Using Most Interesting Patterns’, Journal of Information Science, Vol. 44, No. 1, February 2018.

W. Tian, B. Choi, V. Phoha, ‘An Adaptive Web Cache Access Predictor Using Neural Network’, Development in Applied Artificial Intelligence, Lecture Notes in Artificial Intelligence, Vol. 2358, 450–459, 2002.

L. Vintan, A. Gellert, J. Petzold, T. Ungerer, ‘Person Movement Prediction Using Neural Networks’, Proceedings of the KI2004 International Workshop on Modeling and Retrieval of Context (MRC 2004), Vol. 114, Ulm, Germany, September 2004.

C.-Z. Xu, T. Ibrahim, ‘A Keyword-Based Semantic Prefetching Approach in Internet News Services’, IEEE Transactions on Knowledge and Data Engineering, Vol. 16, Issue 5, 601–611, 2004.

W. Zou, J. Won, J. Ahn, K. Kang, ‘Intentionality-related Deep Learning Method in Web Prefetching’, Proceedings of the 27th International Conference on Network Protocols, Chicago, USA, October 2019.