WEB ACCESS MINING THROUGH DYNAMIC DECISION TREES WITH MARKOVIAN FEATURES

Authors

  • ARPAD GELLERT Computer Science and Electrical Engineering Department, Lucian Blaga University of Sibiu, Romania

Keywords:

Web page prediction, web prefetching, Markov chains, dynamic decision tree, browser extension

Abstract

In this work we propose a hybrid web access prediction method consisting in a dynamic decision tree and different order Markov predictors as components. The predictions generated by the Markov chain components are used as features within the dynamic decision tree. Our goal is to use this hybrid technique in order to anticipate and prefetch the web pages and files accessed by the users through browsers, reducing thus the load times. We use a decision tree to select the most predictive features from a considered feature set and based on those selected features we generate predictions. In our application, the feature set includes the current link, the type of the current link as well as the predictions of different order Markov chains. The optimal configuration of the proposed hybrid technique provides an average web page prediction accuracy of 72.57%.

Downloads

Download data is not yet available.

References

Brala, M., Dhanda, M. An Improved Markov Model Approach to Predict Web Page Caching.

International Journal of Computer Science and Communication Networks, Vol. 2, 393-399, 2012.

Canali, C., Colajanni, M., Lancellotti, R. Adaptive algorithms for efficient content management in

social network services. 10th International Conference on Computer and Information Technology,

-75, 2010.

Ciobanu, D., Dinuca, C.E. Predicting the next page that will be visited by a web surfer using Page

Rank algorithm. International Journal of Computers and Communications, Issue 1, Vol. 6, 60-67,

Cunha, C. A., Bestavros, A., Crovella, M. E. Characteristics of WWW Client Traces. Boston

University Department of Computer Science, Technical Report TR-95-010, 1995.

Dubey, S., Mishra, N. Web Page Prediction using Hybrid Model. International Journal on Computer

Science & Engineering, Vol. 3 Issue 5, 2170-2176, 2011.

Gellert, A, Florea A. Investigating a New Design Pattern for Efficient Implementation of Prediction

Algorithms. Journal of Digital Information Management, Vol. 11, Issue 5, 366-377, 2013.

Gellert, A., Florea, A. Web Page Prediction Enhanced with Confidence Mechanism, Journal of Web

Engineering, Vol. 13, Issue 5-6, 507-524, 2014.

Gellert, A., Florea, A. Web Prefetching through Efficient Prediction by Partial Matching, World

Wide Web: Internet and Web Information Systems, Vol. 19, Issue 5, 921-932, September 2016.

Kaushal, P. Hybrid Markov model for better prediction of web page. International Journal of

Scientific and Research Publications, Vol. 2, Issue 8, 2012.

Khalil, F., Li, J., Wang, H. Integrating Recommendation Models for Improved Web Page Prediction

Accuracy. Proceedings of the 31st Australasian Conference on Computer Science, Vol. 74, 91-100,

Khalil, F., Li, J., Wang, H. An Integrated Model for Next Page Access Prediction. International

Journal of Knowledge and Web Intelligence, Vol. 1, No. 1/2, 48-80, 2009.

Khanchana, R., Punithavalli, M. Web Page Prediction for Web Personalization: A Review. Global

Journal of Computer Science and Technology, Vol. 11, Issue 7, 39-44, 2011.

Kundu A., Guha S., Mitra A., Mukherjee T. A New Approach in Dynamic Prediction for User based

Web Page Crawling. Proceedings of the International Conference on Management of Emergent

Digital Ecosystems, 166-173, Bangkok, Thailand, October 2010.

Lowd, D., Davis, J. Improving Markov Network Structure Learning Using Decision Trees. Journal

of Machine Learning Research, Vol. 15, 501-532, 2014.

Mitchell, T. Machine Learning. McGraw-Hill, 1997.

Narvekar, M., Banu, S.S. Predicting User’s Web Navigation Behavior Using Hybrid Approach.

International Conference on Advanced Computing Technologies and Applications, Vol. 45, 3-12,

Mumbai, India, March 2015.

Pabarskaite Z. Decision Trees for Web Log Mining. Journal of Intelligent Data Analysis, Vol. 7,

Issue 2, 141-154, April 2003.

Pamutha, T., Chimphlee, S., Kimpan, C., Sanguansat, P. Web Page Access Prediction on Server

Side. Journal of Convergence Information Technology, Vol. 9, No. 5, September 2014.

Quinlan, J.R. Induction of Decision Trees. Machine Learning, Vol. 1, Issue 1, 81-106, 1986.

Radinsky K., Svore, K.M., Dumais, S.T., Shokouhi, M., Teevan, J., Bocharov, A., Horvitz, E.

Behavioral Dynamics on the Web: Learning, Modeling, and Prediction. ACM Transactions on

Information Systems, Vol. 31, Issue 3, July 2013.

Singhai, N., Nigam R.K. A Novel Technique to Predict Oftenly Used Web Pages from Usage

Patterns. International Journal of Emerging Trends & Technology in Computer Science, Vol. 1,

Issue 4, 49-55, 2012.

Temgire, S., Gupta. P. Review on Web Prefetching Techniques. International Journal of Technology

Enhancements and Emerging Engineering Research, Vol. 1, Issue 4, 100-105, 2013.

Umapathi, C., Aramuthan M., Raja, K., Enhancing Web Services Using Predictive Caching,

International Journal of Research and Reviews in Information Sciences, Vol. 1, No. 3, September

Wan, M., Jönsson, A., Wang, C., Li, L., Yang, Y. Web user clustering and Web prefetching using

Random Indexing with weight functions. Knowledge and Information Systems, Vol. 33, Issue 1,

-115, 2012

Downloads

Published

2017-05-25

Issue

Section

Articles