FINDING UNEXPECTED NAVIGATION BEHAVIOUR IN CLICKSTREAM DATA FOR WEBSITE DESIGN IMPROVEMENT
Keywords:
Navigation behaviour, web usage mining, clickstream data, sequential mining, website design, website designersAbstract
This paper describes a novel web usage mining approach to discover patterns in the navigation of websites known as Unexpected Navigation Behaviours (UNBs). The approach provides a web designer with a means of identifying and classifying patterns of browsing and, by reviewing these patterns, the designer can then choose to modify the design of their site or redesign it completely. UNB mining is based on the Consecutive Common Subsequence (CCS), a special instance of Common Subsequence (CS), which is used to define a set of expected routes. The predefined expected routes are then treated as rules and stored in a rule base. By using the predefined route and the UNB mining algorithm, interesting navigation behaviours can be discovered. This paper will introduce the format of the expected route and describe the UNB algorithms. It will also describe a tool that a website designer can use to define the expected route more efficiently, which can help the website designer to make decision about where and how the design of website can be improved. The paper concludes with a series of experiments designed to evaluate how well the UNB mining algorithms work and demonstrate how UNB mining can be useful for improving website design.
Downloads
References
Banerjee, A. and Ghosh, J., Clickstream clustering using weighted longest common
subsequences, In Proceedings of the 1st SIAM International Conference on Data Mining:
Workshop on Web Mining, 2001.
Borges, J. and Levene, M., Data mining of user navigation patterns, In Masand B. M. and
Spiliopoulu, M. eds. Web Usage Analysis and User Profiling, International WEBKDD'99
Workshop, San Diego, California, US, LNCS 1836, 2000, 92-111.
Canter, D., River, R. and Storrs, G., Characterizing user navigation through complex data
structure, Behaviour and Information Technology, 4 (2), 1985, 93-102.
Clark, L., Ting, I. H., Kimble, C., Wright, P. and Kudenko, D., Combining ethnographic and
clickstream data to identify user web navigation strategies, Information Research, Vol.11 No.2,
paper 249, January 2006 [Available at http://InformationR.net/ir/11-2/paper249.html].
Cooley, R., Mobasher, B. and Srivastave, J., Web mining: information and pattern discovery on
the World Wide Web, In Proceedings of the 9th IEEE International Conference on Tool with
Artificial Intelligence, Newport Beach, CA, USA, 1997, 558-567.
Cooley, R., Tan, P. N. and Srivastava, J., Discovery of interesting usage patterns from web data,
In Masand B. M. and Spiliopoulu, M. eds. Web Usage Analysis and User Profiling, International
WEBKDD'99 Workshop, San Diego, California, USA, August 15, 2000, LNCS 1836, Springer-
Verlag, 2000, 163-182.
Dömel, P., WebMap - a graphical hypertext navigation tool, In Proceedings of the Second
International WWW Conference, Chicago, USA, 1994.
Fan, X. and Holsapple, C. W., An empirical study of web site navigation structures’ impacts on
web site usability, Decision Support Systems, 43, 2007, 476-491.
Eirinaki, M. and Vazirgiannis, M., Web mining for web personalization, ACM Transactions on
Internet Technology, 3 (1), 2003, 1-27.
Fu, Y., Creado, M. and Ju, C., Reorganizing web sites based on user access patterns, In
Proceedings of the 10th International Conference on Information and Knowledge Management
(CIKM’01), Atlanta Georgia, USA, November 5-10, 2001, 583-585.
G. Hooker, G. and Finkelman, M., Sequential analysis for learning models of navigation, In
Proceedings of WebKDD 2004 Workshop on Web Mining and Web Usage Analysis, Seattle,
WA, USA, 2004.
Kohavi, R., Mining e-commerce data: the good, the bad, and the ugly, In Proceedings of the
seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,
April 16-18, Hong Kong, 2001, 8-13.
Kohavi, R., Mason, L. and Zheng, Z., Lessons and challenges from mining retail e-commerce
data, Machine Learning, 57, 2001, 83-113.
Kothari, R., Mittal, R., Jain, V. and Mohania, M., On using page co-occurrences for computing
clickstream similarity, In Proceedings of SIAM International Conference on Data Mining, San
Francisco, CA, USA, May 1-3 2003.
Lee, W. P., Liu, C. H. and Lu, C. C., Intelligent Agent-based systems for personalized
recommendation in internet commerce, Expert Systems with Applications, 22, 2002, 275-284.
Mobasher, B., Dai, H., Luo, T. and Nakagawa, M., Discovery and Evaluation of Aggregate
Usage Profile for Web Personalization, Data Mining and Knowledge Discovery, 6, 2002, 61-82.
Nasraoui, O., Cardona, C. and Rojas, C., Mining evolving web clickstreams with explicit
retrieval similarity measures”, In Proceedings of International Web Dynamics Workshop,
International World Wide Web Conference, New York, NY, USA, May 2004.
Nasraoui, O. and Pavuluri, M., Complete this puzzle: a connectionist approach to accurate web
recommendation based on a committee of predictors, In Proceedings of WebKDD 2004
workshop on Web Mining and Web Usage Analysis, Seattle, WA USA, 2004, 47-60.
Nielsen, J., Farrell, S., Molich, R., and Snyder, C., E-Commerce User Experience, Nielsen
Norman Group, 2001.
Pather, S., Erwin, G. and Remenyi, D., Measuring e-commerce effectiveness: a conceptual
model, In Proceedings of SAICSIT Conference, 2003, 143-152.
Perkowitz, M. and Etzioni, O., Adaptive web sites, Communications of the ACM, 143 (8), 2000,
-158.
Raphael, A. and Brower, G., Usability testing: Think-aloud protocol, User-Centered Information
Design Workbook, [Available at http://www.washington.edu/webguides/workbook/] (Access
date: 7 March 2008).
Ristad, E. S. and Yianilos, P. N., Learning string edit distance, IEEE Transactions on Pattern
Analysis and Machine Intelligence, 20 (2), 1998, 522-532.
Shahabi, C. and Kashani, F. B., Efficient and anonymous web usage mining based on client-side
tracking”, In Proceedings of WEBKDD 2001, 2001, 113-144.
Srivastava, J., Cooley, R., Deshpande, M. and Tan, P. N., Web usage mining: discovery and
applications of usage patterns from web data, SIGKDD Explorations, 1(2), 2000,12-23.
Tan, P. N. and Kumar, V., Discovery of web robot sessions based on their navigation patterns,
Data Mining and Knowledge Discovery, 6, 2000, 9-35.
Ting, I. H., Kimble, C. and Kudenko, D., Visualizing and Classifying the Pattern of User's
Browsing Behaviour for Website Design Recommendation, In Proceedings of the First
International Workshop on Knowledge Discovery in Data Stream (ECML/ PKDD 2004) Pisa,
Italy, 24 September 2004, 101-102.
Ting, I. H., Kimble, C. and Kudenko, D., A pattern restore method for restoring missing patterns
in server side clickstream data, In Zhang, Y. et al. eds. APWeb 2005, LNCS 3399, Springer-
Verlag, 2005, 501-512.
Wu, H., Gordon, M., Demaagd, K. and Fan, W., Mining web navigations for intelligence,
Decision Support Systems, 41, 2006, 574-591.
Yen, P.-C., The design and evaluation of accessibility on web navigation, Decision Support
Systems, 42, 2007, 2219-2235.