BEHAVIOR BASED WEB PAGE EVALUATION
Keywords:
Web-human interaction, browser interface, navigation, web usage mining, user modelingAbstract
This paper describes our efforts to investigate factors in user browsing behavior to automatically evaluate Web pages that the user shows interest in. To evaluate Web pages automatically, we developed a clientside logging/analyzing tool: the GINIS Framework. We do not focus on just clicking, scrolling, navigation, or duration of visit alone, but we propose integrating these patterns of interaction to recognize and evaluate user response to a given Web page. Unlike most previous Web studies analyzing access through proxies or servers, this work focuses primarily on client-side user behavior using a customized Web browser. First, GINIS unobtrusively gathers logs of user behavior through the user’s natural interaction with the Web browser. Then, it analyses the logs and extracts effective rules to evaluate Web pages using a C4.5 machine learning system. Eventually, GINIS becomes able to automatically evaluate Web pages using these learned rules, after which the evaluation can be utilized for a variety of user profiling applications. We successfully confirmed, for example, that time spent on a Web page is not the most important factor in predicting interest from behavior, which conflicts with the findings of most previous studies.
Downloads
References
Gulli, A. and Signorini, A. (2005) The Indexable Web is more than 11.5 Billion Pages, In
Proceedings of the WWW Conference 2005, 902-903.
Sullivan, D. (2005) Search Engine Sizes,
http://searchenginewatch.com/showPage.html?page=2156481
Google Toolbar, http://toolbar.google.com/
Internet Explorer, http://www.microsoft.com/windows/products/winfamily/ie/default.mspx
Firefox, http://www.mozilla.com/
Opera, http://www.opera.com/
Nagino, N. and Yamada, S. (2003) Future View: Web Navigation Based on Learning User’s
Browsing Patterns, In Proceedings of the International Conference on Web Intelligence, 541-
Kato, H., Nakayama, T. and Yamane, Y. (2000) Navigation Analysis Tool based on the
Correlation between Contents Distribution and Access Patterns, In Proceedings of WebKDD
Workshop on Web Mining for E-Commerce at the 6th ACM SIGKDD, 95-104.
Drummond, C., Ionescu, D., Holte, R., Georganas, N. and Petriu, E. (1996) Intelligent
Browsing for Multimedia Applications, In Proceedings of International Conference on
Multimedia Computing and Systems.
Milic-Frayling, N., Sommerer, R. and Rodden, K. (2003) WebScout: Support for Revisitation of
Web Pages within a Navigation Session, In Proceedings of the International Conference on
Web Intelligence.
Catledge, L. D. and Pitkow, J. E. (1995) Characterizing Browsing Strategies in the World-Wide
Web, In Proceedings of the Third international World-Wide Web Conference on Technology,
Tools and Applications, 1065-1073.
Morita, M. and Shinoda, Y. (1994) Information Filtering Based on User Behavior Analysis and
Best Match Text Retrieval, In Proceedings of the 17th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval, 272-281.
Quilan, R. (1993) C4.5: Programs for Machine Learning, Morgan Kaufmann.
Lim, T., Loh, W. and Shih, Y. (2000) A Comparison of Prediction Accuracy, Complexity, and
Training Time of Thirty-Three Old and New Classification Algorithms, Machine Learning
Journal, Vol. 40, 203-228.
Weinreich, H., Obendort, H., Herder, E. and Mayer, M. (2006) Off the Beaten Tracks:
Exploring Three Aspects of Web Navigation, In Proceedings of the WWW Conference 2006,
ACM Press, 133-142.
Speretta, M., and Gauch, S. (2004) Personalizing Search Based on User Search Histories, In
Proceedings of the 13th International Conference on Information and Knowledge Management.
Kim, H., and Chan, K. (2005) Implicit Indicators for Interesting Web Pages, Web Information
System and Technologies, 270-277.
Chaffee, J., Gauch, S. (2000) Personal Ontologies for Web Navigation, In Proceedings of the
th International Conference on Information and Knowledge Management, 227-234.
Chen, C. C., Chen, M. C., and Sun, Y. (2001) PVA: A Self-Adaptive Personal View Agent, In
Proceedings of the 7th International Conference on Knowledge Discovery and Data Mining,
-262.
Claypool, M., Le, P., Waseda, M., and Brown, D. (2001) Implicit Interest Indicators, In
Proceedings of the 6th International Conference on Intelligent User Interfaces, 33-40.
Kelly, D. and Teevan, J. (2003) Implicit Feedback for Inferring User Preference: A
Bibliography, SIGIR Forum, 18-28.
White, R. W., Ruthven, I. and Jose, J. M. (2002) Finding Relevant Documents using Top
Ranking Sentences: An Evaluation of Two Alternative Schemes, In Proceedings of the 25th
Annual International ACM SIGIR Conference on Research and Development in Information
Retrieval, 57-64.
Oard, D.W. and Kim, J. (2001) Modeling Information Content Using Observable Behavior, In
Proceedings of the 64th Annual Meeting of American Society for Information Science and
Technology.
Seo, Y. and Zhang, B. (2000)Learning User's Preferences by Analyzing Web-browsing
Behaviors, In Proceedings of the Fourth International Conference on Autonomous Agents, 381-
Jansen, B. J. and M. D. McNeese (2005) Evaluating the Effectiveness of and Patterns of
Interactions with Automated Searching Assistance, Journal of the American Society for
Information Science and Technology, Vol. 56(14), 1480-1503.
Jansen, B. J., Spink, A., and Saracevic, T. (2000) Real Life, Real Users, and Real Needs: A
Study and Analysis of User Queries on the Web, Information Processing and Management, Vol.
(2), 207-227.
Jansen, B. J. and Pooch, U. (2001) A Review of Web Searching Studies and a Framework for
Future Research, Journal of the American Society for Information Science and Technology, Vol.
(3), 235-246.
Fox, S., Karnawat, K., Mydland, M., Dumais, S. T. and White, T. (2005) Evaluating Implicit
Measures to Improve the Search Experience, ACM Transactions on Information Systems, Vol.
(2), 147-168.
Kelly, D. (2005) Implicit Feedback: Using Behavior to Infer Relevance, New Directions in
Cognitive Information Retrieval, Vol. 19.
Kelly, D. and Belkin, N. J. (2001) Reading Time, Scrolling and Interaction: Exploring Implicit
Sources of User Preferences for Relevance Feedback, In Proceedings of the 24th Annual
International ACM SIGIR Conference on Research and Development in Information Retrieval,
-409.
Hilbert, D.M. and Redmiles D.F. (2000) Extracting Usability Information from User Interface
Events, ACM Computing Surveys, Vol. 32(4), 384-421.
Firefox Continues to Erode Microsoft Dominance, http://www.netapplications.com/news.asp
Browser Market Share White Paper, http://www.e-janco.com/browser.htm
Manber, U., Patel, A. and Robison, J. (2000) Experience with Personalization of Yahoo!,
Communications of the ACM, ACM Vol. 43(8), 35-39.
Personalization is not Technology: Using Web Personalization to Promote your Business Goal,
http://www.boxesandarrows.com/view/personalization_is_not_technology_using_web_personal
ization_to_promote_your_business_goal.
Eirinaki, M. and Vazirgiannis, M. (2003) Web Mining for Web Personalization, ACM
Transactions on Internet Technology, Vol. 3(1), 1-27.
Mobasher, B., Cooley, R. and Srivastava, J. (2000) Automatic Personalization Based on Web
Usage Mining, Communications of the ACM, Vol. 43(8), 142-151.
Sugiyama, K., Hatano, K. and Yoshikawa, M. (2004) Adaptive Web Search Based on User
Profile Constructed Without Any Effort From Users, In Proceedings of the 13th International
Conference on World Wide Web, 675-684.
Sugiyama, K., Hatano, K., Yoshikawa, M. and Uemura, S. (2004) User-Oriented Adaptive Web
Information Retrieval based on Implicit Observations, In Proceedings of the 6th Asia Pacific
Web Conference, 636-634.
Yan, T.W. and Garcia-Molina, H. (1995) SIFT - A Tool for Wide-Area Information
Dissemination, In Proceedings of the USENIX Technical Conference, 177-186.
My Yahoo, http://my.yahoo.com/
My Netscape, http://my.netscape.com/
Google Personalized Search, Google Labs, http://labs.google.com/
Shahabi, C. and Chen, Y. (2003) An Adaptive Recommendation System without Explicit
Acquisition of User Relevance Feedback, Distributed and Parallel Databases Vol. 14(2), 173-
Collewijn, H. (1999) Eye Movement Recording, In R. H. S. Carpenter & J. G. Robson (Eds.),
Vision research: A practical guide to laboratory methods, Oxford: Oxford University Press,
-287.
Pazzani, M. and Billsus, D. (1997) Learning and Revising User Profiles: The Identification of
Interesting Web Sites. Machine Learning, Vol. 27(3), 313-331.
Kosala, R. and Blockeel, H. (2000) Web Mining Research: A Survey, SIGKDD ACM SIGKDD
Explorations Newsletter, Vol. 2(1), 1-15.
Sun, J., Zeng, H., Liu, H., Lu, Y., and Chen, Z. (2005) CubeSVD: A Novel Approach to
Personalized Web Search, In Proceedings of the 14th International Conference on World Wide
Web, 382-390.
White, R. W. and Drucker, S. M. (2007) Investigating behavioral variability in web search, In
Proceedings of the 16th International Conference on World Wide Web, WWW '07. ACM Press,
New York, NY, 21-30.
Hu, J., Zeng, H., Li, H., Niu, C., and Chen, Z. (2007) Demographic prediction based on user's
browsing behavior, In Proceedings of the 16th International Conference on World Wide Web,
WWW '07, ACM Press, New York, NY, 151-160.