TECHNIQUES AND METRICS FOR IMPROVING WEBSITE STRUCTUREa
Keywords:
Web Metrics, Web Organization, Log File ProcessingAbstract
Evaluation of the link structure of a web site and its redefinition to achieve increased efficiency with regard to easier information retrieval is a common problem in website development. Nevertheless much effort has been devoted in order to analyze the overall statistical properties of a web site, rather than to assess the actual value of its pages. In this paper two distinct metrics are proposed, which aim to quantify the importance of a web page based on the visits it receives by the users and its location within the website. Subsequently, certain guidelines are presented, which can be used to reorganize the website, taking into account the optimization of these metrics. Finally we evaluate the proposed algorithms using real-world website data and verify that they exhibit more elaborate behavior than a related simpler technique.
Downloads
References
Berkhin, P., Becher, J.D. & Randall, D.J.: Interactive path analysis of web site traffic. Proceedings of
KDD01 pp. 414-419, 2001.
Bose, P., Kranakis, E., Krizanc, D., Vargas Martin, M., Czyzowicz, J., Pelc, A. & Gasieniec, L.: Strategies
for Hotlink Assignments. In Proceedings of the International Symposium on Algorithms and computation,
(ISAAC 2000), pp 23-34, LNCS 2223, Springer Verlag, 2000
Botafogo, R.A., Rivlin, E. & Shneiderman, B.: Structural Analysis of Hypertext: Identifying Hierarchies and
Useful Metrics, ACM Transactions on Information Systems, vol. 10, no 2, pp. 142-180, April 1992
Brin S. & Page L., The anatomy of a large-scale hypertextual Web search engine, Computer Networks and
ISDN Systems, 30(1-7): 107:117, 1998
Chen Ming-Syan, Jong Soo Park, & Philip S. Yu. Data mining for path traversal patterns in a web
environment. In Proc. of the 16th International Conference on Distributed Computing Systems, pp. 385-392,
Christopoulou, E., Garofalakis, J, Makris, C., Panagis, Y., Sakkopoulos, E. & Tsakalidis, A. Automating
restructuring of web applications, poster presentation in ACM HT 02, (available through the following link
http://mmlab.ceid.upatras.gr/HT02/ht2002.pdf).
Drott M.C. Using web server logs to improve site design Proceedings of ACM SIGDOC 98 pp.43-50, 1998.
Extended Log File Format W3C. http://www.w3.org/pub/WWW/TR/WD-logfile.html.
Garofalakis, J.D., Kappos, P. & Mourloukos, D.: Web Site Optimization Using Page Popularity. IEEE
Internet Computing 3(4): 22-29 (1999)
Hypertext Transfer Protocol, RFC 2616, W3C. http://www.w3.org/Protocols/rfc2616/rfc2616.html.
Inline Frames, HTML 4 W3C recommendation http://w3c.org/TR/REC-html40/present/frames.html
Kleinberg, J.M.: Authoritative Sources in a Hyperlinked Environment. JACM 46(5): 604-632 (1999)
Spiliopoulou, M., Mobasher, B., Berendt, B. & Nakagawa, M.: A framework for the evaluation of session
reconstruction heuristics in the web usage analysis. INFORMS Journal on Computing. Special Issue on
Mining Web-Based Data for E-Business Applications, Vol. 15 No. 2, 2003.
Srikant, R., Yang, Y.: Mining web logs to improve website organization. In Proceedings of the WWW10,
Hong-Kong, pp 430-437, 5/2001
Zhou Baoyao, Jinlin Chen, Jin Shi, HongJiang Zhang, Qiufeng Wu: Website link structure evaluation and
improvement based on user visiting patterns. In Proceedings of the 12th ACM Conference on Hypertext
(HT01), pp. 241-244, 2001
Web reference:Analog. http://www.analog.cx
Web reference:SurfStats http://www.surfstats.com
Web reference:Web Trends http://www.webtrends.com
Web reference:WebLogs http://www.cape.com