Abstract
As regards session identification method on web mining, an improved one has been put forward. Firstly, considering website structure and its content, page access time threshold will be reached after collecting access time of each page, which should be used to divide sessions into various sets. Then, the session sets will be optimized further, with the help of session reconstruction, namely union and rupture. It has been proved through experiment that the session set which is attained by the above method is more faithful.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jia-wei, H., Xiao-feng, M., Jing, A.: Research on web mining. Journal of computer research & development 38(4), 405–414 (2001)
Facca, M., Lanzi, P.L.: Mining Interstiong Knowledge from Weblogs: A Survey. Data and Knowledge Engineering 53(3), 225–241 (2005)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Min2ing World Wide Web Browsing Patterns. Knowledge and Information system 1(1), 5–32 (1999)
Fu, Y., Sandhu, K., Shih, M.: A generalization - Based Approachto Clustering of Web Usage Session. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS (LNAI), vol. 1836, pp. 21–28. Springer, Heidelberg (2000)
Spiliopoulou, M., Mobasher, B., Berendt, B., et al.: A Framework for the Evaluation of Session Reconstruction Heuristics in WebUsage Analysis. INFORMS Journal of Computing 15(2), 171–179 (2004)
Chen, M.S., Park, J.S., Yu, P.S.: Data Mining for Path Traversal Patterns in a Web Environment. In: Proc 16th Int ’l Conf. Distributed Computing System ( ICDCS 1996), pp. 385–392. IEEE CS Press, Los Alamitos (1996)
Xianliang, Y., Wei, Z.: An Improved Session Identification Method in Web Mining. Huazhong University of Science and Technology Journal (natural science edition) 7, 33–35 (2006)
Fang, Y., Lijuan, W.: Practical Data Mining. Electronic Industry Press, Beijing (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fang, Y., Huang, Z. (2010). An Improved Algorithm for Session Identification on Web Log. In: Wang, F.L., Gong, Z., Luo, X., Lei, J. (eds) Web Information Systems and Mining. WISM 2010. Lecture Notes in Computer Science, vol 6318. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16515-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-16515-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16514-6
Online ISBN: 978-3-642-16515-3
eBook Packages: Computer ScienceComputer Science (R0)