Abstract
As document map creation algorithms like WebSOM are computationally expensive, and hardly reconstructible even from the same set of documents, new methodology is urgently needed to allow to construct document maps to handle streams of new documents entering document collection. This challenge is dealt with within this paper. In a multi-stage process, incrementality of a document map is warranted.1 The quality of map generation process has been investigated based on a number of clustering and classification measures. Conclusions concerning the impact of incremental, topic-sensitive approach on map quality are drawn.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
1. CC Aggarwal, F Al-Garawi, and PS Yu. Intelligent crawling on theWorld Wide Web with arbitrary predicates. In Proc. 10th International World Wide Web Conference, pages 96–105, 2001.
2. M.W.Berry, Large scale singular value decompositions, International Journal of Supercomputer Applications, 6(1), 1992, pp.13–49
3. L.N. De Castro, F.J. von Zuben, An evolutionary immune network for data clustering, SBRN′2000, IEEE Computer Society Press, 2000
4. M. Dittenbach, A. Rauber, D. Merkl, Discovering Hierarchical Structure in Data Using the Growing Hierarchical Self-Organizing Map, Neurocomputing, Elsevier, ISSN 0925–2312, 48 (1–4) 2002, pp. 199–216
5. B. Fritzke, Some competitive learning methods, draft available from http://www.neuroinformatik.ruhr-uni-bochum.de/ini/VDM/research/ gsn/JavaPaper
6. B. Fritzke, A growing neural gas network learns topologies, in: G. Tesauro, D.S. Touretzky, and T.K. Leen (Eds.) Advances in Neural Information Processing Systems 7, MIT Press Cambridge, MA, 1995, pp. 625–632
7. B. Fritzke, A self-organizing network that can follow non-stationary distributions, in: Proceedings of the International Conference on Artificial Neural Net works ′97, Springer, 1997, pp. 613–618
8. C. Hung, S.Wermter, A Constructive and Hierarchical Self-Organising Model in A Non-Stationary Environment, International Joint Conference in Neural Networks, 2005
9. M. Kłopotek, M. Dramiński, K. Ciesielski, M. Kujawiak, S.T. Wierzchoń, Mining document maps, in Proceedings of Statistical Approaches toWeb Mining Workshop (SAWM) at PKDD′04, M. Gori, M. Celi, M. Nanni eds., Pisa, 2004, 87–98
10. K. Ciesielski, M. Dramiński, M. Kłopotek, M. Kujawiak, S. Wierzchoń: Architecture for graphical maps of Web contents, in Proc.WISIS′2004, Warsaw, 2004
11. K. Ciesielski, M. Dramiński, M. Kłopotek, M. Kujawiak, S. Wierzchoń: Mapping document collections in non-standard geometries. B. De Beats, R. De Caluwe, G. de Tre, J. Fodor, J. Kacprzyk, S. Zadroÿzny (eds): Current Issues in Data and Knowledge Engineering. Akademicka Warszawa, 2004, pp.122–132
12. K. Ciesielski, M. Dramiński, M. Kłopotek, M. Kujawiak, S.T. Wierzchoń, On some clustering algorithms for Document Maps Creation, to appear in: Proceedings of the Intelligent Information Processing and Web Mining (IIS:IIPWM- 2005), Gdansk, 2005
13. M. Kłopotek, S. Wierzchoń, K. Ciesielski, M. Dramiński, D. Czerski, M. Kujawiak: Understanding Nature of Map Representation of Document Collections Map Quality Measurements, to appear in Proceeding of International Conference on Artificial Intelligence, Siedlce, September 2005
14. M. Kłopotek: A New Bayesian Tree Learning Method with Reduced Time and Space Complexity, Fundamenta Informaticae, 49(4)2002, IOS Press, 349–367
15. T. Kohonen, Self-Organizing Maps. Springer Series in Information Sciences, Vol. 30, Springer, Berlin, Heidelberg, New York, 2001. Third Extended Edition, 501 pages. ISBN 3-540-67921-9, ISSN 0720-678X
16. B. C., K. Livesay, and K. Lund. Explorations in context space: Words, sentences, discourse Discourse Processes, 25(2-3):211–257, 1998
17. A. Rauber, Cluster Visualization in Unsupervised Neural Networks, Diplomarbeit, Technische Universitt Wien, Austria, 1996
18. J. Timmis, aiVIS: Artificial ImmuneNetwork Visualization, in: Proceedings of EuroGraphics UK 2001 Conference, Univeristy College London 2001, pp.61–69
19. T. Zhang, R. Ramakrishan, M. Livny, BIRCH: E.cient Data Clustering Method for Large Databases, in: Proceedings of ACM SIGMOD International Conference on Data Management, 1997
20. Y. Zhao, G. Karypis, Criterion functions for document Clustering: Experiments and analysis, available at URL: http://www-users.cs.umn.edu/~karypis/ publications/ir.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer
About this paper
Cite this paper
Ciesielski, K., Dramiński, M., Kłopotek, M.A., Czerski, D., Wierzchoń, S.T. (2006). Adaptive Document Maps. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds) Intelligent Information Processing and Web Mining. Advances in Soft Computing, vol 35. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-33521-8_11
Download citation
DOI: https://doi.org/10.1007/3-540-33521-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33520-7
Online ISBN: 978-3-540-33521-4
eBook Packages: EngineeringEngineering (R0)