Abstract
With rapid growth of the Internet, how to get information from this huge information space becomes an even important problem. In this paper, An Intelligence Document Semantic Indexing System: IDSIS is proposed. Some new technologies are integrated in IDSIS to obtain good performance. IDSIS is composed of four key procedures. A parallel, distributed and configurable Spider is used for information gathering; A multi-hierarchy document classification approach combining the information gain initially processes gathered web documents; A swarm intelligence based document clustering method is used for information organization; A concept-based retrieval interface is applied for user interactive retrieval. IDSIS is a concept-associated document semantic indexing system for information retrieval on Internet.
This research supported by NSFC grant 60073019, 90104021, 60173017 and NSFB grant 4011003.
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-0-387-35602-0_35
Chapter PDF
Similar content being viewed by others
References
E.Bonabeau,, M.Dorigo, & G. Theraulaz, Swarm Intelligence: From Natural to Artificial Systems, Oxford Univ. Press, New York, 1999
H. Chen, K. J. Lynch, K. Basu, and D. T. Ng. Generating, integrating, and activating thesauri for concept-based document retrieval. IEEE EXPERT, Special Series on Artificial Intelligence in Text-based Information Systems, 8 (2): 25–34, April 1993.
Dong Mingkai, Tian Qijia, Shi Zhongzhi, Web Spider Based on Intelligent Agent. SCl2001, Orlando, P292–296, 2001.
Shaohui Liu, Mingkai Dong, Haijun Zhang, Rong Li, Zhongzhi Shi, An Approach of Multi-hierarchy Text Classification, 2001 International Conferences on Info-tech and Info-net PPOCEEDINGS,Conferences C:95–100
G. Salton, B. Buckley. Term-weighting Approaches in Automatic Text Retrieval. Information Processing and Management, 1998, 24 (5): 513–523
Wu Bin Zheng Yi Liu Shaohui Shi Zhongzhi,CSIM: A Document Clustering Algorithm Based On Swarm Intelligence, To be appeared In Proceedings of Congress on Evolutionary Computation,2002
Zhendong Dong, Qiang Dong, HowNet, http://www.keenage.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 IFIP International Federation for Information Processing
About this paper
Cite this paper
Shi, Z., Wu, B., He, Q., Gong, X., Liu, S., Zheng, Y. (2002). IDSIS: Intelligent Document Semantic Indexing System. In: Musen, M.A., Neumann, B., Studer, R. (eds) Intelligent Information Processing. IIP 2002. IFIP — The International Federation for Information Processing, vol 93. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35602-0_33
Download citation
DOI: https://doi.org/10.1007/978-0-387-35602-0_33
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-1031-1
Online ISBN: 978-0-387-35602-0
eBook Packages: Springer Book Archive