Abstract
In this paper we propose a clustering-based hybrid approach for multi-document summarization which integrates sentence clustering, local recommendation and global search. For sentence clustering, we adopt a stability-based method which can determine the optimal cluster number automatically. We weight sentences with terms they contain for local sentence recommendation of each cluster. For global selection, we propose a global criterion to evaluate overall performance of a summary. Thus the sentences in the final summary are determined by not only the configuration of individual clusters but also the overall performance. This approach successfully gets top-level performance running on corpus of DUC04.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barzilay, R., McKeown, K.R., Elhadad, M.: Information fusion in the context of multi-document summarization. In: ACL 1999, Maryland (1999)
Blair-Goldensohn, S., Evans, D.: Columbia University at DUC 2004. In: DUC 2004 Workshop, Boston, MA (2004)
Boros, E., Kantor, P.B., Neu, D.J.: A Clustering Based Approach to Creating Multi-Document Summaries. In: DUC 2001 workshop (2001)
Hardy, H., Shimizu, N.: Cross-Document Summarization by Concept Classification. In: SIGIR 2002, pp. 121–128.
Lange, T., Braun, M., Roth, V., Buhmann, J.M.: Stability-Based Model Selection. In: Advances in Neural Information Processing Systems, vol. 15 (2002)
Levine, E., Domany, E.: Resampling Method for Unsupervised Estimation of Clus-ter Calidity. Neural Computation 13, 2573–2593 (2001)
Lin, C.-Y., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co- Occurrence Statistics. In: Proceedings of the Human Technology Conference (HLTNAACL- 2003), Edmonton, Canada (2003)
Niu, Z., Ji, D., Tan, C.L.: Document Clustering Based on Clus-ter Validation. In: CIKM 2004, Washington, DC, USA, November 8-13 (2004)
Radev, D., Allison, T., Blair-Goldensohn, S., Blitzer, J., Çelebi, A., Dimitrov, S., Drabek, E., Hakim, A., Lam, W., Liu, D., Otterbacher, J., Qi, H., Saggion, H., Teufel, S., Topper, M., Winkel, A., Zhu, Z.: MEAD - a platform for multidocument multilingual text summarization. In: Proceedings of LREC 2004, Lisbon, Portugal (May 2004)
Roth, V., Lange, T.: Feature Selection in Clustering Problems. In: NIPS 2003 workshop (2003)
Siddharthan, A., Nenkova, A., McKeown, K.: Syntactic Simplication for Improving Content Selection in Multi-Document Summarization. In: Proceeding of COLING 2004, Geneva, Switzerland (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nie, Y., Ji, D., Yang, L., Niu, Z., He, T. (2006). Multi-document Summarization Using a Clustering-Based Hybrid Strategy. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_53
Download citation
DOI: https://doi.org/10.1007/11880592_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45780-0
Online ISBN: 978-3-540-46237-8
eBook Packages: Computer ScienceComputer Science (R0)