Skip to main content

Automatic Text Summarization: A New Hybrid Model Based on Vector Space Modelling, Fuzzy Logic and Rhetorical Structure Analysis

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11684))

Included in the following conference series:

Abstract

In this paper, we present a new hybrid system for automatic text summarization. First, vector space modelling is used to compute two original metrics of coverage and fidelity. The latter metrics are combined onto a unified Fidelity-Coverage (F-C) score using fuzzy logic theory. Then, a rhetorical analysis is performed on top of sentences having the highest F-C scores in order to achieve coherence. Conducted experiments on the Timeline17 dataset show that the proposed system outperforms state of the art extractive summarization models. Also, generated abstracts generally satisfy the three criteria of a good summary, namely coverage, fidelity and coherence.

Supported by the Canadian Social Sciences and Humanities Research Council.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mehdi A., et al.: Text summarization techniques: a brief survey. J. Comput. Lang. abs/1707.02268 (2017)

    Google Scholar 

  2. Andhale, N., Bewoor, L.A.: An overview of Text Summarization techniques. In: 2016 International Conference on Computing Communication Control and Automation (ICCUBEA), Pune 2016, pp. 1–7 (2016)

    Google Scholar 

  3. Yogan, J.K., Ong Sing, G., Halizah, B., Ngo, H.C., Puspalata, C.S.: A review on automatic text summarization approaches. J. Comput. Sci. 12(4), 178–190 (2016)

    Article  Google Scholar 

  4. Nenkova, A., Vanderwende, L., McKeown, K.: A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization. In: the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, USA 2006, pp. 573–580 (2006)

    Google Scholar 

  5. Nenkova, A., Vanderwende, L.: The impact of frequency on summarization. Microsoft Research (2005)

    Google Scholar 

  6. Filatova, E., Hatzivassiloglou, V.: A formal model for information selection in multi-sentence text extraction. In: The 20th International Conference on Computational Linguistics, 2004, pp. 397–403 (2004)

    Google Scholar 

  7. Fung, P., Ngai, G.: One story, one flow: hidden Markov story models for multilingual multidocument summarization. In: The ACM Transactions on Speech and Language Processing, 2006, pp. 1–16 (2006)

    Article  Google Scholar 

  8. Galley, M.: A skip-chain conditional random field for ranking meeting utterances by importance. In: The Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, (NLP 2006), pp. 364–372 (2006)

    Google Scholar 

  9. Gupta, V., Lehal, G.S.: A survey of text summarization extractive techniques. J. Emerg. Technol. Web Intell. 2(3), 258–268 (2010)

    Google Scholar 

  10. Svore, K.M., Vanderwende, L., Burges, C.J.: Enhancing single-document summarization by combining RankNet and third-party sources. Microsoft Corporation (2007)

    Google Scholar 

  11. Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M.: Learning to rank using gradient descent. In: Proceedings of the 22nd International Conference on Machine Learning, (CML 2005), ACM, pp. 89–96 (2005)

    Google Scholar 

  12. Hannah, M.E., Mukherjee, S.: A classification-based summarisation model for summarising text documents. Int. J. Inf. Commun. Technol. 6, 292–308 (2014)

    Google Scholar 

  13. Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Mani, I., Maybury, M.T. (eds.) Advances in Automatic Text Summarization, pp. 111–121. The MIT Press, Cambridge (1999)

    Google Scholar 

  14. Kundi, F.M., Ahmad, S., Khan, A., Asghar, M.Z.: Detection and scoring of internet slangs for sentiment analysis using SentiWordNet. Life Sci. J. 11, 66–72 (2014)

    Google Scholar 

  15. Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Interdisc. J. Study Discourse 8(3), 243–281 (1988)

    Google Scholar 

  16. Tran, G.B., Tran, T.A., Tran, N.K., Alrifai, M., Kanhabua, N.: Leverage learning to rank in an optimization framework for timeline summarization. In: TAIA workshop, SIGIR 2013 (2013)

    Google Scholar 

  17. Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)

    Article  MathSciNet  Google Scholar 

  18. Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: Lin, D., Wu, D. (eds.) Proceedings of EMNLP 2004, Association for Computational Linguistics, Barcelona, Spain, Association for Computational Linguistics, Barcelona, Spain, pp. 404–411 (2004)

    Google Scholar 

  19. Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22(5), 457–479 (2004)

    Article  Google Scholar 

  20. Torres-Moreno, J.-M.: Automatic Text Summarization. Wiley, London (2014)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alaidine Ben Ayed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ben Ayed, A., Biskri, I., Meunier, JG. (2019). Automatic Text Summarization: A New Hybrid Model Based on Vector Space Modelling, Fuzzy Logic and Rhetorical Structure Analysis. In: Nguyen, N., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2019. Lecture Notes in Computer Science(), vol 11684. Springer, Cham. https://doi.org/10.1007/978-3-030-28374-2_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-28374-2_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-28373-5

  • Online ISBN: 978-3-030-28374-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics