Skip to main content

Entity-Based Short Text Classification Using Convolutional Neural Networks

  • Conference paper
  • First Online:
Knowledge Engineering and Knowledge Management (EKAW 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12387))

Abstract

It is beyond human capabilities to analyze a huge amount of short text produced on the World Wide Web in the form of search queries, social media platforms, etc. Due to many difficulties underlying short text for automated processing, i.e, sparsity and insufficient context, the traditional text classification approaches cannot easily be applied to short text. This study discusses a Convolutional Neural Network (CNN) based approach for short text classification. Given a short text, the model generates the text representation by leveraging words together with the entities. To validate the effectiveness of the model, several experiments have been conducted on different datasets. The results suggest that the proposed model is capable of performing short text classification with a high accuracy and outperforms the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://tagme.d4science.org/tagme/.

  2. 2.

    https://www.cs.york.ac.uk/semeval-2013/task2/.

  3. 3.

    http://www.cs.cornell.edu/people/pabo/movie-review-data/.

  4. 4.

    http://cogcomp.cs.illinois.edu/Data/QA/QC/.

  5. 5.

    http://groups.di.unipi.it/~gulli/AG_corpus_of_news_articles.html.

  6. 6.

    https://wikipedia2vec.github.io/wikipedia2vec/pretrained/.

References

  1. Chen, J., Hu, Y., Liu, J., Xiao, Y., Jiang, H.: Deep short text classification with knowledge powered attention. In: AAAI (2019)

    Google Scholar 

  2. Chen, Z., Tang, Y., Zhang, Z., Zhang, C., Wang, L.: Sentiment-aware short text classification based on convolutional neural network and attention. In: IEEE - ICTAI (2019)

    Google Scholar 

  3. Ferragina, P., Scaiella, U.: TAGME: on-the-fly annotation of short text fragments (by Wikipedia entities). In: CIKM (2010)

    Google Scholar 

  4. Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, October 2014, pp. 1746–1751. Association for Computational Linguistics (2014)

    Google Scholar 

  5. Kowsari, K., Meimandi, K.J., Heidarysafa, M., Mendu, S., Barnes, L.E., Brown, D.E.: Text classification algorithms: a survey. Information 10(4), 150 (2019)

    Article  Google Scholar 

  6. Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: ICML (2014)

    Google Scholar 

  7. Li, X., Roth, D.: Learning question classifiers: the role of semantic information. Nat. Lang. Eng. 12(3), 229–249 (2006)

    Article  Google Scholar 

  8. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)

    Google Scholar 

  9. Nakov, P., Kozareva, Z., Ritter, A., Rosenthal, S., Stoyanov, V., Wilson, T.: Semeval-2013 task 2: sentiment analysis in Twitter. CoRR, abs/1912.06806 (2019)

    Google Scholar 

  10. Oshikawa, R., Qian, J., Wang, W.Y.: A survey on natural language processing for fake news detection. CoRR, abs/1811.00770 (2018)

    Google Scholar 

  11. Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: NAACL-HLT (2018)

    Google Scholar 

  12. Rashkin, H., Choi, E., Jang, J.Y., Volkova, S., Choi, Y.: Truth of varying shades: analyzing language in fake news and political fact-checking. In: EMNLP (2017)

    Google Scholar 

  13. Song, Y., Wang, H., Wang, Z., Li, H., Chen, W.: Short text conceptualization using a probabilistic knowledgebase. In: IJCAI. IJCAI/AAAI (2011)

    Google Scholar 

  14. Wang, J., Wang, Z., Zhang, D., Yan, J.: Combining knowledge with deep convolutional neural networks for short text classification. In: Sierra, C. (ed.) IJCAI (2017)

    Google Scholar 

  15. Wang, Z., Wang, H.: Understanding short texts. In: The Association for Computational Linguistics (ACL) (Tutorial), August 2016

    Google Scholar 

  16. Wu, W., Li, H., Wang, H., Zhu, K.Q.: Probase: a probabilistic taxonomy for text understanding. In: ACM SIGMOD (2012)

    Google Scholar 

  17. Xu, J., Cai, Y.: Incorporating context-relevant knowledge into convolutional neural networks for short text classification. In: AAAI (2019)

    Google Scholar 

  18. Yamada, I., et al.: Wikipedia2Vec: an efficient toolkit for learning and visualizing the embeddings of words and entities from Wikipedia. arXiv preprint 1812.06280v3 (2020)

    Google Scholar 

  19. Zeng, J., Li, J., Song, Y., Gao, C., Lyu, M.R., King, I.: Topic memory networks for short text classification. In: EMNLP (2018)

    Google Scholar 

  20. Zhang, X., LeCun, Y.: Text understanding from scratch. CoRR, abs/1502.01710 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mehwish Alam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Alam, M., Bie, Q., Türker, R., Sack, H. (2020). Entity-Based Short Text Classification Using Convolutional Neural Networks. In: Keet, C.M., Dumontier, M. (eds) Knowledge Engineering and Knowledge Management. EKAW 2020. Lecture Notes in Computer Science(), vol 12387. Springer, Cham. https://doi.org/10.1007/978-3-030-61244-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61244-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61243-6

  • Online ISBN: 978-3-030-61244-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics