Skip to main content

An Empirical Study of Text Features for Identifying Subjective Sentences in Portuguese

  • Conference paper
  • First Online:
Intelligent Systems (BRACIS 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13074))

Included in the following conference series:

Abstract

Studies in sentiment analysis have examined how different features are effective in identifying subjective sentences. Several studies on sentiment analysis exist in the literature that have already performed some evaluation of NLP classifications. However, the vast majority of them did not handle texts in the Brazilian Portuguese language, and there is no one to consider the combination of sets of text features of NLP tasks with classifiers. Therefore, in our investigation, we combined empirical features to identify subjective sentences in Portuguese and provide a comprehensive analysis of each set of features’ relative importance using a representative set of user reviews.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://auto.gluon.ai.

  2. 2.

    The Python package version was v3.0.0 with model pt_core_news_lg, except for the COMP and SUP POS tags, which were extracted with v2.2.0 and model pt_core_news_sm.

  3. 3.

    https://universaldependencies.org/u/pos.

  4. 4.

    https://www.ime.usp.br/~pf/dicios/index.html.

  5. 5.

    https://scikit-learn.org/stable.

References

  1. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.J.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Language in Social Media (LSM 2011), pp. 30–38 (2011)

    Google Scholar 

  2. Almatarneh, S., Gamallo, P.: Linguistic features to identify extreme opinions: an empirical study. In: Yin, H., Camacho, D., Novais, P., Tallón-Ballesteros, A.J. (eds.) IDEAL 2018. LNCS, vol. 11314, pp. 215–223. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03493-1_23

    Chapter  Google Scholar 

  3. Almatarneh, S., Gamallo, P.: Comparing supervised machine learning strategies and linguistic features to search for very negative opinions. Information 10(1), 16 (2019)

    Article  Google Scholar 

  4. Balage Filho, P., Pardo, T.A.S., AluĂ­sio, S.: An evaluation of the Brazilian Portuguese liwc dictionary for sentiment analysis. In: Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology (2013)

    Google Scholar 

  5. Belisário, L.B., Ferreira, L.G., Pardo, T.A.S.: Evaluating richer features and varied machine learning models for subjectivity classification of book review sentences in portuguese. Information 11(9), 437 (2020)

    Article  Google Scholar 

  6. Cambria, E., Li, Y., Xing, F.Z., Poria, S., Kwok, K.: Senticnet 6: ensemble application of symbolic and subsymbolic ai for sentiment analysis. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 105–114 (2020)

    Google Scholar 

  7. Carvalho, C.M.A., Nagano, H., Barros, A.K.: A comparative study for sentiment analysis on election Brazilian news. In: Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology, pp. 103–111 (2017)

    Google Scholar 

  8. Chai, Y., Lei, C., Yin, C.: Study on the influencing factors of online learning effect based on decision tree and recursive feature elimination. In: Proceedings of the 10th International Conference on E-Education, E-Business, E-Management and E-Learning, pp. 52–57 (2019)

    Google Scholar 

  9. Chaturvedi, I., Cambria, E., Welsch, R.E., Herrera, F.: Distinguishing between facts and opinions for sentiment analysis: survey and challenges. Inf. Fusion 44, 65–77 (2018)

    Article  Google Scholar 

  10. Chenlo, J.M., Losada, D.E.: A machine learning approach for subjectivity classification based on positional and discourse features. In: Lupu, M., Kanoulas, E., Loizides, F. (eds.) IRFC 2013. LNCS, vol. 8201, pp. 17–28. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41057-4_3

    Chapter  Google Scholar 

  11. Chenlo, J.M., Losada, D.E.: An empirical study of sentence features for subjectivity and polarity classification. Inf. Sci. 280, 275–288 (2014)

    Article  Google Scholar 

  12. Freitas, C., Motta, E., Milidiú, R., César, J.: Vampiro que brilha... rá! desafios na anotaçao de opiniao em um corpus de resenhas de livros. Encontro de Linguística de Corpus 11, 22 (2012)

    Google Scholar 

  13. Freitas, L.A.D., Vieira, R.: Ontology-based feature-level sentiment analysis in Portuguese reviews. Int. J. Bus. Inf. Syst. 32(1), 30–55 (2019)

    Google Scholar 

  14. Ge, P.: Analysis on approaches and structures of automated machine learning frameworks. In: 2020 International Conference on Communications, Information System and Computer Engineering (CISCE), pp. 474–477. IEEE (2020)

    Google Scholar 

  15. Hogenboom, A., Bal, D., Frasincar, F., Bal, M., De Jong, F., Kaymak, U.: Exploiting emoticons in polarity classification of text. J. Web Eng. 14(1 & 2), 22–40 (2015)

    Google Scholar 

  16. Honnibal, M., Montani, I.: spacy 2: natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing. To appear 7(1), 411–420 (2017)

    Google Scholar 

  17. Kimura, M., Katsurai, M.: Investigating the consistency of emoji sentiment lexicons constructed using different languages. In: Proceedings of the 20th International Conference on Information Integration and Web-based Applications & Services, pp. 310–313 (2018)

    Google Scholar 

  18. Mansour, R., Hady, M.F.A., Hosam, E., Amr, H., Ashour, A.: Feature selection for twitter sentiment analysis: an experimental study. In: Gelbukh, A. (ed.) CICLing 2015. LNCS, vol. 9042, pp. 92–103. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18117-2_7

    Chapter  Google Scholar 

  19. Moraes, S.M.W., Santos, A.L.L., Redecker, M., Machado, R.M., Meneguzzi, F.R.: Comparing approaches to subjectivity classification: a study on Portuguese tweets. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds.) PROPOR 2016. LNCS (LNAI), vol. 9727, pp. 86–94. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41552-9_8

    Chapter  Google Scholar 

  20. Oliveira, M., Melo, T.: Investigando features de sentenças para classificação de subjetividade e polaridade em português do brasil. In: Anais do XVII Encontro Nacional de Inteligência Artificial e Computacional, pp. 270–281. SBC (2020)

    Google Scholar 

  21. Palshikar, G., Apte, M., Pandita, D., Singh, V.: Learning to identify subjective sentences. In: Proceedings of the 13th International Conference on Natural Language Processing, pp. 239–248 (2016)

    Google Scholar 

  22. Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic inquiry and word count: Liwc 2001. Mahway: Lawrence Erlbaum Assoc. 71(2001), 2001 (2001)

    Google Scholar 

  23. Pereira, D.A.: A survey of sentiment analysis in the Portuguese language. Artif. Intell. Rev. 54(2), 1087–1115 (2021)

    Article  Google Scholar 

  24. Poria, S., Cambria, E., Winterstein, G., Huang, G.B.: Sentic patterns: dependency-based rules for concept-level sentiment analysis. Knowl.-Based Syst. 69, 45–63 (2014)

    Article  Google Scholar 

  25. Turney, P.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 417–424 (2002)

    Google Scholar 

Download references

Acknowledgments

The authors are thankful for the financial and material support provided by the Amazonas State Research Support Foundation (FAPEAM) through Project PPP 04/2017 and for the assistance of the Intelligent Systems Laboratory (LSI) of the Amazonas State University (UEA). We also gratefully acknowledge the support provided by the Gratificação de Produtividade Acadêmica (GPA) of the Amazonas State University (Portaria 086/2021).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miguel de Oliveira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

de Oliveira, M., de Melo, T. (2021). An Empirical Study of Text Features for Identifying Subjective Sentences in Portuguese. In: Britto, A., Valdivia Delgado, K. (eds) Intelligent Systems. BRACIS 2021. Lecture Notes in Computer Science(), vol 13074. Springer, Cham. https://doi.org/10.1007/978-3-030-91699-2_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-91699-2_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-91698-5

  • Online ISBN: 978-3-030-91699-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics