A Machine Learning Approach to Comment Toxicity Classification

Chakrabarty, Navoneel

doi:10.1007/978-981-13-9042-5_16

Navoneel Chakrabarty¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 999))

2223 Accesses
26 Citations

Abstract

Nowadays, derogatory comments are often made by one another, not only in offline environment but also immensely in online environments like social networking websites and online communities. So, an Identification combined with Prevention System in all social networking websites and applications, including all the communities, existing in the digital world is a necessity. In such a system, the Identification Block should identify any negative online behavior and should signal the Prevention Block to take action accordingly. This study aims to analyze any piece of text and detect different types of toxicity like obscenity, threats, insults and identity-based hatred. The labeled Wikipedia Comment Dataset prepared by Jigsaw is used for the purpose. A 6-headed Machine Learning tf–idf Model has been made and trained separately, yielding a Mean Validation Accuracy of 98.08% and Absolute Validation Accuracy of 91.64%. Such an Automated System should be deployed for enhancing the healthy online conversation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Coversation AI Team. https://conversationai.github.io/
Perspective API. https://perspectiveapi.com/#/
Georgakopoulos, S.V., Tasoulis, S.K., Vrahatis, A.G., Plagianakos, V.P.: Convolutional neural networks for toxic comment classification. In: 10th Hellenic Conference on Artificial Intelligence (2018)
Google Scholar
Khieu, K., Narwal, N.: Detecting and classifying toxic comments. https://web.stanford.edu/class/cs224n/reports/6837517.pdf
Chu, T., Jue, K., Wang, M.: “Comment abuse classification with deep learning. https://web.stanford.edu/class/cs224n/reports/2762092.pdf
Kohli, M., Kuehler, E., Palowitch, J.: Paying attention to toxic comments online. https://web.stanford.edu/class/cs224n/reports/6856482.pdf
https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data

Download references

Author information

Authors and Affiliations

Jalpaiguri Government Engineering College, Jalpaiguri, West Bengal, India
Navoneel Chakrabarty

Authors

Navoneel Chakrabarty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Navoneel Chakrabarty .

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Howrah, West Bengal, India
Asit Kumar Das
Department of Computer Science and Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
Janmenjoy Nayak
Department of Computer Application, Veer Surendra Sai University of Technology, Burla, Sambalpur, Odisha, India
Bighnaraj Naik
Department of Bioinformatics, Maulana Abul Kalam Azad University of Technology, Kolkata, West Bengal, India
Soumen Kumar Pati
Faculty of Communication Sciences, University of Teramo, Teramo, Italy
Danilo Pelusi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakrabarty, N. (2020). A Machine Learning Approach to Comment Toxicity Classification. In: Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore. https://doi.org/10.1007/978-981-13-9042-5_16

Download citation

DOI: https://doi.org/10.1007/978-981-13-9042-5_16
Published: 18 August 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9041-8
Online ISBN: 978-981-13-9042-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics