Skip to main content

The Use of Machine Learning in the Classification of Electronic Lawsuits: An Application in the Court of Justice of Minas Gerais

  • Conference paper
  • First Online:
Intelligent Systems (BRACIS 2020)

Abstract

With the abundance of electronic lawsuits already implemented throughout Brazil, courts have a valuable source of information in text format that constitute attractive bases for the application of Artificial Intelligence (AI) and machine learning (ML). In this research, supervised learning approaches were explored for the automatic classification of types of documents in electronic court proceedings of the Court of Justice of Minas Gerais (TJMG). The methodology is composed of cross-validation within the specific corpus of the legal domain, comparing traditional classifiers and more recent methods based on neural networks and deep learning models, using Glove word vectors generated for the Portuguese Language and Convolutional Neural Network (CNN). This work achieved high precision in the results and if implemented in the courts it can provide significant savings in financial and human resources, allowing lawsuits classification activities, currently done manually by employees, to be performed in seconds by the machine. The result of this experiment shows that the hit rates for the CNN and SVM classifiers exceed 93% and is considered a high result. Based on the assumption that Glove brings extra semantic resources that can help in classifying texts from court proceedings, this work demonstrates Glove’s effectiveness by showing that a CNN with Glove surpasses SVM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The first instance is the first hierarchical jurisdiction, i.e., the first body of Justice to which the citizen must address a dispute resolution request.

  2. 2.

    Legal term referring to the manifestations in processes such as: Initial Petition, Contestation (defense), Embargos, Sentence.

  3. 3.

    Judicial circumscription, under the jurisdiction of one or more judges of law.

  4. 4.

    The Natural Language Toolkit is a set of libraries and programs for symbolic and statistical processing of natural language written in the Python programming language.

  5. 5.

    Open source machine learning library for the Python programming language.

  6. 6.

    https://github.com/adrianocapanema/ClassificationOfElectronicLawsuitsAnApplicationInTheTJMG.

References

  1. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation (2011)

    Google Scholar 

  2. Sulea, O.-M., Zampieri, M., Malmasi, S., Vela, M., Dinu, L.P., van Genabith, J.: Exploring the Use of Text Classification in the Legal Domain (2017)

    Google Scholar 

  3. Mastropaolo, A., Pallante, F., Radicioni, D.P.: Legal documents categorization by compression. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law - ICAIL’13, p. 92. ACM Press, Rome (2013)

    Google Scholar 

  4. Francesconi, E., Passerini, A.: Automatic classification of provisions in legislative texts. Artif. Intell. Law 15, 1–17 (2007). https://doi.org/10.1007/s10506-007-9038-0

    Article  Google Scholar 

  5. Lilleberg, J., Zhu, Y., Zhang, Y.: Support vector machines and Word2vec for text classification with semantic features. In: 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC), pp. 136–140. IEEE, Beijing (2015)

    Google Scholar 

  6. Wei, F., Qin, H., Ye, S., Zhao, H.: Empirical study of deep learning for text classification in legal document review. In: 2018 IEEE International Conference on Big Data (Big Data), pp. 3317–3320 (2018). https://doi.org/10.1109/BigData.2018.8622157

  7. Howe, J.S.T., Khang, L.H., Chai, I.E.: Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments (2019). arXiv:1904.06470 [cs]

  8. Chalkidis, I., Fergadiotis, M., Malakasiotis, P., Aletras, N., Androutsopoulos, I.: Extreme Multi-Label Legal Text Classification: A case study in EU Legislation (2019). arXiv:1905.10892 [cs]

  9. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  10. Kim, Y.: Convolutional neural networks for sentence classification (2014). arXiv preprint arXiv:1408.5882

  11. Agarwal, A., Negahban, S., Wainwright, M.: A simple way to prevent neural networks from overfitting. Ann. Stat. 40, 1171–1197 (2012)

    Article  Google Scholar 

  12. Hartmann, N., Fonseca, E., Shulby, C., Treviso, M., Rodrigues, J., Aluisio, S.: Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks (2017). arXiv:1708.06025 [cs]

  13. Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)

    MathSciNet  MATH  Google Scholar 

  14. Abadi, M., et al.: Tensorflow: A system for large-scale machine learning. In: Symposium on Operating Systems Design and Implementation, pp. 265–283 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adriano Capanema Silva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Silva, A.C., Maia, L.C.G. (2020). The Use of Machine Learning in the Classification of Electronic Lawsuits: An Application in the Court of Justice of Minas Gerais. In: Cerri, R., Prati, R.C. (eds) Intelligent Systems. BRACIS 2020. Lecture Notes in Computer Science(), vol 12319. Springer, Cham. https://doi.org/10.1007/978-3-030-61377-8_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61377-8_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61376-1

  • Online ISBN: 978-3-030-61377-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics