Skip to main content

Miscoding Alerts Within Hospital Datasets: An Unsupervised Machine Learning Approach

  • Conference paper
  • First Online:
Trends and Advances in Information Systems and Technologies (WorldCIST'18 2018)

Abstract

The appropriate funding of hospital services may depend upon grouping hospital episodes into Diagnosis Related Groups (DRGs). DRGs rely on the quality of clinical data held in administrative healthcare databases, mainly proper diagnoses and procedure codes. This work proposes a methodology based on unsupervised machine learning and statistical methods to generate alerts of suspect cases of up- and under-coding in healthcare administrative databases. The administrative database, with a DRG assigned to each hospital episode, was split into homogeneous patient subgroups by applying decision tree-based algorithms. The proportions of specific diagnosis and procedure codes were compared within targeted subgroups to identify hospitals with abnormal distributions. Preliminary results indicate that the proposed methodology has the potential to automatically identify upcoding and under-coding suspect cases, as well as other relevant types of discrepancies regarding coding practices. Nevertheless, additional evaluation under the medical perspective need to be incorporated in the methodology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cheng, P., Gilchrist, A., Robinson, K.M., Paul, L.: The risk and consequences of clinical miscoding due to inadequate medical documentation: a case study of the impact on health services funding. Health Inf. Manage. J. 38, 35–46 (2009)

    Google Scholar 

  2. Mathauer, I., Wittenbecher, F.: Hospital payment systems based on diagnosis-related groups: experiences in low- and middle-income countries. Bull. World Health Organ. 91(10), 746–756 (2013)

    Article  Google Scholar 

  3. Busse, R., Geissler, A., Quentin, W., Wiley, M.: Diagnosis Related Group in Europe. Moving towards transparency, efficiency and quality in hospitals. McGraw Hill, New York (2011)

    Google Scholar 

  4. Fetter, R.B., Thompson, J.D., Mills, R.E.: A system for cost and reimbursement control in hospitals. Yale J Biol Med. 49, 123–136 (1976)

    Google Scholar 

  5. Luo, W., Gallagher, M.: Unsupervised DRG upcoding detection in healthcare databases. In: IEEE International Conference on Data Mining Workshops (ICDMW), pp. 600–605. IEEE (2010)

    Google Scholar 

  6. Dafny L.S: How Do Hospitals Respond to Price Changes? In: National Bureau of Economic Research, Inc., NBER Working Papers: 9972 (2003)

    Google Scholar 

  7. Reid, B., Palmer, G.R., Aisbett, C.: Under-coding in Australia limits the performance of DRG groupers. Health Inf. Manage. 29, 113–117 (1999)

    Google Scholar 

  8. Freitas, A., Lema, I., da Costa-Pereira, A.: Comorbidity coding trends in hospital administrative databases. In: New Advances in Information Systems and Technologies, pp, 609–617. Springer, Cham (2016)

    Chapter  Google Scholar 

  9. Bauder, R., Khoshgoftaar, T.M., Seliya, N.: A survey on the state of healthcare upcoding fraud analysis and detection. Health Serv. Outcomes Res. Method. 17, 31–55 (2017)

    Article  Google Scholar 

  10. Bauder, R.A., Khoshgoftaar, T.M.: A probabilistic programming approach for outlier detection in healthcare claims. In: 15th IEEE International Conference on Machine Learning And Applications (ICMLA), New York: IEEE, pp. 347–54 (2016)

    Google Scholar 

  11. Suresh, N., de Traversay, J., Gollamudi, H., Pathria, A., Tyler, M.: Detection of Upcoding and Code Gaming Fraud and Abuse in Prospective Payment Healthcare Systems. US Patent 8,666,757 (2014)

    Google Scholar 

  12. Averill, R.F., Goldfield, N., Hughes, J.S., et al.: All Patient Refined Diagnosis Related Groups Methodology Overview 3 M Health Information Systems. https://www.hcup- us.ahrq.gov/db/nation/nis/APR-DRGsV20MethodologyOverviewandBibliography.pdf (2003)

  13. Administração Central do Sistema de Saúde (ACSS). https://www.acss.min-saude.pt/wp-content/uploads/2016/12/Portaria_207_2017-1.pdf. (last accessed 2018/01/07)

  14. Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)

    Google Scholar 

  15. Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: 15th International Conference on Machine Learning, pp. 144–151 (1998)

    Google Scholar 

  16. Shi, H.: Best-first Decision Tree Learning. Hamilton, New Zealand (2007)

    Google Scholar 

  17. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)

    MATH  Google Scholar 

  18. Kalmegh, S.: Analysis of WEKA data mining algorithm REPTree, simple cart and randomtree for classification of indian news. Int. J. Innov. Sci. Eng. Technol. 2, 438–446 (2015)

    Google Scholar 

  19. Stiglic, G., et al.: Comprehensive Decision Tree Models in Bioinformatics. PLosONE 7(3), e33812 (2012)

    Article  Google Scholar 

  20. Podgorelec, V., Kokol, P., Stiglic, B., Rozman, I.: Decision trees: an overview and their use in medicine. J. Med. Syst. 26, 445–463 (2002)

    Article  Google Scholar 

  21. Azar, A.T., El-Metwally, S.M.: Decision tree classifiers for automated medical diagnosis. Neural Comput. Appl. 23, 2387–2403 (2013)

    Article  Google Scholar 

  22. Warrens, M.J.: On Association Coefficients for 2 × 2 Tables and Properties That Do Not Depend on the Marginal Distributions. Psychometrika 73, 777–789 (2008)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

Project NORTE-01-0145-FEDER-000016 (NanoSTIMA) is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF). The authors would also like to thank the Central Authority for Health Services, I.P. (ACSS) for providing access to the data.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julio Souza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Souza, J., Santos, J.V., Lopes, F., Viana, J., Freitas, A. (2018). Miscoding Alerts Within Hospital Datasets: An Unsupervised Machine Learning Approach. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 746. Springer, Cham. https://doi.org/10.1007/978-3-319-77712-2_115

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77712-2_115

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77711-5

  • Online ISBN: 978-3-319-77712-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics