Skip to main content

Extracting Diagnostic Knowledge from Hepatitis Dataset by Decision Tree Graph-Based Induction

  • Conference paper
Active Mining

Abstract

We have proposed a method called Decision Tree Graph-Based Induction (DT-GBI), which constructs a classifier (decision tree) for graph-structured data while simultaneously constructing attributes for classification. Graph-Based Induction (GBI) is utilized in DT-GBI for efficiently extracting typical patterns from graph-structured data by stepwise pair expansion (pairwise chunking). Attributes, i.e., substructures useful for classification task, are constructed by GBI on the fly while constructing a decision tree in DT-GBI. We applied DT-GBI to four classification tasks of hepatitis data using only the time-series data of blood inspection and urinalysis, which was provided by Chiba University Hospital. In the first and second experiments, the stages of fibrosis were used as classes and a decision tree was constructed for discriminating patients with F4 (cirrhosis) from patients with the other stages. In the third experiment, the types of hepatitis (B and C) were used as classes, and in the fourth experiment the effectiveness of interferon therapy was used as class label. The preliminary results of experiments, both constructed decision trees and their predictive accuracies, are reported in this paper. The validity of extracted patterns is now being evaluated by the domain experts (medical doctors). Some of the patterns match experts’ experience and the overall results are encouraging.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth & Brooks/Cole Advanced Books & Software (1984)

    Google Scholar 

  2. Fortin, S.: The graph isomorphism problem (1996)

    Google Scholar 

  3. Ho, T.B., Nguyen, T.D., Kawasaki, S., Le, S.Q., Nguyen, D.D., Yokoi, H., Takabayashi, K.: Mining hepatitis data with temporal abstraction. In: Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2003, pp. 369–377 (2003)

    Google Scholar 

  4. Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Knowledge discovery from structured data by beam-wise graph-based induction. In: Ishizuka, M., Sattar, A. (eds.) PRICAI 2002. LNCS (LNAI), vol. 2417, pp. 255–264. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Ohsaki, M., Sato, Y., Yokoi, H., Yamaguchi, T.: A rule discovery support system for sequential medical data - in the case study of a chronic hepatitis dataset -. In: Working note of International Workshop on Active Mining (AM2002), pp. 97–102 (2002)

    Google Scholar 

  6. Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)

    Google Scholar 

  7. Quinlan, J.R.: C4.5:Programs For Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)

    Google Scholar 

  8. Read, R.C., Corneil, D.G.: The graph isomorphism disease. Journal of Graph Theory 1, 339–363 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  9. Tsumoto, S., Takabayashi, K., Nagira, M., Hirano, S.: Trend-evaluation multiscale analysis of the hepatitis dataset. In: Project “Realization of Active Mining in the Era of Information Flood” Report, pp. 191–197 (March 2003)

    Google Scholar 

  10. Warodom, G., Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Classifier construction by graph-based induction for graph-structured data. In: Whang, K.-Y., Jeon, J., Shim, K., Srivastava, J. (eds.) PAKDD 2003. LNCS (LNAI), vol. 2637, pp. 52–62. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  11. Warodom, G., Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Performance evaluation of decision tree graph-based induction. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds.) DS 2003. LNCS (LNAI), vol. 2843, pp. 128–140. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  12. Yamada, Y., Suzuki, E., Yokoi, H., Takabayashi, K.: Decision-tree induction from time-series data based on a standard-example split test. In: Proc. of the 12th International Conference on Machine Learning, August 2003, pp. 840–847 (2003)

    Google Scholar 

  13. Yoshida, K., Motoda, H.: Clip: Concept learning from inference pattern. Journal of Artificial Intelligence 75(1), 63–92 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Geamsakul, W. et al. (2005). Extracting Diagnostic Knowledge from Hepatitis Dataset by Decision Tree Graph-Based Induction. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds) Active Mining. Lecture Notes in Computer Science(), vol 3430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423270_8

Download citation

  • DOI: https://doi.org/10.1007/11423270_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26157-5

  • Online ISBN: 978-3-540-31933-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics