Skip to main content

KBIS: Using Domain Knowledge to Guide Instance Selection

  • Chapter
Instance Selection and Construction for Data Mining

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 608))

Abstract

Prior to mining data for knowledge, selecting a potentially useful set of target data is necessary. Mining with missing attribute values increases uncertainty and decreases discovery accuracy. We present an instance selection method that determines the mining usability of an instance based on knowledge about which attributes are missing and the relative significance of the various attributes as defined by a domain expert. Knowledge-based instance selection (KbIS) is an instance utility metric that incorporates domain knowledge into a multi-criteria decision-making technique for instance selection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Bosc, P. and Lietard, M. 1997. Quantified statements and some interpretations for the OWA operator. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 241–257.

    Chapter  Google Scholar 

  • Brachman, R. J. and T. Anand. 1996. The process of knowledge discovery in databases: A human-centered approach. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 37–57.

    Google Scholar 

  • Cheeseman, P., J. Kelly, M. Self, J. Stutz, W. Taylor, and D. Freeman. 1988. AutoClass: A Bayesian classification system. Proc. 5th Int. Conf. on Machine Learning, pp. 54–64.

    Google Scholar 

  • Famili, A. <Fazel.Famili©iit.nrc.ca>. Personal communication about 20% heuristic method. (3 November 1997).

    Google Scholar 

  • Famili, A., W. M. Shen, R. Weber, and E. Simoudis. 1997. Data preprocessing and intelligent data analysis. Intelligent Data Analysis 1(1): <http://www.elsevier.com/locate/da> (21 January 1997).

    Google Scholar 

  • Fayyad, U. M., G. Piatetsky-Shapiro, and P. Smyth. 1996. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 1–34.

    Google Scholar 

  • Grinstein, G. G. 1996. Harnessing the human in knowledge discovery. In KDD-96, Proc. 2nd Int. Conf. on Knowledge Discovery and Data Mining, Portland, Oregon, pp. 384–385.

    Google Scholar 

  • Holsheimer, M. and A. Siebes. 1994. Data mining: The search for knowledge in databases. Amsterdam, The Netherlands: CWI, Report No. CS-R9406.

    Google Scholar 

  • Huber, P. J. 1997. From large to huge: A statistician’s reactions to KDD and DM. KDD-97, Proc. 3rd Int. Conf. on Knowledge Discovery and Data Mining, Newport Beach, California., pp. 304–308.

    Google Scholar 

  • Kacprzyk, J., M. Fedrizzi, and H. Nurmi. 1997. OWA operators in group decision making and consensus reaching under fuzzy preferences and fuzzy majority. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp.193–206.

    Chapter  Google Scholar 

  • Liu, B. and W. Hsu. 1996. Post-Analysis of learned rules. Proc. 13th Natl. Conf. on AI and 8th Innovative Appl. of AI, pp. 164–168.

    Google Scholar 

  • Mitchell, T. M. 1997. Machine Learning. Boston, MA: McGraw-Hill.

    MATH  Google Scholar 

  • Owrang, M. M. and F. H. Grupe. 1996. Using domain knowledge to guide database knowledge discovery. Exp. Syst. with Appl. 10(2):173–180.

    Article  Google Scholar 

  • Quinlan, J. R. 1993. C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann.

    Google Scholar 

  • Weiss, S. M. and N. Indurkhya. 1997. Predictive Data Mining: A Practical Guide. San Francisco, CA: Morgan Kaufmann.

    Google Scholar 

  • Yager, R. R. 1998. On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. on Syst., Man, and Cybern. 18:183–190.

    Article  MathSciNet  Google Scholar 

  • Yager, R. R. 1993. On ordered weighted averaging aggregation operators. In Readings in Fuzzy Sets for Intelligent Systems, D. Dubois, H. Prade, and R. R. Yager (Eds.). San Mateo, CA: Morgan Kaufmann, pp. 80–87.

    Google Scholar 

  • Yager, R. R. 1997a. Criteria importances in OWA aggregation. Proc. 6th Int. Conf. on Fuzzy Syst., Barcelona Spain, pp. 1677–1682.

    Google Scholar 

  • Yager, R. R. 1997b. On the inclusion of importances in OWA aggregations. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 41–59.

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Wright, P., Hodges, J. (2001). KBIS: Using Domain Knowledge to Guide Instance Selection. In: Liu, H., Motoda, H. (eds) Instance Selection and Construction for Data Mining. The Springer International Series in Engineering and Computer Science, vol 608. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3359-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-3359-4_15

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-4861-8

  • Online ISBN: 978-1-4757-3359-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics