Skip to main content

Mining Natural Language Programming Directives with Class-Oriented Bayesian Networks

  • Conference paper
Advanced Data Mining and Applications (ADMA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

Abstract

Learning a programming language is a painstaking process, as it requires knowledge of its syntax, apart from knowing the basic process of representing logical sequences to programming stages. This fact deteriorates the coding process and expels most users from programming. Particularly for novice users or persons with vision problems, learning of how to program and tracing the syntax errors could be improved dramatically by using the most natural of all interfaces, i.e. natural language. Towards this orientation, we suggest a wider framework for allowing programming using natural language. The framework can be easily extended to support different object-oriented programming languages such as C, C++, Visual Basic or Java. Our suggested model is named “Language Oriented Basic” and it concerns an intelligent interface that supports code creation, modification and control in Visual Basic. Users can use simple-structured Greek sentences in natural language and the system can output the corresponding syntactic tree. When users declare end of input, the system transforms the syntactic trees to source code. Throughout the whole interaction process, users can check the under-development code in order to verify its correspondence to their expectations. Due to the fact that using natural language can cause a great degree of ambiguity, Bayesian networks and learning from examples have been utilized as an attempt to reason on the most probable programming representation, given a natural language input sentence. In order to enhance the classifier, we propose a novel variation of Bayesian networks that favor the classification process. Experimental results have depicted precision and recall measures in a range of 73% and 70% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ballard, B., Bierman, A.: Programming in natural language: NLC as a prototype. In: Proceedings of the 1979 Annual Conference of ACM/CSC-ER (1979)

    Google Scholar 

  2. Balzer, R.M.: A Global View Of Automatic Programming. In: Proc. 3rd Joint Conference On Artificial Intelligence, pp. 494–499 (August 1993)

    Google Scholar 

  3. Cooper, J., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)

    MATH  Google Scholar 

  4. Dijkstra, E.W.: On The Foolishness Of Natural Language Programming. unpublished report (1978)

    Google Scholar 

  5. Duda, R., Hart, P.: Pattern Classification and Scene Analysis. John Wiley & Sons, New York (1973)

    MATH  Google Scholar 

  6. Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)

    Article  MATH  Google Scholar 

  7. Kate, R., Wong, Y., Mooney, R.: Learning to transform natural to formal languages. In: Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI 2005), Pittsburgh (2005)

    Google Scholar 

  8. Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: Proceedings, Tenth National Conference on Artificial Intelligence, pp. 223–228. AAAI, Menlo Park (1992)

    Google Scholar 

  9. Lieberman, H., Liu, H.: Feasibility studies for programming in natural language. Kluwer Academic Publishers, Dordrecht (2005)

    Google Scholar 

  10. Liu, H., Lieberman, H.: Metafor: Visualizing stories as code. In: ACM Conference on Intelligent User Interfaces (2005)

    Google Scholar 

  11. Mitchell, T.: Machine Learning. Mc Graw-Hill, New York (1997)

    MATH  Google Scholar 

  12. Pane, J., Ratanamahatana, C., Meyers, B.: Studying the language and structure in non-programmers’ solutions to programming problems. International Journal of Human Computer Studies 54(2) (2001)

    Google Scholar 

  13. Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo (1988)

    Google Scholar 

  14. Petrick, S.R.: On Natural Language Based Computer Systems. IBM J. Res. Develop., 314–325 (July 1976)

    Google Scholar 

  15. Simmons, R.F.: Personal Communication at TINLAP-2 Conference, Univ. of Illinois (July 1978)

    Google Scholar 

  16. Suzuki, J.: A construction of Bayesian networks from databases on a MDL scheme. In: Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence, San Francisco, CA, pp. 266–273 (1993)

    Google Scholar 

  17. Woods, W.A.: A Personal View Of Natural Language Understanding. In: Natural Language Interfaces, SIGART, Newsletter, pp. 17–20 (1977)

    Google Scholar 

  18. Voutilainen, A., Heikkila, J., Antitil, A.: Constraint grammar of English. Publication 21, Department of General Linguistics, University of Helsinki, Finland (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Maragoudakis, M., Cosmas, N., Garbis, A. (2008). Mining Natural Language Programming Directives with Class-Oriented Bayesian Networks. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88192-6_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88191-9

  • Online ISBN: 978-3-540-88192-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics