Linguistic Habit Graphs Used for Text Representation and Correction

Gadamer, Marcin

doi:10.1007/978-3-319-59060-8_22

Marcin Gadamer¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10246))

Included in the following conference series:

International Conference on Artificial Intelligence and Soft Computing

1946 Accesses

Abstract

This paper introduces a novel associative way of storing, compressing, and processing sentences. The Linguistic Habit Graphs (LHG) are introduced as graph models that could be used for spell checking, text correction, proof–reading, and compression of sentences. All the above mentioned functionalities are always available in the constant computational complexity as a result of the associative way of text processing, special kinds of connections and graph nodes that enable to activate various important relations between letters and words simultaneously for any given contexts. Furthermore, using the proposed graph structure, new algorithms have been developed to provide effective text analyzes and contextual text correction. These new algorithms can properly locate and often automatically correct typical mistakes in texts written in a given language for which the graph was build.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

1000x Faster Spelling Correction algorithm, June 2012. http://blog.faroo.com/2012/06/07/improved-edit-distance-based-spelling-correction/. Online; Visited 03 Oct 2016
Brown, P.F., et al.: Class-based n-gram models of natural language. Comput. Linguist. 18(4), 467–479 (1992)
Google Scholar
Blunsom, P.: Hidden Markov Models. Lecture notes, vol. 15, pp. 18–19, August 2004
Google Scholar
Chen, S.F., Goodman, J.: An empirical study of smoothing techniques for language modeling. In: Proceedings of the 34th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics (1996)
Google Scholar
Ide, N., Pustejovsky, J. (eds.): Handbook of Linguistic Annotation. Springer, Netherlands (2017). doi:10.1007/978-94-024-0881-2
Google Scholar
Deepak, P., Deshpande, P.M.: Operators for Similarity Search. Semantics, Techniques and Usage Scenarios. Springer International Publishing, Cham (2015)
Google Scholar
Gadamer, M., Horzyk, A.: Text analysis and correction using specialized linguistic habit graphs LHG. Image Process. Commun. 17(4), 245–250 (2012). Bydgoszcz
Google Scholar
Horzyk, A.: Artificial Associative Systems and Associative Artificial Intelligence, pp. 1–280. Academic Publishing House EXIT, Warsaw (2013)
Google Scholar
Horzyk, A.: How does generalization and creativity come into being in neural associative systems and how does it form human-like knowledge? Neurocomputing 144, 238–257 (2014). Elsevier
Google Scholar
Horzyk, A.: Innovative types and abilities of neural networks based on associative mechanisms and a new associative model of neurons. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2015. LNCS, vol. 9119, pp. 26–38. Springer, Cham (2015). doi:10.1007/978-3-319-19324-3_3
Chapter Google Scholar
Horzyk, A., Gadamer, M.: Associative text representation and correction. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2013. LNCS (LNAI), vol. 7894, pp. 76–87. Springer, Heidelberg (2013). doi:10.1007/978-3-642-38658-9_7
Chapter Google Scholar
How to write a spelling corrector. http://norvig.com/spell-correct.html. Online; Visited: 05 Oct 2016
Jurafsky, D.: Speech & language processing. Pearson Education India, Noida (2000)
Google Scholar
Kohonen, T., Somervuo, P.: Self-organizing maps of symbol strings. Neurocomputing 21(1), 19–30 (1998)
Article MATH Google Scholar
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing, vol. 999. MIT Press, Cambridge (1999)
MATH Google Scholar
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The stanford CoreNLP natural language processing toolkit. In: ACL (System Demonstrations), pp. 55–60 (2014)
Google Scholar
Martin, D., Jurafsky, J.H., Jurafsky, D.: Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall, Upper Saddle River (2000)
Google Scholar
Savary, A., Piskorski, J.: Lexicons and Grammars For Named Entity Annotation in the National Corpus of Polish, pp. 141–154. Intelligent Information Systems, Siedlce (2010)
Google Scholar
Stolcke, A.: SRILM-an extensible language modeling toolkit. In: Proceedings International Conference on Spoken Language Processing, pp. 257–286 (2002)
Google Scholar
Täckström, O., Das, D., Petrov, S., McDonald, R., Nivre, J.: Token and type constraints for cross-lingual part-of-speech tagging. Trans. Assoc. Comput. Linguist. 1, 1–12 (2013)
Google Scholar
The soundex indexing system. https://goo.gl/5BUcR5 (May 2007). Online; Visited 04 Oct 2016
What are some algorithms of spelling correction that were used by search engine? https://goo.gl/8xhvpQ. Online; Visited 03 Oct 2016
Whitelaw, C., Hutchinson, B., Chung, G.Y., Ellis, G.: Using the web for language independent spellchecking and autocorrection. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 2, pp. 890–899. Association for Computational Linguistics (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Automatics and Biomedical Engineering, AGH University of Science and Technology, Mickiewicza Av. 30, 30–059, Krakow, Poland
Marcin Gadamer

Authors

Marcin Gadamer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marcin Gadamer .

Editor information

Editors and Affiliations

Częstochowa University of Technology, Częstochowa, Poland
Leszek Rutkowski
Częstochowa University of Technology, Częstochowa, Poland
Marcin Korytkowski
Częstochowa University of Technology, Częstochowa, Poland
Rafał Scherer
AGH University of Science and Technology, Kraków, Poland
Ryszard Tadeusiewicz
University of California, Berkeley, California, USA
Lotfi A. Zadeh
University of Louisville, Louisville, Kentucky, USA
Jacek M. Zurada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gadamer, M. (2017). Linguistic Habit Graphs Used for Text Representation and Correction. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2017. Lecture Notes in Computer Science(), vol 10246. Springer, Cham. https://doi.org/10.1007/978-3-319-59060-8_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-59060-8_22
Published: 24 May 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59059-2
Online ISBN: 978-3-319-59060-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics