Reinforcement Learning Based Interactive Agent for Personalized Mathematical Skill Enhancement

Islam, Muhammad Zubair; Mehmood, Kashif; Kim, Hyung Seok

doi:10.1007/978-3-030-43120-4_31

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11989))

Included in the following conference series:

International Conference on Mathematical Aspects of Computer and Information Sciences

593 Accesses

Abstract

Traditional intelligent systems recommend a teaching sequence to individual students without monitoring their ongoing learning attitude. It causes frustrations for students to learn a new skill and move them away from their target learning goal. As a step to make the best teaching strategy, in this paper a Personalized Skill-Based Math Recommender (PSBMR) framework has been proposed to automatically recommend pedagogical instructions based on a student’s learning progress over time. The PSBMR utilizes an adversarial bandit in contrast to the classic multi-armed bandit (MAB) problem to estimate the student’s ability and recommend the task as per his skill level. However, this paper proposes an online learning approach to model a student concept learning profile and used the Exp3 algorithm for optimal task selection. To verify the framework, simulated students with different behavioral complexity have been modeled using the Q-matrix approach based on item response theory. The simulation study demonstrates the effectiveness of this framework to act fairly with different groups of students to acquire the necessary skills to learn basic mathematics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Miliband, D.: Personalised Learning: Building a New Relationship with Schools, Speech to the North of England Education Conference, Belfast, January 2004
Google Scholar
Yago, H., Clemente, J., Rodriguez, D.: Competence-based recommender systems: a systematic literature review. Behav. Inf. Technol. 37(10–11), 958–977 (2018)
Article Google Scholar
Ricci, F., Rokach, L., Shapira, B.: Introduction to Recommender Systems Handbook. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P. (eds.) Recommender Systems Handbook. Springer, Boston (2011). https://doi.org/10.1007/978-0-387-85820-3_1
Chapter MATH Google Scholar
Zaphiris, P., Ioannou, A. (eds.): LCT 2015. LNCS, vol. 9192. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-20609-7
Book Google Scholar
Slivkins, A.: Introduction to Multi-Armed Bandits, April 2019. arXiv:1904.07272v3
Clement, B., Roy, D., Oudeyer, P.-Y., Lopes, M.: Online optimization of teaching sequences with multi-armed bandits. In: 7th International Conference on Education Data Mining, London, UK (2014)
Google Scholar
Serrano-Guerrero, J., Romero, F.P., Olivas, J.A.: Hiperion: a fuzzy approach for recommending educational activities based on the acquisition of competencies. Inf. Sci. (NY) 248, 114–129 (2013)
Article Google Scholar
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1), 48–77 (2003)
Article MathSciNet Google Scholar
Wang, Y., Heffernan, N.: Extending knowledge tracing to allow partial credit: using continuous versus binary nodes. In: Lane, H.C., Yacef, K., Mostow, J., Pavlik, P. (eds.) AIED 2013. LNCS (LNAI), vol. 7926, pp. 181–188. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39112-5_19
Chapter Google Scholar
Reckase, M.D.: Multidimensional Item Response Theory Models. Springer, New York (2009). https://doi.org/10.1007/978-0-387-89976-3
Book MATH Google Scholar
Rupp, A.A., Templin, J.L.: Unique characteristics of diagnostic classification models: a comprehensive review of the current state-of-the-art. Meas. Interdiscip. Res. Perspect. 6(4), 219–262 (2008)
Article Google Scholar
Chen, Y., Li, X., Liu, J., Ying, Z.: Recommendation system for adaptive learning. Appl. Psychol. Meas. 42(1), 24–41 (2018)
Article Google Scholar
Koedinger, K.R., Brunskill, E., Baker, R.S., Mclaughlin, E.A., Stamper, J.: New potentials for data-driven intelligent tutoring system development and optimization. AI Mag. 34(3), 27–41 (2013)
Article Google Scholar
Leighton, J.P., Gierl, M.J., Hunka, S.M.: The attribute hierarchy method for cognitive assessment: a variation on Tatsuoka’s rule-space approach. J. Educ. Meas. 41(3), 205–237 (2004)
Article Google Scholar
Nino-Mora, J.: Stochastic scheduling. In: Floudas, C.A., Pardalos, P.M. (Eds.) Encyclopedia of Optimization, pp. 3818–3824 (2009)
Google Scholar
Burtini, G., Loeppky, J., Lawrence, R.: A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit. arXiv:1510.00757v4
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: Gambling in a rigged casino: the adversarial multi-armed bandit problem. In: Proceedings of Annual Symposium Foundations of Computer Science, Milwaukee, WI, pp. 322–331 (1995)
Google Scholar
Lattimore, T., Szepesvá, C.: Bandit Algorithms. Cambridge University Press, Cambridge (2018). Draft of 28th July, Revision 1016
Google Scholar
Schapire, R.E., Freund, Y.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139 (1997)
Article MathSciNet Google Scholar
Mertz, Jr., J.: Using a simulated student for instructional design. In: Proceedings of the Seventh World Conference on Artificial Intelligence in Education (1995)
Google Scholar
Beck, J.: Modeling the student with reinforcement learning. Paper presented at the 6th Annual Conference on User Modelling, Sardina, Italy (1997)
Google Scholar
Meneghetti, D.D.R., Junior, P.TA.: Application and Simulation of Computerized Adaptive Tests Through the Package catsim (2017). arXiv:1707.03012v2

Download references

Acknowledgment

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2019R1A4A1023746, No. 2019R1F1A1060799).

Author information

Authors and Affiliations

Department of Information and Communication Engineering, Sejong University, Seoul, 05006, Republic of Korea
Muhammad Zubair Islam, Kashif Mehmood & Hyung Seok Kim

Authors

Muhammad Zubair Islam
View author publications
You can also search for this author in PubMed Google Scholar
Kashif Mehmood
View author publications
You can also search for this author in PubMed Google Scholar
Hyung Seok Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyung Seok Kim .

Editor information

Editors and Affiliations

AIT Austrian Institute of Technology, Vienna, Austria
Daniel Slamanig
IMJ-PRG, Sorbonne University, Paris, France
Elias Tsigaridas
Institute of Information Technologies, Gebze Technical University, Gebze, Turkey
Zafeirakis Zafeirakopoulos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Islam, M.Z., Mehmood, K., Kim, H.S. (2020). Reinforcement Learning Based Interactive Agent for Personalized Mathematical Skill Enhancement. In: Slamanig, D., Tsigaridas, E., Zafeirakopoulos, Z. (eds) Mathematical Aspects of Computer and Information Sciences. MACIS 2019. Lecture Notes in Computer Science(), vol 11989. Springer, Cham. https://doi.org/10.1007/978-3-030-43120-4_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-43120-4_31
Published: 18 March 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43119-8
Online ISBN: 978-3-030-43120-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics