Skip to main content

Factored MDPs for Optimal Prosumer Decision-Making in Continuous State Spaces

  • Conference paper
  • First Online:
Multi-Agent Systems and Agreement Technologies (EUMAS 2015, AT 2015)

Abstract

The economic profitability of Smart Grid prosumers (i.e., producers that are simultaneously consumers) depends on their tackling of the decision-making problem they face when selling and buying energy. In previous work, we had modelled this problem compactly as a factored Markov Decision Process, capturing the main aspects of the business decisions of a prosumer corresponding to a community microgrid of any size. Though that work had employed an exact value iteration algorithm to obtain a near-optimal solution over discrete state spaces, it could not tackle problems defined over continuous state spaces. By contrast, in this paper we show how to use approximate MDP solution methods for taking decisions in this domain without the need of discretizing the state space. Specifically, we employ fitted value iteration, a sampling-based approximation method that is known to be well behaved. By so doing, we generalize our factored MDP solution method to continuous state spaces. We evaluate our approach using a variety of basis functions over different state sample sizes, and compare its performance to that of our original “exact” value iteration algorithm. Our generic approximation method is shown to exhibit stable performance in terms of accumulated reward, which for certain basis functions reaches 98 % of that gathered by the exact algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    See http://www.powertac.org/node/11 for a list of related publications.

  2. 2.

    States on the x axis in these figures are ranked in reverse order wrt. steps-to-go in the horizon: states with small indices occur early in the day-ahead, and the ones to the right late.

References

  1. Ackermann, T. (ed.): Wind Power in Power Systems. Wiley, Chichester (2005)

    Google Scholar 

  2. Angelidakis, A., Chalkiadakis, G.: Factored MDPs for optimal prosumer decision-making. In: Proceedings of AAMAS-2015, pp. 503–511 (2015)

    Google Scholar 

  3. Asmus, P.: Microgrids, virtual power plants and our distributed energy future. Electr. J. 23(10), 72–82 (2010)

    Article  Google Scholar 

  4. Boutilier, C., Dean, T., Hanks, S.: Decision-theoretic planning: structural assumptions and computational leverage. J. Artif. Intell. Res. (JAIR) 11, 1–94 (1999)

    MathSciNet  MATH  Google Scholar 

  5. Busoniu, L., Babuska, R., De Schutter, B., Ernst, D.: Reinforcement Learning and Dynamic Programming Using Function Approximators. CRC Press, Boca Raton (2010)

    Book  MATH  Google Scholar 

  6. Chalkiadakis, G., Robu, V., Kota, R., Rogers, A., Jennings, N.: Cooperatives of distributed energy resources for efficient virtual power plants. In: Proceedings of AAMAS-2011, pp. 787–794 (2011)

    Google Scholar 

  7. DeGroot, M., Schervish, J.: Probability and Statistics. Addison-Wesley, Reading (2002)

    Google Scholar 

  8. Gordon, G.J.: Stable function approximation in dynamic programming. In: Proceedings of the 12th International Conference on Machine Learning, pp. 261–268 (1995)

    Google Scholar 

  9. Guestrin, C., Koller, D., Parr, R., Venkataraman, S.: Efficient solution algorithms for factored MDPs. J. Artif. Intell. Res. (JAIR) 19, 399–468 (2003)

    MathSciNet  MATH  Google Scholar 

  10. Kanchev, H., Lu, D., Colas, F., Lazarov, V., Francois, B.: Energy management and operational planning of a microgrid with a PV-based active generator for Smart Grid applications. IEEE Trans. Ind. Electron. 58(10), 4583–4592 (2011)

    Article  Google Scholar 

  11. Kirschen, D., Strbac, G.: Fundamentals of Power System Economics. Wiley, Chichester (2005)

    Google Scholar 

  12. Munos, R., Szepesvári, C.: Finite-time bounds for fitted value iteration. J. Mach. Learn. Res. 9, 815–857 (2008)

    MathSciNet  MATH  Google Scholar 

  13. Nikovski, D., Zhang, W.: Factored markov decision process models for stochastic unit commitment. In: IEEE Conference on Innovative Technologies for an Efficient and Reliable Electricity Supply (CITRES), pp. 28–35 (2010)

    Google Scholar 

  14. Ramchurn, S.D., Vytelingum, P., Rogers, A., Jennings, N.: Putting the ‘smarts’ into the Smart Grid: a grand challenge for artificial intelligence. Commun. ACM 55(4), 86–97 (2012)

    Article  Google Scholar 

  15. Rogers, A., Ramchurn, S., Jennings, N.: Delivering the Smart Grid: challenges for autonomous agents and multi-agent systems research. In: Proceedings of AAAI-2012, pp. 2166–2172 (2012)

    Google Scholar 

  16. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)

    Google Scholar 

  17. Zhao, B., Zhang, X., Chen, J., Wang, C., Guo, L.: Operation optimization of standalonemicrogrids considering lifetime characteristics of battery energy storage system. IEEE Trans.Sustain. Energ. 4(4), 934–943 (2013)

    Article  Google Scholar 

  18. Federation of European renewable energy cooperatives. http://www.rescoop.eu

Download references

Acknowledgements

The work presented in this paper was supported by the Greek General Secretariat for Research and Technology (GSRT) through the funding of research project “AFORMI – Reconfigurable Systems for scientific research” with proposal code 2427 within the context of action “ARISTEIA” of the Lifelong Learning Program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Angelos Angelidakis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Angelidakis, A., Chalkiadakis, G. (2016). Factored MDPs for Optimal Prosumer Decision-Making in Continuous State Spaces. In: Rovatsos, M., Vouros, G., Julian, V. (eds) Multi-Agent Systems and Agreement Technologies. EUMAS AT 2015 2015. Lecture Notes in Computer Science(), vol 9571. Springer, Cham. https://doi.org/10.1007/978-3-319-33509-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-33509-4_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-33508-7

  • Online ISBN: 978-3-319-33509-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics