Skip to main content

Designing Bounded Min-Knapsack Bandits Algorithm for Sustainable Demand Response

  • Conference paper
  • First Online:
PRICAI 2021: Trends in Artificial Intelligence (PRICAI 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13031))

Included in the following conference series:

  • 2228 Accesses

Abstract

Recent trends focus on incentivizing consumers to reduce their demand consumption during peak hours for sustainable demand response. To minimize the loss, the distributor companies should target the right set of consumers and demand the right amount of electricity reductions. Almost all the existing algorithms focus on demanding single unit reductions from the selected consumers and thus have limited practical applicability. Even for single unit reductions, none of the work provides a polynomial time constant approximation factor algorithm to minimize the loss to the distributor company. This paper proposes a novel bounded integer min-knapsack algorithm (MinKPDR) and shows that the algorithm, while allowing for multiple unit reduction, also optimizes the loss to the distributor company within a factor of two (multiplicative) and a problem dependant additive constant. The loss is a function of the cost of buying the electricity from the market, costs incurred by the consumers, and compliance probabilities of the consumers. When the compliance probabilities of the consumers are not known, the problem can be formulated as a combinatorial multi-armed bandit (CMAB) problem. Existing CMAB algorithms fail to work in this setting due to the non-monotonicity of a reward function and time varying optimal sets. We propose a novel algorithm (Twin-MinKPDR-CB) to learn these compliance probabilities efficiently. Twin-MinKPDR-CB works for non-monotone reward functions, bounded min-knapsack constraints, and time-varying optimal sets. We theoretically show that Twin-MinKPDR-CB achieves sub-linear regret of \(O(\log T)\) with T being the number of rounds for which demand response is run.

A. Singh and P. M. Reddy—contributed equally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, S., Devanur, N.R.: Bandits with concave rewards and convex knapsacks. In: Proceedings of the Fifteenth ACM Conference on Economics and Computation, pp. 989–1006 (2014)

    Google Scholar 

  2. Akasiadis, C., et al.: Incentives for rescheduling residential electricity consumption to promote renewable energy usage. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 328–337, November 2015

    Google Scholar 

  3. Anderson, R., Fuloria, S.: On the security economics of electricity metering. In: Proceedings of the WEIS (2010)

    Google Scholar 

  4. Chao, H.: Competitive electricity markets with consumer subscription service in a smart grid. J. Regul. Econ. 1–26 (2012)

    Google Scholar 

  5. Chen, W., Hu, W., Li, F., Li, J., Liu, Y., Lu, P.: Combinatorial multi-armed bandit with general reward functions. Adv. Neural Inf. Process. Syst. 29, 1659–1667 (2016)

    Google Scholar 

  6. Chen, W., Wang, Y., Yuan, Y.: Combinatorial multi-armed bandit: General framework and applications. In: International Conference on Machine Learning, pp. 151–159 (2013)

    Google Scholar 

  7. Chen, X., Nie, Y., Li, N.: Online residential demand response via contextual multi-armed bandits. arXiv preprint arXiv:2003.03627 (2020)

  8. Csirik, J.: Heuristics for the 0–1 min-knapsack problem. Acta Cybernetica 10(1–2), 15–20 (1991)

    Google Scholar 

  9. Gai, Y., Krishnamachari, B., Jain, R.: Combinatorial network optimization with unknown variables: multi-armed bandits with linear rewards and individual observations. IEEE/ACM Trans. Netw. 20(5), 1466–1478 (2012)

    Google Scholar 

  10. Gurobi Optimization L: Gurobi optimizer reference manual (2021). http://www.gurobi.com

  11. Hsu, Y.Y., Su, C.C.: Dispatch of direct load control using dynamic programming. IEEE Trans. Power Syst. 6(3), 1056–1061 (1991)

    Google Scholar 

  12. Jain, S., Balakrishnan, N., Narahari, Y., Hussain, S.A., Voo, N.Y.: Constrained tâtonnement for fast and incentive compatible distributed demand management in smart grids. In: Proceedings of the Fourth International Conference on Future Energy Systems, pp. 125–136. ACM (2013)

    Google Scholar 

  13. Jain, S., Gujar, S., Bhat, S., Zoeter, O., Narahari, Y.: A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing. Artif. Intell. 254, 44–63 (2018)

    Google Scholar 

  14. Jain, S., Narayanaswamy, B., Narahari, Y.: A multiarmed bandit incentive mechanism for crowdsourcing demand response in smart grids (2014)

    Google Scholar 

  15. Li, Y., Hu, Q., Li, N.: Learning and selecting the right customers for reliability: a multi-armed bandit approach (2018)

    Google Scholar 

  16. Ma, H., Parkes, D.C., Robu, V.: Generalizing demand response through reward bidding. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 60–68. AAMAS 2017, International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC (2017)

    Google Scholar 

  17. Ma, H., Robu, V., Li, N.L., Parkes, D.C.: Incentivizing reliability in demand-side response. In: the proceedings of The 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), pp. 352–358 (2016)

    Google Scholar 

  18. Methenitis, G., Kaisers, M., La Poutré, H.: Forecast-based mechanisms for demand response. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1600–1608. International Foundation for Autonomous Agents and Multiagent Systems (2019)

    Google Scholar 

  19. Mittal, S., Schulz, A.S.: An FPTAS for optimizing a class of low-rank functions over a polytope. Math. Program. 141(1–2), 103–120 (2013)

    Google Scholar 

  20. Park, S., Jin, Y., Song, H., Yoon, Y.: Designing a critical peak pricing scheme for the profit maximization objective considering price responsiveness of customers. Energy 83, 521–531 (2015)

    Article  Google Scholar 

  21. Ramchurn, S., Vytelingum, P., Rogers, A., Jennings, N.: Agent-based control for decentralised demand side management in the smart grid. In: AAMAS, pp. 5–12, February 2011

    Google Scholar 

  22. Robu, V., Chalkiadakis, G., Kota, R., Rogers, A., Jennings, N.R.: Rewarding cooperative virtual power plant formation using scoring rules. Energy 117, 19–28 (2016). https://doi.org/10.1016/j.energy.2016.10.077

    Article  Google Scholar 

  23. Shweta, J., Sujit, G.: A multiarmed bandit based incentive mechanism for a subset selection of customers for demand response in smart grids. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2046–2053 (2020)

    Google Scholar 

  24. Xu, H., Liu, Y., Lau, W.C., Li, R.: Combinatorial multi-armed bandits with concave rewards and fairness constraints. In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, pp. 2554–2560 (2020)

    Google Scholar 

  25. Zhang, Q., Wang, X., Fu, M.: Optimal implementation strategies for critical peak pricing. In: 2009 6th International Conference on the European Energy Market, pp. 1–6. IEEE (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shweta Jain .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Singh, A., Reddy, P.M., Jain, S., Gujar, S. (2021). Designing Bounded Min-Knapsack Bandits Algorithm for Sustainable Demand Response. In: Pham, D.N., Theeramunkong, T., Governatori, G., Liu, F. (eds) PRICAI 2021: Trends in Artificial Intelligence. PRICAI 2021. Lecture Notes in Computer Science(), vol 13031. Springer, Cham. https://doi.org/10.1007/978-3-030-89188-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-89188-6_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-89187-9

  • Online ISBN: 978-3-030-89188-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics