Designing Bounded Min-Knapsack Bandits Algorithm for Sustainable Demand Response

Singh, Akansha; Reddy, P. Meghana; Jain, Shweta; Gujar, Sujit

doi:10.1007/978-3-030-89188-6_1

Akansha Singh¹²,
P. Meghana Reddy¹²,
Shweta Jain¹² &
…
Sujit Gujar¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13031))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

2228 Accesses

Abstract

Recent trends focus on incentivizing consumers to reduce their demand consumption during peak hours for sustainable demand response. To minimize the loss, the distributor companies should target the right set of consumers and demand the right amount of electricity reductions. Almost all the existing algorithms focus on demanding single unit reductions from the selected consumers and thus have limited practical applicability. Even for single unit reductions, none of the work provides a polynomial time constant approximation factor algorithm to minimize the loss to the distributor company. This paper proposes a novel bounded integer min-knapsack algorithm (MinKPDR) and shows that the algorithm, while allowing for multiple unit reduction, also optimizes the loss to the distributor company within a factor of two (multiplicative) and a problem dependant additive constant. The loss is a function of the cost of buying the electricity from the market, costs incurred by the consumers, and compliance probabilities of the consumers. When the compliance probabilities of the consumers are not known, the problem can be formulated as a combinatorial multi-armed bandit (CMAB) problem. Existing CMAB algorithms fail to work in this setting due to the non-monotonicity of a reward function and time varying optimal sets. We propose a novel algorithm (Twin-MinKPDR-CB) to learn these compliance probabilities efficiently. Twin-MinKPDR-CB works for non-monotone reward functions, bounded min-knapsack constraints, and time-varying optimal sets. We theoretically show that Twin-MinKPDR-CB achieves sub-linear regret of \(O(\log T)\) with T being the number of rounds for which demand response is run.

A. Singh and P. M. Reddy—contributed equally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, S., Devanur, N.R.: Bandits with concave rewards and convex knapsacks. In: Proceedings of the Fifteenth ACM Conference on Economics and Computation, pp. 989–1006 (2014)
Google Scholar
Akasiadis, C., et al.: Incentives for rescheduling residential electricity consumption to promote renewable energy usage. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 328–337, November 2015
Google Scholar
Anderson, R., Fuloria, S.: On the security economics of electricity metering. In: Proceedings of the WEIS (2010)
Google Scholar
Chao, H.: Competitive electricity markets with consumer subscription service in a smart grid. J. Regul. Econ. 1–26 (2012)
Google Scholar
Chen, W., Hu, W., Li, F., Li, J., Liu, Y., Lu, P.: Combinatorial multi-armed bandit with general reward functions. Adv. Neural Inf. Process. Syst. 29, 1659–1667 (2016)
Google Scholar
Chen, W., Wang, Y., Yuan, Y.: Combinatorial multi-armed bandit: General framework and applications. In: International Conference on Machine Learning, pp. 151–159 (2013)
Google Scholar
Chen, X., Nie, Y., Li, N.: Online residential demand response via contextual multi-armed bandits. arXiv preprint arXiv:2003.03627 (2020)
Csirik, J.: Heuristics for the 0–1 min-knapsack problem. Acta Cybernetica 10(1–2), 15–20 (1991)
Google Scholar
Gai, Y., Krishnamachari, B., Jain, R.: Combinatorial network optimization with unknown variables: multi-armed bandits with linear rewards and individual observations. IEEE/ACM Trans. Netw. 20(5), 1466–1478 (2012)
Google Scholar
Gurobi Optimization L: Gurobi optimizer reference manual (2021). http://www.gurobi.com
Hsu, Y.Y., Su, C.C.: Dispatch of direct load control using dynamic programming. IEEE Trans. Power Syst. 6(3), 1056–1061 (1991)
Google Scholar
Jain, S., Balakrishnan, N., Narahari, Y., Hussain, S.A., Voo, N.Y.: Constrained tâtonnement for fast and incentive compatible distributed demand management in smart grids. In: Proceedings of the Fourth International Conference on Future Energy Systems, pp. 125–136. ACM (2013)
Google Scholar
Jain, S., Gujar, S., Bhat, S., Zoeter, O., Narahari, Y.: A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing. Artif. Intell. 254, 44–63 (2018)
Google Scholar
Jain, S., Narayanaswamy, B., Narahari, Y.: A multiarmed bandit incentive mechanism for crowdsourcing demand response in smart grids (2014)
Google Scholar
Li, Y., Hu, Q., Li, N.: Learning and selecting the right customers for reliability: a multi-armed bandit approach (2018)
Google Scholar
Ma, H., Parkes, D.C., Robu, V.: Generalizing demand response through reward bidding. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 60–68. AAMAS 2017, International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC (2017)
Google Scholar
Ma, H., Robu, V., Li, N.L., Parkes, D.C.: Incentivizing reliability in demand-side response. In: the proceedings of The 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), pp. 352–358 (2016)
Google Scholar
Methenitis, G., Kaisers, M., La Poutré, H.: Forecast-based mechanisms for demand response. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1600–1608. International Foundation for Autonomous Agents and Multiagent Systems (2019)
Google Scholar
Mittal, S., Schulz, A.S.: An FPTAS for optimizing a class of low-rank functions over a polytope. Math. Program. 141(1–2), 103–120 (2013)
Google Scholar
Park, S., Jin, Y., Song, H., Yoon, Y.: Designing a critical peak pricing scheme for the profit maximization objective considering price responsiveness of customers. Energy 83, 521–531 (2015)
Article Google Scholar
Ramchurn, S., Vytelingum, P., Rogers, A., Jennings, N.: Agent-based control for decentralised demand side management in the smart grid. In: AAMAS, pp. 5–12, February 2011
Google Scholar
Robu, V., Chalkiadakis, G., Kota, R., Rogers, A., Jennings, N.R.: Rewarding cooperative virtual power plant formation using scoring rules. Energy 117, 19–28 (2016). https://doi.org/10.1016/j.energy.2016.10.077
Article Google Scholar
Shweta, J., Sujit, G.: A multiarmed bandit based incentive mechanism for a subset selection of customers for demand response in smart grids. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2046–2053 (2020)
Google Scholar
Xu, H., Liu, Y., Lau, W.C., Li, R.: Combinatorial multi-armed bandits with concave rewards and fairness constraints. In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, pp. 2554–2560 (2020)
Google Scholar
Zhang, Q., Wang, X., Fu, M.: Optimal implementation strategies for critical peak pricing. In: 2009 6th International Conference on the European Energy Market, pp. 1–6. IEEE (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Technology, Ropar, India
Akansha Singh, P. Meghana Reddy & Shweta Jain
International Institute of Information Technology, Hyderabad, India
Sujit Gujar

Authors

Akansha Singh
View author publications
You can also search for this author in PubMed Google Scholar
P. Meghana Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Shweta Jain
View author publications
You can also search for this author in PubMed Google Scholar
Sujit Gujar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shweta Jain .

Editor information

Editors and Affiliations

MIMOS Berhad, Kuala Lumpur, Malaysia
Duc Nghia Pham
Sirindhorn International Institute of Science and Technology, Thammasat University, Mueang Pathum Thani, Thailand
Thanaruk Theeramunkong
Data61, CSIRO, Brisbane, QLD, Australia
Guido Governatori
Department of Philosophy, Tsinghua University, Beijing, China
Fenrong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, A., Reddy, P.M., Jain, S., Gujar, S. (2021). Designing Bounded Min-Knapsack Bandits Algorithm for Sustainable Demand Response. In: Pham, D.N., Theeramunkong, T., Governatori, G., Liu, F. (eds) PRICAI 2021: Trends in Artificial Intelligence. PRICAI 2021. Lecture Notes in Computer Science(), vol 13031. Springer, Cham. https://doi.org/10.1007/978-3-030-89188-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-89188-6_1
Published: 25 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89187-9
Online ISBN: 978-3-030-89188-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics