Skip to main content

Task Scheduling of GPU Cluster for Large-Scale Data Process with Temperature Constraint

  • Conference paper
  • First Online:
The 8th International Conference on Computer Engineering and Networks (CENet2018) (CENet2018 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 905))

Included in the following conference series:

  • 765 Accesses

Abstract

With the development of GPU general-purpose computing, GPU heterogeneous cluster has become a widely used parallel processing solution for Large-scale data. Considering temperature management and controlling becomes a new research topic in high-performance computing field. A novel task scheduling model for GPU cluster with temperature limitation was built to balance the heat distribution and prevent the temperature hotspots occur. The scheduling index was introduced by combining the utilization of GPU and temperature. And the state matrix was designed to monitor the GPU cluster and provided status information for scheduler. When the temperature exceeds specific threshold value, the scheduler can improve the speed of fans to reduce the temperature. The experimental results show that the proposed scheduler can balance the heat distribution and prevent the temperature hotspots. Compared with the benchmark scheduling model, the loss of scheduling performance is in the acceptable range.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kaur, T., Chana, I.: Energy efficiency techniques in cloud computing: a survey and taxonomy. ACM Comput. Surv. 48(2), 22–54 (2016)

    Google Scholar 

  2. Lena, M., Mahyar, M.N., Zhang, Q., Shi, W.: Energy-aware scheduling of MapReduce jobs for big data applications. IEEE Trans. Parallel Distrib. Syst. 26(10), 2720–2733 (2015)

    Article  Google Scholar 

  3. Neetesh, K., Deo, P.: An energy aware cost effective scheduling framework for heterogeneous cluster system. Future Gener. Comput. Syst. 71, 73–88 (2017)

    Article  Google Scholar 

  4. Shi, Y.L., Zhang, K.H., Cui, L.Z., Liu, L., Zheng, Y.Q., Zhang, S.D., Yu, H.: MapReduce short jobs optimization based on resource reuse. Microprocess. Microsyst. 47, 178–187 (2016)

    Article  Google Scholar 

  5. Wang, H.F., Cao, Y.P.: GPU power consumption optimization control model of GPU clusters. Acta Electronica Sin. 43(10), 1904–1910 (2015). (in Chinese)

    Google Scholar 

  6. Huo, H.P., Hu, X.M., Sheng, C.C., Wu, B.F.: An energy efficient task scheduling scheme for node-layer heterogeneous GPU clusters. Comput. Appl. Softw. 30(3), 283–286 (2013). (in Chinese)

    Google Scholar 

  7. Lee, Y., Kulkarni, I., Pompili, D., Parashar, M.: Proactive thermal management in green data centers. J. Supercomputing 60(2), 165–195 (2012)

    Article  Google Scholar 

  8. Zhang, S., Chatha, K.S.: Approximation algorithm for the temperature-aware scheduling problem. In: IEEE/ACM ACM International Conference on Computer-Aided Design, San Jose, USA, pp. 281–288 (2007)

    Google Scholar 

  9. Li, X., Jiang, X.H., Wu, Z.H., Ye, K.J.: Research of thermal management methods for green data centers. Chin. J. Comput. 37(5), 1–21 (2014). (in Chinese)

    Google Scholar 

  10. Vanderster, D.C., Baniasadi, A., Dimopoulos, N.J.: Exploiting task temperature profiling in temperature-aware task scheduling for computational clusters. In: Choi, L., Paek, Y., Cho, S. (eds.) Advances in Computer Systems Architecture, pp. 175–185. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  11. Liu, H., Wang, J.G., Ge, Z.Z., Gu, Q., Chen, Q., Du, J.C.: Self-learning load balancing scheduling algorithm for GPU heterogeneous cluster. J. Xi’an Shiyou Univ. (Nat. Sci. Ed.) 30(3), 105–111 (2015). (in Chinese)

    Google Scholar 

Download references

Acknowledgement

This project is supported by Shandong Provincial Natural Science Foundation, China (No. ZR2017MF050), Project of Shandong Province Higher Educational Science and technology program (No. J17KA049), Shandong Province Key Research and Development Program of China (No. 2018GGX101005, 2017CXGC0701, 2016GGX109001) Shandong Province Independent Innovation and Achievement Transformation, China (No. 2014ZZCX02702).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yunpeng Cao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, H., Cao, Y. (2020). Task Scheduling of GPU Cluster for Large-Scale Data Process with Temperature Constraint. In: Liu, Q., Mısır, M., Wang, X., Liu, W. (eds) The 8th International Conference on Computer Engineering and Networks (CENet2018). CENet2018 2018. Advances in Intelligent Systems and Computing, vol 905. Springer, Cham. https://doi.org/10.1007/978-3-030-14680-1_13

Download citation

Publish with us

Policies and ethics