Abstract
The rapidly increasing complexity and scale of optimization problems pose challenges to search ability and performance of traditional evolutionary algorithms which could be only executed sequentially without scalability and are difficult to obtain an ideal solution in a reasonable time. In this paper, Spark-ITGO, a parallel and scalable invasive tumor growth optimization algorithm on Spark, is presented based on the serial invasive tumor growth optimization (ITGO). In Spark-ITGO, a parallel multiple-tumor evolution model is proposed to search the optimal solution of problems. A balanced multi-island optimal migration strategy is designed to increase diversity of population and prevent converging into a local optimum. Additionally, a universal parallel evolutionary algorithm framework is implemented based on resilient distributed dataset (RDD) and central broadcast mechanism. Spark-ITGO is evaluated on benchmark experiments of CEC2013 and CEC2010 LSGO and the results show that it achieves great scalability and performs better than other evolutionary algorithms.
Similar content being viewed by others
Data availability
The data used to support the findings of this study are available from the corresponding author upon request.
Code availability
The code of this study are available from the corresponding author upon request.
References
Gong, Y.-J., Chen, W.-N., Zhan, Z.-H., Zhang, J., Li, Y., Zhang, Q., Li, J.-J.: Distributed evolutionary algorithms and their models: A survey of the state-of-the-art. Appl. Soft Comput. 34, 286–300 (2015)
Teijeiro, D., Pardo, X.C., Penas, D.R., González, P., Banga, J.R., Doallo, R.: Evaluation of parallel differential evolution implementations on MapReduce and Spark. In: European Conference on Parallel Processing, pp. 397–408. Springer(2016)
Alba, E., Luque, G., Nesmachnow, S.: Parallel metaheuristics: Recent advances and new trends. Int. Trans. Oper. Res. 20(1), 1–48 (2013)
Ren, J., Zhang, D., He, S., Zhang, Y., Li, T.: A survey on end-edge-cloud orchestrated network computing paradigms: Transparent computing, mobile edge computing, fog computing, and cloudlet. ACM Comput. Surveys (CSUR) 52(6), 1–36 (2019)
Zaharia, M., Xin, R.S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M.J.: Apache spark: A unified engine for big data processing. Commun. ACM 59(11), 56–65 (2016)
Tang, D., Dong, S., Jiang, Y., Li, H., Huang, Y.: ITGO: Invasive tumor growth optimization algorithm. Appl. Soft Comput. 36, 670–698 (2015)
Tang, D., Dong, S., He, L., Jiang, Y.: Intrusive tumor growth inspired optimization algorithm for data clustering. Neural Comput. Appl. 27(2), 349–374 (2016)
Dubreuil, M., Gagné, C., Parizeau, M.: Analysis of a master-slave architecture for distributed evolutionary computations. IEEE Trans. Syst. Man Cybern. Part B 36(1), 229–235 (2006)
Giacobini, M., Tomassini, M., Tettamanzi, A.G.B., Alba, E.: Selection intensity in cellular evolutionary algorithms for regular lattices. IEEE Trans. Evol. Comput. 9(5), 489–505 (2005)
Herrera, F., Lozano, M.: Gradual distributed real-coded genetic algorithms. IEEE Trans. Evol. Comput. 4(1), 43–63 (2000)
Radenski, A.: Distributed simulated annealing with MapReduce. In: European conference on the applications of evolutionary computation, pp. 466–476. Springer (2002)
Ferrucci, F., Salza, P., Kechadi, M.-T., Sarro, F.: A parallel genetic algorithms framework based on Hadoop MapReduce. In: Proceedings of the 30th Annual ACM Symposium on Applied Computing, pp. 1664–1667 (2015)
Lu, H.-C., Hwang, F.J., Huang, Y.-H.: Parallel and distributed architecture of genetic algorithm on Apache Hadoop and Spark. Appl. Soft Comput. 95, 106497 (2020)
Ochi, L.S., Drummond, L.M., Figueiredo, R.M.: Design and implementation of a parallel genetic algorithm for the travelling purchaser problem. In: Proceedings of the 1997 ACM symposium on applied computing, pp. 257–262 (1997)
Ferrucci, F., Salza, P., Sarro, F.: Using hadoop mapreduce for parallel genetic algorithms: A comparison of the global, grid and island models. Evol. Comput. 26(4), 535–567 (2018)
Khezr, S.N., Navimipour, N.J.: MapReduce and its application in optimization algorithms: a comprehensive study. Majlesi J Multimed Process 4(3) (2015).
Deng, C., Tan, X., Dong, X., Tan, Y.: A parallel version of differential evolution based on resilient distributed datasets model. In: Bio-inspired computing-theories and applications, pp. 84–93. Springer (2015)
Teijeiro, D., Pardo, X.C., González, P., Banga, J.R., Doallo, R.: Implementing parallel differential evolution on Spark. In: european conference on the applications of evolutionary computation, pp. 75–90. Springer (2016)
Qi, R.-Z., Wang, Z.-J., Li, S.-Y.J.: Technology: A parallel genetic algorithm based on spark for pairwise test suite generation. J. Comput. Sci. Technol. 31(2), 417–427 (2016)
Maqbool, F., Razzaq, S., Lehmann, J., Jabeen, H.: scalable distributed genetic algorithm using apache spark (S-GA). In: International conference on intelligent computing, pp. 424–435. Springer (2019)
He, Z., Peng, H., Chen, J., Deng, C., Wu, Z.: A Spark-based differential evolution with grouping topology model for large-scale global optimization. Clust. Comput. 24(1), 515–535 (2021). https://doi.org/10.1007/s10586-020-03124-z
Gaifang, D., Xueliang, F., Honghui, L., Pengfei, X.: Cooperative ant colony-genetic algorithm based on spark. Comput. Electr. Eng. 60, 66–75 (2017)
AlJame, M., Ahmad, I., Alfailakawi, M.: Apache Spark Implementation of Whale Optimization Algorithm. Clust. Comput. 23(3), 2021–2034 (2020)
Yuan, J.: An anomaly data mining method for mass sensor networks using improved PSO algorithm based on spark parallel framework. J. Grid Comput. 18(2), 251–261 (2020)
Acknowledgements
This work was supported by National Natural Science Foundation of China (61976239), Science and Technology Project in Guangzhou of China (201903010046) and Innovation Foundation of High-end Scientific Research Institutions in Zhongshan of China (2019AG031).
Author information
Authors and Affiliations
Contributions
The algorithm was conceived by AX and JL. The parallel program was developed by AX and JL. The experiments were designed and performed by SD, AX, JZ and JL. JL, AX and SD wrote, reviewed and revised the manuscript. SD guided the work. All authors read and approved the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lin, J., Xiao, A., Dong, S. et al. Spark-ITGO: a parallel invasive tumor growth optimization algorithm on spark. Cluster Comput 25, 2633–2660 (2022). https://doi.org/10.1007/s10586-021-03396-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-021-03396-z