Skip to main content

Cost Analysis for Big Geospatial Data Processing in Public Cloud Providers

  • Conference paper
  • First Online:
Cloud Computing and Service Science (CLOSER 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 864))

Included in the following conference series:

  • 557 Accesses

Abstract

Cloud computing is a suitable platform for running applications to process large volumes of data. Currently, with the growth of geographic and spatial data volume, conceptualized as Big Geospatial Data, some tools have been developed to allow the processing of this data efficiently. This work presents a cost-efficient method for processing geospatial data, optimizing the number of data nodes in a SpatialHadoop cluster according to dataset size. With this, it is possible to analyse and compare the costs for this type of application on public cloud providers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://aws.amazon.com.

  2. 2.

    http://azure.microsoft.com/.

  3. 3.

    https://cloud.google.com.

  4. 4.

    www.gartner.com/doc/reprints?id=12G2O5FC&ct=150519.

  5. 5.

    www.cloudbus.org/cloudsim.

References

  1. Alarabi, L., Eldawy, A., Alghamdi, R., Mokbel, M.F.: TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 897–900. ACM (2014)

    Google Scholar 

  2. Bachiega, J., Reis, M., Araujo, A., Holanda, M.: Cost optimization on public cloud provider for big geospatial data: a case study using Open Street Map. In: Proceedings of the 7th International Conference on Cloud Computing and Services Science, pp. 54–62 (2017)

    Google Scholar 

  3. Chaisiri, S., Lee, B.S., Niyato, D.: Optimization of resource provisioning cost in cloud computing. IEEE Trans. Serv. Comput. 5(2), 164–177 (2012)

    Article  Google Scholar 

  4. Das, J., Dasgupta, A., Ghosh, S.K., Buyya, R.: A geospatial orchestration framework on cloud for processing user queries. In: 2016 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM), pp. 1–8. IEEE (2016)

    Google Scholar 

  5. Eldawy, A., Li, Y., Mokbel, M.F., Janardan, R.: CG\__Hadoop: computational geometry in MapReduce. In: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 294–303. ACM (2013)

    Google Scholar 

  6. Eldawy, A., Mokbel, M.F.: A demonstration of SpatialHadoop: an efficient mapreduce framework for spatial data. Proc. VLDB Endowment 6(12), 1230–1233 (2013)

    Article  Google Scholar 

  7. Eldawy, A., Mokbel, M.F.: Pigeon: a spatial MapReduce language. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 1242–1245. IEEE (2014)

    Google Scholar 

  8. Eldawy, A., Mokbel, M.F.: SpatialHadoop: a MapReduce framework for spatial data. In: 2015 IEEE 31st International Conference on Data Engineering (ICDE), pp. 1352–1363. IEEE (2015)

    Google Scholar 

  9. Eldawy, A., Mokbel, M.F., Jonathan, C.: HadoopViz: a MapReduce framework for extensible visualization of big spatial data. In: 2016 IEEE 32nd International Conference on Data Engineering (ICDE), pp. 601–612. IEEE (2016)

    Google Scholar 

  10. Ji, C., Li, Y., Qiu, W., Awada, U., Li, K.: Big data processing in cloud computing environments. In: Pervasive Systems, Algorithms and Networks (ISPAN), 2012 12th International Symposium on. pp. 17–23. IEEE (2012)

    Google Scholar 

  11. Kambatla, K., Pathak, A., Pucha, H.: Towards optimizing hadoop provisioning in the cloud. HotCloud 9, 12 (2009)

    Google Scholar 

  12. Krämer, M., Senner, I.: A modular software architecture for processing of big geospatial data in the cloud. Comput. Graph. 49, 69–81 (2015)

    Article  Google Scholar 

  13. Li, A., Yang, X., Kandula, S., Zhang, M.: CloudCmp: comparing public cloud providers. In: Proceedings of the 10th ACM SIGCOMM Conference on Internet Measurement, pp. 1–14. ACM (2010)

    Google Scholar 

  14. Mell, P., Grance, T., et al.: The NIST definition of cloud computing (2011)

    Google Scholar 

  15. Mokbel, M.F., Alarabi, L., Bao, J., Eldawy, A., Magdy, A., Sarwat, M., Waytas, E., Yackel, S.: A demonstration of MNTG-a web-based road network traffic generator. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 1246–1249. IEEE (2014)

    Google Scholar 

  16. Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: a not-so-foreign language for data processing. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of data, pp. 1099–1110. ACM (2008)

    Google Scholar 

  17. Rosa, M., Moura, B., Vergara, G., Santos, L., Ribeiro, E., Holanda, M., Walter, M.E., Araújo, A.: BioNimbuZ: a federated cloud platform for bioinformatics applications. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 548–555. IEEE (2016)

    Google Scholar 

  18. Sagiroglu, S., Sinanc, D.: Big data: a review. In: 2013 International Conference on Collaboration Technologies and Systems (CTS), pp. 42–47. IEEE (2013)

    Google Scholar 

  19. Yang, C., Goodchild, M., Huang, Q., Nebert, D., Raskin, R., Xu, Y., Bambacus, M., Fay, D.: Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing? Int. J. Digital Earth 4(4), 305–329 (2011)

    Article  Google Scholar 

  20. Yang, C., Yu, M., Hu, F., Jiang, Y., Li, Y.: Utilizing cloud computing to address big geospatial data challenges. Comput. Environ. Urban Syst. 61, 120–128 (2017)

    Article  Google Scholar 

  21. Zhao, Y., Calheiros, R.N., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 432–441. IEEE (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to João Bachiega Jr. , Marco Sousa Reis , Aletéia P. F. Araújo or Maristela Holanda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bachiega, J., Reis, M.S., Araújo, A.P.F., Holanda, M. (2018). Cost Analysis for Big Geospatial Data Processing in Public Cloud Providers. In: Ferguson, D., Muñoz, V., Cardoso, J., Helfert, M., Pahl, C. (eds) Cloud Computing and Service Science. CLOSER 2017. Communications in Computer and Information Science, vol 864. Springer, Cham. https://doi.org/10.1007/978-3-319-94959-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-94959-8_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-94958-1

  • Online ISBN: 978-3-319-94959-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics