Skip to main content

MRFM: An Efficient Approach to Spatial Join Aggregate

  • Conference paper
Web-Age Information Management (WAIM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7419))

Included in the following conference series:

Abstract

Spatial join aggregate(SJA) is a commonly used but time-consuming operation in spatial database. Since it involves both the spatial join and the aggregate operation, performing SJA is a challenging task especially facing the deluge of spatial data. A popular model nowadays for massive data processing is the shared-nothing cluster using MapReduce. Thus, to explore SJA in MapReduce, a Map-Reduce-Filter-Merge(MRFM) algorithm is proposed.Map step divides the total SJA task into disjoint sets, then Reduce step aggregate each set individually, a Filter operation will filter those aggregate results of single assignment spatial objects.Finally, Merge step further aggregate the partial results of multiple assignment spatial objects using an efficient merge algorithm. Extensive experiments in large real spatial data have demonstrated the efficiency, effectiveness and scalability of the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tao, Y., Papadias, D.: Range aggregate processing in spatial databases. IEEE Transactions on Knowledge and Data Engineering 16(12), 1555–1570 (2004)

    Article  Google Scholar 

  2. Jurgens, M., Lenz, H.: The Ra*-Tree: An Improved R-Tree with Materialized Data for Supporting Range Queries on OLAP-Data. In: Proc. DEXA Workshop (1998)

    Google Scholar 

  3. Papadias, D., Kalnis, P., Zhang, J., Tao, Y.: Efficient OLAP Operations in Spatial Data Warehouses. In: Jensen, C.S., Schneider, M., Seeger, B., Tsotras, V.J. (eds.) SSTD 2001. LNCS, vol. 2121, p. 443. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  4. Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross- Tabs and Subtotals. In: Proc. Intl. Conf. Data Eng. (1996)

    Google Scholar 

  5. Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)

    Article  Google Scholar 

  6. Dittrich, J.P., Seeger, B.: Data redundancy and duplicate detection in spatial join processing. In: ICDE, pp. 535–546 (2000)

    Google Scholar 

  7. UC Bureau, Census 2010 Tiger/Line data (2010)

    Google Scholar 

  8. Pavlo, A., Paulson, E., et al.: A comparison of approaches to large-scale data analysis. In: SIGMOD Conference, pp. 165–178 (2009)

    Google Scholar 

  9. Jiang, D., Ooi, B.C., Shie, L., Wu, S.: The Performance of MapReduce: An Indepth Study. In: VLDB 2010 (2010)

    Google Scholar 

  10. Yang, H., Dasdan, A., et al.: Map-Reduce-Merge: simplified relational data processing on large clusters. In: SIGMOD Conference, pp. 1029–1040 (2007)

    Google Scholar 

  11. White, T.: Hadoop: The Definitive Guide. Yahoo! Press, Sebastopol (2009)

    Google Scholar 

  12. Wu, X., Carceroni, R., et al.: Automatic alignment of large-scale aerial rasters to road-maps, Geographic Information Systems. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Article No. 17 (2007)

    Google Scholar 

  13. Cary, A., Sun, Z., Hristidis, V., Rishe, N.: Experiences on Processing Spatial Data with MapReduce. In: Winslett, M. (ed.) SSDBM 2009. LNCS, vol. 5566, pp. 302–319. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  14. Zhang, S., Han, J., Lin, Z., et al.: SJMR: Parallelizing Spatial Join with MapReduce on Clusters. In: SSDBM Conference (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, Y., Chen, L., Jing, N., Xiong, W. (2012). MRFM: An Efficient Approach to Spatial Join Aggregate. In: Bao, Z., et al. Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7419. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33050-6_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33050-6_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33049-0

  • Online ISBN: 978-3-642-33050-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics