Skip to main content

Horizontally Partitioned Data

  • Reference work entry
  • First Online:
Encyclopedia of Database Systems
  • 31 Accesses

Synonyms

Homogeneously distributed data

Definition

Data is said to be horizontally partitioned when several organizations own the same set of attributes for different sets of entities. More formally, horizontal partitioning of data can be defined as follows: given a dataset DB = (E, I) (e.g., hospital discharge data for state of Texas) where E is the set of entities about whom the information is collected (e.g., the set of patients) and I is the set of attributes that is collected about entities (e.g., set of features collected about patients), DB is said to be horizontally partitioned among k sites where each site owns DBi = (Ei, Ii), 1 ≤ ik if E = E1E2…∪ Ek, EiEj = ∅, 1 ≤ ijk and I = I1 = I2… = In. In relational terms, with horizontal partitioning, the relation to be mined is the union of the relations at the sites.

Historical Background

Cheap data storage and abundant network capacity have revolutionized data collection and data dissemination. At the same time,...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Agrawal R, Srikant R.. Privacy-preserving data mining. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2000. p. 439–50.

    Google Scholar 

  2. Clifton C, Marks D. Security and privacy implications of data mining. In: Proceedings of the Workshop on Data Mining and Knowledge Discovery; 1996. p. 15–9.

    Google Scholar 

  3. Friedman A, Wolff R, Schuster A. Providing k-anonymity in data mining. VLDB J. 2008;17(4):789–804.

    Article  Google Scholar 

  4. Han J, Kamber M. Data mining: concepts and techniques. San Francisco: Morgan Kaufmann; 2000.

    MATH  Google Scholar 

  5. Jagannathan G, Wright R.N. Privacy-preserving distributed k-means clustering over arbitrarily partitioned data. In: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2005. p. 593–9.

    Google Scholar 

  6. Kantarcioglu M, Vaidya J. Privacy preserving naive bayes classifier for horizontally partitioned data. In: Proceedings of the Workshop on Privacy Preserving Data Mining; 2003.

    Google Scholar 

  7. Kantarcıoğlu M, Clifton C. Privately computing a distributed k-nn classifier. In: Proceedings of the 8th European Conference on Principles of Data Mining And Knowledge Discovery; 2004. p. 279–0.

    Google Scholar 

  8. Kantarcıoğlu M, Jin J, Clifton C. When do data mining results violate privacy? In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2004; p. 599–604.

    Google Scholar 

  9. Lin X, Clifton C, Zhu M. Privacy preserving clustering with distributed EM mixture modeling. Knowl Inform Syst. 2005;8(1):68–81.

    Article  Google Scholar 

  10. Lindell Y, Pinkas B. Privacy preserving data mining. In: Advances in Cryptology: Proceedings of the 20th Annual International Cryptology Conference; 2000. p. 36–54.

    Chapter  Google Scholar 

  11. Yu H, Jiang X, Vaidya J. Privacy-preserving svm using nonlinear kernels on horizontally partitioned data. In: Proceedings of the 2006 ACM Symposium on Applied Computing; 2006. p. 603–10.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Murat Kantarcıoğlu .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Kantarcıoğlu, M. (2018). Horizontally Partitioned Data. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_1391

Download citation

Publish with us

Policies and ethics