Abstract
Built specifically for the Semantic Web, triple stores are required to accommodate a large number of RDF triples and remain primarily centralized. As triple stores grow and evolve with time, there is a demanding need for scalable techniques to remove resource and performance bottlenecks in such systems. To this end, we propose a fully decentralized peer-to-peer architecture for large scale triple stores in which triples are maintained by individual stakeholders, and a semantics-directed search protocol, mediated by topology reorganization, for locating triples of interest. We test our design through simulations and the results show anticipated improvements over existing techniques for distributed triple stores. In addition to engineering future large scale triple stores, our work will in particular benefit the federation of stand-alone triple stores of today to achieve desired scalability.
Similar content being viewed by others
References
Broekstra J, Kampman A, van Harmelen F. Sesame: A generic architecture for storing and querying RDF and RDF schema. In Proc. the 1st International Semantic Web Conference, Sardinia, Italy, June 9–12, 2002, pp.54–68.
Harris S, Gibbins N. 3store: Efficient bulk RDF storage. In Proc. the 1st International Workshop on Practical and Scalable Semantic Web Systems, Sanibel Island, Florida, USA, October 20, 2003, pp.1–15.
McBride B. Jena: A Semantic Web toolkit. IEEE Internet Computing, 2002, 6(6): 55–59.
Arenas M, Kantere V, Kementsietsidis A et al. The Hyperion project: From data Integration to data coordination. ACM SIGMOD Record, September 2003, 32(3): 53–58.
Aberer K, Cudré-Mauroux P, Hauswirth M et al. GridVine: Building Internet-scale semantic overlay networks. In Proc. the 3rd International Semantic Web Conference, Hiroshima, Japan, November 7–11, 2004, pp.107–121.
Nejdl W,Wolf B, Qu C et al. EDUTELLA: A P2P networking infrastructure based on RDF. In Proc. the 11th International Conference on World Wide Web, Honolulu, Hawaii, USA, May 7–11, 2002, pp.604–615.
Nejdl W, Wolpers M, Siberski W et al. Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks. In Proc. the 12th International Conference on World Wide Web, Budapest, Hungary, May 20–24, 2003, pp.536–543.
Kokkinidis G, Sidirourgos L, Christophides V. Query Processing in RDF/S-Based P2P Database Systems. Semantic Web and Peer-to-Peer, Staab S, Stuckenschmidt H (eds.), Springer-Verlag, 2006, pp.59–81.
Halevy A, Ives Z, Mork P et al. Piazza: Data management infrastructure for Semantic Web applications. In Proc. the 12th International Conference on World Wide Web, Budapest, Hungary, May 20-24, 2003, pp.556-567.
Cai M, Frank M. RDFPeers: A scalable distributed RDF repository based on a structured peer-to-peer network. In Proc. the 13th International Conference on World Wide Web, New York, NY, USA, May 17–22, 2004, pp.650–657.
Stuckenschmidt H, Vdovjak R, Broekstra J et al. Towards distributed processing of RDF path queries. Int. J. Web Engineering and Technology, 2005, 2(2/3): 207–230.
Clark D. Face-to-face with peer-to-peer networking. Computer, 2001, 34(1): 18–21.
Gruber T. A translation approach to portable ontology specifications. Knowledge Acquisition, June 1993, 5(2): 199–220.
Mitra P, Noy N, Jaiswal A. OMEN: A probabilistic ontology mapping tool. In Proc. the 4th International Semantic Web Conference, Galway, Ireland, Nov. 6–10, 2005, pp.537–547.
Pan R, Ding Z, Yu Y et al. A Bayesian network approach to ontology mapping. In Proc. the 4th International Semantic Web Conference, Galway, Ireland, Nov. 6–10, 2005, pp.563–577.
Gong L. JXTA: A network programming environment. IEEE Internet Computing, May 2001, 5(3): 88–95.
Balakrishnan H, Kaashoek M, Karger D et al. Looking up data in P2P systems. Communications of the ACM, February 2003, 46(2): 43–48.
Lv Q, Cao P, Cohen E et al. Search and replication in unstructured peer-to-peer systems. In Proc. the 16th Int. Conf. Supercomputing, New York, NY, USA, June 22–26, 2002, pp.84–95.
Stoica I, Morris R, Karger D et al. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proc. the ACM SIGCOMM Conference, San Diego, California, USA, August 27–31, 2001, pp.149–160.
Chawathe Y, Ratnasamy S, Breslau L et al. Making Gnutella-like P2P systems scalable. In Proc. the ACM SIGCOMM Conference, Karlsruhe, Germany, Aug. 25–29, 2003, pp.407–418.
Haase P, Siebes R, van Harmelen F. Peer selection in peer-to-peer networks with semantic topologies. In Proc. the 1st Int. IFIP Conf. Semantics of a Networked World (ICSNW 2004), Paris, France, June 17–19, 2004, pp.108–125.
Haase P, Broekstra J, Ehrig M et al. Bibster - A semantics-based bibliographic peer-to-peer system. In Proc. the 3rd International Semantic Web Conference, Hiroshima, Japan, November 7–11, 2004, pp.122–136.
Menascé D. Scalable P2P search. IEEE Internet Computing, March/April 2003, 7(2): 83–87.
Sripanidkulchai K, Maggs B, Zhang H. Efficient content location using Interest-based locality in peer-to-peer systems. In Proc. IEEE INFOCOM, San Francisco, CA, USA, April 1–3, 2003, pp.2166–2176.
Sayers C, Wilkinson K. A pragmatic approach to storing and distributing RDF in context using Snippets. Technical Report HPL-2003-231, Enterprise Systems and Data Management Laboratory, HP Laboratories Palo Alto, USA, November 2003.
Zhou J. DDLS: Extending open hypermedia systems into peer-to-peer environments [Ph.D. Dissertation]. University of Southampton, UK, 2004.
Abadi D, Marcus A, Madden S, Hollenbach K. Scalable Semantic Web data management using vertical partitioning. In Proc. the 33rd International Conference on Very Large Data Bases, Vienna, Austria, September 23–27, 2007, pp.411–422.
Ratnasamy S, Francis P, Handley M et al. A scalable content-addressable network. In Proc. the ACM SIGCOMM Conference 2001, San Diego, California, USA, August 27–31, 2001, pp.161–172.
Rowstron A, Druschel P. Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Proc. the 18th IFIP/ACM International Conference on Distributed Systems Platforms (Middleware 2001), Heidelberg, Germany, November 12–16, 2001, pp.329–350.
Winick J, Jamin S. Inet-3.0: Internet topology generator. Technical Report CSE-TR-456-02, University of Michigan, 2002.
Zhou J, Hall W, De Roure D et al. Supporting ad-hoc resource sharing on the Web: A peer-to-peer approach to hypermedia link services. ACM Transactions on Internet Technology, May 2007, 7(2): Article 11.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was primarily conducted while Jing Zhou was affiliated with the School of Electronics and Computer Science, University of Southampton, U.K. and is supported in part by the Leading Academic Discipline Program, 211 Project for Communication University of China (the 3rd phase).
Rights and permissions
About this article
Cite this article
Zhou, J., Hall, W. & De Roure, D. Building a Distributed Infrastructure for Scalable Triple Stores. J. Comput. Sci. Technol. 24, 447–462 (2009). https://doi.org/10.1007/s11390-009-9236-1
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-009-9236-1