Reading List on Data Management Issues Over P2P  

Overview

N. Daswani, H. Garcia-Molina, and B. Yang. Open problems in data-sharing peer-to-peer systems. In Proc. 9th Int. Conf. on Database Theory, 2003.

P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In Proc. 5th Int. Workshop on the World Wide Web and Databases (WebDB), 2002.

B. Yang and H. Garcia-Molina. Improving search in peer-to-peer networks. In Proc. 22nd Int. Conf. on Distributed Computing Systems, pages 5-12, 2002.

M. Schlosser, M. Sintek, S. Decker, and W. Nejdl. A scalable and ontology-based P2P infrastructure for semantic web services. In Peer-to-Peer Computing, pages 104-111, 2002.

P. Valduriez and Pacitti E. Data management in large-scale P2P systems. In High Performance Computing for Computational Science - VECPAR 2004, 6th International Conference, pages 104-118, 2004.

S. D. Gribble, A. Y. Halevy, Z. G. Ives,  M. Rodrig, and D. Suciu. What can database do for peer-to-peer? In Proc. 4th Int. Workshop on the World Wide Web and Databases (WebDB), pages 31-36, 2001.

J. M. Hellerstein. Architectures and algorithms for Internet-scale (P2P) data management. In Proc. 30th Int. Conf. on Very Large Data Bases, 2004.

M. Castro, M. Costa, and A. Rowstron. Peer-to-peer overlays: structured, unstructured, or both? Technical Report MSR-TR-2004-73, Microsoft Research, Cambridge, 2004.

K. Aberer and M. Hauswirth. An overview on peer-to-peer information systems. In Workshop on Distributed Data and Structures, 2002.

 

Unstructured P2P

V. Kantere, D. Tsoumakos, and N. Roussopoulos. Querying structured data in an unstructured p2p system. In International Workshop on Web Information and Data Management, pages 64-71, 2004.

Y. Petrakis and E. Pitoura. On constructing small worlds in unstructured peer-to-peer systems. In EDBT Workshops 2004, 2004.

 

Super peer-based P2P

A. Montresor. A robust protocol for building superpeer overlay topologies. In Proceedings of the 4th International Conference on Peer-to-Peer Computing, August 2004.

M. T. Schlosser, M. Sintek, S. Decker, and W. Nejdl. Hypercup - hypercubes, ontologies, and efficient search on peer-to-peer

A. Montresor. A robust protocol for building superpeer overlay topologies. In Proceedings of the 4th International Conference on Peer-to-Peer Computing, August 2004.

M. Schlosser, M. Sintek, S. Decker, and W. Nejdl. A scalable and ontology-based P2P infrastructure for semantic web services. In Peer-to-Peer Computing, pages 104-111, 2002.

 

Structured P2P (DHT)

S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. A scalable content-addressable network. In ACM SIGCOMM 2001 Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communication, pages 161-172, 2001.

S. Ratnasamy, M. Handley, R. Karp, and S. Shenker. Topologically-aware overlay construction and server selection. In The 21st Annual Joint Conference of the IEEE Computer and Communications Societies, 2002.

A. Rowstron and P. Druschel. Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Middleware 2001, IFIP/ACM International Conference on Distributed Systems Platforms, pages 329-350, 2001.

I. Stoica, R. Morris, D. Liben-Nowell, D. R. Karger, M. F. Kaashoek, F. Dabek, and H. Balakrishnan. Chord: a scalable peer-to-peer lookup protocol for internet applications. In ACM SIGCOMM 2001 Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communication, pages 149-160, 2001.

B. Y. Zhao, J. Kubiatowicz, and A. D. Joseph. Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Technical report, University of California, Berkeley, 2001.

N. Harvey, M. B. Jones, S. Saroiu, M. Theimer, and A. Wolman. Skipnet: A scalable overlay network with practical locality properties. In USENIX Symposium on Internet Technologies and Systems, 2003.

H. V. Jagadish, B. C. Ooi, and Q. H. Vu. Baton: A balanced tree structure for peer-to-peer networks. In Proc. 31th Int. Conf. on Very Large Data Bases, 2005.

H. V. Jagadish, B.C. Ooi, Q.H. Vu, and A.Y. Zhou R. Zhang. VBI-tree: A peer-to-peer framework for supporting multi-dimensional indexing schemes. In Proc. Int. Conf. on Data Engineering, 2006.

D. Malkhi, M. Naor, and D. Ratajczak. Viceroy: a scalable and dynamic emulation of the butterfly. In Proc. ACM SIGACT-SIGOPS 21st Symp. on the Principles of Dist. Comp., pages 183-192, 2002.

G. S. Manku, M. Bawa, and P. Raghavan. Symphony: distributed hashing in a small world. In USENIX Symposium on Internet Technologies and Systems, 2003.

P. Maymounkov and D. Mazi?eres. Kademlia: A peer-to-peer information system based on the xor metric. In Peer-to-Peer Systems, First International Workshop, IPTPS 2002, pages 53-65, 2002.

A. Bharambe, M. Agrawal, and S. Seshan. Mercury: Supporting scalable multi-attribute range queries. In ACM SIGCOMM 2004 Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communication, 2004.

M. Datar. Butterflies and peer-to-peer networks. In ESA 2002, 10th Annual European Symposium, pages 310-322, 2002.

A. Datta, S. Girdzijauskas, and K. Aberer. On De Bruijn routing in distributed hash tables: there and back again. In Fourth IEEE International Conference on Peer-to-Peer Computing, 2004.

K. Aberer. P-Grid: A self-organizing access structure for P2P information systems. In Sixth International Conference on Cooperative Information Systems, 2001.

A. Crainiceanu, P. Linga, J. Gehrke, and J. Shanmugasundaram. Querying peer-to-peer networks using p-trees. In Proc. 7th Int. Workshop on the World Wide Web and Databases (WebDB), pages 25-30,2004.

 

Hierarchical DHT

L. G. Erice, E. Biersack, P. Felber, K. W. Ross, and G. U. Keller. Hierarchical peer-to-peer systems. In Euro-Par 2003. Parallel Processing, 9th International Euro-Par Conference, pages 1230-1239, 2003.

P. Ganesan, P. K. Gummadi, and H. Garcia-Molina. Canon in g major: Designing dhts with hierarchical structure. In Proc. 24rd Int. Conf. on Distributed Computing Systems, pages 263-272, 2004.

 

XML over P2P

E. Pitoura, S. Abiteboul, D. Pfoser, G. Samaras, and M. Vazirgiannis. Dbglobe: a service-oriented p2p system for global computing. In ACM SIGMOD Record, 2003.

G. Koloniari and E. Pitoura. Content-based routing of path queries in peer-to-peer systems. In Advances in Database Technology | EDBT'04, pages 29-47, 2004.

G. Erice, P. A. Felber, E.W. Biersack, G. Urvoy-Keller, and K.W.Ross. Data indexing in peer-to-peer DHT networks. 2004. Proc. 24rd Int. Conf. on Distributed Computing Systems.

L. Galanis, Y.Wang, S. R. Jeffery, and D. J. DeWitt. Locating data sources in large distributed systems. In Proc. 29th Int. Conf. on Very Large Data Bases, pages 874-885, 2003.

L. Galanis, Y. Wang, S. R. Jeffery, and D. J. DeWitt. Processing queries in a large peer-to-peer system. In Proc. of the 15th Int. Conf. on Advanced Information Systems Engineering, 2003.

C. Sartiani, P. Manghi, G. Ghelli, and G. Conforti. Xpeer: A self-organizing xml p2p database system. In Proc. of the First International Workshop on Peer-to-Peer Computing and Databases, 2003.

G. Skobeltsyn, M. Hauswirth, and K. Aberer. Efficient processing of XPath queries with structured overlay networks. In submitted to the 2005 International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE), 2005.

G. Koloniari, Y. Petrakis, and E. Pitoura. Content-based overlay networks for XML peers based on multi-level bloom filters. In Databases, Information Systems, and Peer-to-Peer Computing, First International Workshop, 2003.

G. Koloniari and E. Pitoura. Peer-to-peer management of XML data: issues and research challenges. ACM SIGMOD Record, 34(2), June 2005.

A. Bonifati, U. Matrangolo, A. Cuzzocrea, and M. Jain. XPath lookup queries in P2P networks. In International Workshop on Web Information and Data Management, 2004.

 

Range Query over P2P

A. Andrzejak and Z. Xu. Scalable, efficient range queries for grid information services. In Peer-to-Peer Computing, 2002.

A. Datta, M. Hauswirth, R. John, R. Schmidt, and K. Aberer. Range queries in trie-structured overlays. Technical Report IC/2004/111, EPFL, 2004.

A. Gupta, D. Agrawal, and A. E. Abbadi. Approximate range selection queries in peer-to-peer systems. In First Biennial Conference on Innovative Data Systems Research, 2003.

S. Ratnasamy, J. M. Hellerstein, and S. Shenker. Range queries over dhts. Technical Report IRB-TR-03-009, Intel Research Berkeley, June 2003.

 

Top-k Query over P2P

W. Balke, W. Nejdl, W. Siberski, and U. Thaden. Progressive distributed top-k retrieval in peer-to-peer networks. In Proc. Int. Conf. on Data Engineering, pages 174-185, 2005.

P. Cao and Z. Wang. Efficient top-k query calculation in distributed networks. In Proc. ACM SIGACT-SIGOPS Symp. on Principles of Dist. Comp., pages 206-215, 2004.

S. Michel, P. Triantafillou, and G. Weikum. Klee: A framework for distributed top-k query algorithms. Technical report, Max-Planck Institute for Computer Science, Germany, and University of Patras, Greece, 2005.

W. Nejdl, W. Siberski, U. Thaden, and W. Balke. Top-k query evaluation for schema-based peer-to-peer networks. In International Semantic Web Conference, pages 137-151, 2004.

 

Multidimensional Query over P2P

P. Ganesan, B. Yang, and H. Garcia-Molina. One torus to rule them all: Multidimensional queries in p2p systems. In Proc. 7th Int. Workshop on the World Wide Web and Databases (WebDB), pages 19-24, 2004.

 

Similarity Search over P2P

O. D. Sahin, F. Emekci, D. Agrawal and A. E. Abbadi. Content-based similarity search over peer-to-peer systems. In Databases, Information Systems, and Peer-to-Peer Computing, Second International Workshop, 2004.

 

Relational Database over P2P

W. Fontijn and P. A. Boncz. Ambientdb: P2p data management middleware for ambient intelligence. In Workshop on Middleware Support for Pervasive Computing (PerWare), 2004.

E. Franconi, G. M. Kuper, A. Lopatenko, and I. Zaihrayeu.The coDB Robust Peer-to-Peer Database System. In the Twelfth Italian Symposium on Advanced Database Systems, 2004

M. Harren, J. M. Hellerstein, R. Huebsch, B. T. Loo, S. Shenker, and I. Stoica. Complex queries in DHT-based peer-to-peer networks. In Peer-to-Peer Systems, First International Workshop, IPTPS, 2002.

R. Huebsch, B. N. Chun, J. M. Hellerstein, B. T. Loo, P. Maniatis, T. Roscoe, S. Shenker, I. Stoica, and A. R. Yumerefendi. The architecture of PIER: an Internet-scale query processor. In CIDR 2005, Second Biennial Conference on Innovative Data Systems Research, pages 28-43, 2005.

B. T. Loo, J. M. Hellerstein, R. Huebsch, S. Shenker, and I. Stoica. Enhancing P2P file-sharing with an Internet-scale query processor. In Proc. 30th Int. Conf. on Very Large Data Bases, pages 432-443, 2004.

W. S. Ng, B. C. Ooi, K. Tan, and A. Zhou. Peerdb: A P2P-based system for distributed data sharing. In Proc. 19th Int. Conf. on Data Engineering, pages 633-644, 2003.

K. Sattler, P. Rosch, E. Buchmann, and K. Bohm. A physical query algebra for DHT-based P2P systems. In the 6th Workshop on Distributed Data and Structures (WDAS 2004), 2004.

K. Sattler, P. Roumlsch, C. Wet, and E. Buchmann. Best effort query processing in DHT-based P2P systems. In 1st IEEE International Workshop on Networking Meets Databases (NetDB), 2005.

I. Tatarinov and A. Halevy. Efficient query reformulation in peer data management systems. In Proc. ACM SIGMOD Int. Conf. on Management of Data, 2004.

P. Triantafillou and T. Pitoura. Towards a unifying framework for complex query processing over structured peer-to-peer data networks. In Databases, Information Systems, and Peer-to-Peer Computing, First International Workshop, pages 169-183, 2003.

 

Load Balancing over P2P

P. Ganesan, M. Bawa, and H. Garcia-Molina. Online balancing of range-partitioned data with applications to peer-to-peer systems. In Proc. 30th Int. Conf. on Very Large Data Bases, pages 444-455, 2004.

D. R. Karger and M. Ruhl. Simple efficient load balancing algorithms for peer-to-peer systems. In Peer-to-Peer Systems, First International Workshop, pages 131-140, 2004.

A. Rao, K. Lakshminarayanan, S. Surana, R. M. Karp, and I. Stoica. Load balancing in structured P2P systems. pages 68-79, 2003.

 

Multicast & Gossip & Flooding over P2P

M. Castro, M. B. Jones, A. Kermarrec, A. I. T. Rowstron, M. Theimer, H. J. Wang, and A. Wolman. An evaluation of scalable application-level multicast built using peer-to-peer overlays. In The 22st Annual Joint Conference of the IEEE Computer and Communications Societies, pages 14-29, 2003.

C. Gkantsidis, M. Mihail, and A. Saberi. Random walks in peer-to-peer networks. In The 23st Annual Joint Conference of the IEEE Computer and Communications Societies, 2004.

M. Jelasity, R. Guerraoui, A. Kermarrec, and M. Steen. The peer sampling service: Experimental evaluation of unstructured gossip-based implementations. In Middleware 2004, IFIP/ACM International Conference on Distributed Systems Platforms, pages 79-98, 2004.

S. Ratnasamy, M. Handley, R. M. Karp, and S. Shenker. Application-level multicast using content addressable networks. In Networked Group Communication, pages 14-29, 2001.

 

Schema Mapping over P2P

M. Arenas, V. Kantere, A. Kementsietsidis, I. Kiringa, R. J. Miller, and J. Mylopoulos. The Hyperion project: from data integration to data coordination. ACM SIGMOD Record, 32(3):53-58, 2003.

A. Halevy, Z. Ives, P. Mork, and I. Tatarinov. Piazza: data management infrastructure for semantic web applications. In Proc. 12th Int. World Wide Web Conference, 2003.

 

Applications & Systems

M. Balazinska, H. Balakrishnan, and D. R. Karger. Ins/twine: A scalable peer-to-peer architecture for intentional resource discovery. In Pervasive Computing, First International Conference, Pervasive 2002.

Frank Dabek. A cooperative file system. Master's thesis, Massachusetts Institute of Technology, 2001.

P. Druschel and A. Rowstron. Past: A large-scale, persistent peer-to-peer storage utility. In HotOS VIII, 2001.

P. Ganesan, Q. Sun, and H. Garcia-Molina. Adlib: A self-tuning index for dynamic p2p systems. In Proc. Int. Conf. on Data Engineering, pages 256-257, 2005.

F. B. Kashani, C. Chen, and C. Shahabi. Wspds: Web services peer-to-peer discovery service. In International Symposium on Web Services and Applications 2004, 2004.

J. Kubiatowicz, D. Bindel, Y. Chen, S. E. Czerwinski, P. R. Eaton, D. Geels, R. Gummadi, S. C. Rhea, H. Weatherspoon, W. Weimer, C. Wells, and B. Y. Zhao. Oceanstore: An architecture for global-scale persistent storage. In Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems, 2000.

A. Rowstron, A. Kermarrec, M. Castro, and P. Druschel. Scribe: The design of a large-scale event notification infrastructure. In Networked Group Communication, Third International COST264 Workshop,NGC 2001, 2001.

H. Zhang, A. Goel, and R. Govindan. Using the small-world model to improve freenet performance. In The 21st Annual Joint Conference of the IEEE Computer and Communications Societies, pages 431-438, 2001.

 

Measurement & Simulators

H. Hsiao and C. King. Modeling and evaluating peer-to-peer storage architectures. In 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002.

J. Harris and D. Deugo. Towards a peer-to-peer simulator. In International Conference on Internet Computing 2004, pages 276-284, 2004.

Q. He, M. Ammar, G. Riley, H. Raj, and R. Fujimoto. Mapping peer behavior to packet-level details: A framework for packet-level simulation of peer-to-peer systems. In 11th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, 2003.

S. Saroiu, P. K. Gummadi, and S. D. Gribble. A measurement study of peer-to-peer file sharing systems. In Proceedings of Multimedia Computing and Networking, 2002.

M. Schlosser, T. Condie, and S. Kamvar. Simulating a file-sharing P2P network. In 1st Workshop on Semantics in Grid and P2P Networks, 2003.

 

Correctness & Availability

P. Linga, A. Crainiceanu, J. Gehrke, and J. Shanmugasundaram. Guaranteeing correctness and availability in P2P range indices. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 323-334, 2005.

 

Extract from Wang Qiang’s Collections, Thanks John!