This third edition of a classic textbook can be used to teach at the senior undergraduate and graduate levels. The material concentrates on fundamental theories as well as techniques and algorithms. The advent of the Internet and the World Wide Web, and, more recently, the emergence of cloud computing and streaming data applications, has forced a renewal of interest in distributed and parallel data management, while, at the same time, requiring a rethinking of some of the traditional techniques. This book covers the breadth and depth of this re-emerging field. The coverage consists of two parts. The first part discusses the fundamental principles of distributed data management and includes distribution design, data integration, distributed query processing and optimization, distributed transaction management, and replication. The second part focuses on more advanced topics and includes discussion of parallel database systems, distributed object management, peer-to-peer data management, web data management, data stream systems, and cloud computing. New in this Edition: New chapters, covering database replication, database integration, multidatabase query processing, peer-to-peer data management, and web data management. Coverage of emerging topics such as data streams and cloud computing Extensive revisions and updates based on years of class testing and feedback Ancillary teaching materials are available.
Cited By
- Teng D, Liang Y, Vo H, Kong J and Wang F (2022). Efficient 3D Spatial Queries for Complex Objects, ACM Transactions on Spatial Algorithms and Systems, 8:2, (1-26), Online publication date: 30-Jun-2022.
- Hellings J and Sadoghi M (2021). ByShard, Proceedings of the VLDB Endowment, 14:11, (2230-2243), Online publication date: 1-Jul-2021.
- Ruan P, Dinh T, Loghin D, Zhang M, Chen G, Lin Q and Ooi B Blockchains vs. Distributed Databases Proceedings of the 2021 International Conference on Management of Data, (1504-1517)
- Nguyen T, Bobineau C, Debusschere V, Giap Q and Hadjsaid N (2020). CPDE: A Methodology for the Transparent Distribution of Centralized Smart Grid Programs, IEEE Transactions on Parallel and Distributed Systems, 32:2, (342-354), Online publication date: 1-Feb-2021.
- Mehenni T Multi-database Mining Proceedings of the 1st International Conference on Intelligent Systems and Pattern Recognition, (39-44)
- Ben-Eliezer O and Yogev E The Adversarial Robustness of Sampling Proceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, (49-62)
- Ezéchiel K, Ojha S and Agarwal R (2020). A New Eager Replication Approach Using a Non-Blocking Protocol Over a Decentralized P2P Architecture, International Journal of Distributed Systems and Technologies, 11:2, (69-100), Online publication date: 1-Apr-2020.
- El-Hindi M, Binnig C, Arasu A, Kossmann D and Ramamurthy R (2019). BlockchainDB, Proceedings of the VLDB Endowment, 12:11, (1597-1609), Online publication date: 1-Jul-2019.
- Alotaibi R, Bursztyn D, Deutsch A, Manolescu I and Zampetakis S Towards Scalable Hybrid Stores Proceedings of the 2019 International Conference on Management of Data, (1660-1677)
- Pacaci A and Özsu M Experimental Analysis of Streaming Algorithms for Graph Partitioning Proceedings of the 2019 International Conference on Management of Data, (1375-1392)
- Michailidou A and Gounaris A A fast solution for bi-objective traffic minimization in geo-distributed data flows Proceedings of the 23rd International Database Applications & Engineering Symposium, (1-10)
- Chen W, Hsiu P and Kuo T Enabling Failure-resilient Intermittently-powered Systems Without Runtime Checkpointing Proceedings of the 56th Annual Design Automation Conference 2019, (1-6)
- Gao Y The First International Workshop on Knowledge Graph Technology and Applications Companion Proceedings of The 2019 World Wide Web Conference, (675-675)
- Vrandecic D and Leskovec J The First International Workshop on Knowledge Graph Technology and Applications Companion Proceedings of The 2019 World Wide Web Conference, (674-674)
- Domingue J, Third A and Ramachandran M The FAIR TRADE Framework for Assessing Decentralised Data Solutions Companion Proceedings of The 2019 World Wide Web Conference, (866-882)
- Acosta M, Berners-Lee T, Dietze S, Dimou A, Domingue J, Ibánez L, Janowicz K, Vidal M and Zaveri A Linked Data on theWeb and its Relationship with Distributed Ledgers (LDOW/LDDL) Companion Proceedings of The 2019 World Wide Web Conference, (823-839)
- Cheng F and Yang Z (2019). FastMFDs, The Journal of Supercomputing, 75:5, (2497-2517), Online publication date: 1-May-2019.
- Archer A, Aydin K, Bateni M, Mirrokni V, Schild A, Yang R and Zhuang R (2019). Cache-aware load balancing of data center applications, Proceedings of the VLDB Endowment, 12:6, (709-723), Online publication date: 1-Feb-2019.
- Sebaa A, Nouicer A and Tari A (2019). Impact of technology evolution on the materialised views, International Journal of Business Information Systems, 30:4, (427-462), Online publication date: 1-Jan-2019.
- Honti G, Abonyi J and Natella R (2019). A Review of Semantic Sensor Technologies in Internet of Things Architectures, Complexity, 2019, Online publication date: 1-Jan-2019.
- Tos U, Mokadem R, Hameurlain A, Ayav T and Bora S (2018). Ensuring performance and provider profit through data replication in cloud systems, Cluster Computing, 21:3, (1479-1492), Online publication date: 1-Sep-2018.
- Tabet K, Mokadem R and Laouar M Towards a New Data Replication Strategy in MongoDB Systems Proceedings of the 4th ACM International Conference of Computing for Engineering and Sciences, (1-7)
- Leclercq É and Savonnet M A Tensor Based Data Model for Polystore Proceedings of the 22nd International Database Engineering & Applications Symposium, (110-118)
- Belyaev K, Sun W, Ray I and Ray I (2018). On the design and analysis of protocols for Personal Health Record storage on Personal Data Server devices, Future Generation Computer Systems, 80:C, (467-482), Online publication date: 1-Mar-2018.
- Mirrokni V, Thorup M and Zadimoghaddam M Consistent hashing with bounded loads Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, (587-604)
- Thalij S, Hakkoymaz V and Vitiello A (2018). Multiobjective Glowworm Swarm Optimization-Based Dynamic Replication Algorithm for Real-Time Distributed Databases, Scientific Programming, 2018, Online publication date: 1-Jan-2018.
- Hassen F and Amel G (2017). An efficient synchronous indexing technique for full-text retrieval in distributed databases, Procedia Computer Science, 112:C, (811-821), Online publication date: 1-Sep-2017.
- Vathy-Fogarassy g and Hugyk T (2017). Uniform data access platform for SQL and NoSQL database systems, Information Systems, 69:C, (93-105), Online publication date: 1-Sep-2017.
- Kosaka Y, Murakami S, Laurent T, Goto K and Toyama M RTA Proceedings of the 21st International Database Engineering & Applications Symposium, (135-142)
- Oguz D, Yin S, Ergenç B, Hameurlain A and Dikenelli O (2017). Extended Adaptive Join Operator with Bind-Bloom Join for Federated SPARQL Queries, International Journal of Data Warehousing and Mining, 13:3, (47-72), Online publication date: 1-Jul-2017.
- Rabl T and Jacobsen H Query Centric Partitioning and Allocation for Partially Replicated Database Systems Proceedings of the 2017 ACM International Conference on Management of Data, (315-330)
- Yadav M and Kumar T (2017). Distributed Query Plan Generation using Cuckoo Search Algorithm, International Journal of Energy Optimization and Engineering, 6:1, (86-100), Online publication date: 1-Jan-2017.
- Harding R, Van Aken D, Pavlo A and Stonebraker M (2017). An evaluation of distributed concurrency control, Proceedings of the VLDB Endowment, 10:5, (553-564), Online publication date: 1-Jan-2017.
- (2017). Decomposing federated queries in presence of replicated fragments, Web Semantics: Science, Services and Agents on the World Wide Web, 42:C, (1-18), Online publication date: 1-Jan-2017.
- Tian Y, Özcan F, Zou T, Goncalves R and Pirahesh H (2016). Building a Hybrid Warehouse, ACM Transactions on Database Systems, 41:4, (1-38), Online publication date: 23-Dec-2016.
- Kolev B, Valduriez P, Bondiombouy C, Jiménez-Peris R, Pau R and Pereira J (2016). CloudMdsQL, Distributed and Parallel Databases, 34:4, (463-503), Online publication date: 1-Dec-2016.
- Psaroudakis I, Scheuer T, May N, Sellami A and Ailamaki A (2016). Adaptive NUMA-aware data placement and task scheduling for analytical workloads in main-memory column-stores, Proceedings of the VLDB Endowment, 10:2, (37-48), Online publication date: 1-Oct-2016.
- Liu J, Pacitti E, Valduriez P, de Oliveira D and Mattoso M (2016). Multi-objective scheduling of Scientific Workflows in multisite clouds, Future Generation Computer Systems, 63:C, (76-95), Online publication date: 1-Oct-2016.
- Carniani E, D'Arenzo D, Lazouski A, Martinelli F and Mori P (2016). Usage Control on Cloud systems, Future Generation Computer Systems, 63:C, (37-55), Online publication date: 1-Oct-2016.
- Kolev B, Bondiombouy C, Valduriez P, Jimenez-Peris R, Pau R and Pereira J The CloudMdsQL Multistore System Proceedings of the 2016 International Conference on Management of Data, (2113-2116)
- Neven F Logical Aspects of Massively Parallel and Distributed Systems Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, (303-314)
- Ameloot T, Geck G, Ketsman B, Neven F and Schwentick T (2016). Data partitioning for single-round multi-join evaluation in massively parallel systems, ACM SIGMOD Record, 45:1, (33-40), Online publication date: 2-Jun-2016.
- Schätzle A, Przyjaciel-Zablocki M, Skilevic S and Lausen G (2016). S2RDF, Proceedings of the VLDB Endowment, 9:10, (804-815), Online publication date: 1-Jun-2016.
- Bondiombouy C, Kolev B, Levchenko O and Valduriez P Multistore Big Data Integration with CloudMdsQL Transactions on Large-Scale Data- and Knowledge-Centered Systems XXVIII - Volume 9940, (48-74)
- Du N, Ye X and Wang J (2016). A semantic-aware data generator for ETL workflows, Concurrency and Computation: Practice & Experience, 28:4, (1016-1040), Online publication date: 25-Mar-2016.
- Verborgh R, Vander Sande M, Hartig O, Van Herwegen J, De Vocht L, De Meester B, Haesendonck G and Colpaert P (2016). Triple Pattern Fragments, Web Semantics: Science, Services and Agents on the World Wide Web, 37:C, (184-206), Online publication date: 1-Mar-2016.
- Mami I, Bellahsene Z and Coletta R A Constraint Optimization Method for Large-Scale Distributed View Selection Transactions on Large-Scale Data- and Knowledge-Centered Systems XXV - Volume 9620, (71-108)
- Sellami M, Hacid M and Gammoudi M Inference Control in Data Integration Systems Proceedings of the Confederated International Conferences on On the Move to Meaningful Internet Systems: OTM 2015 Conferences - Volume 9415, (285-302)
- Sioutas S, Sourla E, Tsichlas K and Zaroliagis C ART$$^+$$+ Revised Selected Papers of the First International Workshop on Algorithmic Aspects of Cloud Computing - Volume 9511, (126-137)
- Bondiombouy C, Kolev B, Levchenko O and Valduriez P Integrating Big Data and Relational Data with a Functional SQL-like Query Language Proceedings, Part I, of the 26th International Conference on Database and Expert Systems Applications - Volume 9261, (170-185)
- Pirk H (2015). ...like Commanding an Anthill, ACM SIGMOD Record, 44:2, (53-58), Online publication date: 12-Aug-2015.
- Salama A, Binnig C, Kraska T and Zamanian E Cost-based Fault-tolerance for Parallel Data Processing Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (285-297)
- Zaychik Moffitt V, Stoyanovich J, Abiteboul S and Miklau G Collaborative Access Control in WebdamLog Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (197-211)
- Bailis P, Fekete A, Franklin M, Ghodsi A, Hellerstein J and Stoica I Feral Concurrency Control Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (1327-1342)
- Vulimiri A, Curino C, Godfrey P, Jungblut T, Karanasos K, Padhye J and Varghese G WANalytics Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (1087-1092)
- Zamanian E, Binnig C and Salama A Locality-aware Partitioning in Parallel Database Systems Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (17-30)
- Vulimiri A, Curino C, Godfrey P, Jungblut T, Padhye J and Varghese G Global analytics in the face of bandwidth and regulatory constraints Proceedings of the 12th USENIX Conference on Networked Systems Design and Implementation, (323-336)
- Dokeroglu T, Bayir M and Cosar A (2015). Robust heuristic algorithms for exploiting the common tasks of relational cloud database queries, Applied Soft Computing, 30:C, (72-82), Online publication date: 1-May-2015.
- Qiang Zeng , Mingyi Zhao , Peng Liu , Yadav P, Calo S and Lobo J (2015). Enforcement of Autonomous Authorizations in Collaborative Distributed Query Evaluation, IEEE Transactions on Knowledge and Data Engineering, 27:4, (979-992), Online publication date: 1-Apr-2015.
- Xu X, Xiong L and Liu J Database Fragmentation with Confidentiality Constraints Proceedings of the 5th ACM Conference on Data and Application Security and Privacy, (263-270)
- Du M, Versteeg S, Schneider J, Han J and Grundy J (2015). Interaction Traces Mining for Efficient System Responses Generation, ACM SIGSOFT Software Engineering Notes, 40:1, (1-8), Online publication date: 6-Feb-2015.
- Braganholo V and Mattoso M (2014). A Survey on XML Fragmentation, ACM SIGMOD Record, 43:3, (24-35), Online publication date: 4-Dec-2014.
- Wang W, Liu Z, Jiang Y, Yuan X and Wei J EasyCache: a transparent in-memory data caching approach for internetware Proceedings of the 6th Asia-Pacific Symposium on Internetware, (35-44)
- Bailis P, Fekete A, Franklin M, Ghodsi A, Hellerstein J and Stoica I (2014). Coordination avoidance in database systems, Proceedings of the VLDB Endowment, 8:3, (185-196), Online publication date: 1-Nov-2014.
- Anagnostopoulos V, Moulos V, Menychtas A, Varvarigou T and Gatzioura A Intelligent Clouds Proceedings of the 18th Panhellenic Conference on Informatics, (1-6)
- Karampaglis Z, Gounaris A and Manolopoulos Y A Bi-objective Cost Model for Database Queries in a Multi-cloud Environment Proceedings of the 6th International Conference on Management of Emergent Digital EcoSystems, (109-116)
- Liu J, Silva V, Pacitti E, Valduriez P and Mattoso M Scientific Workflow Partitioning in Multisite Cloud Revised Selected Papers, Part I, of the Euro-Par 2014 International Workshops on Parallel Processing - Volume 8805, (105-116)
- Amagasa T, Zhang F, Sakuma J and Kitagawa H A scheme for privacy-preserving ontology mapping Proceedings of the 18th International Database Engineering & Applications Symposium, (87-95)
- Binnig C, Salama A and Zamanian E DoomDB Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (913-916)
- Liu Z and Hacigümüs H Online optimization and fair costing for dynamic data sharing in a cloud data market Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (1359-1370)
- Galárraga L, Hose K and Schenkel R Partout Proceedings of the 23rd International Conference on World Wide Web, (267-268)
- Pazos R, Vázquez G, Martínez J, Pérez-Ortega J and Martínez-Luna G (2014). Minimizing roundtrip response time in distributed databases with vertical fragmentation, Journal of Computational and Applied Mathematics, 259, (905-913), Online publication date: 1-Mar-2014.
- Li F, Ooi B, Özsu M and Wu S (2014). Distributed data management using MapReduce, ACM Computing Surveys, 46:3, (1-42), Online publication date: 1-Jan-2014.
- Wiese L Taxonomy-Based Fragmentation for Anti-instantiation in Distributed Databases Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing, (363-368)
- Yahyavi A and Kemme B (2013). Peer-to-peer architectures for massively multiplayer online games, ACM Computing Surveys, 46:1, (1-51), Online publication date: 1-Oct-2013.
- Ibáñez L, Skaf-Molli H, Molli P and Corby O (2013). Live linked data, International Journal of Metadata, Semantics and Ontologies, 8:2, (119-133), Online publication date: 1-Sep-2013.
- Scherzinger S, De Almeida E, Ickert F and Del Fabro M On the necessity of model checking NoSQL database schemas when building SaaS applications Proceedings of the 2013 International Workshop on Testing the Cloud, (1-6)
- Feinerer I, Franconi E and Guagliardo P Lossless horizontal decomposition with domain constraints on interpreted attributes Proceedings of the 29th British National conference on Big Data, (77-91)
- Shi X, Lv Y, Shao Y and Cui B bCATE Proceedings of the 14th international conference on Web-Age Information Management, (769-780)
- Noordhof W and Hoffert J A taxonomy of protocol frameworks and gap analysis for adaptive publish/subscribe distributed realtime embedded systems Proceedings of the 51st ACM Southeast Conference, (1-6)
- Zaamout S and Osborn W A strategy for optimizing a multi-site query in a distributed spatial database Proceedings of the 12th international conference on Web and Wireless Geographical Information Systems, (16-24)
- Eltabakh M, Özcan F, Sismanis Y, Haas P, Pirahesh H and Vondrak J Eagle-eyed elephant Proceedings of the 16th International Conference on Extending Database Technology, (89-100)
- Wang Z, Luo T, Xu Y, Cheng F, Zhang X and Wang X A fast indexing algorithm optimization with user behavior pattern Proceedings of the 2012 international conference on Pervasive Computing and the Networked World, (592-605)
- Ramakrishnan S, Swart G and Urmanov A Balancing reducer skew in MapReduce workloads using progressive sampling Proceedings of the Third ACM Symposium on Cloud Computing, (1-14)
- Brinkley J and Detwiler L (2012). A Query Integrator and Manager for the Query Web, Journal of Biomedical Informatics, 45:5, (975-991), Online publication date: 1-Oct-2012.
- Mokhtar H and Adel N Transaction processing using thread-to-metadata Proceedings of the 16th International Database Engineering & Applications Sysmposium, (230-234)
- Sakka M and Defude B A mediator-based system for distributed semantic provenance management systems Proceedings of the 16th International Database Engineering & Applications Sysmposium, (193-198)
- Rodríguez L, Li X, Cervantes J and García-Lamont F DYMOND Proceedings of the 16th International Database Engineering & Applications Sysmposium, (71-80)
- Coletta R, Castanier E, Valduriez P, Frisch C, Ngo D and Bellahsene Z Public data integration with WebSmatch Proceedings of the First International Workshop on Open Data, (5-12)
- Chirigati F, Silva V, Ogasawara E, de Oliveira D, Dias J, Porto F, Valduriez P and Mattoso M Evaluating parameter sweep workflows in high performance computing Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, (1-10)
- Obaid A, Amja A, Mili H and Seguin N Query execution on a mobile database system Proceedings of the 27th Annual ACM Symposium on Applied Computing, (569-571)
- Usbeck K and Beal J An agent framework for agent societies Proceedings of the compilation of the co-located workshops on DSM'11, TMC'11, AGERE! 2011, AOOPES'11, NEAT'11, & VMIL'11, (201-212)
- Brut M, Laborie S, Manzat A and Sèdes F Generic information system architecture for distributed multimedia indexation and management Proceedings of the 15th international conference on Advances in databases and information systems, (347-360)
- Valduriez P Principles of distributed data management in 2020? Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I, (1-11)
- Boukraâ D, Boussaïd O and Bentayeb F Vertical fragmentation of XML data warehouses using frequent path sets Proceedings of the 13th international conference on Data warehousing and knowledge discovery, (196-207)
- Rodriguez L and Li X A vertical partitioning algorithm for distributed multimedia databases Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II, (544-558)
- Brut M, Codreanu D, Dumitrescu S, Manzat A and Sedes F A distributed architecture for flexible multimedia management and retrieval Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II, (249-263)
- Arenas M and Pérez J Querying semantic web data with SPARQL Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, (305-316)
- Bose B and Sane S DTCOT Proceedings of the International Conference and Workshop on Emerging Trends in Technology, (518-523)
Index Terms
- Principles of Distributed Database Systems