Abstract
One of the fundamental principles of the database approach is that a database allows a nonredundant, unified representation of all data managed in an organization. This is achieved only when methodologies are available to support integration across organizational and application boundaries.
Methodologies for database design usually perform the design activity by separately producing several schemas, representing parts of the application, which are subsequently merged. Database schema integration is the activity of integrating the schemas of existing or proposed databases into a global, unified schema.
The aim of the paper is to provide first a unifying framework for the problem of schema integration, then a comparative review of the work done thus far in this area. Such a framework, with the associated analysis of the existing approaches, provides a basis for identifying strengths and weaknesses of individual methodologies, as well as general guidelines for future improvements and extensions.
- AL-FEDAGHi, S., AND SCHEUERMANN, P. 1981. Mapping considerations in the design of schemas for the relational model. IEEE Trans. So{tw. Eng. SE-7, I (Jan.).Google Scholar
- BATINI, C., AND LENZERINI, M. 1984. A methodology for data schema integration in the entity relationship model. IEEE Trans. Softw. Eng. SE~IO, 6 (Nov.), 650-663.Google Scholar
- CASANOVA, M., AND VIDAL, M. 1983. Towards a sound view integration methodology. In Proceedings of the 2nd ACM SIGACT/SIGMOD Conference on Principles of Database Systems (Atlanta, Ga., Mar. 21-23). ACM, New York, pp. 36-47. Google Scholar
- DAYAL, U., AND HWANO, H. 1984. View definition and generalization for database integration in multibase: A system for heterogeneous distributed databases. IEEE Trans. Softw. Eng. SE-I 0, 6 (Nov.), 628-644.Google Scholar
- ELMASRI, R., LARSON, J., AND NAVATHE, S. B. 1987. Integration algorithms for federated databases and logical database design. Tech. Rep., Honeywell Corporate Research Center (submitted for publication).Google Scholar
- KAHN, B. 1979. A structured logical data base design methodology. Ph.D. dissertation, Computer Science Dept., Univ. of Michigan, Ann Arbor, Mich. Google Scholar
- MANNINO, M. V., AND EFFELSBERG, W. 1984a. A methodology for global schema design, Computer and Information Sciences Dept., Univ. of Florida, Tech. Rep. No. TR-84-1, Sept.Google Scholar
- MOTRO, A., AND BUNEMAN, P. 1981. Constructing superviews. In Proceedings of the international Conference on Management of Data (Ann Arbor, Mich., Apr. 29-May 1). ACM, New York. Google Scholar
- NAVATHE, S. B., AND GADGIL, S. G. 1982. A methodology for view integration in logical data base design. In Proceedings of the 8th International Conference on Very Large Data Bases (Mexico City). VLDB Endowment, Saratoga, Calif. Google Scholar
- TEORE~, T., AND FRY, J. 1982. Design of Database Structures. Prentice-Hall, Englewood Cliffs, N.J. Google Scholar
- WIEDERHOLD, G., AND ELMASm, R. 1979. A structural model for database systems. Rep. STAN- CS-79-722, Computer Science Dept., Stanford Univ., Stanford, Calif. Google Scholar
- YAO, S. B., WADDLE, V., AND HOUSEL, B. 1982. View modeling and integration using the functional data model. IEEE Trans. Softw. Eng. SE- 8, 6, 544-553.Google Scholar
- ALBANO, A., CARDELLI, L., AND ORSINI, R. 1985. Galileo: A strongly typed, interactive conceptual language. A CM Trans. Database Syst. 10, 2 (June), 230-260. Google Scholar
- ATZENI, P., AUSIELLO, G., BATINI, C., AND MOSCAR- INI, M. 1982. Inclusion and equivalence between relational database schemata. Theor. Comput. Sci. 19, 267-285.Google Scholar
- BATINI, C., AND LENZERINI, M. 1983. A conceptual foundation to view integration. In Proceedings of the IFIP TC.2 Working Conference on System Description Methodologies (Kecskmet, Hungary). Elsevier, Amsterdam, pp. 109-139.Google Scholar
- BATINI, C., LENZERINI, M., AND MOSCARINI, M. 1983. Views integration. In Methodology and Tools for Data Base Design, S. Ceri, Ed. North- Holland, Amsterdam.Google Scholar
- BATINI, C., DEMO, B., AND DI LEVA, A. 1984. A methodology for conceptual design of office data bases. Inf. Syst. 9, 3, 251-263. Google Scholar
- BATINI, C., NARDELLI, E., AND TAMASSIA, R. 1986. A layout algorithm for data flow diagrams. IEEE Trans. Softw. Eng. SE-12, 4 (Apr.), 538-546. Google Scholar
- BEERI, C., BERNSTEIN, P., AND GOODMAN, N. 1978. A sophisticate's introduction to database normalization theory. In Proceedings of the 4th International Conference on Very Large Data Bases (West Berlin, Sept. 13-15). IEEE, New York.Google Scholar
- BERNSTEIN, P. A. 1976. Synthesizing third normal form relations from functional dependencies. ACM Trans. Database Syst. 1, 4 (Dec.), 277-298. Google Scholar
- BILLER, H. 1979. On the equivalence of data base schemas: A semantic approach to data translation. Inf. Syst. 4, 1, 35-47.Google Scholar
- BILLER, H., AND NEUHOLD, E. J. 1982. Concepts for the conceptual schema. In Architecture and Models in Data Base Management Systems, G. M. Nijssen, Ed. North Holland, Amsterdam, pp. 1-30.Google Scholar
- BISKUP, J., AND CONVENT, B. 1986. A formal view integration method. In Proceedings of the International Conference on the Management of Data (Washington, D.C., May 28-30). ACM, New York. Google Scholar
- BISKUP, J., DAYAL, U., AND BERNSTEIN, P. A. 1979. Independent database schemas. In Proceedings of the International Conference on the Management of Data (Boston, Mass., May 30- June 1). ACM, New York. Google Scholar
- BOUZEGHOUB, M., GARDARIN, G., AND METAIS, E. 1986. Database design tools: An expert systems approach. In Proceedings of 11th International Conference of Very Large Databases (Stockholm, Sweden). Morgan Kaufmann, Los Altos, Calif.Google Scholar
- BRODIE, M. L. 1981. On modelling behavioural semantics of data. In Proceedings of the 7th International Conference on Very Large Data Bases (Cannes, France, Sept. 9-11). IEEE, New York, pp. 32-41.Google Scholar
- BRODIE, M. L., AND ZILLES, S. N., Eos. 1981. In Proceedings of the Workshop on Data Abstraction, Databases, and Conceptual Modelling. SIGPLAN Not. 16, 1 (Jan.). Google Scholar
- CARSWELL, J. L., AND NAVATHE, S. B. 1986. SA-ER: A methodology that links structured analysis and entity relationship modeling for database design. In Proceedings of the 5th International Conference on the Entity Relationship Approach, S. Spaccapietra, Ed. (Dijon, France, Nov.), pp. 19-36.Google Scholar
- CERI, S., ED. 1983. Methodology and Tools for Database Design. North-Holland, Amsterdam. Google Scholar
- CERI, S., AND PELA(~ATrI, G. 1984. Distributed Databases: Principles and Systems. McGraw-Hill, New York. Google Scholar
- CERI, S., PELAGATTI, G., AND BRACCHI, G. 1981. A structured methodology for designing static and dynamic aspects of data base applications. Inf. Syst. 6, 1, 31-45.Google Scholar
- CHEN, P. P. 1976. The entity-relationship model-- Toward a unified view of data. A CM Trans. Database Syst. 1, 1 (Mar.), 9-36. Google Scholar
- CHEN, P. P. 1983. English sentence structure and entity-relationship diagrams. J. Inf. Sci. 29, 127-150.Google Scholar
- CHIANG, W., BASAR, E., LIEN, C., AND TEiCHROEW, D. 1983. Data modeling with PSL/PSA: The view integration system (VIS). ISDOS Rep. No. M0549-0, Ann Arbor, Mich.Google Scholar
- CHILSON, D., AND KUDLAC, C. 1983. Database design: A survey of logical and physical design techniques. Database 15, I (Fall). Google Scholar
- DATA DESIGNER 1981. Data designer product description. Database Design Inc., Ann Arbor, Mich.Google Scholar
- DEMO, B. 1983. Program analysis for conversion from a navigation to a specification database interface. In Proceedings of the 9th International Conference on Very Large Data Bases (Florence, Italy). VLDB Endowment, Saratoga, Calif. Google Scholar
- DEMO, B., AND KUNOU, S. 1985. Modeling the CO- DASYL DML execution context dependency for application program conversion. In Proceedings of the International Conference on Management of Data (Austin, Tx., May 28-30). ACM, New York, pp. 354-363. Google Scholar
- DOS SANTOS, C. S., NEUHOLD, E. J., AND FURTADO, A. L. 1980. A data type approach to the entity relationship model. In Proceedings of the International Conference on the Entity Relationship Approach to System Analysis and Design, P. Chen, Ed. (Los Angeles, 1979). North-Holland, Amsterdam, pp. 103-120. Google Scholar
- EICK, C. F., AND LOCKEMANN, P. C. 1985. Acquisition of terminological knowledge using database design techniques. In Proceedings of the International Conference on Management of Data (Austin, Tx., May 28--30). ACM, New York, pp. 84-94. Google Scholar
- ELMASRI, R. 1980. On the design, use and integration of data models. Ph.D. dissertation, Pep. No. STAN-CS-80-801, Dept. of Computer Science, Stanford Univ., Stanford, Calif. Google Scholar
- ELMASRI, R., AND NAVATHE, $. B. 1984. Object integration in database design. In Proceedings of the IEEE COMPDEC Conference (Anaheim, Calif., Apr.). IEEE, New York, pp. 426-433. Google Scholar
- ELMASRI, R., AND WIEDERHOLD, G. 1979. Data model integration using the structural model. In Proceedings of the International Con{erence on Management o{ Data (Boston, Mass., May 30- June 1). ACM, New York. Google Scholar
- ELMASRI, R., WEELDRYER, J., AND H~.VNER, A. 1985. The category concept: An extension to the entity-relationship model. Data Knowl. Eng. 1, 1 (June). Google Scholar
- FERRARA, F. M. 1985. EASY-ER: An integrated system for the design and documentation of data base applications. In Proceedings of the 4th International Conference on the Entity Relationship Approach (Chicago, Ill.). IEEE Computer Society, Silver Spring, Md., pp. 104-113. Google Scholar
- HAMMER, M., AND McLEOD, D. 1981. Database description with SDM: A semantic database model. ACM Trans. Database Syst. 6, 3 (Sept.), 351-386. Google Scholar
- HUBBARD, G. 1980. Computer Assisted Data Base Design. Van Nostrand-Reinhold, New York. Google Scholar
- HWANO, H. Y. 1982. Database integration and optimization in multidatabase systems. Ph.D. dissertation, Dept. of Computer Science, Univ. of Texas, Austin, Oct. Google Scholar
- KLUG, A., AND TSICHRITZIS, D., Eds. 1977. The ANSI/X3/SPARC Report of the Study Group on Data Base Management Systems. AFIPS Press, Reston, Va.Google Scholar
- LANDERS, T. A., *NO ROSENnER(;, R. L. 1982. An overview of Multibase. In Distributed Databases, H. J. Schneider, Ed. North-Holland, Amsterdam.Google Scholar
- LARSON, J., NAVATHE, S. B., AND ELMASRI, R. 1986. Attribute equivalence and its role in schema integration. Tech. Rep., Honeywell Computer Sciences Center, Golden Valley, Minn.Google Scholar
- LUM, V., GHOSH, S., SCHKOLNiCK, M., jEFFERSON, D., Su, S., FRY, J., ,NO YAO, B. 1979. 1978 New Orleans data base design workshop. In Proceedings of the 5th International Conference on Very Large Data Bases (Rio de Janeiro, Oct. 3-5). IEEE, New York, pp. 328-339.Google Scholar
- MAIER, D. 1983. The Theory of Relational Databases. Computer Science Press, Potomac, Md. Google Scholar
- MANNINO, M. V., AND EFFELSBERG, W. 1984b. Matching techniques in global schema design. In Proceedings of the IEEE COMPDEC Conference (Los Angeles, Calif.). IEEE, New York, pp. 418-425. Google Scholar
- MANNINO, M. V., AND KARLE, C. 1986. An extension of the general entity manipulator language for global view definition. Data Knowl. Eng. 2, 1.Google Scholar
- MANNINO, M. V., NAVATHE, S. B., AND EFFELSBERG, W. 1986. Operators and rules for merging generalization hierarchies. Working Paper, Graduate School of Business, Univ. of Texas, Austin, April 1986.Google Scholar
- MCLEOD, D., AND HEIMBIGNER, D. 1980. A federated architecture for data base systems. In Proceedings of the AFIPS National Computer Con{erence, vol. 39. AFIPS Press, Arlington, Va.Google Scholar
- MOTRO, A. 1981. Virtual merging of databases. Ph.D. dissertation, Tech. Rep. #MS-CIS-80-39, Computer Science Dept., Univ. of Pennsylvania, Philadelphia, Pa. 1981. Google Scholar
- MYLOPOULOS, J., BERNSTEIN, P. A., AND WONG, H. K.T. 1980. A language facility for designing database-intensive applications. ACM Trans. Database Syst. 5, 2 (June) 185-207. Google Scholar
- NATIONAL BUREAU OF STANDARDS 1982. Data base directions: Information resource managementstrategies and tools. Special Publ. 500-92, A. Goldfine, Ed. U.S. Dept. of Commerce, Washington, D.C., Sept. 1982.Google Scholar
- NAVATHE, S.B., AND SCHKOLNICK, M. 1978. View representation in logical database design. In Proceedings of the International Conference on Management of Data (Austin, Tex.). ACM, New York, pp. 144-156. Google Scholar
- NAVATHE, S. B., AND KERSCHnERC, L. 1986. Role of data dictionaries in information resource management. Inf. Manage. 10, 1. Google Scholar
- NAVATHE, S. B., SASHIDHAR, T., AND ELMASRI, R. 1984. Relationship matching in schema integration. In Proceedings of the l Oth International Conference on Very Large Data Bases (Singapore). Morgan Kaufmann, Los Altos, Calif. Google Scholar
- NAVATHE, S. B., ELMASRI, R., AND LARSON, J. 1986. Integrating user views in database design. IEEE Computer 19, 1 (Jan.), 50-62.Google Scholar
- NG, P., JAJODIA, S., AND SPRINGSTEEL, F. 1983. The problem of equivalence of entity relationship diagrams. IEEE Trans. So{tw. Eng. SE-9, 5, 617-630.Google Scholar
- OLLE, T. W., SOL, H. G., AND VERRIJN-STUART, A. A., Eds. 1982. Information systems design methodologies: A comparative review. In Proceedings o{ the IFIP WG 8.1 Working Conference on Comparative Review of Information Systems Design Methodologies (Noordwijkerhout, The Netherlands). North-Holland, Amsterdam. Google Scholar
- RISSANEN, J. 1977. Independent components of relations. ACM Trans. Database Syst. 2, 4 (Dec.), 317-325. Google Scholar
- ROLLAND, C., AND RICHARDS, C. 1982. Transaction modeling. In Proceedings of the International Conference on Management of Data (Orlando, Fla., June 2-4). ACM, New York, pp. 265-275. Google Scholar
- SAKAI, H. 1981. A method for defining information structures and transactions in conceptual schema design. In Proceedings of the 7th International Conference on Very Large Data Bases (Cannes, France, Sept. 9-11). IEEE, New York, pp. 225-234.Google Scholar
- SCHEUERMANN, P., SCHIFFNER, G., AND WEBER, H. 1980. Abstraction capabilities and invariant properties modeling within the entity relationship approach. In Proceedings of the International Conference on Entity Relationship Approach to System Analysis and Design, P. Chen, Ed. (Los Angeles, 1979). North-Holland, Amsterdam. Google Scholar
- SHIN, D. G., AND IRANI, K. B. 1985. Knowledgebased distributed database system design. In Proceedings of the International Conference on Management of Data (Austin, Tex., May 28-30). ACM, New York, pp. 95-105. Google Scholar
- SHIPMAN, D. W. 1980. The functional data model and data language DAPLEX. ACM Trans. Database Syst. 6, i (Mar.), 140-173. Google Scholar
- SMITH, J. M., AND SMITH, D. C. 1977. Database abstraction: Aggregation and generalization. ACM Trans. Database Syst. 2, 2 (June), 105-133. Google Scholar
- TUCHERMAN, L., FURTADO, A. L., ANO CASANOVA, M. A. 1985. A tool for modular database design. In Proceedings of the 11th International Con{erence on Very Large Data Bases (Stockholm, Sweden). Morgan Kaufmann, Los Altos, Calif.Google Scholar
- ULLMAN, J. D. 1982. Principles of Database Systems, 2nd ed. Computer Science Press, Potomac, Md. Google Scholar
- WEELDREYER, J. A. 1986. Structural aspects of the entity-category-relationship model of data, Tech. Rep. HR-80-251, Honeywell Computer Sciences Center, Golden Valley, Minn.Google Scholar
Recommendations
XML schema integration to facilitate E-commerce
Web-enabled systems integrationXML has become the de facto standard for Information Exchange protocol for e-commerce and many work group applications such as Enterprise Resource Planning (ERP). The availability of large amounts of heterogeneous distributed web data necessitates the ...
Design of Automatic Database Schema Generator Based on XML Schema
CIS '07: Proceedings of the 2007 International Conference on Computational Intelligence and SecurityWith the recent expansion of e-commerce, B2B has surfaced as an area of substantial interest to the corporate world. B2B refers to economic transactions created among businesses through various networks, including the Internet. Currently, XML documents ...
On transformation to redundancy free XML schema from relational database schema
APWeb'03: Proceedings of the 5th Asia-Pacific web conference on Web technologies and applicationsWhile XML is emerging as the universal format for publishing and exchanging data on the Web, most business data is still stored and maintained in relational database management systems. As a result, there is an increasing need to efficiently publish ...
Comments