A GENERIC FRAMEWORK FOR EXTRACTING XML DATA FROM LEGACY DATABASES

Authors

  • PHILIPPE THIRAN Technische Universiteit Eindhoven, The Netherlands University of Namur, Belgium
  • FABRICE ESTIEVENART CETIC, Gosselies, Belgium
  • JEAN-LUC HAINAUT University of Namur, Belgium
  • GEERT-JAN HOUBEN Technische Universiteit Eindhoven, The Netherlands

Keywords:

XML, Legacy Database, Schema Transformation, Data Reverse Engineering

Abstract

This paper describes a generic framework of which semantics-based XML data can be derived from legacy databases. It consists in first recovering the conceptual schema of the database through reverse engineering techniques, and then in converting this schema, or part of it, into XML-compliant data structures. Both steps heavily rely on generic schema transformation techniques, while all the schemas involved in the whole process are expressed in a unique model, named GER. Transformations between schemas are expressed as sequences of reversible transformation primitives. The reversed sequence can be used to automatically translate query, data or updates posed on XML.

 

Downloads

Download data is not yet available.

References

D. Florescu and D.Kossmann (1999), Storing and querying XML data using an RDBMS, IEEE Data

Engineering Bulletin, Vol. 22, pp. 27-34.

M.J. Carey, D. Florescu, Z.G. Ives, Y. Lu, J. Shanmugasundaram, E.J. Shekita and S.N. Subramanian (2000),

XPERANTO: Publishing object-relational data as XML, Proc. of WebDB, pp. 105-110.

M. Fernandez, Y. Kadiyska, D. Suciu, A. Morishima and W.-C. Tan (2002), SilkRoute: A framework for

publishing relational data in XML, ACM Transactions on Database Systems (TODS), Vol. 27.

D. Lee, M. Mani, F. Chiu and W. W. Chu (2002), NeT and CoT: Translating Relational Schemas to XML

Schemas using Semantic Constraints, Proc. of the ACM International Conference on Information and

Knowledge Management.

P. Rodriguez-Gianolli and J. Mylopoulos (2001), A Semantic Approach to XML-based Data Integration, in

Proc. of the ER Conference.

F. Yergeau, J. Cowan, T. Bray, J. Paoli, C. M. Sperberg-McQueen and E. Maler (2004), XML 1.1, W3C

Recommendation, Technical Report, W3C.

J.-L. Hainaut (2005), Transformation-based Database Engineering, Chapter in Transformation of Knowledge,

Information and Data: Theory and Applications, P. van Bommel Editor, IDEA Group.

M.R. Jansen, Thomas H. Moller and T.B. Pedersen (2003), Converting XML DTDs to UML Diagrams for

Conceptual Data Integration, Data and Knowledge Engineering, Vol. 44, pp. 323-346.

R. Conrad, D. Scheffener and J.C. Freytag (2000), XML Conceptual Modeling Using UML, Proc. of ER

Conference, pp. 558-571.

W3C Working Group (2001), XML Schema, W3C Recommendation, Technical Report, W3C.

F. Estiévenart (2004), XML Specific Predicates and Transformations, Technical Paper, CETIC Research

Center, Gosselies.

J.-L. Hainaut (2002), Introduction to Database Reverse Engineering, Technical Report, CS Department,

University of Namur, http://www.info.fundp.ac.be/~dbm/publication/2002/DBRE-2002.pdf (last consult. Aug.

.

J-L. Hainaut, M. Chandelon, C. Tonneau and M. Joris (1993), Contribution to a Theory of Database Reverse

Engineering, in Proc. of the IEEE WCRE Conference, pp. 161-170.

V. Englebert (2002), Voyager 2 Manual, DB-MAIN Series, Institut d’Informatique, University of Namur,

http://www.db-main.be

J.-M. Hick, V. Englebert, J. Henrard, D. Roland and J.-L. Hainaut (2004), The DB-MAIN Database

Engineering CASE Tool (version 7) - Functions Overview, Technical Report, University of Namur,

http://www.db-main.be.

Oracle Corporation (2004), Oracle XML SQL Utility, Oracle Corporation, http://www.oracle.com (last

consult. Feb 2005).

J. Cheng and J. Xu (2000), IBM DB2 XML Extender, Proc. of ICDE.

Ph. Thiran, F. Estiévenart, J-L. Hainaut and G-J. Houben (2004), Exporting Databases in XML - A Conceptual

and Generic Approach, in Proc. of WISM (CAiSE'04).

P. McBrien and A. Poulovassilis (1998), A General Formal Framework for Schema Transformation, Data &

Knowledge Engineering, 28(1), 47-71.

A. Rosenthal and D. Reiner (1988), Theoretically sound transformations for practical database design, Proc.

of Entity-Relationship Approach.

O. Rauh and E. Stickel (1995), Standard Transformations for the Normalization of ER Schemata, Proc. of the

CAiSE•95 Conf., Jyväskylä, Finland, LNCS, Springer-Verlag.

H.A. Proper and T.A. Halpin (1998), Database Schema Transformation & Optimization, Proc. of the 14 th

International Conference on Conceptual Modeling, LNCS, 1021, 191-203, Springer.

J-L. Hainaut, M. Chandelon, C. Tonneau and M. Joris (1993b), Transformational techniques for database

reverse engineering, Proc. of the 12th Int. Conf. on ER Approach, Arlington-Dallas, ER Institute.

L. Dongwon and W. C. Wesley (2000), Comparative Analysis of Six XML Schema Languages,

http://www.cobase.cs.ucla.edu/tech-docs/dongwon/ucla-200008.html.

Downloads

Published

2005-04-27

How to Cite

THIRAN, P., ESTIEVENART, F. ., HAINAUT, J.-L. ., & HOUBEN, G.-J. . (2005). A GENERIC FRAMEWORK FOR EXTRACTING XML DATA FROM LEGACY DATABASES. Journal of Web Engineering, 4(3), 205–223. Retrieved from https://journals.riverpublishers.com/index.php/JWE/article/view/4277

Issue

Section

Articles