Journal of Applied Science and Engineering

Published by Tamkang University Press

1.30

Impact Factor

2.10

CiteScore

Ching-Ming Chao This email address is being protected from spambots. You need JavaScript enabled to view it.1, Po-Zung Chen2 and Shih-Yang Yang2

1Department of Computer and Information Science, Soochow University, Taipei, Taiwan 100, R.O.C.
2Department of Computer Science and Information Engineering, Tamkang University, Tamsui, Taiwan 251, R.O.C.


 

Received: January 3, 2005
Accepted: June 29, 2005
Publication Date: December 1, 2005

Download Citation: ||https://doi.org/10.6180/jase.2005.8.4.06  


ABSTRACT


The World Wide Web contains a huge and increasing volume of information. The web warehouse is an efficient and effective means to facilitate utilization of information on the Web, not only to individual users but also to business organizations, especially for decision-making purposes. On the other hand, XML has recently become the new standard for representation and exchange of data on the Web. In this paper, therefore, we study the XML web warehouse and propose an approach to the problems of change detection and warehouse maintenance of an XML web warehouse. This paper has three major contributions. First, we propose an object-oriented data model for XML web pages in the web warehouse as well as system architecture for change detection and warehouse maintenance. Second, we propose a change detection method based on mobile agent technology to actively detect changes of data sources of the web warehouse. Third, we propose an incremental and deferred maintenance method to maintain XML web pages in the web warehouse. We compare by experiments our approach with a rewriting approach to storage and maintenance of the XML web warehouse. Performance evaluation shows that our approach is more efficient than the rewriting approach in terms of the response time and storage space of the web warehouse.


Keywords: Web Warehouse, XML, Change Detection, Warehouse Maintenance, Mobile Agent


REFERENCES


  1. [1] Ng, W. K., Lin, E. P., Huang, C. T., Bhowmick, S. and Qin, F. Q., “Web Warehousing: an Algebra for Web Information,” Proc. IEEE Forum on Research and Technology Advances in Digital Libraries, Santa Barbara, CA, U.S.A., pp. 228237 (1998).
  2. [2] Cheng, K., Kambayashi, Y., Lee, S. T. and Mohania, M., “Functions of a Web Warehouse,” Proc. Kyoto International Conference on Digital Libraries: Research and Practices, British Library, U.K., pp. 172179 (2000).
  3. [3] Xyleme, L., “A Dynamic Warehouse for XML Data of the Web,” IEEE Data Engineering Bulletin, Vol. 24, No. 2, pp. 4047 (2001).
  4. [4] Bray, T., Paoli, J., Sperberg-McQueen, C. M. and Maler, E., Extensible Markup Language (XML) 1.0 (Second Edition), W3C Recommendation, http:// www.w3.org/TR/REC-xml (2000).
  5. [5] Lindsay, B. G., Haas, L. M., Mohan, C., Pirahesh, H. and Wilms, P. F., “A Snapshot Differential Refresh Algorithm,” Proc. ACM SIGMOD International Conference on Management of Data, Washington, D.C., U.S.A., pp. 5360 (1986).
  6. [6] Labio, W. and Garcia-Molina, H., “Efficient Snapshot Differential Algorithm for Data Warehousing,” Proc. Twenty-Second International Conference on Very Large Data Bases, Zurich, Switzerland, pp. 6374 (1995).
  7. [7] Chawathe, S. S., Rajaraman, A., Garcia-Molina, H. and Widom, J., “Change Detection in Hierarchically Structured Information,” Proc. ACM SIGMOD International Conference on Management of Data, Montreal, Quebec, Canada, pp. 493504 (1996).
  8. [8] Chawathe, S. S., Abiteboul, S., Widom, J., “Representing and Querying Changes in Semistructured Data,” Proc. Fourteenth IEEE International Conference on Data Engineering, Orlando, FL, U.S.A., pp. 413 (1998).
  9. [9] Cobena, G., Abiteboul, S. and Marian, A., “Detecting Changes in XML Documents,” Proc. Eighteenth IEEE International Conference on Data Engineering, San Jose, CA, U.S.A., pp. 4152 (2002).
  10. [10] Nguyen, B., Abiteboul, S., Cobena, G. and Preda, M., “Monitoring XML Data on the Web,” Proc. ACM SIGMOD International Conference on Management of Data, Santa Barbara, CA, U.S.A., pp. 437448 (2001).
  11. [11] Xu, H., Wu, Q., Wang, H., Yang, G. and Jia, Y., “KF-Diff+: Highly Efficient Change Detection Algorithm for XML Documents,” Proc. International Conference on Ontologies, Databases and Applications of Semantics, Irvine, CA, U.S.A., pp. 1273286 (2002).
  12. [12] Bhowmick, S. S., Ng, W. K., Madria, S. K. and Lim, E. P., “Detecting and Representing Relevant Web Deltas Using Web Join,” Proc. Twentieth IEEE International Conference on Distributed Computing Systems, Taipei, Taiwan, pp. 255262 (2000).
  13. [13] Dart, S. A., “Content Change Management: Problems for Web Systems,” Proc. Ninth International Symposium on System Configuration Management, Toulouse, France, pp. 116 (1999).
  14. [14] Lim, S. J. and Ng, Y. K., “An Automated Change Detection Algorithm for HTML Documents Based on Semantic Hierarchies,” Proc. Seventeenth IEEE International Conference on Data Engineering, Heidelberg, Germany, pp. 303312 (2001).
  15. [15] Harrison, C. G., Chess, D. M. and Kershenbaum, A., “Mobile Agents: Are They a Good Idea?,” Mobile Object Systems Towards the Programmable Internet, LNCS Vol. 1222, Springer-Verlag, New York (1997).
  16. [16] Berbers, Y., Decker, B. D. and Joosen, W., “Infrastructure for Mobile Agents,” Proc. Seventh ACM SIGOPS European Workshop: Systems Support for Worldwide Applications, pp. 173180 (1996).
  17. [17] Cockayne, W. R. and Zyda, M., Mobile Agents, Manning Publications Co., Greenwich, Connecticut, U.S.A. (1997).
  18. [18] Brewington, B., Gray, R., Moizumi, K., Kotz, D., Cybenko, G. and Rus, D., “Mobile Agents in Distributed Information Retrieval,” Intelligent Information Agents, Chapter 12, Springer-Verlag, New York (1999).
  19. [19] Guttman, R. H., Moukas, A. G. and Maes, P., “Agent-Mediated Electronic Commerce: a Survey,” The Knowledge Engineering Review, Vol. 13, No. 2, pp. 147159 (1998).
  20. [20] He, M., Jennings, N. R. and Leung, H., “On Agent-Mediated Electronic Commerce,” IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 4, pp. 9851003 (2003).
  21. [21] Jung, J. J., Hwang, D. Y., Jeon, S. B. and Jo, G. S., “Agents-Based Framework for Brokerage between Buyers and Sellers on Electronic Commerce,” Proc. International Conference on Electronic Commerce, Seoul, Korea, pp. 1622 (1998).
  22. [22] Sohn, S. and Yoo, K. J., “An Architecture of Electronic Market Applying Mobile Agent Technology,” Proc. Third IEEE Symposium on Computers and Communications, Athens, Greece, pp. 359364 (1998).
  23. [23] Blakeley, J. A., Larson, P. A. and Tompa, F. W., “Efficiently Updating Materialized Views,” Proc. ACM SIGMOD International Conference on Management of Data, Washington, D.C., U.S.A., pp. 6171 (1986).
  24. [24] Gupta, A., Mumick, I. S. and Subrahmanian, V. S., “Maintaining Views Incrementally,” Proc. ACM SIGMOD International Conference on Management of Data, Washington, D.C., U.S.A., pp. 157166 (1993).
  25. [25] Zhuge, Y., Garcia-Molina, H., Hammer, J. and Widom, J., “View Maintenance in a Warehousing Environment,” Proc. ACM SIGMOD International Conference on Management of Data, San Jose, CA, U.S.A., pp. 316327 (1995).
  26. [26] Zhuge, Y., Garcia-Molina, H. and Wiener, J. L., “The Strobe Algorithms for Multi-Source Warehouse Consistency,” Proc. Fourth IEEE International Conference on Parallel and Distributed Information Systems, Miami Beach, FL, USA, pp. 146157 (1996).
  27. [27] Agrawal, D., El Abbadi, A., Singh, A. and Yurek, T., “Efficient View Maintenance at Data Warehouses,” Proc. ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, U.S.A., pp. 417427 (1997).
  28. [28] Apparao, V. et al., Document Object Model (DOM) Level 1 Specification (Version 1.0), http://www. w3.org/TR/REC-DOM-Level-1 (1998).
  29. [29] Chawathe, S. S., Abiteboul, S. and Widom, J., “Managing Historical Semistructured Data,” Theory and Practice of Object Systems, Vol. 5, No. 3., pp. 143 162 (1999).
  30. [30] Papakonstantinou, Y., Garcia-Molina, H. and Widom, J., “Object Exchange Across Heterogeneous Information Sources,” Proc. Eleventh IEEE International Conference on Data Engineering, Taipei, Taiwan, pp. 251260 (1995).