Abstract
Interest in XML databases has been expanding rapidly over the last few years. In this paper, we study the problem of incorporating probabilistic information into XML databases. We propose the Probabilistic Interval XML (PIXML for short) data model in this paper. Using this data model, users can express probabilistic information within XML markups. In addition, we provide two alternative formal model-theoretic semantics for PIXML data. The first semantics is a “global” semantics which is relatively intuitive, but is not directly amenable to computation. The second semantics is a “local” semantics which supports efficient computation. We prove several correspondence results between the two semantics. To our knowledge, this is the first formal model theoretic semantics for probabilistic interval XML. We then provide an operational semantics that may be used to compute answers to queries and that is correct for a large class of probabilistic instances.
- Abellan, J. and Moral, S. 2003. Building classification trees using the total uncertainty criterion. Intl. J. Intell. Syst. 18, 12, 1215--1225.Google ScholarCross Ref
- Barbara, D., Garcia-Molina, H., and Porter, D. 1992. The management of probabilistic data. IEEE Trans. Knowl. Data Engin. 4, 487--502. Google ScholarDigital Library
- Boole, G. 1954. The Laws of Thought. Macmillan.Google Scholar
- Bouwerman, B. and O'Connell, R. 2000. Forecasting and Time Series: An Applied Approach. Brooks/Cole Publishing.Google Scholar
- Braz, R., Amir, E., and Roth, D. 2005. Lifted first-order probabilistic inference. In 19th International Joint Conference on Artificial Intelligence (IJCAI'05). Google ScholarDigital Library
- Cavallo, R. and Pittarelli, M. 1987. The theory of probabilistic databases. In Proceedings of the 13th International Conference on Very Large Data Bases. Brighton, England, 71--81. Google ScholarDigital Library
- Dekhtyar, A., Goldsmith, J., and Hawkes, S. 2001. Semistructured probabilistic databases. In Proceedings of the Conference on Statistical and Scientific Database Management (SSDBM). Fairfax, VA, USA, 36--45.Google Scholar
- Dey, D. and Sarkar, S. 1996. A probabilistic relational model and algebra. ACM Trans. Datab. Syst. 21, 3, 339--369. Google ScholarDigital Library
- Dyreson, C. and Snodgrass, R. 1998. Supporting valid-time indeterminacy. ACM Trans. Datab. Syst. 23, 1, 1--57. Google ScholarDigital Library
- Eiter, T., Lu, J., Lukasiewicz, T., and Subrahmanian, V. 2001. Probabilistic object bases. ACM Trans. Datab. Syst. 26, 3 (Sept.). Google ScholarDigital Library
- Fagin, R., Halpern, J., and Megiddo, N. 1990. A logic for reasoning about probabilities. Inform. Computat., 78--128. Google ScholarDigital Library
- Friedman, N., Getoor, L., Koller, D., and Pfeffer, A. 2005. Learning probabilistic relational models. In Proceedings of the 16th International Joint Conference on Artificial Intelligence. 1300--1307. Google ScholarDigital Library
- Friedman, N. and Koller, D. 2003. Being bayesian about network structure. a bayesian approach to structure discovery in bayesian networks. Machine Learn. J. 50, 1-2, 95--125.Google ScholarCross Ref
- Getoor, L., Friedman, N., Koller, D., and Pfeffer, A. 2001. Learning probabilistic relational models. Relational Data Mining, Newsletter.Google Scholar
- Goldman, S. and Rivest, R. 1986. A non-iterative maximum entropy algorithm. In Proceedings of the 2nd Annual Conference on Uncertainty in Artificial Intelligence (UAI'86). Elsevier Science Publishing Comapny, Inc., New York, NY, 133--148.Google Scholar
- Goldsmith, J., Dekhtyar, A., and Zhao, W. 2003. Can probabilistic databases help elect qualified officials% In Proceedings of Florida Atlantic Artificial Intelligence Research Symposium (FLAIRS). St. Augustine, FL, 501--505.Google Scholar
- Guntzer, U., Kiessling, W., and Thone, H. 1991. New directions for uncertainty reasoning in deductive databases. In Proceedings of ACM SIGMOD Conference. Denver, CO. 178--187. Google ScholarDigital Library
- Hung, E., Getoor, L., and Subrahmanian, V. 2003. PXML: A probabilistic semistructured data model and algebra. In Proceedings of 19th International Conference on Data Engineering (ICDE). Bangalore, India.Google Scholar
- Kamberova, G. and Bajcsy, R. 1998. Stereo depth estimation: the confidence interval approach. In Proceedings of the International Conference on Computer Vision (ICCV'98). Bombay, India, 503--509. Google ScholarDigital Library
- Kersting, K. and Raedt, L. D. 2001. In Proceedings of the 11th Conference on Inductive Logic Programming. Google ScholarDigital Library
- Kiessling, W., Thone, H., and Guntzer, U. 1992. Database support for problematic knowledge. In Proceedings of International Conference on Extending Database Technology. Vienna, Austria. Lecture Notes in Computer Science, vol. 580, Springer, 421--436. Google ScholarDigital Library
- Koller, D. and Pfeffer, A. 1997. Object-oriented Bayesian networks. In Proceedings of the Conference on Uncertainty in Artificial Intelligence. 302--313.Google Scholar
- Koller, D. and Pfeffer, A. 1998. Probabilistic frame-based systems. Proceedings of the 15th National Conference on Artificial Intelligence, 580--587. Google ScholarDigital Library
- Lakshmanan, L. V., Leone, N., Ross, R., and Subrahmanian, V. 1997. ProbView: A flexible probabilistic database system. ACM Trans. Datab. Syst. 22, 3, 419--469. Google ScholarDigital Library
- Lakshmanan, L. V. and Sadri, F. 1994. Probabilistic deductive databases. In Proceedings of International Symposium on Logic Programming (SLP). Ithaca, New York, 254--268. Google ScholarDigital Library
- Lakshmanan, L. V. and Shiri, N. 1996. A parametric approach to deductive databases with uncertainty. In Proceedings of International Workshop on Logic In Databases. San Miniato, Italy, 61--81. Google ScholarDigital Library
- Laskey, K. 2005. First order Bayesian logic, research report. Tech. rep., George Mason University.Google Scholar
- McHugh, J., Abiteboul, S., Goldman, R., Quass, D., and Widom, J. 1997. Lore: A database management system for semistructured data. SIGMOD Record 26, 3, 54--66. Google ScholarDigital Library
- Moral, S. and Cano, A. 2002. Strong conditional independence for credal sets. Ann. Math. AI 35, 1-4, 295--321. Google ScholarDigital Library
- Nierman, A. and Jagadish, H. 2002. ProTDB: Probabilistic data in XML. In Proceedings of the 28th International Conference on Very Large Data Bases. Hong Kong, China, 646--657. Google ScholarDigital Library
- Poole, D. 2003. First order probabilistic inference. In Proceedings of the International Joint Conference on Artificial Intelligence. 985--991.Google Scholar
- Poole, D. and Zhang, L. 2003. Exploiting contextual independence in probabilistic inference. J. AI Resear. 18, 263--313.Google ScholarDigital Library
- Radev, D., Fan, W., and Qi, H. 2002. Probabilistic question answering from the web. In Proceedings of the 11th International World Wide Web Conference. Honolulu, Haiwaii, 408--419. Google ScholarDigital Library
- Ross, S. 1998. A First Course in Probability. Prentice Hall.Google Scholar
- W3C. Extensible Markup Language (XML). http://www.w3.org/XML.Google Scholar
- Zhao, W., Dekhtyar, A., and Goldsmith, J. 2003. Query algebra operations for interval probabilities. In Proceedings of the Iternational Conference on Database and Expert Systems Applications (DEXA). Prague, Czech Republic, 527--536.Google Scholar
- Zhao, W., Dekhtyar, A., and Goldsmith, J. 2004. Databases for interval probabilities. Inter. J. Intelli. Syst. To appear. Google ScholarDigital Library
Index Terms
- Probabilistic interval XML
Recommendations
Probabilistic Interval XML
ICDT '03: Proceedings of the 9th International Conference on Database TheoryInterest in XML databases has been growing over the last few years. In this paper, we study the problem of incorporating probabilistic information into XML databases. We propose the Probabilistic Interval XML (PIXml for short) data model in this paper. ...
Mapping of bibliographical standards into XML
The most popular bibliographical standards, which prescribe the exchange of bibliographical data in machine readable form, are MARC (Machine Readable Cataloguing) and UNIMARC (Universal Machine Readable Cataloguing). This paper presents two schemas, ...
The essence of XML
The World-Wide Web Consortium (W3C) promotes XML and related standards, including XML Schema, XQuery, and XPath. This paper describes a formalization of XML Schema. A formal semantics based on these ideas is part of the official XQuery and XPath ...
Comments