全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Metadata for Approximate Query Answering Systems

DOI: 10.1155/2012/247592

Full-Text   Cite this paper   Add to My Lib

Abstract:

In business intelligence systems, data warehouse metadata management and representation are getting more and more attention by vendors and designers. The standard language for the data warehouse metadata representation is the Common Warehouse Metamodel. However, business intelligence systems include also approximate query answering systems, since these software tools provide fast responses for decision making on the basis of approximate query processing. Currently, the standard meta-model does not allow to represent the metadata needed by approximate query answering systems. In this paper, we propose an extension of the standard metamodel, in order to define the metadata to be used in online approximate analytical processing. These metadata have been successfully adopted in ADAP, a web-based approximate query answering system that creates and uses statistical data profiles. 1. Introduction The concept of metadata—born in the transactional systems literature—has gained the greatest attention in data warehousing environments, where two classes of metadata are distinguishable: back-room and front-room metadata. The first class aims to describe, among other things, the several heterogeneous data sources, the data transformation that must be performed to feed the data warehouse, and the refresh status of the stored data. The front-room metadata aim to represent the conceptual data model of the data warehouse and they are commonly used by the so-called business intelligence platforms in order to automatically generate the analytical queries. According to [1], the business intelligence can be represented as an information supply chain, to highlight that the information comes from a set of data sources and goes through transformation processes, in order to become useful for decision making. However, this chain is composed of different steps, each of them being both producer and consumer of data and metadata. It is unlikely that a single vendor provides a complete system able to covers all the steps of the chain. For this reason, each step is usually managed by specific software tools, produced by different vendors. Of course, their integration is quite difficult, since each tool aims to improve its own effectiveness and efficiency. It follows that the metadata are commonly stored according to a proprietary format. In order to overcome this limit, a central management of metadata is always a benefit for enterprises that are divided into departmental areas not directly linked among them. In this case, the most effective choice consists of using a central metadata

References

[1]  J. Poole, D. Chang, D. Tolbert, and D. Mellor, Common Warehouse Metamodel, John Wiley & Sons, 2002.
[2]  A. Sen, “Metadata management: past, present and future,” Decision Support Systems, vol. 37, no. 1, pp. 151–173, 2004.
[3]  Object Management Group, “Common Warehouse Metamodel Specification,” vers. 1.1, vol. 1, OMG, Needham, MA, USA, 2003, http://www.omg.org/docs/formal/03-03-02.pdf.
[4]  Object Management Group, “XML Metadata Interchange (XMI) Specification,” vers. 2.0, OMG, Needham, MA USA, 2003, http://www.omg.org/docs/formal/03-05-02.pdf.
[5]  S. Chaudhuri, U. Dayal, and V. Ganti, “Database technology for decision support systems,” Computer, vol. 34, no. 12, pp. 48–55, 2001.
[6]  F. Di Tria, E. Lefons, and F. Tangorra, “Metrics for approximate query engine evaluation,” in Proceedings of the 27th ACM Symposium on Applied Computing (ACM SAC ’12), pp. 885–887, Riva del Garda, Italy, 2012.
[7]  S. Acharya, P. B. Gibbons, V. Poosala, and S. Ramaswamy, “The AQUA approximate query answering system,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 574–576, Philadelphia, PA, USA, 1999.
[8]  A. Gupta, V. Harinarayan, and D. Quaas, “Aggregate-query processing in data warehousing environments,” in Proceedings of the 21th International Conference on Very Large Data Bases (VLDB ’95), pp. 358–369, Zurich, Switzerland, 1995.
[9]  K. Chakrabarti, M. Garofalakis, R. Rastogi, and K. Shim, “Approximate query processing using wavelets,” The International Journal on Very Large Data Bases, vol. 10, no. 2-3, pp. 199–223, 2001.
[10]  Y. Ioannidis and V. Poosala, “Histogram-based approximation of set-valued query answers,” in Proceedings of the 25th International Conference on Very Large Data Bases (VLDB '99), pp. 174–185, Edinburgh, Scotland, 1999.
[11]  P. B. Gibbons and Y. Matias, “New sampling-based summary statistics for improving approximate query answers,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 331–342, Seattle, Wash, USA, 1998.
[12]  E. Lefons, A. Merico, and F. Tangorra, “Analytical profile estimation in database systems,” Information Systems, vol. 20, no. 1, pp. 1–20, 1995.
[13]  J. Spiegel and N. Polyzotis, “TuG synopses for approximate query answering,” ACM Transactions on Database Systems, vol. 34, no. 1, article 3, 2009.
[14]  J. B. Peltzer, A. M. Teredesai, and G. Reinard, “AQUAGP: approximate query answers using genetic programming,” in Proceedings of the 9th European Conference (EuroGP '06), pp. 49–60, Budapest, Hungary, April 2006.
[15]  C. Jermaine, S. Arumugam, A. Pol, and A. Dobra, “Scalable approximate query processing with the DBO engine,” ACM Transactions on Database Systems, vol. 33, no. 4, article 23, 2008.
[16]  Object Management Group, “XML MetaObject Facility Specification,” vers. 1.4, OMG, Needham, MA USA, 2002, http://www.omg.org/docs/formal/02-04-03.pdf.
[17]  C. dell’Aquila, F. Di Tria, E. Lefons, and F. Tangorra, “Data reduction for data analysis,” in New Aspects on Computing Research, C. Cepisca, G. A. Kouzaev, and N. E. Mastorakis, Eds., pp. 204–210, 2008.
[18]  F. Di Tria, E. Lefons, and F. Tangorra, “Hybrid methodology for data warehouse conceptual design by UML schemas,” Information and Software Technology, vol. 54, no. 4, pp. 360–379, 2012.
[19]  F. Di Tria, E. Lefons, F. Tangorra, and M. M. GrHy, “GrHyMM: a graph-oriented hybrid multidimensional model,” in Proceedings of the Advances in Conceptual Modeling. Recent Developments and New Directions, pp. 86–97, Springer, Brussels, Belgium, 2011.
[20]  D. Marco, Building and Managing the Meta Data Repository, Wiley, 2000.
[21]  A. Tannenbaum, Metadata Solutions, Addison-Wesley, 2002.
[22]  E. Bertino and L. Martino, “Object-oriented database management systems: concepts and issues,” Computer, vol. 24, no. 4, pp. 33–47, 1991.
[23]  K. Schlegel and B. Sood, Business Intelligence Platform Capability Matrix, Gartner Research, 2007.
[24]  F. Yan, W. C. Hou, Z. Jiang, C. Luo, and Q. Zhu, “Selectivity estimation of range queries based on data density approximation via cosine series,” Data & Knowledge Engineering, vol. 63, no. 3, pp. 855–878, 2007.
[25]  T. Palpanas, P. Chowdhary, F. Pinel, and G. Mihaila, “Integrated model-driven dashboard development,” Information Systems Frontiers, vol. 9, no. 2-3, pp. 195–208, 2007.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413