Hassani, Hossein and Gheitanchi, Shahin and Yeganegi, Mohammad Reza (2008): On the Application of Data Mining to Official Data. Forthcoming in: Journal of Data Science
Preview |
PDF
MPRA_paper_10070.pdf Download (168kB) | Preview |
Abstract
Retrieving valuable knowledge and statistical patterns from official data has a great potential in supporting strategic policy making. Data Mining (DM) techniques are well-known for providing flexible and efficient analytical tools for data processing. In this paper, we provide an introduction to applications of DM to official statistics and flag the important issues and challenges. Considering recent advancements in software projects for DM, we propose intelligent data control system design and specifications as an example of DM application in official data processing.
Item Type: | MPRA Paper |
---|---|
Original Title: | On the Application of Data Mining to Official Data |
Language: | English |
Keywords: | Data mining, Official data, Intelligent data control system |
Subjects: | C - Mathematical and Quantitative Methods > C6 - Mathematical Methods ; Programming Models ; Mathematical and Simulation Modeling C - Mathematical and Quantitative Methods > C0 - General |
Item ID: | 10070 |
Depositing User: | Hossein Hassani |
Date Deposited: | 17 Aug 2008 12:42 |
Last Modified: | 28 Sep 2019 00:19 |
References: | Adam, N. R. and Wortmann, J. C. (1998). Security-control methods for statistical databases: A comparative study, ACM Computing Surveys, 21(4), pp. 515--556. D'Angiolini, G. (2002). Developing a Metadata Infrastructure for Official data: the ISTAT experience.\\ http://www.di.uniba.it/~malerba/activities/mod02/pdfs/dangiolini.pdf Han, J. and Kamber, M. (2006). Data Mining Concepts and Techniques, Second Edition. Elsevier Inc. Hand, D. J., Mannila, H. and Smyth, P. (2001). Principles of Data Mining. MIT Press. Hand, D. J. (1998a). Data mining: statistics and more?. The American Statistician, 52, pp. 112--119. Hand, D. J. (1998b). Data mining-reaching beyond statistics, Research in Official Statistics, 2, pp. 5--17. Hassani, H. and Haeri Mehrizi, A. (2006a). Data Mining and official statistics, Journal of Statistical Centre of Iran , vol 67, No 4, pp. 21--34. Hassani, H. and Haeri Mehrizi, A. (2006b). Data Mining & Official Data, In Proceeding of the 8th Iranian International Statistics Conference, Shiraz, Iran, pp 61--68. Hassani, H. and Anari, M. (2005). Using Data Mining for Data Quality Improvement. In Proceedings of the 55th session International Statistical Institute (ISI), Sydney, Austeralia. Hipp, J. Güntzer, U. and Grimmer, U. (2001). Data Quality Mining - Making a Virtue of Necessity. Proceedings of the 6th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2001), Santa Barbara, California, pp. 52--57. Hipp, J. G¨untzer, U. and Nakhaeizadeh, G. (2000). Mining association rules: Deriving a superior algorithm by analysing today's approaches. In Proceedings of the 4th European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD '00), pp 159--168, Lyon, France. Mackinnon, M. J. and Glick, N. (1999) Data Mining and Knowledge Discovery in Databases - An Overview, Australian & New Zealand Journal of Statistics , 41 (3), pp. 255-–275. Nanopoulos, Ph. and King, J. (2002). Important Issues on Statistical confidentiality. http://www.di.uniba.it/~malerba/activities/mod02/pdfs/nanopoulos.pdf Pujari, A. K. (2001). Data Mining Techniques, Universities Press, Hyderabad, India. Saporta, G. (2000). Data Mining and Official Statistics. Quinta Conferenza Nationale di Statistica, ISTAT, Roma. http://cedric.cnam.fr/PUBLIS/RC184.pdf Saporta, G. (1998). The Unexploited Mines of Academic and Official Statistics, in Academic and Official Statistics Co-operation, Eurostat, pp. 11--15. Shoshani A. (1982). Statistical databases: characteristics, problems, and some solutions. In Proc. of the international conference on very large data bases (VLDB), pp. 208–-222. Siebes, A. (1996). Data Mining: What it is and how it is done, SEBD, pp. 329--344. Wang, J. (2003). Data Mining: Opportunities and Challenges. Idea Group Publishing. Yeganegi, M. R. Hassani, H. and Haeri Mehrizi A. (2006). Artificial Intelligence and Its Application to Official Statistics , In Proceeding of the 8th Iranian International Statistics Conference, Shiraz, Iran, pp 120--132. |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/10070 |