Association rules mining is a major data mining field that leads to
discovery of associations and correlations
among items in today’s big data environment. The conventional
association rule mining focuses mainly on positive itemsets generated from
frequently occurring itemsets (PFIS). However, there has been a significant
study focused on infrequent itemsets with utilization of negative association
rules to mine interesting frequent itemsets (NFIS) from transactions. In this
work, we propose an efficient backward calculating negative frequent itemset
algorithm namely EBC-NFIS for computing backward supports that can extract both positive and negative
frequent itemsets synchronously from
dataset. EBC-NFIS algorithm is based on popular e-NFIS algorithm that computes supports of negative itemsets from the supports of
positive itemsets. The proposed algorithm makes use of previously computed
supports from memory to minimize the computation time. In addition, association
rules, i.e. positive and negative
association rules (PNARs) are generated from discovered frequent itemsets using
EBC-NFIS algorithm. The efficiency of the proposed algorithm is verified by
several experiments and comparing results with e-NFIS algorithm. The
experimental results confirm that the
proposed algorithm successfully discovers NFIS and PNARs and runs significantly faster than conventional e-NFIS algorithm.
References
[1]
Mahmood, S., Shahbaz, M. and Guergachi, A. (2014) Negative and Positive Association Rules Mining from Text Using Frequent and Infrequent Itemsets. The Scientific World Journal, 2014, Article ID: 973750. https://doi.org/10.1155/2014/973750
[2]
Çokpnar, S. and Gündem, T.I. (2012) Positive and Negative Association Rule Mining on XML Data Streams in Database as a Service Concept. Expert Systems with Applications, 39, 7503-7511. https://doi.org/10.1016/j.eswa.2012.01.128
[3]
Kadir, A.S.A., Bakar, A.A. and Hamdan, A.R. (2011) Frequent Absence and Presence Itemset for Negative Association Rule Mining. International Conference on Intelligent Systems Design and Applications, Cordoba, 22-24 November 2011, 965-970. https://doi.org/10.1109/ISDA.2011.6121783
[4]
Folasire, O., Akinyemi, O. and Owoaje, E. (2014) Perceived Social Support among HIV Positive and HIV Negative People in Ibadan, Nigeria. World Journal of AIDS, 4, 15-26. https://doi.org/10.4236/wja.2014.41003
[5]
Agrawal, R. and Srikant, R. (1994) Fast Algorithms for Mining Association Rules in Large Databases. VLDB’94: Proceedings of the 20th International Conference on Very Large Data Bases, Santiago de Chile, 12-15 September 1994, 487-499.
[6]
Han, J., Pei, J. and Yin, Y. (2000) Mining Frequent Patterns without Candidate Generation. ACM Sigmod Record, 29, 1-12. https://doi.org/10.1145/335191.335372
[7]
Brin, S., Motwani, R. and Silverstein, C. (1997) Beyond Market Baskets: Generalizing Association Rules to Correlations. ACM Sigmod International Conference on Management of Data, Tucson, 13-15 May 1997, 265-276. https://doi.org/10.1145/253262.253327
[8]
Savasere, A., Omiecinski, E. and Navathe, S. (1998) Mining for Strong Negative Associations in a Large Database of Customer Transactions. Proceedings 14th International Conference on Data Engineering, Orlando, 23-27 February 1998, 494-502.
[9]
Wu, X., Zhang, C. and Zhang, S. (2004) Efficient Mining of both Positive and Negative Association Rules. ACM Transactions on Information Systems (TOIS), 22, 381-405. https://doi.org/10.1145/1010614.1010616
[10]
Cornelis, C., Yan, P., Zhang, X. and Chen, G. (2006) Mining Positive and Negative Association Rules from Large Databases. IEEE Conference on Cybernetics and Intelligent Systems, Bangkok, 7-9 June 2006, 1-6. https://doi.org/10.1109/ICCIS.2006.252251
[11]
Dong, X., Sun, F., Han, X. and Hou, R. (2006) Study of Positive and Negative Association Rules Based on Multi-Confidence and Chi-Squared Test. International Conference on Advanced Data Mining and Applications, Xi’an, 14-16 August 2006, 100-109. https://doi.org/10.1007/11811305_10
[12]
Dong, X., Niu, Z., Shi, X., Zhang, X. and Zhu, D. (2007) Mining both Positive and Negative Association Rules from Frequent and Infrequent Itemsets. International Conference on Advanced Data Mining and Applications, Harbin, 6-8 August 2007, 122-133. https://doi.org/10.1007/978-3-540-73871-8_13
[13]
Antonie, M.L. and Zaane, O.R. (2004) Mining Positive and Negative Association Rules: An Approach for Confined Rules. European Conference on Principles of Data Mining and Knowledge Discovery, Pisa, 20-24 September 2004, 27-38. https://doi.org/10.1007/978-3-540-30116-5_6
[14]
Dong, X., Ma, L. and Han, X. (2011) E-NFIS: Efficient Negative Frequent Itemsets Mining Only Based on Positive Ones. IEEE 3rd International Conference on Communication Software and Networks, Xi’an, 27-29 May 2011, 517-519.
[15]
Das, A., Ng, W.K. and Woon, Y.K. (2001) Rapid Association Rule Mining. Proceedings of the 10th International Conference on Information and Knowledge Management, Atlanta, 5-10 October 2001, 474-481.
[16]
Zaki, M.J. (2000) Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering, 12, 372-390.
[17]
Li, X., Liu, Y., Peng, J. and Wu, Z. (2002) The Extended Association Rules and Atom Association Rules. Journal of Computer Research and Application, 12, 1740-1750.
[18]
Asuncion, A. and Newman, D. (2013) UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine, 28. https://archive.ics.uci.edu/ml/index.php