Performance Comparison of Apriori, ECLAT and FP-Growth Algorithm for No Biological Data Genes for Association Rule Learning

Anies Nurfazlin Anuar (1), Shahreen Kasim (2), - Hendrick (3)
(1) Faculty Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, 86400 Parit Raja, Johor, Malaysia
(2) Faculty Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, 86400 Parit Raja, Johor, Malaysia.
(3) Department Electronic Department, National Kaohsiung University of Science and Technology, Taiwan
How to cite (IJASEIT) :
Anuar, A. N., Kasim, S., & Hendrick, .-. (2020). Performance Comparison of Apriori, ECLAT and FP-Growth Algorithm for No Biological Data Genes for Association Rule Learning. International Journal of Advanced Science Computing and Engineering, 2(3). https://doi.org/10.62527/ijasce.2.3.103
This project is carried out to study the performance comparison of Apriori Algorithm, ECLAT Algorithm and FP-Growth Algorithm for no biological data genes. There are many genes with no biological data, but for this project we have chosen 4 types of no biological data genes. No biological data genes are genes that have no specific data about themselves such as location, behaviour and function of the genes. Association Rule Learning is a technique implementing big data in finding frequent item-sets. Frequent item-sets are items that occur frequently in the database. The performance of these three algorithms is compared through time efficiency and the ability to process small and large datasets. After the comparison, we can conclude that FP-Growth algorithm is the fastest algorithm for small data-set and Apriori algorithm and ECLAT algorithm takes lesser time to generate the frequent item-sets compared to FP-Growth algorithm.

Sagiroglu, S., & Sinanc, D. (2013). Big data: A review. 2013 International Conference on Collaboration Technologies and Systems (CTS).

Retrieved from https://doi.org/10.1109/cts.2013.6567202

Rungta, K. (2020). What is BIG DATA? Introduction, Types, Characteristics, Example. Guru99.com; Guru99. Retrieved from https://www.guru99.com/what-is-big-data.html.

Agrawal, R., Imieliński, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC (pp. 207–216). New York: ACM.

Lutkevich, B. (2017). association rules. SearchBusinessAnalytics; TechTarget. Retrieved from https://searchbusinessanalytics.techtarget.com/definition/association-rules-in-data-mining

Abhinav Rai (2019, June 4) Association Rule Mining: An Overview and its Applications. UpGrad Blog. Retrieved from https://www.upgrad.com/blog/association-rule-mining-an-overview-and-its-applications/

Home - Dev-C++ Official Website. (2020). Bloodshed.net. Retrieved from https://www.bloodshed.net

Apriori Algorithm in Machine Learning - Javatpoint. (2011). Www.javatpoint.com. Retrieved from https://www.javatpoint.com/apriori-algorithm-in-machine-learning

Frequent Pattern (FP) Growth Algorithm In Data Mining. (2021, May 30). Softwaretestinghelp.com. Retrieved from https://www.softwaretestinghelp.com/fp-growth-algorithm-data-mining/

Gene Ontology Consortium (2020). Annotations. Retrieved from http://current.geneontology.org/products/pages/downloads.html.

PANTHER - Gene List Analysis. (2021). Pantherdb.org. Retrieved from http://pantherdb.org/geneListAnalysis.do

Gayathri.G.S (February,2017). Performance comparison of Apriori, ECLAT and FP-growth algorithm for association rule learning. , International Journal of Computer Science and Mobile Computing. Retrieved from www.ijcsmc.com.