Performance Comparison of Apriori, ECLAT and FP-Growth Algorithm for No Biological Data Genes for Association Rule Learning

Anies Nurfazlin Anuar, Shahreen Kasim, - Hendrick

Abstract


This project is carried out to study the performance comparison of Apriori Algorithm, ECLAT Algorithm and FP-Growth Algorithm for no biological data genes. There are many genes with no biological data, but for this project we have chosen 4 types of no biological data genes. No biological data genes are genes that have no specific data about themselves such as location, behaviour and function of the genes. Association Rule Learning is a technique implementing big data in finding frequent item-sets. Frequent item-sets are items that occur frequently in the database. The performance of these three algorithms is compared through time efficiency and the ability to process small and large datasets. After the comparison, we can conclude that FP-Growth algorithm is the fastest algorithm for small data-set and Apriori algorithm and ECLAT algorithm takes lesser time to generate the frequent item-sets compared to FP-Growth algorithm.

Keywords


Association rule learning; algorithm; frequent item-sets; gene.

References


Sagiroglu, S., & Sinanc, D. (2013). Big data: A review. 2013 International Conference on Collaboration Technologies and Systems (CTS).

Retrieved from https://doi.org/10.1109/cts.2013.6567202

Rungta, K. (2020). What is BIG DATA? Introduction, Types, Characteristics, Example. Guru99.com; Guru99. Retrieved from https://www.guru99.com/what-is-big-data.html.

Agrawal, R., Imieliński, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC (pp. 207–216). New York: ACM.

Lutkevich, B. (2017). association rules. SearchBusinessAnalytics; TechTarget. Retrieved from https://searchbusinessanalytics.techtarget.com/definition/association-rules-in-data-mining

Abhinav Rai (2019, June 4) Association Rule Mining: An Overview and its Applications. UpGrad Blog. Retrieved from https://www.upgrad.com/blog/association-rule-mining-an-overview-and-its-applications/

Home - Dev-C++ Official Website. (2020). Bloodshed.net. Retrieved from https://www.bloodshed.net

Apriori Algorithm in Machine Learning - Javatpoint. (2011). Www.javatpoint.com. Retrieved from https://www.javatpoint.com/apriori-algorithm-in-machine-learning

Frequent Pattern (FP) Growth Algorithm In Data Mining. (2021, May 30). Softwaretestinghelp.com. Retrieved from https://www.softwaretestinghelp.com/fp-growth-algorithm-data-mining/

Gene Ontology Consortium (2020). Annotations. Retrieved from http://current.geneontology.org/products/pages/downloads.html.

PANTHER - Gene List Analysis. (2021). Pantherdb.org. Retrieved from http://pantherdb.org/geneListAnalysis.do

Gayathri.G.S (February,2017). Performance comparison of Apriori, ECLAT and FP-growth algorithm for association rule learning. , International Journal of Computer Science and Mobile Computing. Retrieved from www.ijcsmc.com.




DOI: https://doi.org/10.30630/ijasce.2.3.103

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Flag Counter

 

Organized / Collaboration

- Soft Computing and Data Mining Centre, UTHM, Malaysia and Department of Information Technology

- Society of Visual Informatics, Indonesia