An Information Entropy Based to Identify Dominant Species for No Biological Data Genes

Nurul Ain Nazirah (1), Shahreen Kasim (2), Dwiny Meidelfi (3)
(1) Faculty Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, 86400 Parit Raja, Johor, Malaysia
(2) Faculty Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, 86400 Parit Raja, Johor, Malaysia
(3) Department of Information Technology, Politeknik Negeri Padang, Indonesia
Fulltext View | Download
How to cite (IJASEIT) :
Nazirah, N. A., Kasim, S., & Meidelfi, D. (2020). An Information Entropy Based to Identify Dominant Species for No Biological Data Genes. International Journal of Advanced Science Computing and Engineering, 2(3), 86–96. https://doi.org/10.62527/ijasce.2.3.101
This report discusses about the dominant species of no biological data genes. This genes are belong to animal species. Gene is one of a process where the biological data encoded in the gene that instructed by the DNA to convert into a functional product such as protein. In gene classification, with the growth of using gene expression database, there are not enough tools to extract the gene expression from these databases. There exist over 23,000 to 50,000 genes for animal genome. So, this might contribute to data redundancy as problems can happen while handling a huge database. Therefore, to overcome the problem, many approaches to cluster and determine the dominant species have been proposed in the previous literature. For this project, in order to determine the dominant species, the information entropy based method is used. As conclusion, the purpose of this research is to identify the dominant species of no biological genes using entropy method proposed.

Yourgenome (2016). What is gene expression. Retrieved on January 1, 2016, from https://www.yourgenome.org/facts/what-is-gene-expression.

Giancarlo, R., Scaturro, D., & Utro, F. (2016). Valworkbench: An open source java library for cluster validation, with applications to microarray data analysis. Computer methods and programs in biomedicine, 118(2), 207-217.

Vaes, E., Khan, M., & Mombaerts, P. (2016). Statistical analysis of differential gene expression relative to a fold change threshold on NanoString data of mouse odorant receptor genes. BMC bioinformatics, 15(1), 39.

Gene Ontology Consortium (2020). Annotations. Retrieved on October 9, 2020, from http://current.geneontology.org/products/pages/downloads.html.

Gene Ontology Consortium (2020). Annotations. Retrieved on October 9, 2020, from http://current.geneontology.org/products/pages/downloads.html.

Gene Ontology Consortium (2020). Annotations. Retrieved on October 9, 2020, from http://current.geneontology.org/products/pages/downloads.html.

PANTHER Classification System (2021), Panther Gene Lists. Retrieved on December 18, 2020 from http://pantherdb.org/.

Soltanian AR, Rabiei N, Bahreini F. Feature Selection in Microarray Data Using Entropy Information. In: Husi H, editor. Computational Biology [Internet]. Brisbane (AU): Codon Publications; 2019 Nov 21. Chapter 10. Available from: https://www.ncbi.nlm.nih.gov/books/NBK550347/ doi: 10.15586/computationalbiology.2019.ch10.

Soltanian AR, Rabiei N, Bahreini F. Feature Selection in Microarray Data Using Entropy Information. In: Husi H, editor. Computational Biology [Internet]. Brisbane (AU): Codon Publications; 2019 Nov 21. Chapter 10. Available from: https://www.ncbi.nlm.nih.gov/books/NBK550347/ doi: 10.15586/computationalbiology.2019.ch10.

Soltanian AR, Rabiei N, Bahreini F. Feature Selection in Microarray Data Using Entropy Information. In: Husi H, editor. Computational Biology [Internet]. Brisbane (AU): Codon Publications; 2019 Nov 21. Chapter 10. Available from: https://www.ncbi.nlm.nih.gov/books/NBK550347/ doi: 10.15586/computationalbiology.2019.ch10.