Please use this identifier to cite or link to this item:
Author(s): Md. Sarwar Kamal
Munesh Chandra Trivedi
Jannat Binta Alam
Nilanjan Dey
Amira S. Ashour
Fuqian Shi
João Manuel R. S. Tavares
Title: Big DNA Datasets Analysis under Push down Automata
Issue Date: 2018-08
Abstract: Consensus is a significant part that supports the identification of unknown information about animals, plants and insects around the globe. It represents a small part of Deoxyribonucleic acid (DNA) known as the DNA segment that carries all the information for investigation and verification. However, excessive datasets are the major challenges to mine the accurate meaning of the experiments. The datasets are increasing exponentially in ever seconds. In the present article, a memory saving consensus finding approach is organized. The principal component analysis (PCA) and independent component (ICA) are used to pre-process the training datasets. A comparison is carried out between these approaches with the Apriori algorithm. Furthermore, the push down automat (PDA) is applied for superior memory utilization. It iteratively frees the memory for storing targeted consensus by removing all the datasets that are not matched with the consensus. Afterward, the Apriori algorithm selects the desired consensus from limited values that are stored by the PDA. Finally, the Gauss-Seidel method is used to verify the consensus mathematically.
Subject: Ciências da Saúde, Ciências médicas e da saúde
Health sciences, Medical and Health sciences
Scientific areas: Ciências médicas e da saúde
Medical and Health sciences
Document Type: Artigo em Revista Científica Internacional
Rights: openAccess
Appears in Collections:FEUP - Artigo em Revista Científica Internacional

Files in This Item:
File Description SizeFormat 
288183.pdfPaper draft567.99 kBAdobe PDFThumbnail

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.