model was found to be capable of achieving 23.5
percent faster search performance than DMLA
(Zhang et al, 2018), 16.5 percent faster search
performance than AQ PMS (Yu et al, 2019) and 18.2
percent faster search performance than PSO (Reddy
et al,2010) under various test sequence sizes. This
performance was also evaluated in terms of search
delay. This is a result of the combination of Map
Reduce and GA with various distance metrics,
which helps to enhance search performance in
various scenarios. These improvements enable the
proposed model to be deployed for numerous real-
time QPMS application scenarios. In future, the
proposed model must be validated on larger datasets,
and can be improved via use of Deep Learning
Models like Q-Learning, Autoencoders, and other
Convolutional Neural Networks (CNNs), which will
assist in further improving its scalability. This
performance can be further improved via use of
Gated Recurrent Units (GRUs), Generative
Adversarial Networks (GANs), along with
bioinspired computing models which will allow the
model to be continuously optimized for different
Motif Search based use cases.
ACKNOWLEDGEMENTS
The authors are thankful to the Director, Department
of Computer Science and Engineering, Visvesvaraya
National Institute of Technology (VNIT), Nagpur
for providing necessary facilities for this work.
REFERENCES
R. Semwal, I. Aier, U. Raj and P. K. Varadwaj, "Pr[m]:
An Algorithm for Protein Motif Discovery," in
IEEE/ACM Transactions on Computational Biology
and Bioinformatics, vol. 19, no. 1, pp. 585-592, 1 Jan.-
Feb. 2022, doi: 10.1109/TCBB.2020.2999262.
P. Xiao, X. Cai and S. Rajasekaran, "EMS3: An Improved
Algorithm for Finding Edit-Distance Based Motifs," in
IEEE/ACM Transactions on Computational Biology
and Bioinformatics, vol. 18, no. 1, pp. 27-37, 1 Jan.-
Feb. 2021, doi: 10.1109/TCBB.2020.3024222.
Q. Yu and X. Zhang, "A New Efficient Algorithm for
Quorum Planted Motif Search on Large DNA
Datasets," in IEEE Access, vol. 7, pp. 129617-129626,
2019, doi: 10.1109/ACCESS.2019.2940115.
T. Li, X. Zhang, F. Luo, F. -X. Wu and J. Wang,
"MultiMotifMaker: A Multi-Thread Tool for
Identifying DNA Methylation Motifs from Pacbio
Reads," in IEEE/ACM Transactions on Computational
Biology and Bioinformatics, vol. 17, no. 1, pp. 220-
225, 1 Jan.-Feb. 2020, doi: 10.1109/TCBB.2018.2
861399.
H. Zhang, L. Zhu and D. -S. Huang, "DiscMLA: An
Efficient Discriminative Motif Learning Algorithm
over High-Throughput Datasets," in IEEE/ACM
Transactions on Computational Biology and
Bioinformatics, vol. 15, no. 6, pp. 1810-1820, 1 Nov.-
Dec. 2018, doi: 10.1109/TCBB.2016.2561930.
H. Zhao, X. Xu, Y. Song, D. L. Lee, Z. Chen and H. Gao,
"Ranking Users in Social Networks with Motif-Based
PageRank," in IEEE Transactions on Knowledge and
Data Engineering, vol. 33, no. 5, pp. 2179-2192, 1
May 2021, doi: 10.1109/TKDE.2019.2953264.
X. Sun, Y. Tan, Q. Wu, B. Chen and C. Shen, "TM-Miner:
TFS-Based Algorithm for Mining Temporal Motifs in
Large Temporal Network," in IEEE Access, vol. 7, pp.
49778-49789, 2019, doi: 10.1109/ACCESS.2019.
2911181.
Z. Xing and S. Tu, "A Graph Neural Network Assisted
Monte Carlo Tree Search Approach to Traveling
Salesman Problem," in IEEE Access, vol. 8, pp.
108418-108428, 2020, doi: 10.1109/ACCESS.
2020.3000236.
S. Yu, F. Xia, Y. Sun, T. Tang, X. Yan and I. Lee,
"Detecting Outlier Patterns With Query-Based
Artificially Generated Searching Conditions," in IEEE
Transactions on Computational Social Systems, vol. 8,
no. 1, pp. 134-147, Feb. 2021, doi: 10.1109/
TCSS.2020.2977958.
M. U. Chaudhry and J. -H. Lee, "Feature Selection for
High Dimensional Data Using Monte Carlo Tree
Search," in IEEE Access, vol. 6, pp. 76036-76048,
2018, doi: 10.1109/ACCESS.2018.2883537.
Nicolae, M., Rajasekaran, S. qPMS9: An Efficient
Algorithm for Quorum Planted Motif Search. Sci Rep
5, 7813(2015). https://doi.org/10.1038/srep07813
Yu, Qiang & Zhang, Xiao. (2019). A New Efficient
Algorithm for Quorum Planted Motif Search on Large
DNA Datasets. IEEE Access. PP. 1-1. 10.1109/
ACCESS.2019.2940115.
Shrimankar, D.D. High performance computing
approachfor DNA motif discovery. CSIT 7, 295β297
(2019). https://doi.org/10.1007/s40012-019-00235-w
Peng Xiao, Xingyu Cai, and Sanguthevar Rajasekaran.
2021. EMS3: An Improved Algorithm for Finding Edit-
Distance Based Motifs. IEEE/ACM Trans. Comput.
Biol. Bioinformatics 18, 1 (Jan.-Feb. 2021)27,
37.https://doi.org/10.1109/TCBB.2020.3024222
Schultz, C.J., Wu, Y. & Baumann, U. A targeted
bioinformatics approach identifies highly variable cell
surface proteins that are unique to Glomeromycotina.
Mycorrhiza 32, 45β66(2022). https://doi.org/10.1007/
s00572-021-01066-x
J. Merlin and H. Dinh,"Poster: Randomized algorithms for
planted Motif Search," in 2013 IEEE 3rd International
Conference on Computational Advances in Bio and
medical Sciences (ICCABS), Las Vegas, NV, USA,
2012 pp. 1. doi: 10.1109/ICCABS.2012.6182654
Yu, Qiang & Huo, Hongwei & Vitter, Jeffrey & Huan, Jun
& Nekrich, Yakov. (2015). An Efficient Exact