Rule Pruning algorithm is an approach combining an 
extended  function  set  and  a  frequency  counting 
mechanism  for  ranking  production  rules.  Together, 
with the choice of extended function set and pruning 
algorithm,  AutoGE  achieved  significantly  better 
genome  length  in  13  out  of  14  problems, with  the 
(balanced) arity-based grammar structure. Significant 
improvement  in  approximation  performance  for  13 
problems and generalization performance for 8 out of 
14  problems  is  observed  with  balanced  arity-based 
grammar.  We  therefore  conclude  that  arity-based 
grammar structure (simple or balanced), as opposed 
to commonly used mixed arity grammar, would yield 
better  results  not  only  in  terms  of  shorter  genome 
lengths but minimized errors for symbolic regression 
problems resulting in enhanced accuracy. 
6.1  Future Work 
An immediate extension to the current work is to trial 
symbolic  problems  with  real-world  data,  and  by 
exploring  other  problem  domains  for  instance 
program  synthesis,  and  Boolean  logic.  The  PRP 
algorithm  performance  can  be  further  enhanced  by 
investigating other search mechanisms, for example 
particle  swarm  optimization  or  ant  colony 
optimization.  We  aim  to  extend  AutoGE’s  suite  of 
algorithms and to make it more robust by exploring 
approaches like grammar-based EDAs. 
ACKNOWLEDGEMENTS 
This work was supported with the financial support 
of the Science Foundation Ireland grant 13/RC/2094. 
REFERENCES 
Bleuler, S., Bader, J., and Zitzler, E. (2008). Reducing Bloat 
in GP with Multiple Objectives.  In Knowles Corne D., 
Deb K., C. D., editor, Multiobjective Problem Solving 
from Nature, pages 177–200.  Springer 
Fagan  D.,  Murphy  E.  (2018)  Mapping  in  Grammatical 
Evolution.  In:  Ryan  C.,  O'Neill  M.,  Collins  J.  (eds) 
Handbook of Grammatical Evolution. Springer, Cham. 
Keijzer,  M.  (2003)  ‘Improving  symbolic  regression  with 
interval arithmetic and linear scaling’, Lecture Notes in 
Computer Science, 2610, pp. 70–82. Springer Berlin 
Koza,  J.  R.  (1993)  Genetic Programming: On the 
Programming of Computers by Means of Natural 
Selection. MIT Press. 
Kshirsagar,  M.,  Jachak,  R.,  Chaudhari,  P.  and  Ryan,  C. 
(2020),  GEMO:  Grammatical  Evolution  Memory 
Optimization  System,  In  Proceedings of the 12th 
International Joint Conference on Computational 
Intelligence (IJCCI), pages 184-191 
McDermott J. et.  al. (2012). Genetic  programming needs 
better benchmarks. In Proceedings of the 14th annual 
conference on Genetic and evolutionary computation 
(GECCO '12). pages 791–798. 
Murphy, A., Ali, M., Dias, D., Amaral, J., Naredo, E. and 
Ryan C.  (2020), Grammar-based Fuzzy Pattern Trees 
for Classification Problems., In Proceedings of the 12th 
International Joint Conference on Computational 
Intelligence (IJCCI), pages 71-80 
Nicolau,  M.  et  al.  (2015)  ‘Guidelines  for  defining 
benchmark  problems  in  Genetic  Programming’,  In 
Proceedings of IEEE Congress on Evolutionary 
Computation (CEC’15), pp. 1152–1159 
Nicolau  M.,  Agapitos  A.  (2018)  Understanding 
Grammatical Evolution: Grammar Design. In: Ryan C., 
O'Neill M., Collins J. (eds) Handbook of Grammatical 
Evolution. Springer, Cham. 
Nicolau, M.  and Agapitos, A.  (2020) ‘Choosing function 
sets  with  better  generalisation  performance  for 
symbolic  regression  models’,  Genetic Programming 
and Evolvable Machines. Springer US 
O'Neill, M., Ryan, C., Keijzer, M. et al. (2003) ‘Crossover 
in Grammatical Evolution’, Genetic Programming and 
Evolvable Machines 4, 67–93. 
Ryan  C.,  Collins  J.,  Neill  M.O.  (1998)  Grammatical 
evolution: Evolving programs for an arbitrary language. 
In: Banzhaf W., Poli R., Schoenauer M., Fogarty T.C. 
(eds)  Genetic  Programming.  EuroGP  1998.  Lecture 
Notes in Computer Science, vol 1391. Springer, Berlin 
Ryan C., O’Neill M., Collins J. (2018) Introduction to 20 
Years of Grammatical Evolution. In: Ryan C., O'Neill 
M.,  Collins  J.  (eds)  Handbook of Grammatical 
Evolution. Springer, Cham. 
Uy, N. Q. et. al. (2013) ‘Guiding Function Set Selection in 
Genetic  Programming  based  on  Fitness  Landscape 
Analysis’,  in  Companion Publication of the 2013 
Genetic and Evolutionary Computation Conference 
(GECCO’13), pp. 149–150. 
Vladislavleva, E. J., Smits, G. F. and den Hertog, D. (2009) 
‘Order  of  nonlinearity  as  a  complexity  measure  for 
models  generated  by  symbolic  regression  via  pareto 
genetic  programming’,  IEEE Transactions on 
Evolutionary Computation, 13(2), pp. 333–349. 
Wang,  G.  and  Soule,  T.  (2004)  ‘How  to  Choose 
Appropriate Function Sets for Genetic Programming’, 
Lecture Notes in Computer Science, 3003, pp. 198–207.