- Perla Juárez-Smith1,
- Leonardo Trujillo ORCID:orcid.org/0000-0003-1812-57361,
- Mario García-Valdez1,
- Francisco Fernández de Vega2 &
- …
- Francisco Chávez2
396Accesses
Abstract
This work presents a unique genetic programming (GP) approach that integrates a numerical local search method and a bloat-control mechanism to address some of the main issues with traditional GP. The former provides a directed search operator to work in conjunction with standard syntax operators that perform more exploration in design space, while the latter controls code growth by maintaining program diversity through speciation. The system can produce highly parsimonious solutions, thus reducing the cost of performing the local optimization process. The proposal is extensively evaluated using real-world problems from diverse domains, and the behavior of the search is analyzed from several different perspectives, including how species evolve, the effect of the local search process and the interpretability of the results. Results show that the proposed approach compares favorably with a standard approach, and that the hybrid algorithm can be used as a viable alternative for solving real-world symbolic regression problems.
This is a preview of subscription content,log in via an institution to check access.
Access this article
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
Buy Now
Price includes VAT (Japan)
Instant access to the full article PDF.














Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
We must state that the termsspecies andspeciation are used in a technical sense, following their use in NEAT, and are not meant to imply that they closely resemble their biological namesakes.
Source code:https://github.com/saarahy/NGP-LS.
Basically an aggregate of the size of all the individuals evaluated by a search.
References
M. Affenzeller, S.M. Winkler, B. Burlacu, G. Kronberger, M. Kommenda, S. Wagner, Dynamic observation of genotypic and phenotypic diversity for different symbolic regression GP variants, inProceedings of the Genetic and Evolutionary Computation Conference Companion, GECCO ’17 (ACM, New York, 2017), pp. 1553–1558
N. Agarwal, B. Bullins, E. Hazan, Second-order stochastic optimization for machine learning in linear time. J. Mach. Learn. Res.18(1), 4148–4187 (2017)
S. Angra, S. Ahuja, Machine learning and its applications: a review, in2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC) (2017), pp. 57–60
D. Applegate, B. Mayfield, An analysis of exchanging fitness cases with population size in symbolic regression genetic programming with respect to the computational model. in2013 IEEE Congress on Evolutionary Computation (2013), pp. 3111–3116
R.M.A. Azad, C. Ryan, A simple approach to lifetime learning in genetic programming-based symbolic regression. Evol. Comput.22(2), 287–317 (2014)
S. Bleuler, J. Bader, E. Zitzler,Reducing Bloat in GP with Multiple Objectives (Springer, Berlin, 2008), pp. 177–200
R.H. Byrd, R.B. Schnabel, G.A. Shultz, A trust region algorithm for nonlinearly constrained optimization. SIAM J. Numer. Anal.24(5), 1152–1170 (1987)
M. Castelli, L. Trujillo, L. Vanneschi, A. Popovič, Prediction of energy performance of residential buildings: a genetic programming approach. Energy Build.102, 67–74 (2015)
S. Chand, M. Wagner, Evolutionary many-objective optimization: a quick-start guide. Surv. Oper. Res. Manag. Sci.20(2), 35–42 (2015)
X. Chen, Y.-S. Ong, M.-H. Lim, K.C. Tan, A multi-facet survey on memetic computation. IEEE Trans. Evol. Comput.15(5), 591–607 (2011)
J.-S. Chou, C.-F. Tsai, Concrete compressive strength analysis using a combined classification and regression technique. Autom. Constr.24, 52–60 (2012)
V.V. de Melo, W. Banzhaf, Improving the prediction of material properties of concrete using kaizen programming with simulated annealing. Neurocomputing246, 25–44 (2017)
S. Dignum, R. Poli, Operator equalisation and bloat free GP, inGenetic Programming: 11th European Conference, EuroGP 2008 (Springer, Berlin, 2008), pp. 110–121
J. Enríquez-Zárate, L. Trujillo, S. de Lara, M. Castelli, E. Z-Flores, L. Muñoz, A. Popovič, Automatic modeling of a gas turbine using genetic programming: an experimental study. Appl. Soft Comput.50, 212–222 (2017)
O.F. Ertuĝrul, A novel type of activation function in artificial neural networks: trained activation function. Neural Netw.99, 148–157 (2018)
F.-A. Fortin, F.-M. De Rainville, M.-A. Gardner, M. Parizeau, C. Gagné, DEAP: evolutionary algorithms made easy. J. Mach. Learn. Res.13, 2171–2175 (2012)
J.E. Hernández-Beltran, V.H. Díaz-Ramirez, L. Trujillo, P. Legrand, Design of estimators for restoration of images degraded by haze using genetic programming. Swarm Evol. Comput.44, 49–63 (2019)
T.K. Ho, Random decision forests, inProceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1 (1995), pp. 278–282
P. Juárez-Smith, L. Trujillo, Integrating local search within neat-GP, inProceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion, GECCO ’16 Companion (ACM, New York, 2016), pp. 993–996
S.S. Kim, K.C. Kwak, Development of quantum-based adaptive neuro-fuzzy networks. IEEE Trans. Syst. Man Cybern. B (Cyberne.)40(1), 91–100 (2010)
M. Kommenda, G. Kronberger, S.M. Winkler, M. Affenzeller, S. Wagner, Effects of constant optimization by nonlinear least squares minimization in symbolic regression, inGenetic and Evolutionary Computation Conference, GECCO ’13, Amsterdam, The Netherlands, July 6–10, 2013, Companion Material Proceedings (2013), pp. 1121–1128
J.R. Koza,Genetic Programming: On the Programming of Computers by Means of Natural Selection (MIT Press, Cambridge, 1992)
J.R. Koza, Human-competitive results produced by genetic programming. Genet. Program. Evolvable Mach.11(3–4), 251–284 (2010)
W.B. Langdon, P. Riccardo,Foundations of Genetic Programming (Springer, Berlin, 2002)
D. Medernach, J. Fitzgerald, R.M.A. Azad, C. Ryan, A new wave: a dynamic approach to genetic programming, inProceedings of the Genetic and Evolutionary Computation Conference 2016, GECCO ’16 (ACM, New York, 2016), pp. 757–764
A. Moraglio, K. Krawiec, C.G. Johnson, Geometric semantic genetic programming, inProceedings of the 12th International Conference on Parallel Problem Solving from Nature—Volume Part I (Springer, Berlin, 2012), pp. 21–31
G. Olague, L. Trujillo, Evolutionary-computer-assisted design of image operators that detect interest points using genetic programming. Image Vis. Comput.29(7), 484–498 (2011)
I. Ortigosa, R. López, J. Garcia, A neural networks approach to residuary resistance of sailing yachts prediction, inProceedings of the International Conference on Marine Engineering MARINE (2007), p. 250
R. Poli, W.B. Langdon, S. Dignum, On the limiting distribution of program sizes in tree-based genetic programming, inGenetic Programming (Springer, Berlin, 2007), pp. 193–204
R. Poli, W.B. Langdon, N.F. McPhee,A Field Guide to Genetic Programming (Lulu Enterprises, Morrisville, 2008)
J.R. Quinlan, Combining instance-based and model-based learning, inMachine Learning, Proceedings of the Tenth International Conference, University of Massachusetts, Amherst, MA, USA, June 27–29, 1993 (1993), pp. 236–243
S.S. Roy, R. Roy, V.E. Balas, Estimating heating load in buildings using multivariate adaptive regression splines, extreme learning machine, a hybrid model of MARS and ELM. Renew. Sustain. Energy Rev.82, 4256–4268 (2018)
Y.-H. Shao, C.-H. Zhang, Z.-M. Yang, L. Jing, N.-Y. Deng, An\(\epsilon \)-twin support vector machine for regression. Neural Comput. Appl.23(1), 175–185 (2013)
S. Silva, Reassembling operator equalisation: a secret revealed, inProceedings of the 13th Annual Conference on Genetic and Evolutionary Computation, GECCO ’11 (ACM, New York, 2011), pp. 1395–1402
S. Silva, E. Costa, Dynamic limits for bloat control in genetic programming and a review of past and current bloat theories. Genet. Program. Evolvable Mach.10(2), 141–179 (2009)
D. Sorensen, Newton’s method with a model trust region modification. SIAM J. Numer. Anal.16, 409–426 (1982)
K.O. Stanley, R. Miikkulainen, Evolving neural networks through augmenting topologies. Evol. Comput.10(2), 99–127 (2002)
L. Trujillo, P. Legrand, G. Olague, J. LéVy-VéHel, Evolving estimators of the pointwise Hölder exponent with genetic programming. Inf. Sci.209, 61–79 (2012)
L. Trujillo, L. Muñoz, E. Galván-López, S. Silva, Neat genetic programming: controlling bloat naturally. Inf. Sci.333, 21–43 (2016)
A. Tsanas, A. Xifara, Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy Build.49, 560–567 (2012)
E.J. Vladislavleva, G.F. Smits, D. den Hertog, Order of nonlinearity as a complexity measure for models generated by symbolic regression via pareto genetic programming. IEEE Trans. Evol. Comput.13(2), 333–349 (2009)
I.-C. Yeh, Modeling of strength of high-performance concrete using artificial neural networks. Cem. Concr. Res.28(12), 1797–1808 (1998)
E. Z-Flores, L. Trujillo, O. Schütze, P. Legrand, A local search approach to genetic programming for binary classification, inProceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation, GECCO ’15 (ACM, New York, 2015), pp. 1151–1158
E. Z-Flores, L. Trujillo, O. Schütze, P. Legrand,EVOLVE—A Bridge Between Probability, Set Oriented Numerics, and Evolutionary Computation V. Chapter, Evaluating the Effects of Local Search in Genetic Programming (Springer, New York, 2014)
E. Z-Flores, M. Abatal, A. Bassam, L. Trujillo, P. Juárez-Smith, Y.E. Hamzaoui, Modeling the adsorption of phenols and nitrophenols by activated carbon using genetic programming. J. Clean. Prod.161, 860–870 (2017)
Acknowledgements
This work was funded by CONACYT (Mexico) project no. FC-2015-2/944Aprendizaje evolutivo a gran escala, and TecNM (Mexico) project no. 6826-18-p. Second author was supported by CONACYT doctoral scholarship 332554. The authors would like to thank Spanish Ministry of Economy, Industry and Competitiveness and European Regional Development Fund (FEDER) under projects TIN2014-56494-C4-4-P (Ephemec) and TIN2017-85727-C4-4-P (DeepBio); Junta de Extremadura Project IB16035 Regional Government of Extremadura, Consejeria of Economy and Infrastructure, FEDER.
Author information
Authors and Affiliations
Tecnológico Nacional de México/Instituto Tecnológico de Tijuana, Tijuana, BC, C.P. 22430, Mexico
Perla Juárez-Smith, Leonardo Trujillo & Mario García-Valdez
Universidad de Extremadura, Badajoz, Spain
Francisco Fernández de Vega & Francisco Chávez
- Perla Juárez-Smith
You can also search for this author inPubMed Google Scholar
- Leonardo Trujillo
You can also search for this author inPubMed Google Scholar
- Mario García-Valdez
You can also search for this author inPubMed Google Scholar
- Francisco Fernández de Vega
You can also search for this author inPubMed Google Scholar
- Francisco Chávez
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toLeonardo Trujillo.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Juárez-Smith, P., Trujillo, L., García-Valdez, M.et al. Local search in speciation-based bloat control for genetic programming.Genet Program Evolvable Mach20, 351–384 (2019). https://doi.org/10.1007/s10710-019-09351-7
Received:
Revised:
Published:
Issue Date:
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative