ABSTRACT
We explore the ability of Evolution Strategies (ESs) to statistically learn the local landscape. Specifically, we consider ESs operating only with isotropic Gaussian mutations near the optimum and investigate the covariance matrix when constructed out of selected individuals by truncation. Unlike previous studies, we do not assume a Derandomization adaptation scheme, nor do we use Information Geometric Optimization in our proofs. We prove that the statistically constructed covariance matrix over such selected decision vectors has the same eigenvectors as the Hessian matrix. We further prove that when the population size is increased, the covariance becomes proportional to the inverse of the Hessian. We also devise and corroborate an analytic approximation of this covariance matrix. In the framework we consider, this confirms the classical hypothesis that learning the landscape is an inherent property of standard ESs, and that this capability stems only from the usage of isotropic Gaussian mutations and rank-based selection.
- Y. Akimoto. Analysis of a natural gradient algorithm on monotonic convex-quadratic-composite functions. In Proceedings of the 14th Annual Conference on Genetic and Evolutionary Computation, GECCO '12, pages 1293--1300, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
- S.-I. Amari and H. Nagaoka. Methods of Information Geometry, volume 191 of Translations of Mathematical Monographs. American Mathematical Society, 2000.Google Scholar
- T. B\"ack, C. Foussette, and P. Krause. Contemporary Evolution Strategies. Natural Computing Series. Springer-Verlag Berlin Heidelberg, 2013.Google Scholar
- H.-G. Beyer. Convergence analysis of evolutionary algorithms that are based on the paradigm of information geometry. Evolutionary Computation, 22(4):679--709, Dec. 2014. Google ScholarDigital Library
- E. Castillo, A. S. Hadi, N. Balakrishnan, and J. M. Sarabia. Extreme Value and Related Models with Applications in Engineering and Science. John Wiley and Sons, 2004.Google Scholar
- P. Embrechts, C. Klüppelberg, and T. Mikosch. Modelling Extremal Events for Insurance and Finance. Springer-Verlag, 1997. Google ScholarCross Ref
- A. H. Feiveson and F. C. Delaney. The distribution and properties of a weighted sum of chi squares. Technical report, National Aeronautics and Space Administration, 1968.Google Scholar
- R. Fisher and L. Tippett. Limiting forms of the frequency distribution of the largest or smallest member of a sample. Proc. Cambridge Philos. Soc., 24:180--190, 1928. Google ScholarCross Ref
- S. S. Gupta. Order Statistics from the Gamma Distribution. Technometrics, 2, May 1960. Google ScholarCross Ref
- N. Hansen and A. Ostermeier. Completely Derandomized Self-Adaptation in Evolution Strategies. Evolutionary Computation, 9(2):159--195, 2001. Google ScholarDigital Library
- P. Moschopoulos and W. Canada. The distribution function of a linear combination of chi-squares. Computers & Mathematics with Applications, 10(4):383--386, 1984. Google ScholarCross Ref
- Y. Ollivier, L. Arnold, A. Auger, and N. Hansen. Information-Geometric Optimization Algorithms: A Unifying Picture via Invariance Principles. arXiv:1106.3708v3, 2014.Google Scholar
- G. Rudolph. On Correlated Mutations in Evolution Strategies. In Parallel Problem Solving from Nature - PPSN II, pages 105--114, Amsterdam, 1992. Elsevier.Google Scholar
- G. Rudolph. Convergence rates of evolutionary algorithms for a class of convex objective functions. Control and Cybernetics, 26(3), 1997.Google Scholar
- G. Rudolph. Handbook of Natural Computing: Theory, Experiments, and Applications, chapter Evolutionary Strategies, pages 673--698. Springer-Verlag, Berlin-Heidelberg, Germany, 2012. Google ScholarCross Ref
- O. M. Shir, J. Roslund, D. Whitley, and H. Rabitz. Efficient retrieval of landscape hessian: Forced optimal covariance adaptive learning. Physical Review E, 89:063306, Jun 2014. Google ScholarCross Ref
- D. Wierstra, T. Schaul, T. Glasmachers, Y. Sun, J. Peters, and J. Schmidhuber. Natural evolution strategies. J. Mach. Learn. Res., 15(1):949--980, Jan. 2014.Google ScholarDigital Library
- A. Zhigljavsky and A.vZilinskas. Stochastic Global Optimization. Springer Optimization and Its Applications. Springer US, 2007.Google Scholar
Index Terms
- On the Statistical Learning Ability of Evolution Strategies
Recommendations
On the Capacity of Evolution Strategies to Statistically Learn the Landscape
GECCO '16 Companion: Proceedings of the 2016 on Genetic and Evolutionary Computation Conference CompanionWe investigate the covariance matrix when constructed by Evolution Strategies (ESs) operating with the selection operator alone. We model continuous generation of candidate solutions about quadratic basins of attraction, with deterministic selection of ...
Niching in evolution strategies
GECCO '05: Proceedings of the 7th annual conference on Genetic and evolutionary computationEAs have the tendency to converge quickly into a single solution. Niching methods, the extension of EAs to address this issue, have been investigated up to date mainly within the field of Genetic Algorithms (GAs). In our study we investigate the basis ...
The effect of crossover on evolution ability of population
GEC '09: Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary ComputationCurrently most of the analysis about the running mechanism of GA focuses on the convergence problem while few focus on population characteristics after the single generation descendiblity. This paper presents the concept of evolution ability of ...
Comments