research-article

On the Statistical Learning Ability of Evolution Strategies

Authors:
Ofer M. Shir

Tel-Hai College, and The Galilee Research Institute - Migal, Upper Galilee, Israel

Tel-Hai College, and The Galilee Research Institute - Migal, Upper Galilee, Israel
View Profile

,
Amir Yehudayoff

Technion - Israel Institute of Technology, Haifa, Israel

Technion - Israel Institute of Technology, Haifa, Israel
View Profile

FOGA '17: Proceedings of the 14th ACM/SIGEVO Conference on Foundations of Genetic AlgorithmsJanuary 2017Pages 127–138https://doi.org/10.1145/3040718.3040722

Published:12 January 2017Publication History

FOGA '17: Proceedings of the 14th ACM/SIGEVO Conference on Foundations of Genetic Algorithms

Pages 127–138

ABSTRACT

We explore the ability of Evolution Strategies (ESs) to statistically learn the local landscape. Specifically, we consider ESs operating only with isotropic Gaussian mutations near the optimum and investigate the covariance matrix when constructed out of selected individuals by truncation. Unlike previous studies, we do not assume a Derandomization adaptation scheme, nor do we use Information Geometric Optimization in our proofs. We prove that the statistically constructed covariance matrix over such selected decision vectors has the same eigenvectors as the Hessian matrix. We further prove that when the population size is increased, the covariance becomes proportional to the inverse of the Hessian. We also devise and corroborate an analytic approximation of this covariance matrix. In the framework we consider, this confirms the classical hypothesis that learning the landscape is an inherent property of standard ESs, and that this capability stems only from the usage of isotropic Gaussian mutations and rank-based selection.

References

Y. Akimoto. Analysis of a natural gradient algorithm on monotonic convex-quadratic-composite functions. In Proceedings of the 14th Annual Conference on Genetic and Evolutionary Computation, GECCO '12, pages 1293--1300, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
S.-I. Amari and H. Nagaoka. Methods of Information Geometry, volume 191 of Translations of Mathematical Monographs. American Mathematical Society, 2000.Google Scholar
T. B\"ack, C. Foussette, and P. Krause. Contemporary Evolution Strategies. Natural Computing Series. Springer-Verlag Berlin Heidelberg, 2013.Google Scholar
H.-G. Beyer. Convergence analysis of evolutionary algorithms that are based on the paradigm of information geometry. Evolutionary Computation, 22(4):679--709, Dec. 2014. Google ScholarDigital Library
E. Castillo, A. S. Hadi, N. Balakrishnan, and J. M. Sarabia. Extreme Value and Related Models with Applications in Engineering and Science. John Wiley and Sons, 2004.Google Scholar
P. Embrechts, C. Klüppelberg, and T. Mikosch. Modelling Extremal Events for Insurance and Finance. Springer-Verlag, 1997. Google ScholarCross Ref
A. H. Feiveson and F. C. Delaney. The distribution and properties of a weighted sum of chi squares. Technical report, National Aeronautics and Space Administration, 1968.Google Scholar
R. Fisher and L. Tippett. Limiting forms of the frequency distribution of the largest or smallest member of a sample. Proc. Cambridge Philos. Soc., 24:180--190, 1928. Google ScholarCross Ref
S. S. Gupta. Order Statistics from the Gamma Distribution. Technometrics, 2, May 1960. Google ScholarCross Ref
N. Hansen and A. Ostermeier. Completely Derandomized Self-Adaptation in Evolution Strategies. Evolutionary Computation, 9(2):159--195, 2001. Google ScholarDigital Library
P. Moschopoulos and W. Canada. The distribution function of a linear combination of chi-squares. Computers & Mathematics with Applications, 10(4):383--386, 1984. Google ScholarCross Ref
Y. Ollivier, L. Arnold, A. Auger, and N. Hansen. Information-Geometric Optimization Algorithms: A Unifying Picture via Invariance Principles. arXiv:1106.3708v3, 2014.Google Scholar
G. Rudolph. On Correlated Mutations in Evolution Strategies. In Parallel Problem Solving from Nature - PPSN II, pages 105--114, Amsterdam, 1992. Elsevier.Google Scholar
G. Rudolph. Convergence rates of evolutionary algorithms for a class of convex objective functions. Control and Cybernetics, 26(3), 1997.Google Scholar
G. Rudolph. Handbook of Natural Computing: Theory, Experiments, and Applications, chapter Evolutionary Strategies, pages 673--698. Springer-Verlag, Berlin-Heidelberg, Germany, 2012. Google ScholarCross Ref
O. M. Shir, J. Roslund, D. Whitley, and H. Rabitz. Efficient retrieval of landscape hessian: Forced optimal covariance adaptive learning. Physical Review E, 89:063306, Jun 2014. Google ScholarCross Ref
D. Wierstra, T. Schaul, T. Glasmachers, Y. Sun, J. Peters, and J. Schmidhuber. Natural evolution strategies. J. Mach. Learn. Res., 15(1):949--980, Jan. 2014.Google ScholarDigital Library
A. Zhigljavsky and A.vZilinskas. Stochastic Global Optimization. Springer Optimization and Its Applications. Springer US, 2007.Google Scholar

Index Terms

On the Statistical Learning Ability of Evolution Strategies

Recommendations

On the Capacity of Evolution Strategies to Statistically Learn the Landscape
GECCO '16 Companion: Proceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion

We investigate the covariance matrix when constructed by Evolution Strategies (ESs) operating with the selection operator alone. We model continuous generation of candidate solutions about quadratic basins of attraction, with deterministic selection of ...
Read More
Niching in evolution strategies
GECCO '05: Proceedings of the 7th annual conference on Genetic and evolutionary computation

EAs have the tendency to converge quickly into a single solution. Niching methods, the extension of EAs to address this issue, have been investigated up to date mainly within the field of Genetic Algorithms (GAs). In our study we investigate the basis ...
Read More
The effect of crossover on evolution ability of population
GEC '09: Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation

Currently most of the analysis about the running mechanism of GA focuses on the convergence problem while few focus on population characteristics after the single generation descendiblity. This paper presents the concept of evolution ability of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
FOGA '17: Proceedings of the 14th ACM/SIGEVO Conference on Foundations of Genetic Algorithms
January 2017
170 pages
ISBN:9781450346511
DOI:10.1145/3040718
Program Chairs:
Christian Igel
University of Copenhagen, Denmark
,
Dirk Sudholt
University of Sheffield, United Kingdom
,
Carsten Witt
Technical University of Denmark, Denmark
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 January 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
covariance
hessian
limit distributions of order statistics
statistical landscape learning
theory of evolution strategies
Qualifiers
- research-article
Conference

Acceptance Rates
FOGA '17 Paper Acceptance Rate13of23submissions,57%Overall Acceptance Rate72of131submissions,55%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 135
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On the Statistical Learning Ability of Evolution Strategies

FOGA '17: Proceedings of the 14th ACM/SIGEVO Conference on Foundations of Genetic Algorithms

ABSTRACT

References

Cited By

Index Terms

Recommendations

On the Capacity of Evolution Strategies to Statistically Learn the Landscape

Niching in evolution strategies

The effect of crossover on evolution ability of population

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

On the Statistical Learning Ability of Evolution Strategies

FOGA '17: Proceedings of the 14th ACM/SIGEVO Conference on Foundations of Genetic Algorithms

ABSTRACT

References

Cited By

Index Terms

Recommendations

On the Capacity of Evolution Strategies to Statistically Learn the Landscape

Niching in evolution strategies

The effect of crossover on evolution ability of population

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media