skip to main content
article

Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization

Published:01 January 2005Publication History
Skip Abstract Section

Abstract

We develop in this article, four adaptive three-timescale stochastic approximation algorithms for simulation optimization that estimate both the gradient and Hessian of average cost at each update epoch. These algorithms use four, three, two, and one simulation(s), respectively, and update the values of the decision variable and Hessian matrix components simultaneously, with estimates based on the simultaneous perturbation methodology. Our algorithms use coupled stochastic recursions that proceed using three different timescales or step-size schedules. We present a detailed convergence analysis of the algorithms and show numerical experiments using all the developed algorithms on a two-node network of M/G/1 queues with feedback for a 50-dimensional parameter vector. We provide comparisons of the performance of these algorithms with two recently developed two-timescale steepest descent simultaneous perturbation analogs that use randomized and deterministic perturbation sequences, respectively. We also present experiments to explore the sensitivity of the algorithms to their associated parameters. The algorithms that use four and three simulations, respectively, perform significantly better than the rest of the algorithms.

References

  1. Andradóttir, S. 1996. Optimization of the transient and steady-state behavior of discrete event systems. Manag. Sci. 42, 5, 717--737. Google ScholarGoogle Scholar
  2. Bertsekas, D. P. 1999. Nonlinear Programming. Athena Scientific, Belmont.Google ScholarGoogle Scholar
  3. Bertsekas, D. P. and Tsitsiklis, J. N. 1989. Parallel and Distributed Computation. Prentice Hall, New Jersey. Google ScholarGoogle Scholar
  4. Bhatnagar, S. 1997. Multiscale Stochastic Approximation Algorithms with Applications to ABR Service in ATM Networks. Ph. D. thesis, Department of Electrical Engineering, Indian Institute of Science, Bangalore, India.Google ScholarGoogle Scholar
  5. Bhatnagar, S. and Borkar, V. S. 1997. Multiscale stochastic approximation for parametric optimization of hidden Markov models. Prob. Eng. and Info. Sci. 11, 509--522.Google ScholarGoogle Scholar
  6. Bhatnagar, S. and Borkar, V. S. 1998. A two time scale stochastic approximation scheme for simulation based parametric optimization. Prob. Eng. and Info. Sci. 12, 519--531.Google ScholarGoogle Scholar
  7. Bhatnagar, S. and Borkar, V. S. 2003. Multiscale chaotic SPSA and smoothed functional algorithms for simulation optimization. Simulation 79, 10, 568--580.Google ScholarGoogle Scholar
  8. Bhatnagar, S., Fu, M. C., Marcus, S. I., and Bhatnagar, S. 2001a. Two timescale algorithms for simulation optimization of hidden Markov models. IIE Trans. 33, 3, 245--258.Google ScholarGoogle Scholar
  9. Bhatnagar, S., Fu, M. C., Marcus, S. I., and Fard, P. J. 2001b. Optimal structured feedback policies for ABR flow control using two-timescale SPSA. IEEE/ACM Trans. Network. 9, 4, 479--491. Google ScholarGoogle Scholar
  10. Bhatnagar, S., Fu, M. C., Marcus, S. I., and Wang, I.-J. 2003. Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences. ACM Trans. Modell. Comput. Simul. 13, 2, 180--209. Google ScholarGoogle Scholar
  11. Brandiere, O. 1998. Some pathological traps for stochastic approximation. SIAM J. Contr. Optim. 36, 1293--1314. Google ScholarGoogle Scholar
  12. Chen, H. F., Duncan, T. E., and Pasik-Duncan, B. 1999. A Kiefer-Wolfowitz algorithm with randomized differences. IEEE Trans. Auto. Cont. 44, 3, 442--453.Google ScholarGoogle Scholar
  13. Chong, E. K. P. and Ramadge, P. J. 1993. Optimization of queues using an infinitesimal perturbation analysis-based stochastic algorithm with general update times. SIAM J. Cont. Optim. 31, 3, 698--732. Google ScholarGoogle Scholar
  14. Chong, E. K. P. and Ramadge, P. J. 1994. Stochastic optimization of regenerative systems using infinitesimal perturbation analysis. IEEE Trans. Auto. Cont. 39, 7, 1400--1410.Google ScholarGoogle Scholar
  15. Dippon, J. and Renz, J. 1997. Weighted means in stochastic approximation of minima. SIAM J. Contr. Optim. 35, 1811--1827. Google ScholarGoogle Scholar
  16. Fabian, V. 1971. Stochastic approximation. In Optimizing Methods in Statistics J. J. Rustagi, Ed. Academic Press, New York, NY, 439--470.Google ScholarGoogle Scholar
  17. Fu, M. C. 1990. Convergence of a stochastic approximation algorithm for the GI/G/1 queue using infinitesimal perturbation analysis. J. Optim. Theo. Appl. 65, 149--160. Google ScholarGoogle Scholar
  18. Fu, M. C. and Hill, S. D. 1997. Optimization of discrete event systems via simultaneous perturbation stochastic approximation. IIE Trans. 29, 3, 233--243.Google ScholarGoogle Scholar
  19. Gelfand, S. B. and Mitter, S. K. 1991. Recursive stochastic algorithms for global optimization in Rd*. SIAM J. Cont. Optim. 29, 5, 999--1018. Google ScholarGoogle Scholar
  20. Hirsch, M. W. 1989. Convergent activation dynamics in continuous time networks. Neural Networks 2, 331--349. Google ScholarGoogle Scholar
  21. Ho, Y. C. and Cao, X. R. 1991. Perturbation Analysis of Discrete Event Dynamical Systems. Kluwer, Boston, MA.Google ScholarGoogle Scholar
  22. Kiefer, E. and Wolfowitz, J. 1952. Stochastic estimation of the maximum of a regression function. Ann. Math. Statist. 23, 462--466.Google ScholarGoogle Scholar
  23. Kleinman, N. L., Spall, J. C., and Naiman, D. Q. 1999. Simulation-based optimization with stochastic approximation using common random numbers. Manag. Sci. 45, 1570--1578. Google ScholarGoogle Scholar
  24. Kushner, H. J. and Clark, D. S. 1978. Stochastic Approximation Methods for Constrained and Unconstrained Systems. Springer Verlag, New York, NY.Google ScholarGoogle Scholar
  25. Kushner, H. J. and Yin, G. G. 1997. Stochastic Approximation Algorithms and Applications. Springer Verlag, New York, NY.Google ScholarGoogle Scholar
  26. Lasalle, J. P. and Lefschetz, S. 1961. Stability by Liapunov's Direct Method with Applications. Academic Press, New York, NY.Google ScholarGoogle Scholar
  27. L'Ecuyer, P. and Glynn, P. W. 1994. Stochastic optimization by simulation: convergence proofs for the GI/G/1 queue in steady-state. Manag. Sci. 40, 11, 1562--1578. Google ScholarGoogle Scholar
  28. Luman, R. R. 2000. Upgrading complex systems of systems: a CAIV methodology for warfare area requirements allocation. Military Operations Research 5, 2, 53--75.Google ScholarGoogle Scholar
  29. Pemantle, R. 1990. Nonconvergence to unstable points in urn models and stochastic approximations. Annals of Prob. 18, 698--712.Google ScholarGoogle Scholar
  30. Polyak, B. T. and Juditsky, A. B. 1992. Acceleration of stochastic approximation by averaging. SIAM J. Control Optim. 30, 4, 838--855. Google ScholarGoogle Scholar
  31. Robbins, H. and Monro, S. 1951. A stochastic approximation method. Ann. Math. Statist. 22, 400--407.Google ScholarGoogle Scholar
  32. Ruppert, D. 1985. A Newton-Raphson version of the multivariate Robbins-Monro procedure. Annals Statist. 13, 236--245.Google ScholarGoogle Scholar
  33. Spall, J. C. 1992. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans. Auto. Cont. 37, 3, 332--341.Google ScholarGoogle Scholar
  34. Spall, J. C. 1997. A one-measurement form of simultaneous perturbation stochastic approximation. Automatica 33, 109--112. Google ScholarGoogle Scholar
  35. Spall, J. C. 2000. Adaptive stochastic approximation by the simultaneous perturbation method. IEEE Trans. Autom. Contr. 45, 1839--1853.Google ScholarGoogle Scholar
  36. Zhu, X. and Spall, J. C. 2002. A modified second-order SPSA optimization algorithm for finite samples. Int. J. Adapt. Contr. Sign. Proce. 16, 397--409.Google ScholarGoogle Scholar

Index Terms

  1. Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader