skip to main content
Skip header Section
Dynamic ProgrammingMarch 2003
Publisher:
  • Dover Publications, Inc.
  • 31 E. Second St. Mineola, NY
  • United States
ISBN:978-0-486-42809-3
Published:01 March 2003
Pages:
366
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

From the Publisher:

An introduction to the mathematical theory of multistage decision processes, this text takes a functional equation approach to the discovery of optimum policies. Written by a leading developer of such policies, it presents a series of methods, uniqueness and existence theorems, and examples for solving the relevant equations. The text examines existence and uniqueness theorems, the optimal inventory equation, bottleneck problems in multistage production processes, a new formalism in the calculus of variation, strategies behind multistage games, and Markovian decision processes. Each chapter concludes with a problem set that Eric V. Denardo of Yale University, in his informative new introduction, calls a rich lode of applications and research topics. 1957 edition. 37 figures.

Cited By

  1. ACM
    Little M, He X and Kayas U (2024). Polymorphic dynamic programming by algebraic shortcut fusion, Formal Aspects of Computing, 0:0
  2. ACM
    Sundram S, Tariq M and Kjolstad F (2024). Compiling Recurrences over Dense and Sparse Arrays, Proceedings of the ACM on Programming Languages, 8:OOPSLA1, (250-275), Online publication date: 29-Apr-2024.
  3. Gao N, Wang D, Zhao M and Hu L (2024). Model-free intelligent critic design with error analysis for neural tracking control, Neurocomputing, 572:C, Online publication date: 1-Mar-2024.
  4. Jabón J, Corbera S, Álvarez R and Barea R (2024). Aerodynamic shape optimization using graph variational autoencoders and genetic algorithms, Structural and Multidisciplinary Optimization, 67:3, Online publication date: 1-Mar-2024.
  5. Hammar K and Stadler R (2024). Learning Near-Optimal Intrusion Responses Against Dynamic Attackers, IEEE Transactions on Network and Service Management, 21:1, (1158-1177), Online publication date: 1-Feb-2024.
  6. Li K and Li Y (2024). Fuzzy Adaptive Optimization Prescribed Performance Control for Nonlinear Vehicle Platoon, IEEE Transactions on Fuzzy Systems, 32:2, (360-372), Online publication date: 1-Feb-2024.
  7. Gao Y, Zhang T, Zhu C, Yang S, Schonfeld P, Zou K, Zhang J, Zhu Y, Wang P and He Q (2024). Biobjective optimization for railway alignment fine‐grained designs with parallel existing railways, Computer-Aided Civil and Infrastructure Engineering, 39:3, (438-457), Online publication date: 26-Jan-2024.
  8. Chen L and Hao F (2024). Optimal tracking control for unknown nonlinear systems with uncertain input saturation, Neurocomputing, 564:C, Online publication date: 7-Jan-2024.
  9. Nguyen N, Tran M and Chandra R (2024). Sequential reversible jump MCMC for dynamic Bayesian neural networks, Neurocomputing, 564:C, Online publication date: 7-Jan-2024.
  10. Prabhavalkar R, Hori T, Sainath T, Schlüter R and Watanabe S (2024). End-to-End Speech Recognition: A Survey, IEEE/ACM Transactions on Audio, Speech and Language Processing, 32, (325-351), Online publication date: 1-Jan-2024.
  11. ACM
    Teng S (2023). “Intelligent Heuristics Are the Future of Computing”, ACM Transactions on Intelligent Systems and Technology, 14:6, (1-39), Online publication date: 31-Dec-2024.
  12. Slavin O and Arlazarov V (2023). Algorithms of the Tiger and CuneiForm Optical Character Recognition Software, Pattern Recognition and Image Analysis, 33:4, (669-684), Online publication date: 1-Dec-2023.
  13. Zhang Y, Chadli M and Xiang Z (2023). Prescribed-Time Formation Control for a Class of Multiagent Systems via Fuzzy Reinforcement Learning, IEEE Transactions on Fuzzy Systems, 31:12, (4195-4204), Online publication date: 1-Dec-2023.
  14. Riboli M, Jaccard M, Silvestri M, Aimi A and Malara C (2023). Collision-free and smooth motion planning of dual-arm Cartesian robot based on B-spline representation, Robotics and Autonomous Systems, 170:C, Online publication date: 1-Dec-2023.
  15. Cespi R, Di Gennaro S, Castillo-Toledo B, Romero-Aragon J and Ramírez-Mendoza R (2023). Neural Network Inverse Optimal Control of Ground Vehicles, Neural Processing Letters, 55:8, (10287-10313), Online publication date: 1-Dec-2023.
  16. Saglam B and Kozat S (2023). Deep intrinsically motivated exploration in continuous control, Machine Language, 112:12, (4959-4993), Online publication date: 1-Dec-2023.
  17. Shakya A, Pillai G and Chakrabarty S (2023). Reinforcement learning algorithms, Expert Systems with Applications: An International Journal, 231:C, Online publication date: 30-Nov-2023.
  18. Jornet M (2023). On the random fractional Bateman equations, Applied Mathematics and Computation, 457:C, Online publication date: 15-Nov-2023.
  19. Heinold A, Meisel F and Ulmer M (2022). Primal-Dual Value Function Approximation for Stochastic Dynamic Intermodal Transportation with Eco-Labels, Transportation Science, 57:6, (1452-1472), Online publication date: 1-Nov-2023.
  20. Pataro I, Cunha R, Gil J, Guzmán J, Berenguel M and Lemos J (2023). Optimal model-free adaptive control based on reinforcement Q-Learning for solar thermal collector fields, Engineering Applications of Artificial Intelligence, 126:PA, Online publication date: 1-Nov-2023.
  21. Qasem O, Gao W and Vamvoudakis K (2023). Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration, Automatica (Journal of IFAC), 157:C, Online publication date: 1-Nov-2023.
  22. Josz C (2023). Global convergence of the gradient method for functions definable in o-minimal structures, Mathematical Programming: Series A and B, 202:1-2, (355-383), Online publication date: 1-Nov-2023.
  23. Zhao M, Wang D, Qiao J, Ha M and Ren J (2023). Advanced value iteration for discrete-time intelligent critic control: A survey, Artificial Intelligence Review, 56:10, (12315-12346), Online publication date: 1-Oct-2023.
  24. Khan I and Aftab M (2023). Adaptive fuzzy dynamic programming (AFDP) technique for linear programming problems lps with fuzzy constraints, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 27:19, (13931-13949), Online publication date: 1-Oct-2023.
  25. Wu W, Eamen L, Dandy G, Razavi S, Kuczera G and Maier H (2023). Beyond engineering, Environmental Modelling & Software, 167:C, Online publication date: 1-Sep-2023.
  26. Dai B, Krishnamurthy P, Papanicolaou A and Khorrami F (2023). State constrained stochastic optimal control for continuous and hybrid dynamical systems using DFBSDE, Automatica (Journal of IFAC), 155:C, Online publication date: 1-Sep-2023.
  27. Grage K, Jansen K and Ohnesorge F Improved Algorithms for Monotone Moldable Job Scheduling Using Compression and Convolution Euro-Par 2023: Parallel Processing, (503-517)
  28. Bertram J, Zambreno J and Wei P (2023). Efficient Unmanned Aerial Systems Navigation With Collision Avoidance in Dense Urban Environments, IEEE Transactions on Intelligent Transportation Systems, 24:8, (8163-8173), Online publication date: 1-Aug-2023.
  29. Kůrková V and Sanguineti M (2023). Approximation of classifiers by deep perceptron networks, Neural Networks, 165:C, (654-661), Online publication date: 1-Aug-2023.
  30. Millán-Arias C, Fernandes B and Cruz F (2023). Proxemic behavior in navigation tasks using reinforcement learning, Neural Computing and Applications, 35:23, (16723-16738), Online publication date: 1-Aug-2023.
  31. ACM
    Sleeman W, Kapoor R and Ghosh P (2022). Multimodal Classification: Current Landscape, Taxonomy and Future Directions, ACM Computing Surveys, 55:7, (1-31), Online publication date: 31-Jul-2023.
  32. Wu Y, Zhou X, Chowdhury S and Wang D Differentially private episodic reinforcement learning with heavy-tailed rewards Proceedings of the 40th International Conference on Machine Learning, (37880-37918)
  33. Ren J, Zhou Y, Jin J, Lyu L and Yan D Dimension-independent certified neural network watermarks via mollifier smoothing Proceedings of the 40th International Conference on Machine Learning, (28976-29008)
  34. Fayyazi M, Abdoos M, Phan D, Golafrouz M, Jalili M, Jazar R, Langari R and Khayyam H (2023). Real-time self-adaptive Q-learning controller for energy management of conventional autonomous vehicles, Expert Systems with Applications: An International Journal, 222:C, Online publication date: 15-Jul-2023.
  35. Klößner T, Torralba Á, Steinmetz M and Sievers S A theory of merge-and-shrink for stochastic shortest path problems Proceedings of the Thirty-Third International Conference on Automated Planning and Scheduling, (203-211)
  36. Zhao T, Zhou S, Sun Y and Niu Z (2023). A Predictive Frame Transmission Scheme for Cloud Gaming in Mobile Edge Cloudlet Systems, IEEE Transactions on Mobile Computing, 22:7, (3774-3789), Online publication date: 1-Jul-2023.
  37. Peng Z, Ji H, Zou C, Kuang Y, Cheng H, Shi K and Ghosh B (2023). Optimal H ∞ tracking control of nonlinear systems with zero-equilibrium-free via novel adaptive critic designs, Neural Networks, 164:C, (105-114), Online publication date: 1-Jul-2023.
  38. Lei Y, Ye D, Shen S, Sui Y, Zhu T and Zhou W (2022). New challenges in reinforcement learning: a survey of security and privacy, Artificial Intelligence Review, 56:7, (7195-7236), Online publication date: 1-Jul-2023.
  39. Li M, Wang D, Zhao M and Qiao J (2023). Event-triggered constrained neural critic control of nonlinear continuous-time multiplayer nonzero-sum games, Information Sciences: an International Journal, 631:C, (412-428), Online publication date: 1-Jun-2023.
  40. Charpentier A, Élie R and Remlinger C (2023). Reinforcement Learning in Economics and Finance, Computational Economics, 62:1, (425-462), Online publication date: 1-Jun-2023.
  41. Candon K, Chen J, Kim Y, Hsu Z, Tsoi N and Vázquez M Nonverbal Human Signals Can Help Autonomous Agents Infer Human Preferences for Their Behavior Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, (307-316)
  42. Théate T, Wehenkel A, Bolland A, Louppe G and Ernst D (2023). Distributional reinforcement learning with unconstrained monotonic neural networks, Neurocomputing, 534:C, (199-219), Online publication date: 14-May-2023.
  43. Goby N, Brandt T and Neumann D (2023). Deep reinforcement learning with combinatorial actions spaces, Computers and Industrial Engineering, 179:C, Online publication date: 1-May-2023.
  44. Lin M, Zhao B and Liu D (2023). Policy gradient adaptive dynamic programming for nonlinear discrete-time zero-sum games with unknown dynamics, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 27:9, (5781-5795), Online publication date: 1-May-2023.
  45. Cohen-Hillel T, Panchamgam K and Perakis G (2023). High-Low Promotion Policies for Peak-End Demand Models, Management Science, 69:4, (2016-2050), Online publication date: 1-Apr-2023.
  46. Deng Q, Kang Q, Zhang L, Zhou M and An J (2023). Objective Space-Based Population Generation to Accelerate Evolutionary Algorithms for Large-Scale Many-Objective Optimization, IEEE Transactions on Evolutionary Computation, 27:2, (326-340), Online publication date: 1-Apr-2023.
  47. Laurière M, Song J and Tang Q (2023). Policy Iteration Method for Time-Dependent Mean Field Games Systems with Non-separable Hamiltonians, Applied Mathematics and Optimization, 87:2, Online publication date: 1-Apr-2023.
  48. ACM
    Zhang Z, Zou Y, Lai J and Xu Q M2DQN: A Robust Method for Accelerating Deep Q-learning Network Proceedings of the 2023 15th International Conference on Machine Learning and Computing, (116-120)
  49. Xiang L, Dudziak Ł, Abdelfattah M, Chau T, Lane N and Wen H Zero-cost operation scoring in differentiable architecture search Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence, (10453-10463)
  50. Schmitt S, Shawe-Taylor J and van Hasselt H Exploration via epistemic value estimation Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence, (9742-9751)
  51. Antonopoulos A, Pagourtzis A, Petsalakis S and Vasilakis M (2023). Faster algorithms for k-subset sum and variations, Journal of Combinatorial Optimization, 45:1, Online publication date: 1-Jan-2023.
  52. Nass D, Belousov B and Peters J Entropic Risk Measure in Policy Search 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (1101-1106)
  53. Dewanto V and Gallagher M Examining Average and Discounted Reward Optimality Criteria in Reinforcement Learning AI 2022: Advances in Artificial Intelligence, (800-813)
  54. Wilbaut C, Todosijević R, Hanafi S and Fréville A (2022). Variable neighborhood search for the discounted {0-1} knapsack problem, Applied Soft Computing, 131:C, Online publication date: 1-Dec-2022.
  55. Friedrich S, Antes G, Behr S, Binder H, Brannath W, Dumpert F, Ickstadt K, Kestler H, Lederer J, Leitgöb H, Pauly M, Steland A, Wilhelm A and Friede T (2022). Is there a role for statistics in artificial intelligence?, Advances in Data Analysis and Classification, 16:4, (823-846), Online publication date: 1-Dec-2022.
  56. An D, Huong V and Xu H (2022). Differential Stability of Discrete Optimal Control Problems with Possibly Nondifferentiable Costs, Applied Mathematics and Optimization, 86:3, Online publication date: 1-Dec-2022.
  57. Rezaei-Shoshtari S, Zhao R, Panangaden P, Meger D and Precup D Continuous MDP homomorphisms and homomorphic policy gradient Proceedings of the 36th International Conference on Neural Information Processing Systems, (20189-20204)
  58. Asadi K, Fakoor R, Gottesman O, Kim T, Littman M and Smola A Faster deep reinforcement learning with slower online network Proceedings of the 36th International Conference on Neural Information Processing Systems, (19944-19955)
  59. Kim J, Park S and Kim G Constrained GPI for zero-shot transfer in reinforcement learning Proceedings of the 36th International Conference on Neural Information Processing Systems, (4585-4597)
  60. Mutti M, De Santi R, De Bartolomeis P and Restelli M Challenging common assumptions in convex reinforcement learning Proceedings of the 36th International Conference on Neural Information Processing Systems, (4489-4502)
  61. Tesar M, Schwarz F and Gratzfeld P Optimized Driving Profiles with Deep Reinforcement Learning for Driver Assistance Systems in Light Rail Vehicles 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), (673-680)
  62. Boldinov V, Bukhalev V and Skrynnikov A (2022). Game Control of a Random Jump Structure of an Object in Mixed Strategies, Journal of Computer and Systems Sciences International, 61:5, (715-723), Online publication date: 1-Oct-2022.
  63. Gornov A, Anikin A, Zarodnyuk T and Sorokovikov P (2022). Modification of the Confidence Bar Algorithm Based on Approximations of the Main Diagonal of the Hessian Matrix for Solving Optimal Control Problems, Automation and Remote Control, 83:10, (1590-1599), Online publication date: 1-Oct-2022.
  64. Bure V and Parilina E (2022). A Multiple Access Game with Incomplete Information, Automation and Remote Control, 83:9, (1467-1475), Online publication date: 1-Sep-2022.
  65. Bertram J, Wei P and Zambreno J (2022). A Fast Markov Decision Process-Based Algorithm for Collision Avoidance in Urban Air Mobility, IEEE Transactions on Intelligent Transportation Systems, 23:9, (15420-15433), Online publication date: 1-Sep-2022.
  66. Nidzwetzki J and Güting R (2022). BBoxDB streams: scalable processing of multi-dimensional data streams, Distributed and Parallel Databases, 40:2-3, (559-625), Online publication date: 1-Sep-2022.
  67. Kumakshev S and Shmatkov A (2022). Optimal Fuel Consumption Trajectories of a Civil Supersonic Aircraft, Journal of Computer and Systems Sciences International, 61:4, (664-676), Online publication date: 1-Aug-2022.
  68. Li K and Li Y (2022). Fuzzy Adaptive Optimal Consensus Fault-Tolerant Control for Stochastic Nonlinear Multiagent Systems, IEEE Transactions on Fuzzy Systems, 30:8, (2870-2885), Online publication date: 1-Aug-2022.
  69. Sassano M, Mylvaganam T and Astolfi A (2022). On the analysis of open-loop Nash equilibria admitting a feedback synthesis in nonlinear differential games, Automatica (Journal of IFAC), 142:C, Online publication date: 1-Aug-2022.
  70. Shende S, Gillman A, Buskohl P and Vemaganti K (2022). Systematic cost analysis of gradient- and anisotropy-enhanced Bayesian design optimization, Structural and Multidisciplinary Optimization, 65:8, Online publication date: 1-Aug-2022.
  71. Chen S, Bolufé-Röhler A, Montgomery J, Zhang W and Hendtlass T Using Average-Fitness Based Selection to Combat the Curse of Dimensionality 2022 IEEE Congress on Evolutionary Computation (CEC), (1-8)
  72. Pichler A, Liu R and Shapiro A (2022). Risk-Averse Stochastic Programming, Operations Research, 70:4, (2439-2455), Online publication date: 1-Jul-2022.
  73. Liu Z, Khojandi A, Li X, Mohammed A, Davis R and Kamaleswaran R (2022). A Machine Learning–Enabled Partially Observable Markov Decision Process Framework for Early Sepsis Prediction, INFORMS Journal on Computing, 34:4, (2039-2057), Online publication date: 1-Jul-2022.
  74. Chu K, Lam A and Li V (2022). Traffic Signal Control Using End-to-End Off-Policy Deep Reinforcement Learning, IEEE Transactions on Intelligent Transportation Systems, 23:7, (7184-7195), Online publication date: 1-Jul-2022.
  75. Hamednia A, Sharma N, Murgovski N and Fredriksson J (2022). Computationally Efficient Algorithm for Eco-Driving Over Long Look-Ahead Horizons, IEEE Transactions on Intelligent Transportation Systems, 23:7, (6556-6570), Online publication date: 1-Jul-2022.
  76. Amor R, Colomer A, Monteagudo C and Naranjo V (2022). A deep embedded refined clustering approach for breast cancer distinction based on DNA methylation, Neural Computing and Applications, 34:13, (10243-10255), Online publication date: 1-Jul-2022.
  77. Hooker J Stochastic Decision Diagrams Integration of Constraint Programming, Artificial Intelligence, and Operations Research, (138-154)
  78. Alonistiotis G, Antonopoulos A, Melissinos N, Pagourtzis A, Petsalakis S and Vasilakis M Approximating Subset Sum Ratio via Subset Sum Computations Combinatorial Algorithms, (73-85)
  79. Sabate-Vidales M and Šiška D The Case for Variable Fees in Constant Product Markets: An Agent Based Simulation Financial Cryptography and Data Security. FC 2022 International Workshops, (225-237)
  80. Shen F, Wang X, Li H and Yin X (2022). Adaptive output‐feedback control for a class of nonlinear systems based on optimized backstepping technique, International Journal of Adaptive Control and Signal Processing, 36:5, (1077-1097), Online publication date: 4-May-2022.
  81. Feldman E and Sakhartov A (2022). Resource Redeployment and Divestiture as Strategic Alternatives, Organization Science, 33:3, (926-945), Online publication date: 1-May-2022.
  82. Strub M and Gammell J (2022). Adaptively Informed Trees (AIT*) and Effort Informed Trees (EIT*), International Journal of Robotics Research, 41:4, (390-417), Online publication date: 1-Apr-2022.
  83. Ramírez J, Yu W and Perrusquía A (2022). Model-free reinforcement learning from expert demonstrations: a survey, Artificial Intelligence Review, 55:4, (3213-3241), Online publication date: 1-Apr-2022.
  84. Kautz H (2022). The third AI summer, AI Magazine, 43:1, (105-125), Online publication date: 31-Mar-2022.
  85. Cook W Computing in Combinatorial Optimization Computing and Software Science, (27-47)
  86. Bargagli Stoffi F, Cevolani G and Gnecco G (2022). Simple Models in Complex Worlds: Occam’s Razor and Statistical Learning Theory, Minds and Machines, 32:1, (13-42), Online publication date: 1-Mar-2022.
  87. Anderlucci L, Fortunato F and Montanari A (2022). High-Dimensional Clustering via Random Projections, Journal of Classification, 39:1, (191-216), Online publication date: 1-Mar-2022.
  88. Wojciechowski P, Subramani K, Velasquez A and Williamson M On the Approximability of Path and Cycle Problems in Arc-Dependent Networks Algorithms and Discrete Applied Mathematics, (292-304)
  89. Aradi S (2022). Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles, IEEE Transactions on Intelligent Transportation Systems, 23:2, (740-759), Online publication date: 1-Feb-2022.
  90. ACM
    Abboud A, Bringmann K, Hermelin D and Shabtay D (2022). SETH-based Lower Bounds for Subset Sum and Bicriteria Path, ACM Transactions on Algorithms, 18:1, (1-22), Online publication date: 31-Jan-2022.
  91. ACM
    Zhang Y and Amin N (2022). Reasoning about “reasoning about reasoning”: semantics and contextual equivalence for probabilistic programs with nested queries and recursion, Proceedings of the ACM on Programming Languages, 6:POPL, (1-28), Online publication date: 16-Jan-2022.
  92. Jones M, Djahel S and Welsh K MQTPP – Towards Multiple Q-Table based Path Planning in UAV Environments 2022 IEEE 19th Annual Consumer Communications & Networking Conference (CCNC), (457-460)
  93. Khutortsev V (2022). TRAJECTORY CONTROL OF THE OBSERVATION PROCESS OF A MOBILE DIGITAL DIRECTION FINDER IN THE TOPOLOGY OF A ROAD NETWORK, Journal of Computer and Systems Sciences International, 61:1, (123-133), Online publication date: 1-Jan-2022.
  94. Bramlage L and Cortese A (2022). Generalized attention-weighted reinforcement learning, Neural Networks, 145:C, (10-21), Online publication date: 1-Jan-2022.
  95. Siddig M and Song Y (2022). Adaptive partition-based SDDP algorithms for multistage stochastic linear programming with fixed recourse, Computational Optimization and Applications, 81:1, (201-250), Online publication date: 1-Jan-2022.
  96. Stefansson E and Johansson K Computing Complexity-aware Plans Using Kolmogorov Complexity 2021 60th IEEE Conference on Decision and Control (CDC), (3420-3427)
  97. Ibragimov D, Novozhilin N and Portseva E (2021). On Sufficient Optimality Conditions for a Guaranteed Control in the Speed Problem for a Linear Time-Varying Discrete-Time System with Bounded Control, Automation and Remote Control, 82:12, (2076-2096), Online publication date: 1-Dec-2021.
  98. Kerimkulov B, Šiška D and Szpruch L (2021). A Modified MSA for Stochastic Control Problems, Applied Mathematics and Optimization, 84:3, (3417-3436), Online publication date: 1-Dec-2021.
  99. Siebenborn M and Wagner J (2021). A multigrid preconditioner for tensor product spline smoothing, Computational Statistics, 36:4, (2379-2411), Online publication date: 1-Dec-2021.
  100. Korneenko V (2021). An Efficient Algorithm of Dead-End Controls for Solving Combinatorial Optimization Problems, Automation and Remote Control, 82:10, (1692-1705), Online publication date: 1-Oct-2021.
  101. Chu H, Guo L, Yan Y, Gao B and Chen H (2021). Self-Learning Optimal Cruise Control Based on Individual Car-Following Style, IEEE Transactions on Intelligent Transportation Systems, 22:10, (6622-6633), Online publication date: 1-Oct-2021.
  102. Villanueva M, Jones C and Houska B (2021). Towards global optimal control via Koopman lifts, Automatica (Journal of IFAC), 132:C, Online publication date: 1-Oct-2021.
  103. Kůrková V and Sanguineti M (2021). Correlations of random classifiers on large data sets, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 25:19, (12641-12648), Online publication date: 1-Oct-2021.
  104. Browning J, Kornreich M, Chow A, Pawar J, Zhang L, Herzog R and Odry B Uncertainty Aware Deep Reinforcement Learning for Anatomical Landmark Detection in Medical Images Medical Image Computing and Computer Assisted Intervention – MICCAI 2021, (636-644)
  105. Hamednia A, Razi M, Murgovski N and Fredriksson J Electric Vehicle Eco-driving under Wind Uncertainty 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), (3502-3508)
  106. Magnanti T (2021). Optimization, Management Science, 67:9, (5349-5363), Online publication date: 1-Sep-2021.
  107. Bortakovsky A and Uryupin I (2021). Optimization of Switchable Systems’ Trajectories, Journal of Computer and Systems Sciences International, 60:5, (701-718), Online publication date: 1-Sep-2021.
  108. Yeganeh-Khaksar A, Ansari M, Safari S, Yari-Karin S and Ejlali A (2021). Ring-DVFS: Reliability-Aware Reinforcement Learning-Based DVFS for Real-Time Embedded Systems, IEEE Embedded Systems Letters, 13:3, (146-149), Online publication date: 1-Sep-2021.
  109. Beck C, Becker S, Grohs P, Jaafari N and Jentzen A (2021). Solving the Kolmogorov PDE by Means of Deep Learning, Journal of Scientific Computing, 88:3, Online publication date: 1-Sep-2021.
  110. Antonopoulos A, Pagourtzis A, Petsalakis S and Vasilakis M Faster Algorithms for  and Variations Frontiers of Algorithmics, (37-52)
  111. Liberti L, Sager S and Wiegele A (2021). Preface, Mathematical Programming: Series A and B, 188:2, (411-419), Online publication date: 1-Aug-2021.
  112. Kishikawa D and Arai S (2021). Estimation of personal driving style via deep inverse reinforcement learning, Artificial Life and Robotics, 26:3, (338-346), Online publication date: 1-Aug-2021.
  113. Scampicchio A, Aravkin A and Pillonetto G (2021). Stable and robust LQR design via scenario approach, Automatica (Journal of IFAC), 129:C, Online publication date: 1-Jul-2021.
  114. Jouvin N, Bouveyron C and Latouche P (2021). A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering, Statistics and Computing, 31:4, Online publication date: 1-Jul-2021.
  115. ACM
    Abadi M and Plotkin G Smart choices and the selection monad Proceedings of the 36th Annual ACM/IEEE Symposium on Logic in Computer Science, (1-14)
  116. Fouché E, Mazankiewicz A, Kalinke F and Böhm K (2021). A framework for dependency estimation in heterogeneous data streams, Distributed and Parallel Databases, 39:2, (415-444), Online publication date: 1-Jun-2021.
  117. Olsen T, Tumlin A, Stiffler N and O’Kane J A Visibility Roadmap Sampling Approach for a Multi-Robot Visibility-Based Pursuit-Evasion Problem 2021 IEEE International Conference on Robotics and Automation (ICRA), (7957-7964)
  118. Girdhar Y, Rivkin D, Wu D, Jenkin M, Liu X and Dudek G Optimizing Cellular Networks via Continuously Moving Base Stations on Road Networks 2021 IEEE International Conference on Robotics and Automation (ICRA), (4020-4025)
  119. Sleiman J, Farshidian F and Hutter M Constraint Handling in Continuous-Time DDP-Based Model Predictive Control 2021 IEEE International Conference on Robotics and Automation (ICRA), (8209-8215)
  120. Goycoolea M, Lamas P, Pagnoncelli B and Piazza A (2021). Lane’s Algorithm Revisited, Management Science, 67:5, (3087-3103), Online publication date: 1-May-2021.
  121. Goncharenko V, Zheltov S, Knyaz V, Lebedev G, Mikhaylin D and Tsareva O (2021). Intelligent System for Planning Group Actions of Unmanned Aircraft in Observing Mobile Objects on the Ground in the Specified Area, Journal of Computer and Systems Sciences International, 60:3, (379-395), Online publication date: 1-May-2021.
  122. Smirnov S (2021). A Guaranteed Deterministic Approach to Superhedging: Financial Market Model, Trading Constraints, and the Bellman–Isaacs Equations, Automation and Remote Control, 82:4, (722-743), Online publication date: 1-Apr-2021.
  123. Bacalhau E, Casacio L and de Azevedo A (2021). New hybrid genetic algorithms to solve dynamic berth allocation problem, Expert Systems with Applications: An International Journal, 167:C, Online publication date: 1-Apr-2021.
  124. ACM
    Nasution A, Murakami Y and Ishida T (2021). Plan Optimization to Bilingual Dictionary Induction for Low-resource Language Families, ACM Transactions on Asian and Low-Resource Language Information Processing, 20:2, (1-28), Online publication date: 31-Mar-2021.
  125. Turri V, Besselink B and Johansson K Gear management for fuel-efficient heavy-duty vehicle platooning 2016 IEEE 55th Conference on Decision and Control (CDC), (1687-1694)
  126. Raković S, Levine W and Açıkmeşe B Continuously generalized model predictive control 2016 IEEE 55th Conference on Decision and Control (CDC), (616-621)
  127. ACM
    Wang G, Fang Z, Xie X, Wang S, Sun H, Zhang F, Liu Y and Zhang D (2020). Pricing-aware Real-time Charging Scheduling and Charging Station Expansion for Large-scale Electric Buses, ACM Transactions on Intelligent Systems and Technology, 12:1, (1-26), Online publication date: 28-Feb-2021.
  128. Yegorov I and Dower P (2021). Perspectives on Characteristics Based Curse-of-Dimensionality-Free Numerical Approaches for Solving Hamilton–Jacobi Equations, Applied Mathematics and Optimization, 83:1, (1-49), Online publication date: 1-Feb-2021.
  129. Dowson O and Kapelevich L (2021). SDDP.jl, INFORMS Journal on Computing, 33:1, (27-33), Online publication date: 1-Jan-2021.
  130. Asif A, Savas E, AlSalman H, Arshad M, Gumaei A, Rehman A and Kayes A (2021). Fixed Point of Rational Contractions and Its Application for Secure Dynamic Routing in Wireless Sensor Networks, Security and Communication Networks, 2021, Online publication date: 1-Jan-2021.
  131. ACM
    Kesan J and Zhang L (2020). Analysis of Cyber Incident Categories Based on Losses, ACM Transactions on Management Information Systems, 11:4, (1-28), Online publication date: 31-Dec-2021.
  132. Scampicchio A and Pillonetto G A convex approach to robust LQR 2020 59th IEEE Conference on Decision and Control (CDC), (3705-3710)
  133. Fujimoto S, Meger D and Precup D An equivalence between loss functions and non-uniform sampling in experience replay Proceedings of the 34th International Conference on Neural Information Processing Systems, (14219-14230)
  134. van der Pol E, Worrall D, van Hoof H, Oliehoek F and Welling M MDP homomorphic networks Proceedings of the 34th International Conference on Neural Information Processing Systems, (4199-4210)
  135. Neu G and Pike-Burke C A unifying view of optimism in episodic reinforcement learning Proceedings of the 34th International Conference on Neural Information Processing Systems, (1392-1403)
  136. Lausser L, Szekely R and Kestler H (2020). Chained correlations for feature selection, Advances in Data Analysis and Classification, 14:4, (871-884), Online publication date: 1-Dec-2020.
  137. Legoll F, Lelièvre T, Myerscough K and Samaey G (2020). Parareal computation of stochastic differential equations with time-scale separation: a numerical convergence study, Computing and Visualization in Science, 23:1-4, Online publication date: 1-Dec-2020.
  138. Bortakovskii A (2020). Separation Theorem for Average Optimal Control for Hybrid Systems of Variable Dimension, Automation and Remote Control, 81:11, (1974-1993), Online publication date: 1-Nov-2020.
  139. ACM
    Bringmann K, Gawrychowski P, Mozes S and Weimann O (2020). Tree Edit Distance Cannot be Computed in Strongly Subcubic Time (Unless APSP Can), ACM Transactions on Algorithms, 16:4, (1-22), Online publication date: 31-Oct-2020.
  140. YOON M, YOON W, SEO M, RYU S and LEE J Air conditioner component optimum operation point search through a deep reinforcement learning algorithm 2020 20th International Conference on Control, Automation and Systems (ICCAS), (365-372)
  141. Kuinchtner D, Meneguzzi F and Sales A A Tensor-Based Markov Decision Process Representation Advances in Soft Computing, (313-324)
  142. Amman H and Tucci M (2020). How Active is Active Learning: Value Function Method Versus an Approximation Method, Computational Economics, 56:3, (675-693), Online publication date: 1-Oct-2020.
  143. ACM
    Feely C, Caulfield B, Lawlor A and Smyth B Providing Explainable Race-Time Predictions and Training Plan Recommendations to Marathon Runners Fourteenth ACM Conference on Recommender Systems, (539-544)
  144. Hamednia A, Murgovski N and Fredriksson J Time Optimal and Eco-driving Mission Planning under Traffic Constraints 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), (1-7)
  145. Andronov A, Dalinger I and Santalova D Problem of Overbooking for a Case of a Random Environment Existence Distributed Computer and Communication Networks, (393-405)
  146. Tsushima K, Trong B, Glück R and Hu Z An Efficient Composition of Bidirectional Programs by Memoization and Lazy Update Functional and Logic Programming, (159-178)
  147. Glüge S, Amirian M, Flumini D and Stadelmann T How (Not) to Measure Bias in Face Recognition Networks Artificial Neural Networks in Pattern Recognition, (125-137)
  148. Petrosian O, Tikhomirov D, Kuchkarov I and Gao H (2020). About One Differential Game Model with Dynamic Updating, Automation and Remote Control, 81:9, (1733-1750), Online publication date: 1-Sep-2020.
  149. Carpentier P, Chancelier J, De Lara M and Pacaud F (2020). Mixed Spatial and Temporal Decompositions for Large-Scale Multistage Stochastic Optimization Problems, Journal of Optimization Theory and Applications, 186:3, (985-1005), Online publication date: 1-Sep-2020.
  150. Bouveret G and Picarelli A (2020). A Level-Set Approach for Stochastic Optimal Control Problems Under Controlled-Loss Constraints, Journal of Optimization Theory and Applications, 186:3, (779-805), Online publication date: 1-Sep-2020.
  151. ACM
    Zhao X, Zheng X, Yang X, Liu X and Tang J Jointly Learning to Recommend and Advertise Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (3319-3327)
  152. Magirou E, Vassalos P and Barakitis N (2020). A policy iteration algorithm for the American put option and free boundary control problems, Journal of Computational and Applied Mathematics, 373:C, Online publication date: 1-Aug-2020.
  153. Xie Y, Dibangoye J and Buffet O Optimally solving two-agent decentralized POMDPs under one-sided information sharing Proceedings of the 37th International Conference on Machine Learning, (10473-10482)
  154. Pavse B, Durugkar I, Hanna J and Stone P Reducing sampling error in batch temporal difference learning Proceedings of the 37th International Conference on Machine Learning, (7543-7552)
  155. Fedus W, Ramachandran P, Agarwal R, Bengio Y, Larochelle H, Rowland M and Dabney W Revisiting fundamentals of experience replay Proceedings of the 37th International Conference on Machine Learning, (3061-3071)
  156. Agarwal R, Schuurmans D and Norouzi M An optimistic perspective on offline reinforcement learning Proceedings of the 37th International Conference on Machine Learning, (104-114)
  157. Vincze D, Tóth A and Niitsuma M Antecedent Redundancy Exploitation in Fuzzy Rule Interpolation-based Reinforcement Learning 2020 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM), (1316-1321)
  158. Boldinov V, Bukhalev V and Skrynnikov A (2020). Game-Theoretic Control of the Object’s Random Jump Structure in the Class of Pure Strategies, Journal of Computer and Systems Sciences International, 59:4, (494-503), Online publication date: 1-Jul-2020.
  159. ACM
    Stevens C and Bagheri H Reducing run-time adaptation space via analysis of possible utility bounds Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, (1522-1534)
  160. ACM
    Karmelita M and Pawlak T CMA-ES for one-class constraint synthesis Proceedings of the 2020 Genetic and Evolutionary Computation Conference, (859-867)
  161. ACM
    Bringmann K and Nakos V Top-𝑘-convolution and the quest for near-linear output-sensitive subset sum Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, (982-995)
  162. Wojciechowski P, Williamson M and Subramani K On Finding Shortest Paths in Arc-Dependent Networks Combinatorial Optimization, (249-260)
  163. Mohanan M and Salgaonkar A (2020). Probabilistic Approach to Robot Motion Planning in Dynamic Environments, SN Computer Science, 1:3, Online publication date: 1-May-2020.
  164. Zhang Y, Zhao B and Liu D (2020). Deterministic policy gradient adaptive dynamic programming for model-free optimal control, Neurocomputing, 387:C, (40-50), Online publication date: 28-Apr-2020.
  165. Shehu Y and Harper R Improved Fault Localization using Transfer Learning and Language Modeling NOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium, (1-6)
  166. Gammell J, Barfoot T and Srinivasa S (2020). Batch Informed Trees (BIT*), International Journal of Robotics Research, 39:5, (543-567), Online publication date: 1-Apr-2020.
  167. ACM
    Willett N, Shin H, Jin Z, Li W and Finkelstein A Pose2Pose Proceedings of the 25th International Conference on Intelligent User Interfaces, (88-99)
  168. Rădulescu R, Mannion P, Roijers D and Nowé A (2019). Multi-objective multi-agent decision making: a utility-based analysis and survey, Autonomous Agents and Multi-Agent Systems, 34:1, Online publication date: 9-Mar-2020.
  169. Bian T, Wolpert D and Jiang Z (2020). Model-Free Robust Optimal Feedback Mechanisms of Biological Motor Control, Neural Computation, 32:3, (562-595), Online publication date: 1-Mar-2020.
  170. Garí Y, Monge D, Mateos C and García Garino C (2019). Learning budget assignment policies for autoscaling scientific workflows in the cloud, Cluster Computing, 23:1, (87-105), Online publication date: 1-Mar-2020.
  171. Wei H, Chen C and Li S (2019). A Unified Approach Through Image Space Analysis to Robustness in Uncertain Optimization Problems, Journal of Optimization Theory and Applications, 184:2, (466-493), Online publication date: 1-Feb-2020.
  172. Salas-Molina F, Rodriguez-Aguilar J and Pla-Santamaria D (2019). A stochastic goal programming model to derive stable cash management policies, Journal of Global Optimization, 76:2, (333-346), Online publication date: 1-Feb-2020.
  173. Larkin E, Privalov A and Bogomolov A (2020). Discrete Approach to Simulating Synchronized Relay Races, Automatic Documentation and Mathematical Linguistics, 54:1, (43-51), Online publication date: 1-Jan-2020.
  174. Fahad L, Tahir S, Shahzad W, Hassan M, Alquhayz H, Hassan R and Acacio Sanchez M (2020). Ant Colony Optimization-Based Streaming Feature Selection, Scientific Programming, 2020, Online publication date: 1-Jan-2020.
  175. Shah A, Ganesan R, Jajodia S, Samarati P and Cam H (2019). Adaptive Alert Management for Balancing Optimal Performance among Distributed CSOCs using Reinforcement Learning, IEEE Transactions on Parallel and Distributed Systems, 31:1, (16-33), Online publication date: 1-Jan-2020.
  176. Niranjana R, Kumar V and Sheen S (2019). Darknet Traffic Analysis and Classification Using Numerical AGM and Mean Shift Clustering Algorithm, SN Computer Science, 1:1, Online publication date: 1-Jan-2020.
  177. Yang R, Sun X and Narasimhan K A generalized algorithm for multi-objective reinforcement learning and policy adaptation Proceedings of the 33rd International Conference on Neural Information Processing Systems, (14636-14647)
  178. Penedones H, Riquelme C, Vincent D, Maennel H, Mann T, Barreto A, Gelly S and Neu G Adaptive temporal-difference learning for policy evaluation with per-state uncertainty estimates Proceedings of the 33rd International Conference on Neural Information Processing Systems, (11895-11905)
  179. Leibfried F, Pascual-Díaz S and Grau-Moya J A unified bellman optimality principle combining reward maximization and empowerment Proceedings of the 33rd International Conference on Neural Information Processing Systems, (7869-7880)
  180. Yang D, Zhao L, Lin Z, Qin T, Bian J and Liu T Fully parameterized quantile function for distributional reinforcement learning Proceedings of the 33rd International Conference on Neural Information Processing Systems, (6193-6202)
  181. Bellemare M, Dabney W, Dadashi R, Taiga A, Castro P, Roux N, Schuurmans D, Lattimore T and Lyle C A geometric perspective on optimal representations for reinforcement learning Proceedings of the 33rd International Conference on Neural Information Processing Systems, (4358-4369)
  182. Nachum O, Chow Y, Dai B and Li L DualDICE Proceedings of the 33rd International Conference on Neural Information Processing Systems, (2318-2328)
  183. Guo Z, Li J and Ramesh R (2019). Optimal Management of Virtual Infrastructures Under Flexible Cloud Service Agreements, Information Systems Research, 30:4, (1424-1446), Online publication date: 1-Dec-2019.
  184. Barham R, Sharieh A and Sleit A (2020). A meta-heuristic framework based on clustering and preprocessed datasets for solving the link prediction problem, Journal of Information Science, 45:6, (794-817), Online publication date: 1-Dec-2019.
  185. Eskandarian A (2019). Scanning the Issue, IEEE Transactions on Intelligent Transportation Systems, 20:12, (4257-4261), Online publication date: 1-Dec-2019.
  186. Wang S, Ahmed N and Yeap T (2019). Optimum Management of Urban Traffic Flow Based on a Stochastic Dynamic Model, IEEE Transactions on Intelligent Transportation Systems, 20:12, (4377-4389), Online publication date: 1-Dec-2019.
  187. ACM
    Huang Z, Liu Q, Zhai C, Yin Y, Chen E, Gao W and Hu G Exploring Multi-Objective Exercise Recommendations in Online Education Systems Proceedings of the 28th ACM International Conference on Information and Knowledge Management, (1261-1270)
  188. ACM
    Jin J, Zhou M, Zhang W, Li M, Guo Z, Qin Z, Jiao Y, Tang X, Wang C, Wang J, Wu G and Ye J CoRide Proceedings of the 28th ACM International Conference on Information and Knowledge Management, (1983-1992)
  189. Ning J and Sobel M (2019). Easy Affine Markov Decision Processes, Operations Research, 67:6, (1719-1737), Online publication date: 1-Nov-2019.
  190. Fokkink R, Lidbetter T and Végh L (2019). On Submodular Search and Machine Scheduling, Mathematics of Operations Research, 44:4, (1431-1449), Online publication date: 1-Nov-2019.
  191. Anand K and Goyal M (2019). Ethics, Bounded Rationality, and IP Sharing in IT Outsourcing, Management Science, 65:11, (5252-5267), Online publication date: 1-Nov-2019.
  192. ACM
    Dinakarrao S, Joseph A, Haridass A, Shafique M, Henkel J and Homayoun H (2019). Application and Thermal-reliability-aware Reinforcement Learning Based Multi-core Power Management, ACM Journal on Emerging Technologies in Computing Systems, 15:4, (1-19), Online publication date: 31-Oct-2019.
  193. Reyes A, Ibargüengoytia P and Santamaría G SPI: A Software Tool for Planning Under Uncertainty Based on Learning Factored MDPs Advances in Soft Computing, (475-485)
  194. Luthmann L, Göttmann H and Lochau M Compositional Liveness-Preserving Conformance Testing of Timed I/O Automata Formal Aspects of Component Software, (147-169)
  195. Hu J and Guo W Flexibility Analysis in Waste-to-Energy Systems based on Decision Rules and Gene Expression Programming 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), (988-993)
  196. Köbis E, Köbis M and Qin X (2019). Nonlinear Separation Approach to Inverse Variational Inequalities in Real Linear Spaces, Journal of Optimization Theory and Applications, 183:1, (105-121), Online publication date: 1-Oct-2019.
  197. Bulinskaya E Discrete-Time Insurance Models. Optimization of Their Performance by Reinsurance and Bank Loans Distributed Computer and Communication Networks, (342-353)
  198. Chen Z and Zhang Z Deep Recurrent Policy Networks for Planning Under Partial Observability Artificial Neural Networks and Machine Learning – ICANN 2019: Theoretical Neural Computation, (598-610)
  199. Leurent E and Maillard O Practical Open-Loop Optimistic Planning Machine Learning and Knowledge Discovery in Databases, (69-85)
  200. Kim N and Browne R (2019). Subspace clustering for the finite mixture of generalized hyperbolic distributions, Advances in Data Analysis and Classification, 13:3, (641-661), Online publication date: 1-Sep-2019.
  201. Malinovsky Y (2019). Sterrett Procedure for the Generalized Group Testing Problem, Methodology and Computing in Applied Probability, 21:3, (829-840), Online publication date: 1-Sep-2019.
  202. Hartisch M and Lorenz U Mastering Uncertainty: Towards Robust Multistage Optimization with Decision Dependent Uncertainty PRICAI 2019: Trends in Artificial Intelligence, (446-458)
  203. Hartisch M and Lorenz U A Novel Application for Game Tree Search - Exploiting Pruning Mechanisms for Quantified Integer Programs Advances in Computer Games, (66-78)
  204. Kishimoto A, Botea A and Marinescu R Depth-first memory-limited AND/OR search and unsolvability in cyclic search spaces Proceedings of the 28th International Joint Conference on Artificial Intelligence, (1280-1288)
  205. ACM
    Choi Y, Park S and Cha H Optimizing Energy Efficiency of Browsers in Energy-Aware Scheduling-enabled Mobile Devices The 25th Annual International Conference on Mobile Computing and Networking, (1-16)
  206. ACM
    Povéda G, Regnier-Coudert O, Teichteil-Königsbuch F, Dupont G, Arnold A, Guerra J and Picard M Evolutionary approaches to dynamic earth observation satellites mission planning under uncertainty Proceedings of the Genetic and Evolutionary Computation Conference, (1302-1310)
  207. ACM
    Knauf F and Bruns R A peek into the swarm Proceedings of the Genetic and Evolutionary Computation Conference, (30-38)
  208. Li F, Jiang Q, Quan W, Song R and Li Y Manipulation Skill Acquisition for Robotic Assembly using Deep Reinforcement Learning 2019 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM), (13-18)
  209. Hornung R, Chen N and van der Smagt P Early integration for movement modeling in latent spaces The Handbook of Multimodal-Multisensor Interfaces, (305-345)
  210. Bortakovskii A and Uryupin I (2019). Minimization of the Number of Switchings between Optimal Continuous-Discrete Controlled Processes, Journal of Computer and Systems Sciences International, 58:4, (528-544), Online publication date: 1-Jul-2019.
  211. Hendrix E, Rocha A and García I On Trajectory Optimization of an Electric Vehicle Computational Science and Its Applications – ICCSA 2019, (249-260)
  212. Abouheaf M and Gueaieb W Model-Free Adaptive Control Approach Using Integral Reinforcement Learning 2019 IEEE International Symposium on Robotic and Sensors Environments (ROSE), (1-7)
  213. Abouheaf M, Mailhot N and Gueaieb W An Online Reinforcement Learning Wing-Tracking Mechanism for Flexible Wing Aircraft 2019 IEEE International Symposium on Robotic and Sensors Environments (ROSE), (1-7)
  214. Kel’manov A and Khandeev V Exact Linear-Time Algorithm for Parameterized K-Means Problem with Optimized Number of Clusters in the 1D Case Numerical Computations: Theory and Algorithms, (394-399)
  215. Trosten D and Sharma P Unsupervised Feature Extraction – A CNN-Based Approach Image Analysis, (197-208)
  216. Zema N, Quadri D, Martin S and Shrit O Formation control of a mono-operated UAV fleet through ad-hoc communications: a Q-learning approach 2019 16th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON), (1-6)
  217. Tingstad Jacobsen S, Gustafsson A, Vu N, Madhusudhana S, Hamednia A, Sharma N and Murgovski N Predictive cruise control behind a stationary or slow moving object 2019 IEEE Intelligent Vehicles Symposium (IV), (2099-2105)
  218. Bouton M, Nakhaei A, Fujimura K and Kochenderfer M Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban Environments 2019 IEEE Intelligent Vehicles Symposium (IV), (1469-1476)
  219. ACM
    Zhou H, Khatri S, Hu J and Liu F A Memory-Efficient Markov Decision Process Computation Framework Using BDD-based Sampling Representation Proceedings of the 56th Annual Design Automation Conference 2019, (1-6)
  220. Kel’manov A and Khandeev V On Polynomial Solvability of One Quadratic Euclidean Clustering Problem on a Line Learning and Intelligent Optimization, (46-52)
  221. Abouheaf M and Gueaieb W Multi-Agent Synchronization Using Online Model-Free Action Dependent Dual Heuristic Dynamic Programming Approach 2019 International Conference on Robotics and Automation (ICRA), (2195-2201)
  222. Wray K and Zilberstein S Policy Networks Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, (2270-2272)
  223. Rockefeller G, Mannion P and Tumer K Curriculum Learning for Tightly Coupled Multiagent Systems Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, (2174-2176)
  224. Pineda L and Zilberstein S (2019). Probabilistic planning with reduced models, Journal of Artificial Intelligence Research, 65:1, (271-306), Online publication date: 1-May-2019.
  225. Singh C, Baishnab K and Anandini C (2019). Analysis and optimization of noises of an analog circuit via PSO algorithms, Microsystem Technologies, 25:5, (1793-1807), Online publication date: 1-May-2019.
  226. ACM
    Brown D, Japa A and Shi Y An Attempt at Improving Density-based Clustering Algorithms Proceedings of the 2019 ACM Southeast Conference, (172-175)
  227. ACM
    Govindaiah S and Petty M Applying Reinforcement Learning to Plan Manufacturing Material Handling Part 2 Proceedings of the 2019 ACM Southeast Conference, (16-23)
  228. Ibragimov D (2019). On the Optimal Speed Problem for the Class of Linear Autonomous Infinite-Dimensional Discrete-Time Systems with Bounded Control and Degenerate Operator, Automation and Remote Control, 80:3, (393-412), Online publication date: 1-Mar-2019.
  229. Han Z, Tan H, Wang R, Chen G, Li Y and Lau F (2019). Energy-Efficient Dynamic Virtual Machine Management in Data Centers, IEEE/ACM Transactions on Networking, 27:1, (344-360), Online publication date: 1-Feb-2019.
  230. Ghosh S, Subramanian E, Bhat S, Gujar S and Paruchuri P VidyutVanika Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (914-921)
  231. Eifler R, Fickert M, Hoffmann J and Ruml W Refining abstraction heuristics during real-time planning Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (7578-7585)
  232. Zhang S and Yao H QUOTA Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (5797-5804)
  233. Gelada C and Bellemare M Off-policy deep reinforcement learning by bootstrapping the covariate shift Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (3647-3655)
  234. Gao Y, Zhao L, Wu L, Ye Y, Xiong H and Yang C Incomplete label multi-task deep learning for spatio-temporal event subtype forecasting Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (3638-3646)
  235. Mitchell A, Ruml W, Spaniol F, Hoffmann J and Petrik M Real-time planning as decision-making under uncertainty Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (2338-2345)
  236. Ryzhov I, Mes M, Powell W and van den Berg G (2019). Bayesian Exploration for Approximate Dynamic Programming, Operations Research, 67:1, (198-214), Online publication date: 1-Jan-2019.
  237. Bensoussan A and Chevalier-Roignant B (2018). Sequential Capacity Expansion Options, Operations Research, 67:1, (33-57), Online publication date: 1-Jan-2019.
  238. Goncharenko V, Lebedev G and Mikhailin D (2019). Online Two-Dimensional Route Planning for a Group of Unmanned Aerial Vehicles, Journal of Computer and Systems Sciences International, 58:1, (147-158), Online publication date: 1-Jan-2019.
  239. Bortakovskii A and Nemychenkov G (2019). Optimal in the Mean Control of Deterministic Switchable Systems Given Discrete Inexact Measurements, Journal of Computer and Systems Sciences International, 58:1, (50-74), Online publication date: 1-Jan-2019.
  240. Wang J, Gou L, Shen H and Yang H (2018). DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks, IEEE Transactions on Visualization and Computer Graphics, 25:1, (288-298), Online publication date: 1-Jan-2019.
  241. Mardanov M and Malik S (2019). Necessary First- and Second-Order Optimality Conditions in Discrete Systems with a Delay in Control, Journal of Dynamical and Control Systems, 25:1, (29-43), Online publication date: 1-Jan-2019.
  242. Breschi V, Bemporad A, Piga D and Boyd S Prediction error methods in learning jump ARMAX models 2018 IEEE Conference on Decision and Control (CDC), (2247-2252)
  243. Buccafusca L and Beck C Maximizing Power in Wind Turbine Arrays with Variable Wind Dynamics 2018 IEEE Conference on Decision and Control (CDC), (2667-2672)
  244. Yegorov I, Dower P and Grüne L Global extension of local control Lyapunov functions via exit-time optimal control * 2018 IEEE Conference on Decision and Control (CDC), (874-879)
  245. Pang B, Bian T and Jiang Z Data-driven Finite-horizon Optimal Control for Linear Time-varying Discrete-time Systems 2018 IEEE Conference on Decision and Control (CDC), (861-866)
  246. Baumann D, Trimpe S, Zhu J and Martius G Deep Reinforcement Learning for Event-Triggered Control 2018 IEEE Conference on Decision and Control (CDC), (943-950)
  247. Sahoo A and Narayanan V Event-based Near Optimal Sampling and Tracking Control of Nonlinear Systems 2018 IEEE Conference on Decision and Control (CDC), (55-60)
  248. Scarciotti G and Mylvaganam T Approximate Infinite-Horizon Optimal Control for Stochastic Systems 2018 IEEE Conference on Decision and Control (CDC), (3294-3298)
  249. Ueda R Searching Behavior of a Simple Manipulator Only with Sense of Touch Generated by Probabilistic Flow Control 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO), (594-599)
  250. Waldock A, Greatwood C, Salama F and Richardson T (2018). Learning to Perform a Perched Landing on the Ground Using Deep Reinforcement Learning, Journal of Intelligent and Robotic Systems, 92:3-4, (685-704), Online publication date: 1-Dec-2018.
  251. Li J and Yang L (2018). Set-Valued Systems with Infinite-Dimensional Image and Applications, Journal of Optimization Theory and Applications, 179:3, (868-895), Online publication date: 1-Dec-2018.
  252. Pulgar F, Charte F, Rivera A and del Jesus M A First Approach to Face Dimensionality Reduction Through Denoising Autoencoders Intelligent Data Engineering and Automated Learning – IDEAL 2018, (439-447)
  253. Guldstrand Larsen K and Legay A Statistical Model Checking the 2018 Edition! Leveraging Applications of Formal Methods, Verification and Validation. Verification, (261-270)
  254. Montanaro U, Fallah S, Dianati M, Oxtoby D, Mizutani T and Mouzakitis A Cloud-Assisted Distributed Control System Architecture for Platooning 2018 21st International Conference on Intelligent Transportation Systems (ITSC), (1258-1265)
  255. Liu X, Montgomery A and Srinivasan K (2018). Analyzing Bank Overdraft Fees with Big Data, Marketing Science, 37:6, (855-882), Online publication date: 1-Nov-2018.
  256. ACM
    Jiaoman D, Lei L and Xiang L Travel planning problem considering site selection and itinerary making Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems, (29-36)
  257. Sun X, Wu C, Chen L and Lin J (2018). Using Inter-Block Synchronization to Improve the Knapsack Problem on GPUs, International Journal of Grid and High Performance Computing, 10:4, (83-98), Online publication date: 1-Oct-2018.
  258. Doan V, Fujimoto H, Koseki T, Yasuda T, Kishi H and Fujita T (2018). Allocation of Wireless Power Transfer System From Viewpoint of Optimal Control Problem for Autonomous Driving Electric Vehicles, IEEE Transactions on Intelligent Transportation Systems, 19:10, (3255-3270), Online publication date: 1-Oct-2018.
  259. Wen G, Chen C, Feng J and Zhou N (2018). Optimized Multi-Agent Formation Control Based on an Identifier–Actor–Critic Reinforcement Learning Algorithm, IEEE Transactions on Fuzzy Systems, 26:5, (2719-2731), Online publication date: 1-Oct-2018.
  260. Ogunmolu O, Gans N and Summers T Minimax Iterative Dynamic Game: Application to Nonlinear Robot Control Tasks 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (6919-6925)
  261. Arneberg J, Tal E and Karaman S Guidance Laws for Partially-Observable Interception Based on Linear Covariance Analysis 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (4185-4191)
  262. ACM
    Shah A, Ganesan R, Jajodia S and Cam H (2018). Dynamic Optimization of the Level of Operational Effectiveness of a CSOC Under Adverse Conditions, ACM Transactions on Intelligent Systems and Technology, 9:5, (1-20), Online publication date: 30-Sep-2018.
  263. Song J, Yoon G, Cho H and Yoon S (2018). Structure preserving dimensionality reduction for visual object recognition, Multimedia Tools and Applications, 77:18, (23529-23545), Online publication date: 1-Sep-2018.
  264. Legros B (2018). M/G/1 queue with event-dependent arrival rates, Queueing Systems: Theory and Applications, 89:3-4, (269-301), Online publication date: 1-Aug-2018.
  265. ACM
    Ding Y, Liu W, Bian J, Zhang D and Liu T Investor-Imitator Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (1310-1319)
  266. ACM
    Zhao X, Zhang L, Ding Z, Xia L, Tang J and Yin D Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (1040-1048)
  267. Bernstein A, Burnaev E and Kachan O Reinforcement Learning for Computer Vision and Robot Navigation Machine Learning and Data Mining in Pattern Recognition, (258-272)
  268. ACM
    Chen A, Ren Z, Yang Y, Liang Y and Pang B A historical interdependency based differential grouping algorithm for large scale global optimization Proceedings of the Genetic and Evolutionary Computation Conference Companion, (1711-1715)
  269. Kel’manov A, Mikhailova L and Romanchenko S On a Problem of Summing Elements Chosen from the Family of Finite Numerical Sequences Analysis of Images, Social Networks and Texts, (305-317)
  270. ACM
    Chen A, Zhang Y, Ren Z, Yang Y, Liang Y and Pang B A global information based adaptive threshold for grouping large scale optimization problems Proceedings of the Genetic and Evolutionary Computation Conference, (833-840)
  271. ACM
    Pawlak T Performance improvements for evolutionary strategy-based one-class constraint synthesis Proceedings of the Genetic and Evolutionary Computation Conference, (873-880)
  272. ACM
    Sroka D and Pawlak T One-class constraint acquisition with local search Proceedings of the Genetic and Evolutionary Computation Conference, (363-370)
  273. Bortakovskii A (2018). Synthesis of Optimal Control-Systems with a Change of the Models of Motion, Journal of Computer and Systems Sciences International, 57:4, (543-560), Online publication date: 1-Jul-2018.
  274. Candelieri A, Perego R and Archetti F Intelligent Pump Scheduling Optimization in Water Distribution Networks Learning and Intelligent Optimization, (352-369)
  275. Szuster M Dual-Heuristic Dynamic Programming in the Three-Wheeled Mobile Transport Robot Control Artificial Intelligence and Soft Computing, (763-776)
  276. Li S, Xu Y, You M and Zhu S (2018). Constrained Extremum Problems and Image Space Analysis---Part I, Journal of Optimization Theory and Applications, 177:3, (609-636), Online publication date: 1-Jun-2018.
  277. Chen J, Köbis E, Köbis M and Yao J (2018). Image Space Analysis for Constrained Inverse Vector Variational Inequalities via Multiobjective Optimization, Journal of Optimization Theory and Applications, 177:3, (816-834), Online publication date: 1-Jun-2018.
  278. Benosman M (2018). Model‐based vs data‐driven adaptive control, International Journal of Adaptive Control and Signal Processing, 32:5, (753-776), Online publication date: 9-May-2018.
  279. Basu A and Ghosh M (2018). Nonzero-Sum Risk-Sensitive Stochastic Games on a Countable State Space, Mathematics of Operations Research, 43:2, (516-532), Online publication date: 1-May-2018.
  280. Prilutskii M (2018). Optimal Management of Two-Stage Stochastic Production Systems, Automation and Remote Control, 79:5, (830-840), Online publication date: 1-May-2018.
  281. Zhang J, Sun J, Zhang R, Zhang Y and Hu X Privacy-Preserving Social Media Data Outsourcing IEEE INFOCOM 2018 - IEEE Conference on Computer Communications, (1106-1114)
  282. Niu S, Chen S, Guo H, Targonski C, Smith M and Kovačević J Generalized value iteration networks Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (6246-6253)
  283. Maliah S and Shani G MDP-based cost sensitive classification using decision trees Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (3746-3753)
  284. Harutyunyan A, Vrancx P, Bacon P, Precup D and Nowe A Learning with options that terminate off-policy Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (3173-3182)
  285. Dabney W, Rowland M, Bellemare M and Munos R Distributional reinforcement learning with quantile regression Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (2892-2901)
  286. Brafman R, De Giacomo G and Patrizi F LT Lƒ/LDLƒ non-Markovian rewards Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (1771-1778)
  287. Warnell G, Waytowich N, Lawhern V and Stone P Deep TAMER Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (1545-1553)
  288. Yang G, Dong X, Luo J and Zhang S (2018). Session search modeling by partially observable Markov decision process, Information Retrieval, 21:1, (56-80), Online publication date: 1-Feb-2018.
  289. Li S, Ge Y, Shi Y and Selişteanu D (2018). Enhanced Oil Recovery for ASP Flooding Based on Biorthogonal Spatial-Temporal Wiener Modeling and Iterative Dynamic Programming, Complexity, 2018, Online publication date: 1-Jan-2018.
  290. Petrosyan L, Sedakov A, Sun H and Xu G (2017). Convergence of strong time-consistent payment schemes indynamic games, Applied Mathematics and Computation, 315:C, (96-112), Online publication date: 15-Dec-2017.
  291. Ma Z, Li Z and Giua A Computation of admissible marking sets in weighted state machines by dynamic programming 2017 IEEE 56th Annual Conference on Decision and Control (CDC), (4847-4852)
  292. Botros A and Rodrigues L State feedback optimal solution for the ECON mode velocity for a cruising turbojet 2017 IEEE 56th Annual Conference on Decision and Control (CDC), (3908-3913)
  293. ACM
    Thalheim J, Rodrigues A, Akkus I, Bhatotia P, Chen R, Viswanath B, Jiao L and Fetzer C Sieve Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference, (14-27)
  294. Leeuwen D and Núñez Queija R (2017). Optimal dispatching in a tandem queue, Queueing Systems: Theory and Applications, 87:3-4, (269-291), Online publication date: 1-Dec-2017.
  295. Qin C, Liu X, Liu G, Wang J and Zhang D Finite Horizon Optimal Tracking Control for Nonlinear Discrete-Time Switched Systems Neural Information Processing, (801-810)
  296. Zhu L, Song R, Xie Y and Li J Adaptive Dynamic Programming for Direct Current Servo Motor Neural Information Processing, (731-740)
  297. Bortakovskii A and Nemychenkov G (2017). Suboptimal control of bunches of trajectories of discrete deterministic automaton time-invariant systems, Journal of Computer and Systems Sciences International, 56:6, (914-929), Online publication date: 1-Nov-2017.
  298. Wei W, Li H and Leus R (2017). Test sequencing for sequential system diagnosis with precedence constraints and imperfect tests, Decision Support Systems, 103:C, (104-116), Online publication date: 1-Nov-2017.
  299. Soldera J, Dodson K and Scharcanski J Face recognition based on geodesic distance approximations between multivariate normal distributions 2017 IEEE International Conference on Imaging Systems and Techniques (IST), (1-6)
  300. Gul O Asymptotically optimal scheduling for energy harvesting wireless sensor networks 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), (1-7)
  301. Kibzun A and Ignatov A (2017). On the existence of optimal strategies in the control problem for a stochastic discrete time system with respect to the probability criterion, Automation and Remote Control, 78:10, (1845-1856), Online publication date: 1-Oct-2017.
  302. Ibragimov D and Sirotin A (2017). On the problem of operation speed for the class of linear infinite-dimensional discrete-time systems with bounded control, Automation and Remote Control, 78:10, (1731-1756), Online publication date: 1-Oct-2017.
  303. Chen Q, Zhang M and Xue B (2017). Feature Selection to Improve Generalization of Genetic Programming for High-Dimensional Symbolic Regression, IEEE Transactions on Evolutionary Computation, 21:5, (792-806), Online publication date: 1-Oct-2017.
  304. Plancher B, Manchester Z and Kuindersma S Constrained unscented dynamic programming 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (5674-5680)
  305. Bacher C and Raidl G Refining Partial Invalidations for Indexed Algebraic Dynamic Programming Machine Learning, Optimization, and Big Data, (562-573)
  306. Zhang Z, Pan Z and Kochenderfer M Weighted double Q-learning Proceedings of the 26th International Joint Conference on Artificial Intelligence, (3455-3461)
  307. Lukina A Resilient control and safety for multi-agent cyber-physical systems Proceedings of the 26th International Joint Conference on Artificial Intelligence, (5187-5188)
  308. Tamar A, Wu Y, Thomas G, Levine S and Abbeel P Value iteration networks Proceedings of the 26th International Joint Conference on Artificial Intelligence, (4949-4953)
  309. Wayllace C, Hou P and Yeoh W New metrics and algorithms for stochastic goal recognition design problems Proceedings of the 26th International Joint Conference on Artificial Intelligence, (4455-4462)
  310. Silver D, van Hasselt H, Hessel M, Schaul T, Guez A, Harley T, Dulac-Arnold G, Reichert D, Rabinowitz N, Barreto A and Degris T The predictron Proceedings of the 34th International Conference on Machine Learning - Volume 70, (3191-3199)
  311. Bellemare M, Dabney W and Munos R A Distributional Perspective on Reinforcement Learning Proceedings of the 34th International Conference on Machine Learning - Volume 70, (449-458)
  312. Bai A, Russell S and Chen X Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway RoboCup 2017: Robot World Cup XXI, (190-203)
  313. Leska M, Aschemann H, Melzer M and Meinert M (2017). Comparative Calculation of the Fuel–Optimal Operating Strategy for Diesel Hybrid Railway Vehicles, International Journal of Applied Mathematics and Computer Science, 27:2, (323-336), Online publication date: 27-Jun-2017.
  314. McGregor S, Buckingham H, Dietterich T, Houtman R, Montgomery C and Metoyer R (2017). Interactive visualization for testing Markov Decision Processes, Journal of Visual Languages and Computing, 39:C, (93-106), Online publication date: 1-Apr-2017.
  315. Hakami M and Kleijn W Machine learning based non-intrusive quality estimation with an augmented feature set 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (5105-5109)
  316. Kostenko V (2017). Combinatorial optimization algorithms combining greedy strategies with a limited search procedure, Journal of Computer and Systems Sciences International, 56:2, (218-226), Online publication date: 1-Mar-2017.
  317. Wei W, Balabdaoui F and Held L (2017). Calibration tests for multivariate Gaussian forecasts, Journal of Multivariate Analysis, 154:C, (216-233), Online publication date: 1-Feb-2017.
  318. Zhang H, Sanin C, Szczerbicki E, Zhu M, Nguyen N, Núñez M and Trawiński B (2017). Towards neural knowledge DNA, Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 32:2, (1575-1584), Online publication date: 1-Jan-2017.
  319. Petrosian O and Barabanov A (2017). Looking Forward Approach in Cooperative Differential Games with Uncertain Stochastic Dynamics, Journal of Optimization Theory and Applications, 172:1, (328-347), Online publication date: 1-Jan-2017.
  320. ACM
    Itzhaky S, Singh R, Solar-Lezama A, Yessenov K, Lu Y, Leiserson C and Chowdhury R (2016). Deriving divide-and-conquer dynamic programming algorithms using solver-aided transformations, ACM SIGPLAN Notices, 51:10, (145-164), Online publication date: 5-Dec-2016.
  321. Fan Q and Yang G (2016). Nearly optimal sliding mode fault-tolerant control for affine nonlinear systems with state constraints, Neurocomputing, 216:C, (78-88), Online publication date: 5-Dec-2016.
  322. Bertsimas D and Mišić V (2016). Decomposable Markov Decision Processes, Operations Research, 64:6, (1537-1555), Online publication date: 1-Dec-2016.
  323. ACM
    Chowdhury R, Ganapathi P, Tithi J, Bachmeier C, Kuszmaul B, Leiserson C, Solar-Lezama A and Tang Y (2016). AUTOGEN, ACM SIGPLAN Notices, 51:8, (1-12), Online publication date: 9-Nov-2016.
  324. ACM
    Kaleeswaran S, Santhiar A, Kanade A and Gulwani S Semi-supervised verified feedback generation Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, (739-750)
  325. ACM
    Itzhaky S, Singh R, Solar-Lezama A, Yessenov K, Lu Y, Leiserson C and Chowdhury R Deriving divide-and-conquer dynamic programming algorithms using solver-aided transformations Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (145-164)
  326. Harutyunyan A, Bellemare M, Stepleton T and Munos R Q() with Off-Policy Corrections Algorithmic Learning Theory, (305-320)
  327. ACM
    Ganesan R, Jajodia S, Shah A and Cam H (2016). Dynamic Scheduling of Cybersecurity Analysts for Minimizing Risk Using Reinforcement Learning, ACM Transactions on Intelligent Systems and Technology, 8:1, (1-21), Online publication date: 3-Oct-2016.
  328. ACM
    Wu B and Wang Y Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search Proceedings of the 24th ACM international conference on Multimedia, (352-356)
  329. Shapiro A and Ugurlu K (2016). Decomposability and time consistency of risk averse multistage programs, Operations Research Letters, 44:5, (663-665), Online publication date: 1-Sep-2016.
  330. Mallick S, Kar R, Ghoshal S and Mandal D (2016). Optimal sizing and design of CMOS analogue amplifier circuits using craziness-based particle swarm optimization, International Journal of Numerical Modelling: Electronic Networks, Devices and Fields, 29:5, (943-966), Online publication date: 1-Sep-2016.
  331. He J, Cai L, Cheng P and Pan J (2016). Delay Minimization for Data Dissemination in Large-Scale VANETs with Buses and Taxis, IEEE Transactions on Mobile Computing, 15:8, (1939-1950), Online publication date: 1-Aug-2016.
  332. Jansen K and Kraft S (2016). An Improved Approximation Scheme for Variable-Sized Bin Packing, Theory of Computing Systems, 59:2, (262-322), Online publication date: 1-Aug-2016.
  333. ACM
    Gál A, Jang J, Limaye N, Mahajan M and Sreenivasaiah K (2016). Space-Efficient Approximations for Subset Sum, ACM Transactions on Computation Theory, 8:4, (1-28), Online publication date: 26-Jul-2016.
  334. Kalyanakrishnan S, Mall U and Goyal R Batch-switching policy iteration Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, (3147-3153)
  335. Chedjou J and Kyamakya K (2016). Benchmarking a recurrent neural network based efficient shortest path problem (SPP) solver concept under difficult dynamic parameter settings conditions, Neurocomputing, 196:C, (175-209), Online publication date: 5-Jul-2016.
  336. Hu X, Wang M, Leeson M, Di Paolo E and Liu H (2016). Deterministic agent-based path optimization by mimicking the spreading of ripples, Evolutionary Computation, 24:2, (319-346), Online publication date: 1-Jun-2016.
  337. (2016). Nonlinear control of a boost converter using a robust regression based reinforcement learning algorithm, Engineering Applications of Artificial Intelligence, 52:C, (1-9), Online publication date: 1-Jun-2016.
  338. Jaderyan M and Khotanlou H (2016). Virulence Optimization Algorithm, Applied Soft Computing, 43:C, (596-618), Online publication date: 1-Jun-2016.
  339. Albrecht S, Crandall J and Ramamoorthy S (2016). Belief and truth in hypothesised behaviours, Artificial Intelligence, 235:C, (63-94), Online publication date: 1-Jun-2016.
  340. Song J, Gao Y, Wang H and An B Measuring the Distance Between Finite Markov Decision Processes Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, (468-476)
  341. ACM
    Chang J, Kittur A and Hahn N Alloy Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, (3180-3191)
  342. Nadendla V and Varshney P (2016). Design of Binary Quantizers for Distributed Detection Under Secrecy Constraints, IEEE Transactions on Signal Processing, 64:10, (2636-2648), Online publication date: 1-May-2016.
  343. Charalampous K and Gasteratos A (2016). On-line deep learning method for action recognition, Pattern Analysis & Applications, 19:2, (337-354), Online publication date: 1-May-2016.
  344. Murzabekov Z (2016). The Synthesis of the Proportional-Differential Regulators for the Systems with Fixed Ends of Trajectories Under Two-Sided Constraints on Control Values, Asian Journal of Control, 18:2, (494-501), Online publication date: 1-Mar-2016.
  345. ACM
    Chowdhury R, Ganapathi P, Tithi J, Bachmeier C, Kuszmaul B, Leiserson C, Solar-Lezama A and Tang Y AUTOGEN Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (1-12)
  346. Lever G, Shawe-Taylor J, Stafford R and Szepesvári C Compressed conditional mean embeddings for model-based reinforcement learning Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, (1779-1787)
  347. Huang S, Zhang J, Wang L and Hua X (2016). Social Friend Recommendation Based on Multiple Network Correlation, IEEE Transactions on Multimedia, 18:2, (287-299), Online publication date: 1-Feb-2016.
  348. Semasinghe P and Hossain E (2016). Downlink Power Control in Self-Organizing Dense Small Cells Underlaying Macrocells, IEEE Transactions on Mobile Computing, 15:2, (350-363), Online publication date: 1-Feb-2016.
  349. Koval M, Pollard N and Srinivasa S (2016). Pre- and post-contact policy decomposition for planar contact manipulation under uncertainty, International Journal of Robotics Research, 35:1-3, (244-264), Online publication date: 1-Jan-2016.
  350. ACM
    Hammer M, Dunfield J, Headley K, Labich N, Foster J, Hicks M and Van Horn D (2015). Incremental computation with names, ACM SIGPLAN Notices, 50:10, (748-766), Online publication date: 18-Dec-2015.
  351. ACM
    Tang Y, You R, Kan H, Tithi J, Ganapathi P and Chowdhury R (2015). Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency, ACM SIGPLAN Notices, 50:8, (205-214), Online publication date: 18-Dec-2015.
  352. Bulinskaya E, Gusak J and Muromskaya A (2015). Discrete-time Insurance Model with Capital Injections and Reinsurance, Methodology and Computing in Applied Probability, 17:4, (899-914), Online publication date: 1-Dec-2015.
  353. Pauli S, Gantner R, Arbenz P and Adelmann A (2015). Multilevel Monte Carlo for the Feynman–Kac formula for the Laplace equation, BIT, 55:4, (1125-1143), Online publication date: 1-Dec-2015.
  354. ACM
    Gencer A, Bindel D, Sirer E and van Renesse R Configuring Distributed Computations Using Response Surfaces Proceedings of the 16th Annual Middleware Conference, (235-246)
  355. ACM
    Hammer M, Dunfield J, Headley K, Labich N, Foster J, Hicks M and Van Horn D Incremental computation with names Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (748-766)
  356. ACM
    Bender M, Berry J, Hammond S, Moore B, Moseley B and Phillips C k-Means Clustering on Two-Level Memory Systems Proceedings of the 2015 International Symposium on Memory Systems, (197-205)
  357. Post I and Ye Y (2015). The Simplex Method is Strongly Polynomial for Deterministic Markov Decision Processes, Mathematics of Operations Research, 40:4, (859-868), Online publication date: 1-Oct-2015.
  358. Li Y, Tee K, Yan R, Chan W, Wu Y and Limbu D Adaptive optimal control for coordination in physical human-robot interaction 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (20-25)
  359. ACM
    Sloan M and Wang J Dynamic Information Retrieval Proceedings of the 2015 International Conference on The Theory of Information Retrieval, (61-70)
  360. ACM
    Bai A, Wu F and Chen X (2015). Online Planning for Large Markov Decision Processes with Hierarchical Decomposition, ACM Transactions on Intelligent Systems and Technology, 6:4, (1-28), Online publication date: 13-Aug-2015.
  361. ACM
    Spirin N, Kuznetsov M, Kiseleva J, Spirin Y and Izhutov P Relevance-aware Filtering of Tuples Sorted by an Attribute Value via Direct Optimization of Search Quality Metrics Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (979-982)
  362. Wei ShangGuan , Xi-Hui Yan , Bai-Gen Cai and Jian Wang (2015). Multiobjective Optimization for Train Speed Trajectory in CTCS High-Speed Railway With Hybrid Evolutionary Algorithm, IEEE Transactions on Intelligent Transportation Systems, 16:4, (2215-2225), Online publication date: 1-Aug-2015.
  363. Dujardin Y, Dietterich T and Chadès I α-min Proceedings of the 24th International Conference on Artificial Intelligence, (2582-2588)
  364. Dobrovidov A, Kulida E and Rudko I (2015). Path optimization for a moving object in an anisotropic environment using the probabilistic criterion in the passive sonar mode, Automation and Remote Control, 76:7, (1271-1281), Online publication date: 1-Jul-2015.
  365. Abu Alsheikh M, Dinh Thai Hoang , Niyato D, Hwee-Pink Tan and Shaowei Lin (2015). Markov Decision Processes With Applications in Wireless Sensor Networks: A Survey, IEEE Communications Surveys & Tutorials, 17:3, (1239-1267), Online publication date: 1-Jul-2015.
  366. ACM
    Marinchev I and Agre G On speeding up the implementation of nearest neighbour search and classification Proceedings of the 16th International Conference on Computer Systems and Technologies, (207-213)
  367. Thomos N, Kurdoglu E, Frossard P and van der Schaar M (2015). Adaptive Prioritized Random Linear Coding and Scheduling for Layered Data Delivery From Multiple Servers, IEEE Transactions on Multimedia, 17:6, (893-906), Online publication date: 1-Jun-2015.
  368. Jae Kyu Suhr and Ho Gi Jung (2015). Dense Stereo-Based Robust Vertical Road Profile Estimation Using Hough Transform and Dynamic Programming, IEEE Transactions on Intelligent Transportation Systems, 16:3, (1528-1536), Online publication date: 1-Jun-2015.
  369. ACM
    Yang H, Guan D and Zhang S (2015). The Query Change Model, ACM Transactions on Information Systems, 33:4, (1-33), Online publication date: 15-May-2015.
  370. Yehoshua R, Agmon N and Kaminka G Frontier-Based RTDP Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, (861-869)
  371. Fernández C, Manyà F, Mateu C and Sole-Mauri F (2015). Approximate dynamic programming for automated vacuum waste collection systems, Environmental Modelling & Software, 67:C, (128-137), Online publication date: 1-May-2015.
  372. Yang C and Kumar M (2015). An information guided framework for simulated annealing, Journal of Global Optimization, 62:1, (131-154), Online publication date: 1-May-2015.
  373. ACM
    Didona D, Quaglia F, Romano P and Torre E Enhancing Performance Prediction Robustness by Combining Analytical Modeling and Machine Learning Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering, (145-156)
  374. ACM
    Tang Y, You R, Kan H, Tithi J, Ganapathi P and Chowdhury R Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (205-214)
  375. Taleghan M, Dietterich T, Crowley M, Hall K and Albers H (2015). PAC optimal MDP planning with application to invasive species management, The Journal of Machine Learning Research, 16:1, (3877-3903), Online publication date: 1-Jan-2015.
  376. Xu M, Liu G and Guan J (2015). Towards a secure medium access control protocol for cluster-based underwater wireless sensor networks, International Journal of Distributed Sensor Networks, 2015, (40-40), Online publication date: 1-Jan-2015.
  377. Haivoronskyy O, Ermoliev Y, Knopov P and Norkin V (2015). Mathematical Modeling of Distributed Catastrophic and Terrorist Risks1, Cybernetics and Systems Analysis, 51:1, (85-95), Online publication date: 1-Jan-2015.
  378. ACM
    Juneja A, Rana B and Agrawal R A Novel Approach for Computer Aided Diagnosis of Schizophrenia using Auditory Oddball Functional MRI Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing, (1-6)
  379. ACM
    Maleki S, Musuvathi M and Mytkowicz T (2014). Parallelizing dynamic programming through rank convergence, ACM SIGPLAN Notices, 49:8, (219-232), Online publication date: 26-Nov-2014.
  380. ACM
    Chen L, Shen H and Sapra K Distributed Autonomous Virtual Resource Management in Datacenters Using Finite-Markov Decision Process Proceedings of the ACM Symposium on Cloud Computing, (1-13)
  381. Thiam P, Kessler V and Schwenker F A Reinforcement Learning Algorithm to Train a Tetris Playing Agent Proceedings of the 6th IAPR TC 3 International Workshop on Artificial Neural Networks in Pattern Recognition - Volume 8774, (165-170)
  382. Sun L, Dong H, Hussain F, Hussain O and Chang E (2014). Cloud service selection, Journal of Network and Computer Applications, 45:C, (134-150), Online publication date: 1-Oct-2014.
  383. ACM
    Guo J, Zulkoski E, Olaechea R, Rayside D, Czarnecki K, Apel S and Atlee J Scaling exact multi-objective combinatorial optimization by parallelization Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering, (409-420)
  384. Powell W (2014). Energy and Uncertainty, AI Magazine, 35:3, (8-21), Online publication date: 1-Sep-2014.
  385. ACM
    Huang T, Huang M, Nguyen Q and Zhao L A Space-Filling Multidimensional Visualization (SFMDVis for Exploratory Data Analysis Proceedings of the 7th International Symposium on Visual Information Communication and Interaction, (19-28)
  386. Petrosyan L and Sedakov A (2014). Multistage network games with perfect information, Automation and Remote Control, 75:8, (1532-1540), Online publication date: 1-Aug-2014.
  387. Kinathil S, Sanner S and Penna N Closed-form solutions to a subclass of continuous stochastic games via symbolic dynamic programming Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, (390-399)
  388. Nicolescu R, Gimel'farb G, Morris J, Delmas P and Gong R (2014). Regularising Ill-posed Discrete Optimisation, Fundamenta Informaticae, 131:3-4, (465-483), Online publication date: 1-Jul-2014.
  389. Kogan D, Kuimova A and Fedosenko Y (2014). The problems of servicing of the binary object flow in system with refillable storage component, Automation and Remote Control, 75:7, (1257-1266), Online publication date: 1-Jul-2014.
  390. ACM
    Wan L, Li K, Liu J and Li K A Novel CPU-GPU Cooperative Implementation of A Parallel Two-List Algorithm for the Subset-Sum Problem Proceedings of Programming Models and Applications on Multicores and Manycores, (70-79)
  391. ACM
    Wan L, Li K, Liu J and Li K A Novel CPU-GPU Cooperative Implementation of A Parallel Two-List Algorithm for the Subset-Sum Problem Proceedings of Programming Models and Applications on Multicores and Manycores, (70-79)
  392. ACM
    Maleki S, Musuvathi M and Mytkowicz T Parallelizing dynamic programming through rank convergence Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming, (219-232)
  393. Khan I (2015). A comparative study of EAG and PBIL on large-scale global optimization problems, Applied Computational Intelligence and Soft Computing, 2014, (19-19), Online publication date: 1-Jan-2014.
  394. Hyytiä E and Aalto S Round-robin routing policy Proceedings of the 7th International Conference on Performance Evaluation Methodologies and Tools, (69-78)
  395. ACM
    Al-Dujaily R, Dahir N, Mak T, Xia F and Yakovlev A (2013). Dynamic programming-based runtime thermal management (DPRTM), ACM Transactions on Design Automation of Electronic Systems, 19:1, (1-27), Online publication date: 1-Dec-2013.
  396. ACM
    Hofri M (2013). Optimal selection and sorting via dynamic programming, ACM Journal of Experimental Algorithmics, 18, (2.1-2.14), Online publication date: 1-Dec-2013.
  397. Denny Fu K, Nakamura Y, Yamamoto T and Ishiguro H (2013). Analysis of Motor Synergies Utilization for Optimal Movement Generation for a Human-like Robotic Arm, International Journal of Automation and Computing, 10:6, (515-524), Online publication date: 1-Dec-2013.
  398. Adinetz A, Kraus J, Meinke J and Pleiter D GPUMAFIA Proceedings of the 19th international conference on Parallel Processing, (838-849)
  399. ACM
    Jin X, Sloan M and Wang J Interactive exploratory search for multi page search results Proceedings of the 22nd international conference on World Wide Web, (655-666)
  400. Kaya H and GüNdüZ-ÖğüDüCü Ş (2013). SAGA, Information Sciences: an International Journal, 228, (113-130), Online publication date: 1-Apr-2013.
  401. ACM
    Kannan B, Meneguzzi F, Dias M and Sycara K Predictive indoor navigation using commercial smart-phones Proceedings of the 28th Annual ACM Symposium on Applied Computing, (519-525)
  402. Polimeni A and Vitetta A (2013). Optimising Waiting at Nodes in Time-Dependent Networks, Journal of Optimization Theory and Applications, 156:3, (805-818), Online publication date: 1-Mar-2013.
  403. Gaggero M, Gnecco G and Sanguineti M (2013). Dynamic Programming and Value-Function Approximation in Sequential Decision Problems, Journal of Optimization Theory and Applications, 156:2, (380-416), Online publication date: 1-Feb-2013.
  404. ValdéS F, Iglesias R, Espinosa F and RodríGuez M (2013). Effect of a risk factor in convoy merging manoeuvres considering uncertainty in travelling times, Applied Soft Computing, 13:1, (247-258), Online publication date: 1-Jan-2013.
  405. Loxton R, Lin Q and Teo K (2012). A stochastic fleet composition problem, Computers and Operations Research, 39:12, (3177-3184), Online publication date: 1-Dec-2012.
  406. Tuyls K and Weiss G (2012). Multiagent Learning, AI Magazine, 33:3, (41-52), Online publication date: 1-Sep-2012.
  407. ACM
    Chen Y, Dunfield J and Acar U (2012). Type-directed automatic incrementalization, ACM SIGPLAN Notices, 47:6, (299-310), Online publication date: 6-Aug-2012.
  408. Kibzun A and Khromova O (2012). Choosing optimal road trajectory with random work cost in different areas, Automation and Remote Control, 73:7, (1181-1194), Online publication date: 1-Jul-2012.
  409. ACM
    Chen Y, Dunfield J and Acar U Type-directed automatic incrementalization Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation, (299-310)
  410. ACM
    Inzinger C, Satzger B, Hummer W, Leitner P and Dustdar S Non-intrusive policy optimization for dependable and adaptive service-oriented systems Proceedings of the 27th Annual ACM Symposium on Applied Computing, (504-510)
  411. Halman N, Orlin J and Simchi-Levi D (2012). Approximating the Nonlinear Newsvendor and Single-Item Stochastic Lot-Sizing Problems When Data Is Given by an Oracle, Operations Research, 60:2, (429-446), Online publication date: 1-Mar-2012.
  412. Couëtoux A, Doghmen H and Teytaud O Improving the Exploration in Upper Confidence Trees Revised Selected Papers of the 6th International Conference on Learning and Intelligent Optimization - Volume 7219, (366-371)
  413. Dodson T, Mattei N and Goldsmith J A Natural Language Argumentation Interface for Explanation Generation in Markov Decision Processes Algorithmic Decision Theory, (42-55)
  414. Valdes F, Iglesias R, Espinosa F, Rodríguez M, Quintia P and Santos C Robot routing approaches for convoy merging maneuvers Proceedings of the 12th Annual conference on Towards autonomous robotic systems, (241-252)
  415. ACM
    Friedmann O, Hansen T and Zwick U Subexponential lower bounds for randomized pivoting rules for the simplex algorithm Proceedings of the forty-third annual ACM symposium on Theory of computing, (283-292)
  416. ACM
    Tsourakakis C, Peng R, Tsiarli M, Miller G and Schwartz R (2011). Approximation algorithms for speeding up dynamic programming and denoising aCGH data, ACM Journal of Experimental Algorithmics, 16, (1.1-1.27), Online publication date: 1-May-2011.
  417. Kamiyama N, Kawahara R, Mori T, Harada S and Hasegawa H (2011). Optimally designing caches to reduce P2P traffic, Computer Communications, 34:7, (883-897), Online publication date: 1-May-2011.
  418. Fiosins M, Fiosina J, Müller J and Görmer J Reconciling strategic and tactical decision making in agent-oriented simulation of vehicles in urban traffic Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques, (144-151)
  419. ACM
    Li L and Kim D Least-cost path estimation in wireless ad hoc sensor networks using Petri nets Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, (1-6)
  420. Krasovskii N and Kotel'Nikova A (2011). Stochastic control in a determinate differential pursuit-evasion game, Automation and Remote Control, 72:2, (305-322), Online publication date: 1-Feb-2011.
  421. Morihata A (2011). A Short Cut to Optimal Sequences, New Generation Computing, 29:1, (31-59), Online publication date: 1-Jan-2011.
  422. Hasselt H Double Q-learning Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2, (2613-2621)
  423. ACM
    Muckell J, Hwang J, Lawson C and Ravi S Algorithms for compressing GPS trajectory data Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, (402-405)
  424. ACM
    Zhang B, Li Q, Chao H, Chen B, Ofek E and Xu Y Annotating and navigating tourist videos Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, (260-269)
  425. Bontoux B, Artigues C and Feillet D (2010). A Memetic Algorithm with a large neighborhood crossover operator for the Generalized Traveling Salesman Problem, Computers and Operations Research, 37:11, (1844-1852), Online publication date: 1-Nov-2010.
  426. Geist R, Jones Z and Westall J Predicting disk scheduling performance with virtual machines Proceedings of the 2010 IFIP WG 6.3/7.3 international conference on Performance Evaluation of Computer and Communication Systems: milestones and future challenges, (61-72)
  427. Tang L, Yang Y and Liu J (2010). An efficient optimal solution to the coil sequencing problem in electro-galvanizing line, Computers and Operations Research, 37:10, (1780-1796), Online publication date: 1-Oct-2010.
  428. Cervellera C, Macciò D and Muselli M (2010). Functional Optimization Through Semilocal Approximate Minimization, Operations Research, 58:5, (1491-1504), Online publication date: 1-Sep-2010.
  429. Jung H and Pedram M (2010). Supervised learning based power management for multicore processors, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 29:9, (1395-1408), Online publication date: 1-Sep-2010.
  430. Gnecco G and Sanguineti M (2010). Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions, Journal of Optimization Theory and Applications, 146:3, (764-794), Online publication date: 1-Sep-2010.
  431. ACM
    Lee S and Popović Z Learning behavior styles with inverse reinforcement learning ACM SIGGRAPH 2010 papers, (1-7)
  432. ACM
    Lee S and Popović Z (2010). Learning behavior styles with inverse reinforcement learning, ACM Transactions on Graphics, 29:4, (1-7), Online publication date: 26-Jul-2010.
  433. Kim Y, Schmid T and Srivastava M Design and implementation of a robust sensor data fusion system for unknown signals Proceedings of the 6th IEEE international conference on Distributed Computing in Sensor Systems, (77-91)
  434. ACM
    Gupta P, Kahng A, Kasibhatla A and Sharma P Eyecharts Proceedings of the 47th Design Automation Conference, (597-602)
  435. ACM
    Acar U, Blelloch G, Ley-Wild R, Tangwongsan K and Turkoglu D (2010). Traceable data types for self-adjusting computation, ACM SIGPLAN Notices, 45:6, (483-496), Online publication date: 12-Jun-2010.
  436. ACM
    Acar U, Blelloch G, Ley-Wild R, Tangwongsan K and Turkoglu D Traceable data types for self-adjusting computation Proceedings of the 31st ACM SIGPLAN Conference on Programming Language Design and Implementation, (483-496)
  437. Beaudry E, Kabanza F and Michaud F Planning for concurrent action executions under action duration uncertainty using dynamically generated Bayesian networks Proceedings of the Twentieth International Conference on International Conference on Automated Planning and Scheduling, (10-17)
  438. Marecki J and Varakantham P Risk-sensitive planning in partially observable environments Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1, (1357-1368)
  439. Sanner S, Uther W and Delgado K Approximate dynamic programming with affine ADDs Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1, (1349-1356)
  440. Sturtevant N, Bulitko V and Björnsson Y On learning in agent-centered search Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1, (333-340)
  441. Feria E Latency-information theory Proceedings of the 33rd IEEE conference on Sarnoff, (343-350)
  442. ACM
    Wang Y, Krishnamachari B, Zhao Q and Annavaram M Markov-optimal sensing policy for user state estimation in mobile devices Proceedings of the 9th ACM/IEEE International Conference on Information Processing in Sensor Networks, (268-278)
  443. Akuiyibo E and Boyd S (2010). Adaptive modulation with smoothed flow utility, EURASIP Journal on Wireless Communications and Networking, 2010, (1-9), Online publication date: 1-Apr-2010.
  444. ACM
    Baffa A and Ciarlini A Modeling POMDPs for generating and simulating stock investment policies Proceedings of the 2010 ACM Symposium on Applied Computing, (2394-2399)
  445. ACM
    Madani O, Thorup M and Zwick U (2010). Discounted deterministic Markov decision processes and discounted all-pairs shortest paths, ACM Transactions on Algorithms, 6:2, (1-25), Online publication date: 1-Mar-2010.
  446. Leng J and Hong T (2010). Mining Outliers in Correlated Subspaces for High Dimensional Data Sets, Fundamenta Informaticae, 98:1, (71-86), Online publication date: 1-Jan-2010.
  447. Torgasin S and Zimmermann K (2010). Algorithm for thermodynamically based prediction of DNA/DNA cross-hybridisation, International Journal of Bioinformatics Research and Applications, 6:1, (82-97), Online publication date: 1-Jan-2010.
  448. Engelbrecht H and du Preez J (2010). Efficient backward decoding of high-order hidden Markov models, Pattern Recognition, 43:1, (99-112), Online publication date: 1-Jan-2010.
  449. ACM
    Lee Y, Lee S and Popović Z Compact character controllers ACM SIGGRAPH Asia 2009 papers, (1-8)
  450. ACM
    Lee Y, Lee S and Popović Z (2009). Compact character controllers, ACM Transactions on Graphics, 28:5, (1-8), Online publication date: 1-Dec-2009.
  451. Song C, Ye J, Liu D and Kang Q (2009). Generalized receding horizon control of fuzzy systems based on numerical optimization algorithm, IEEE Transactions on Fuzzy Systems, 17:6, (1336-1352), Online publication date: 1-Dec-2009.
  452. Bagnara R, Hill P and Zaffanella E (2009). Weakly-relational shapes for numeric abstractions, Formal Methods in System Design, 35:3, (279-323), Online publication date: 1-Dec-2009.
  453. Khalil M, Moustafa M and Abbas H Enhanced DTW based on-line signature verification Proceedings of the 16th IEEE international conference on Image processing, (2685-2688)
  454. ACM
    Cai C, Hengst B, Ye G, Huang E, Wang Y, Aydos C and Geers G On the performance of adaptive traffic signal control Proceedings of the Second International Workshop on Computational Transportation Science, (37-42)
  455. Bagnara R, Hill P and Zaffanella E (2009). Applications of polyhedral computations to the analysis and verification of hardware and software systems, Theoretical Computer Science, 410:46, (4672-4691), Online publication date: 1-Nov-2009.
  456. Bouveyron C and Girard S (2009). Robust supervised classification with mixture models, Pattern Recognition, 42:11, (2649-2658), Online publication date: 1-Nov-2009.
  457. Merat K, Salarieh H and Alasty A (2009). Implementation of dynamic programming for chaos control in discrete systems, Journal of Computational and Applied Mathematics, 233:2, (531-544), Online publication date: 1-Nov-2009.
  458. ACM
    Tomibayashi Y, Takegawa Y, Terada T and Tsukamoto M Wearable DJ system Proceedings of the International Conference on Advances in Computer Entertainment Technology, (132-139)
  459. Dai P and Goldsmith J Finding Best k Policies Algorithmic Decision Theory, (144-155)
  460. van Otterlo M (2009). Intensional dynamic programming. A Rosetta stone for structured dynamic programming, Journal of Algorithms, 64:4, (169-191), Online publication date: 1-Oct-2009.
  461. Cardoen B, Demeulemeester E and Beliën J (2009). Sequencing surgical cases in a day-care environment, Computers and Operations Research, 36:9, (2660-2669), Online publication date: 1-Sep-2009.
  462. ACM
    Sui X and Leung H A q-learning based adaptive bidding strategy in combinatorial auctions Proceedings of the 11th International Conference on Electronic Commerce, (186-194)
  463. Hawkins B and Giraud-Carrier C Ranking search results for translated content Proceedings of the 10th IEEE international conference on Information Reuse & Integration, (242-245)
  464. Zhang Q, Sun G and Xu Y Parallel Algorithms for Solving Markov Decision Process Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing, (466-477)
  465. ACM
    da Silva M, Durand F and Popović J Linear Bellman combination for control of character animation ACM SIGGRAPH 2009 papers, (1-10)
  466. ACM
    da Silva M, Durand F and Popović J (2009). Linear Bellman combination for control of character animation, ACM Transactions on Graphics, 28:3, (1-10), Online publication date: 27-Jul-2009.
  467. Hardwick J and Stout Q (2009). Algorithms for response adaptive sampling designs, WIREs Computational Statistics, 1:1, (118-122), Online publication date: 13-Jul-2009.
  468. Said Y and Wegman E (2009). Roadmap for Optimization, WIREs Computational Statistics, 1:1, (3-17), Online publication date: 13-Jul-2009.
  469. Kolobov A, Mausam and Weld D ReTrASE Proceedings of the 21st International Joint Conference on Artificial Intelligence, (1746-1753)
  470. Dai P, Mausam and Weld D Domain-independent, automatic partitioning for probabilistic planning Proceedings of the 21st International Joint Conference on Artificial Intelligence, (1677-1683)
  471. ACM
    Doerr B, Eremeev A, Horoba C, Neumann F and Theile M Evolutionary algorithms and dynamic programming Proceedings of the 11th Annual conference on Genetic and evolutionary computation, (771-778)
  472. ACM
    Koppejan R and Whiteson S Neuroevolutionary reinforcement learning for generalized helicopter control Proceedings of the 11th Annual conference on Genetic and evolutionary computation, (145-152)
  473. Kodell R, Pearce B, Baek S, Moon H, Ahn H, Young J and Chen J (2009). A model-free ensemble method for class prediction with application to biomedical decision making, Artificial Intelligence in Medicine, 46:3, (267-276), Online publication date: 1-Jul-2009.
  474. Riedmiller M, Gabel T, Hafner R and Lange S (2009). Reinforcement learning for robot soccer, Autonomous Robots, 27:1, (55-73), Online publication date: 1-Jul-2009.
  475. ACM
    Ge T, Zdonik S and Madden S Top-k queries on uncertain data Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, (375-388)
  476. Kasai D, Yamasaki T and Aizawa K Retrieval of time-varying mesh and motion capture data using 2D video queries based on silhouette shape descriptors Proceedings of the 2009 IEEE international conference on Multimedia and Expo, (854-857)
  477. ACM
    Dundar M, Hirleman E, Bhunia A, Robinson J and Rajwa B Learning with a non-exhaustive training dataset Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, (279-288)
  478. Abrardo A, Detti P and Moretti M Message passing resource allocation for the uplink of multicarrier systems Proceedings of the 2009 IEEE international conference on Communications, (3909-3914)
  479. Dönitz C, Vasile I, Onder C and Guzzella L Dynamic programming for hybrid pneumatic vehicles Proceedings of the 2009 conference on American Control Conference, (3956-3963)
  480. Basin M, Shi P and Calderon-Alvarez D Central suboptimal H∞control design for nonlinear polynomial systems Proceedings of the 2009 conference on American Control Conference, (3101-3105)
  481. Gillella P and Sun Z Modeling and control design of a camless valve actuation system Proceedings of the 2009 conference on American Control Conference, (2696-2701)
  482. Song X, Zulkefli M, Sun Z and Miao H Modeling, analysis, and optimal design of the automotive transmission ball capsule system Proceedings of the 2009 conference on American Control Conference, (1379-1384)
  483. Basin M and Calderon-Alvarez D Optimal controller for uncertain stochastic polynomial systems with deterministic disturbances Proceedings of the 2009 conference on American Control Conference, (778-783)
  484. Van Den Broeck L, Diehl M and Swevers J Performant design of an input shaping prefilter via embedded optimization Proceedings of the 2009 conference on American Control Conference, (166-171)
  485. Bonarini A, Lazaric A, Montrone F and Restelli M (2009). Reinforcement distribution in fuzzy Q-learning, Fuzzy Sets and Systems, 160:10, (1420-1443), Online publication date: 15-May-2009.
  486. Eidenberger R, Grundmann T and Zoellner R Probabilistic action planning for active scene modeling in continuous high-dimensional domains Proceedings of the 2009 IEEE international conference on Robotics and Automation, (2639-2644)
  487. Seow K A dynamic programming approach to multi-level supervision Proceedings of the 2009 IEEE international conference on Robotics and Automation, (87-92)
  488. Dibangoye J, Mouaddib A and Chai-draa B Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1, (569-576)
  489. Istepanian R, Philip N and Martini M (2009). Medical QoS provision based on reinforcement learning in ultrasound streaming over 3.5G wireless systems, IEEE Journal on Selected Areas in Communications, 27:4, (566-574), Online publication date: 1-May-2009.
  490. Borri A, Benedetto M and Benedetto M Hybrid Modelling, Power Management and Stabilization of Cognitive Radio Networks Proceedings of the 12th International Conference on Hybrid Systems: Computation and Control, (76-89)
  491. Kainen P, Kůrková V and Sanguineti M On tractability of neural-network approximation Proceedings of the 9th international conference on Adaptive and natural computing algorithms, (11-21)
  492. Kainen P, Kůrková V and Sanguineti M On Tractability of Neural-Network Approximation Proceedings of the 2009 conference on Adaptive and Natural Computing Algorithms - Volume 5495, (11-21)
  493. Deisenroth M, Rasmussen C and Peters J (2009). Gaussian process dynamic programming, Neurocomputing, 72:7-9, (1508-1524), Online publication date: 1-Mar-2009.
  494. Dekhtyar A, Goldsmith J, Goldstein B, Mathias K and Isenhour C (2009). Planning for success, International Journal of Approximate Reasoning, 50:3, (416-428), Online publication date: 1-Mar-2009.
  495. ACM
    Shani G, Meek C, Paek T, Thiesson B and Venolia G Searching large indexes on tiny devices Proceedings of the 14th international conference on Intelligent user interfaces, (257-266)
  496. Bugaev Y and Chikunov S (2009). Generalization of the dynamic programming scheme, Automation and Remote Control, 70:2, (253-262), Online publication date: 1-Feb-2009.
  497. Baglietto M, Sanguineti M and Zoppoli R (2009). The extended Ritz method for functional optimization, Optimization Methods & Software, 24:1, (15-43), Online publication date: 1-Feb-2009.
  498. Sitarz S (2009). Ant algorithms and simulated annealing for multicriteria dynamic programming, Computers and Operations Research, 36:2, (433-441), Online publication date: 1-Feb-2009.
  499. Madani O, Thorup M and Zwick U Discounted deterministic Markov decision processes and discounted all-pairs shortest paths Proceedings of the twentieth annual ACM-SIAM symposium on Discrete algorithms, (958-967)
  500. Kang Q, Wang L and Wu Q (2009). Swarm-based approximate dynamic optimization process for discrete particle swarm optimization system, International Journal of Bio-Inspired Computation, 1:1/2, (61-70), Online publication date: 1-Jan-2009.
  501. Su X and Khoshgoftaar T (2009). A survey of collaborative filtering techniques, Advances in Artificial Intelligence, 2009, (2-2), Online publication date: 1-Jan-2009.
  502. Yamasaki T and Aizawa K (2008). Motion segmentation for time-varying mesh sequences based on spherical registration, EURASIP Journal on Advances in Signal Processing, 2009, (1-9), Online publication date: 1-Jan-2009.
  503. Min R and Cheng H (2009). Effective image retrieval using dominant color descriptor and fuzzy support vector machine, Pattern Recognition, 42:1, (147-157), Online publication date: 1-Jan-2009.
  504. Chaharsooghi S, Heydari J and Zegordi S (2008). A reinforcement learning model for supply chain ordering management, Decision Support Systems, 45:4, (949-959), Online publication date: 1-Nov-2008.
  505. Grompone Von Gioi R, Jakubowicz J, Morel J and Randall G (2008). On Straight Line Segment Detection, Journal of Mathematical Imaging and Vision, 32:3, (313-347), Online publication date: 1-Nov-2008.
  506. ACM
    Dehuri S and Cho S A novel particle swarm optimization for multiple campaigns assignment problem Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology, (317-324)
  507. ACM
    Ley-Wild R, Fluet M and Acar U (2008). Compiling self-adjusting programs with continuations, ACM SIGPLAN Notices, 43:9, (321-334), Online publication date: 27-Sep-2008.
  508. ACM
    Morihata A, Matsuzaki K and Takeichi M (2008). Write it recursively, ACM SIGPLAN Notices, 43:9, (169-178), Online publication date: 27-Sep-2008.
  509. ACM
    Ley-Wild R, Fluet M and Acar U Compiling self-adjusting programs with continuations Proceedings of the 13th ACM SIGPLAN international conference on Functional programming, (321-334)
  510. ACM
    Morihata A, Matsuzaki K and Takeichi M Write it recursively Proceedings of the 13th ACM SIGPLAN international conference on Functional programming, (169-178)
  511. Hill N and Eslambolchilar P Seam carving for enhancing image usability on mobiles Proceedings of the 22nd British HCI Group Annual Conference on People and Computers: Culture, Creativity, Interaction - Volume 2, (131-134)
  512. Fontes D and Fontes F (2008). Optimal reorganization of agent formations, WSEAS Transactions on Systems and Control, 3:9, (789-798), Online publication date: 1-Sep-2008.
  513. Lin F, Lai C and Hong J (2008). Minimize presentation lag by sequencing media objects for auto-assembled presentations from digital libraries, Data & Knowledge Engineering, 66:3, (382-401), Online publication date: 1-Sep-2008.
  514. Dai P, Mausam and Weld D Partitioned external-memory value iteration Proceedings of the 23rd national conference on Artificial intelligence - Volume 2, (898-904)
  515. ACM
    Häckel S, Fischer M, Zechel D and Teich T A multi-objective ant colony approach for pareto-optimization using dynamic programming Proceedings of the 10th annual conference on Genetic and evolutionary computation, (33-40)
  516. Zhao L and Safonova A Achieving good connectivity in motion graphs Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, (127-136)
  517. ACM
    Barrett L and Narayanan S Learning all optimal policies with multiple criteria Proceedings of the 25th international conference on Machine learning, (41-47)
  518. Rachelson E, Quesnel G, Garcia F and Fabiani P A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence, (583-587)
  519. Taylor M, Kuhlmann G and Stone P Transfer Learning and Intelligence Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference, (326-337)
  520. Pankov S A computational approximation to the AIXI model Proceedings of the 2008 conference on Artificial General Intelligence 2008: Proceedings of the First AGI Conference, (256-267)
  521. ACM
    Hammer M and Acar U Memory management for self-adjusting computation Proceedings of the 7th international symposium on Memory management, (51-60)
  522. Bagnara R, Hill P and Zaffanella E (2008). The Parma Polyhedra Library, Science of Computer Programming, 72:1-2, (3-21), Online publication date: 1-Jun-2008.
  523. ACM
    Liu C and Wu J Routing in a cyclic mobispace Proceedings of the 9th ACM international symposium on Mobile ad hoc networking and computing, (351-360)
  524. Acar U and Ley-Wild R Self-adjusting computation with Delta ML Proceedings of the 6th international conference on Advanced functional programming, (1-38)
  525. Okamura H and Dohi T Analysis of a software system with rejuvenation, restoration and checkpointing Proceedings of the 5th international conference on Service availability, (110-128)
  526. Dai P, Strehl A and Goldsmith J Expediting RL by using graphical structures Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3, (1325-1328)
  527. Teacy W, Chalkiadakis G, Rogers A and Jennings N Sequential decision making with untrustworthy service providers Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2, (755-762)
  528. Jamroga W A temporal logic for Markov chains Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2, (697-704)
  529. Sirbiladze G and Sikharulidze A Bellman's optimality principle in the weakly structurable dynamic systems Proceedings of the 9th WSEAS International Conference on Fuzzy Systems, (33-41)
  530. Ross S, Pineau J, Paquet S and Chaib-draa B (2008). Online planning algorithms for POMDPs, Journal of Artificial Intelligence Research, 32:1, (663-704), Online publication date: 1-May-2008.
  531. ACM
    Jung H and Pedram M Resilient dynamic power management under uncertainty Proceedings of the conference on Design, automation and test in Europe, (224-229)
  532. da Motta Salles Barreto A and Anderson C (2008). Restricted gradient-descent algorithm for value-function approximation in reinforcement learning, Artificial Intelligence, 172:4-5, (454-482), Online publication date: 1-Mar-2008.
  533. Courtemanche F, Najjar M, Paccoud B and Mayers A Assisting elders via dynamic multi-tasks planning Proceedings of the 1st international conference on Ambient media and systems, (1-8)
  534. Di Tomaso E and Baldwin J (2008). An approach to hybrid probabilistic models, International Journal of Approximate Reasoning, 47:2, (202-218), Online publication date: 1-Feb-2008.
  535. ACM
    Acar U, Ahmed A and Blume M (2008). Imperative self-adjusting computation, ACM SIGPLAN Notices, 43:1, (309-322), Online publication date: 14-Jan-2008.
  536. ACM
    Acar U, Ahmed A and Blume M Imperative self-adjusting computation Proceedings of the 35th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages, (309-322)
  537. ACM
    Puchinger J and Stuckey P Automating branch-and-bound for dynamic programs Proceedings of the 2008 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation, (81-89)
  538. Kersting K, De Raedt L, Gutmann B, Karwath A and Landwehr N Relational sequence learning Probabilistic inductive logic programming, (28-55)
  539. Wang C, Joshi S and Khardon R (2008). First order decision diagrams for relational MDPs, Journal of Artificial Intelligence Research, 31:1, (431-472), Online publication date: 1-Jan-2008.
  540. Ci S, Wang H and Wu D (2008). A theoretical framework for quality-aware cross-layer optimized wireless multimedia communications, Advances in Multimedia, 2008:2, (1-10), Online publication date: 1-Jan-2008.
  541. Corchado J, Bajo J, de Paz Y and Tapia D (2008). Intelligent environment for monitoring Alzheimer patients, agent technology for health care, Decision Support Systems, 44:2, (382-396), Online publication date: 1-Jan-2008.
  542. Sasazaki K, Saga S, Maeda J and Suzuki Y (2008). Vector quantization of images with variable block size, Applied Soft Computing, 8:1, (634-645), Online publication date: 1-Jan-2008.
  543. Ghavamzadeh M and Mahadevan S (2007). Hierarchical Average Reward Reinforcement Learning, The Journal of Machine Learning Research, 8, (2629-2669), Online publication date: 1-Dec-2007.
  544. Radicioni D and Lombardo V (2007). A Constraint-based Approach for Annotating Music Scores with Gestural Information, Constraints, 12:4, (405-428), Online publication date: 1-Dec-2007.
  545. Leng J, Jain L and Fyfe C Convergence Analysis on Approximate Reinforcement Learning Knowledge Science, Engineering and Management, (85-91)
  546. ACM
    Troncoso-Pastoriza J, Katzenbeisser S and Celik M Privacy preserving error resilient dna searching through oblivious automata Proceedings of the 14th ACM conference on Computer and communications security, (519-528)
  547. ACM
    Mutlu B, Krause A, Forlizzi J, Guestrin C and Hodgins J Robust, low-cost, non-intrusive sensing and recognition of seated postures Proceedings of the 20th annual ACM symposium on User interface software and technology, (149-158)
  548. Kotel'Nikova A and Krasovskii N (2007). On correctness of probabilistic stabilization, Automation and Remote Control, 68:10, (1826-1843), Online publication date: 1-Oct-2007.
  549. Sanner S and Boutilier C Approximate solution techniques for factored first-order MDPs Proceedings of the Seventeenth International Conference on International Conference on Automated Planning and Scheduling, (288-295)
  550. Sigaud O and Wilson S (2007). Learning classifier systems: a survey, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 11:11, (1065-1078), Online publication date: 1-Sep-2007.
  551. ACM
    McCann J and Pollard N Responsive characters from motion fragments ACM SIGGRAPH 2007 papers, (6-es)
  552. ACM
    McCann J and Pollard N (2007). Responsive characters from motion fragments, ACM Transactions on Graphics, 26:3, (6-es), Online publication date: 29-Jul-2007.
  553. Bazzani A, Giorgini B, Rambaldi S, Brambilla M and Cattelani L Walking between free will and determinism Proceedings of the 2007 Summer Computer Simulation Conference, (1043-1050)
  554. ACM
    Hadjam F, Moraga C and Benmohamed M Cluster-based evolutionary design of digital circuits using all improved multi-expression programming Proceedings of the 9th annual conference companion on Genetic and evolutionary computation, (2475-2482)
  555. Hoey J and Little J (2007). Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes, IEEE Transactions on Pattern Analysis and Machine Intelligence, 29:7, (1118-1132), Online publication date: 1-Jul-2007.
  556. Fukasawa R and Goycoolea M On the Exact Separation of Mixed Integer Knapsack Cuts Proceedings of the 12th international conference on Integer Programming and Combinatorial Optimization, (225-239)
  557. Hori M, Kanbara M and Yokoya N Novel stereoscopic view generation by image-based rendering coordinated with depth information Proceedings of the 15th Scandinavian conference on Image analysis, (193-202)
  558. Tyugu E Algorithms and Architectures of Artificial Intelligence Proceedings of the 2007 conference on Algorithms and Architectures of Artificial Intelligence, (1-171)
  559. Chen G, Low C and Yang Z Extremal search of decision policies for scalable distributed applications Proceedings of the 2nd international conference on Scalable information systems, (1-8)
  560. Vidács A and Virtamo J Minimum transmission energy trajectories for a linear pursuit problem Proceedings of the 1st EuroFGI international conference on Network control and optimization, (286-295)
  561. Ćalić J and Campbell N (2007). Compact visualisation of video summaries, EURASIP Journal on Advances in Signal Processing, 2007:2, (17-17), Online publication date: 1-Jun-2007.
  562. Itoh H and Nakamura K (2007). Partially observable Markov decision processes with imprecise parameters, Artificial Intelligence, 171:8-9, (453-490), Online publication date: 1-Jun-2007.
  563. ACM
    Dolgov D, James M and Samples M Combinatorial resource scheduling for multiagent MDPs Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems, (1-8)
  564. Müller S and Supatgiat C (2007). A quantitative optimization model for dynamic risk-based compliance management, IBM Journal of Research and Development, 51:3, (295-307), Online publication date: 1-May-2007.
  565. Jung H and Pedram M Dynamic power management under uncertain information Proceedings of the conference on Design, automation and test in Europe, (1060-1065)
  566. Lazarev A (2007). Graphic approach to combinatorial optimization, Automation and Remote Control, 68:4, (583-592), Online publication date: 1-Apr-2007.
  567. Lanotte R, Maggiolo-Schettini A and Troina A (2007). Parametric probabilistic transition systems for system design and analysis, Formal Aspects of Computing, 19:1, (93-109), Online publication date: 1-Mar-2007.
  568. Guo X (2007). Continuous-Time Markov Decision Processes with Discounted Rewards, Mathematics of Operations Research, 32:1, (73-87), Online publication date: 1-Feb-2007.
  569. Pham T (2007). Spectral distortion measures for biological sequence comparisons and database searching, Pattern Recognition, 40:2, (516-529), Online publication date: 1-Feb-2007.
  570. Zhu X and Wu X Mining complex patterns across sequences with gap requirements Proceedings of the 20th international joint conference on Artifical intelligence, (2934-2940)
  571. Trevizan F, Cozman F and De Barros L Planning under risk and Knightian uncertainty Proceedings of the 20th international joint conference on Artifical intelligence, (2023-2028)
  572. Dai P and Goldsmith J Topological value iteration algorithm for Markov decision processes Proceedings of the 20th international joint conference on Artifical intelligence, (1860-1865)
  573. Jodogne S and Piater J (2007). Closed-loop learning of visual control policies, Journal of Artificial Intelligence Research, 28:1, (349-391), Online publication date: 1-Jan-2007.
  574. Yamasaki T and Aizawa K (2007). Motion segmentation and retrieval for 3D video based on modified shape distribution, EURASIP Journal on Advances in Signal Processing, 2007:1, (211-211), Online publication date: 1-Jan-2007.
  575. Malik A and Choi T (2007). Consideration of illumination effects and optimization of window size for accurate calculation of depth map for 3D shape recovery, Pattern Recognition, 40:1, (154-170), Online publication date: 1-Jan-2007.
  576. ACM
    Bordeaux L, Hamadi Y and Zhang L (2006). Propositional Satisfiability and Constraint Programming, ACM Computing Surveys, 38:4, (12-es), Online publication date: 25-Dec-2006.
  577. Porta J, Vlassis N, Spaan M and Poupart P (2006). Point-Based Value Iteration for Continuous POMDPs, The Journal of Machine Learning Research, 7, (2329-2367), Online publication date: 1-Dec-2006.
  578. ACM
    Calic J and Campbell N Optimising video summaries for mobile devices using visual attention modelling Proceedings of the 2nd international conference on Mobile multimedia communications, (1-5)
  579. Muldoon C, O'Hare G and O'Grady M Managing resources in constrained environments with autonomous agents Proceedings of the 7th international conference on Engineering societies in the agents world VII, (320-339)
  580. Hölldobler S, Karabaev E and Skvortsova O (2006). FLUCAP, Journal of Artificial Intelligence Research, 27:1, (419-439), Online publication date: 1-Sep-2006.
  581. Pineau J, Gordon G and Thrun S (2006). Anytime point-based approximations for large POMDPs, Journal of Artificial Intelligence Research, 27:1, (335-380), Online publication date: 1-Sep-2006.
  582. Kveton B, Hauskrecht M and Guestrin C (2006). Solving factored MDPs with hybrid state and action variables, Journal of Artificial Intelligence Research, 27:1, (153-201), Online publication date: 1-Sep-2006.
  583. Steffen P and Giegerich R (2006). Table design in dynamic programming, Information and Computation, 204:9, (1325-1345), Online publication date: 1-Sep-2006.
  584. Ahmed N, Lu X and Barbosa L (2006). An efficient parallel optimization algorithm for the Token Bucket control mechanism, Computer Communications, 29:12, (2281-2293), Online publication date: 1-Aug-2006.
  585. ACM
    Lee C and Lasenby J 3D human motion compression using wavelet decomposition ACM SIGGRAPH 2006 Research posters, (104-es)
  586. Li H, Liao X and Carin L Incremental least squares policy iteration for POMDPs proceedings of the 21st national conference on Artificial intelligence - Volume 2, (1167-1172)
  587. Kveton B and Hauskrecht M Learning basis functions in hybrid domains proceedings of the 21st national conference on Artificial intelligence - Volume 2, (1161-1166)
  588. Guérin J, Marcotte P and Savard G (2006). An optimal adaptive algorithm for the approximation of concave functions, Mathematical Programming: Series A and B, 107:3, (357-366), Online publication date: 1-Jul-2006.
  589. ACM
    Ager M, Danvy O and Rohde H (2006). Fast partial evaluation of pattern matching in strings, ACM Transactions on Programming Languages and Systems, 28:4, (696-714), Online publication date: 1-Jul-2006.
  590. Vetrov D and Kropotov D (2006). Application of probability filter to signal processing problems, Pattern Recognition and Image Analysis, 16:3, (478-485), Online publication date: 1-Jul-2006.
  591. Ke J (2006). Optimal NT policies for M/G/1 system with a startup and unreliable server, Computers and Industrial Engineering, 50:3, (248-262), Online publication date: 1-Jul-2006.
  592. ACM
    Acar U, Blelloch G, Blume M and Tangwongsan K An experimental analysis of self-adjusting computation Proceedings of the 27th ACM SIGPLAN Conference on Programming Language Design and Implementation, (96-107)
  593. ACM
    Acar U, Blelloch G, Blume M and Tangwongsan K (2006). An experimental analysis of self-adjusting computation, ACM SIGPLAN Notices, 41:6, (96-107), Online publication date: 11-Jun-2006.
  594. Desharnais J, Laviolette F, Moturu K and Zhioua S Trace equivalence characterization through reinforcement learning Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence, (371-382)
  595. Bonet B and Geffner H Learning depth-first search Proceedings of the Sixteenth International Conference on International Conference on Automated Planning and Scheduling, (142-151)
  596. Kveton B and Hauskrecht M Solving factored MDPs with exponential-family transition models Proceedings of the Sixteenth International Conference on International Conference on Automated Planning and Scheduling, (114-120)
  597. Galand L and Perny P Search for Compromise Solutions in Multiobjective State Space Graphs Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy, (93-97)
  598. Jan R, Lin C and Chern M (2006). An optimization model for Web content adaptation, Computer Networks: The International Journal of Computer and Telecommunications Networking, 50:7, (953-965), Online publication date: 15-May-2006.
  599. Pollack J (2006). Mindless Intelligence, IEEE Intelligent Systems, 21:3, (50-56), Online publication date: 1-May-2006.
  600. Lui S, Horner A and Ayers L (2006). MIDI to SP-MIDI Transcoding Using Phrase Stealing, IEEE MultiMedia, 13:2, (52-59), Online publication date: 1-Apr-2006.
  601. Kim Y and Moon B (2006). Multicampaign Assignment Problem, IEEE Transactions on Knowledge and Data Engineering, 18:3, (405-414), Online publication date: 1-Mar-2006.
  602. Liu Z and Kang S (2006). Properties of Solutions for Certain Functional Equations Arising in Dynamic Programming, Journal of Global Optimization, 34:2, (273-292), Online publication date: 1-Feb-2006.
  603. Chowdhury R and Ramachandran V Cache-oblivious dynamic programming Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, (591-600)
  604. Haslum P (2006). Improving heuristics through relaxed search, Journal of Artificial Intelligence Research, 25:1, (233-267), Online publication date: 1-Jan-2006.
  605. Fern A, Yoon S and Givan R (2006). Approximate policy iteration with a policy language bias, Journal of Artificial Intelligence Research, 25:1, (75-118), Online publication date: 1-Jan-2006.
  606. Falelakis M, Diou C and Delopoulos A (2006). Semantic identification, EURASIP Journal on Advances in Signal Processing, 2006, (183-183), Online publication date: 1-Jan-2006.
  607. Ishibashi B and Boutaba R (2005). Topology and mobility considerations in mobile ad hoc networks, Ad Hoc Networks, 3:6, (762-776), Online publication date: 1-Nov-2005.
  608. ACM
    Rotem D, Stockinger K and Wu K Optimizing candidate check costs for bitmap indices Proceedings of the 14th ACM international conference on Information and knowledge management, (648-655)
  609. Girdziušas R and Laaksonen J Optimal stopping and constraints for diffusion models of signals with discontinuities Proceedings of the 16th European conference on Machine Learning, (576-583)
  610. Kang H (2005). G-wire, Pattern Recognition Letters, 26:13, (2042-2051), Online publication date: 1-Oct-2005.
  611. Son H, Kim H, Kim H and Chong K Feasibility of the circularly connected analog CNN cell array-based viterbi decoder Proceedings of the 8th international conference on Parallel Computing Technologies, (151-158)
  612. Dormido Canto S, de Madrid A and Bencomo S (2005). Parallel Dynamic Programming on Clusters of Workstations, IEEE Transactions on Parallel and Distributed Systems, 16:9, (785-798), Online publication date: 1-Sep-2005.
  613. ACM
    Jodogne S and Piater J Interactive learning of mappings from visual percepts to actions Proceedings of the 22nd international conference on Machine learning, (393-400)
  614. ACM
    Sun J, Yuan L, Jia J and Shum H Image completion with structure propagation ACM SIGGRAPH 2005 Papers, (861-868)
  615. ACM
    Li Y, Sun J and Shum H Video object cut and paste ACM SIGGRAPH 2005 Papers, (595-600)
  616. Sanner S and McAllester D Affine algebraic decision diagrams (AADDs) and their application to structured probabilistic inference Proceedings of the 19th international joint conference on Artificial intelligence, (1384-1390)
  617. Kveton B and Hauskrecht M An MCMC approach to solving hybrid factored MDPs Proceedings of the 19th international joint conference on Artificial intelligence, (1346-1351)
  618. Dolgov D and Durfee E Stationary deterministic policies for constrained MDPs with multiple rewards, costs, and discount factors Proceedings of the 19th international joint conference on Artificial intelligence, (1326-1331)
  619. Wilson N Decision diagrams for the computation of semiring valuations Proceedings of the 19th international joint conference on Artificial intelligence, (331-336)
  620. Montani S, Terenziani P and Bottrighi A Exploiting decision theory for supporting therapy selection in computerized clinical guidelines Proceedings of the 10th conference on Artificial Intelligence in Medicine, (136-140)
  621. Spaan M and Vlassis N (2005). Perseus, Journal of Artificial Intelligence Research, 24:1, (195-220), Online publication date: 1-Jul-2005.
  622. ACM
    Sun J, Yuan L, Jia J and Shum H (2005). Image completion with structure propagation, ACM Transactions on Graphics, 24:3, (861-868), Online publication date: 1-Jul-2005.
  623. ACM
    Li Y, Sun J and Shum H (2005). Video object cut and paste, ACM Transactions on Graphics, 24:3, (595-600), Online publication date: 1-Jul-2005.
  624. Kun Z, Heng W and Feng-Yu L (2005). Distributed multicast routing for delay and delay variation-bounded Steiner tree using simulated annealing, Computer Communications, 28:11, (1356-1370), Online publication date: 1-Jul-2005.
  625. Pieraccini R and Lubensky D Spoken language communication with machines Proceedings of the 18th international conference on Innovations in Applied Artificial Intelligence, (6-15)
  626. ACM
    LeFevre K, DeWitt D and Ramakrishnan R Incognito Proceedings of the 2005 ACM SIGMOD international conference on Management of data, (49-60)
  627. de Cooman G and Troffaes M (2005). Dynamic programming for deterministic discrete-time systems with uncertain gain, International Journal of Approximate Reasoning, 39:2-3, (257-278), Online publication date: 1-Jun-2005.
  628. Kersting K An Inductive Logic Programming Approach to Statistical Relational Learning Proceedings of the 2005 conference on An Inductive Logic Programming Approach to Statistical Relational Learning, (1-228)
  629. Wörgötter F and Porr B (2005). Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms, Neural Computation, 17:2, (245-319), Online publication date: 1-Feb-2005.
  630. Zhang W and Zhang N (2005). Restricted value iteration, Journal of Artificial Intelligence Research, 23:1, (123-165), Online publication date: 1-Jan-2005.
  631. Porta J and Celaya E (2005). Reinforcement learning for agents with many sensors and actuators acting in categorizable environments, Journal of Artificial Intelligence Research, 23:1, (79-122), Online publication date: 1-Jan-2005.
  632. Chugh A and Hybinette M Towards adaptive caching for parallel and discrete event simulation Proceedings of the 36th conference on Winter simulation, (336-344)
  633. Sallans B and Hinton G (2004). Reinforcement Learning with Factored States and Actions, The Journal of Machine Learning Research, 5, (1063-1088), Online publication date: 1-Dec-2004.
  634. Wagner H and Whitin T (2004). Dynamic Version of the Economic Lot Size Model, Management Science, 50:12 Supplement, (1770-1774), Online publication date: 1-Dec-2004.
  635. ACM
    van den Broek P and Noppen J (2004). Comparison of two approaches to dynamic programming, ACM SIGACT News, 35:4, (111-116), Online publication date: 1-Dec-2004.
  636. Minoux M (2004). Polynomial approximation schemes and exact algorithms for optimum curve segmentation problems, Discrete Applied Mathematics, 144:1-2, (158-172), Online publication date: 1-Nov-2004.
  637. ACM
    Angiolini F, Menichelli F, Ferrero A, Benini L and Olivieri M A post-compiler approach to scratchpad mapping of code Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems, (259-267)
  638. Guestrin C, Hauskrecht M and Kveton B Solving factored MDPs with continuous and discrete variables Proceedings of the 20th conference on Uncertainty in artificial intelligence, (235-242)
  639. Greenwald A and Boyan J Bidding under uncertainty Proceedings of the 20th conference on Uncertainty in artificial intelligence, (209-216)
  640. ACM
    Kersting K, Otterlo M and De Raedt L Bellman goes relational Proceedings of the twenty-first international conference on Machine learning
  641. Koenig S, Likhachev M, Liu Y and Furcy D (2004). Incremental heuristic search in AI, AI Magazine, 25:2, (99-112), Online publication date: 1-Jun-2004.
  642. Koenig S, Likhachev M, Liu Y and Furcy D (2004). Incremental Heuristic Search in AI, AI Magazine, 25:2, (99-112), Online publication date: 1-Jun-2004.
  643. Wei J (2004). Markov Edit Distance, IEEE Transactions on Pattern Analysis and Machine Intelligence, 26:3, (311-321), Online publication date: 1-Mar-2004.
  644. Liberatore P (2004). On polynomial sized MDP succinct policies, Journal of Artificial Intelligence Research, 21:1, (551-577), Online publication date: 1-Jan-2004.
  645. Falcão A, Stolfi J and de Alencar Lotufo R (2004). The Image Foresting Transform, IEEE Transactions on Pattern Analysis and Machine Intelligence, 26:1, (19-29), Online publication date: 1-Jan-2004.
  646. ACM
    Marinov D and O'Callahan R (2003). Object equality profiling, ACM SIGPLAN Notices, 38:11, (313-325), Online publication date: 26-Nov-2003.
  647. ACM
    Angiolini F, Benini L and Caprara A Polynomial-time algorithm for on-chip scratchpad memory partitioning Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, (318-326)
  648. ACM
    Marinov D and O'Callahan R Object equality profiling Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications, (313-325)
  649. Larsen K, Larsson F, Pettersson P and Yi W (2003). Compact Data Structures and State-Space Reduction for Model-Checking Real-Time Systems, Real-Time Systems, 25:2-3, (255-275), Online publication date: 1-Sep-2003.
  650. Christofides S, Christofides A and Christofides N (2003). The design of corporate tax structures, Mathematical Programming: Series A and B, 98:1-3, (493-510), Online publication date: 1-Sep-2003.
  651. Bonet B and Geffner H Faster heuristic search algorithms for planning with uncertainty and full feedback Proceedings of the 18th international joint conference on Artificial intelligence, (1233-1238)
  652. Luiz da Silva E and Finardi E (2003). Parallel Processing Applied to the Planning of Hydrothermal Systems, IEEE Transactions on Parallel and Distributed Systems, 14:8, (721-729), Online publication date: 1-Aug-2003.
  653. šter B and Dobnikar A (2003). Adaptive Radial Basis Decomposition by Learning Vector Quantization, Neural Processing Letters, 18:1, (17-27), Online publication date: 1-Aug-2003.
  654. ACM
    Liu Y, Goodwin R and Koenig S Risk-averse auction agents Proceedings of the second international joint conference on Autonomous agents and multiagent systems, (353-360)
  655. Price B and Boutilier C (2003). Accelerating reinforcement learning through implicit imitation, Journal of Artificial Intelligence Research, 19:1, (569-629), Online publication date: 1-Jul-2003.
  656. Guestrin C, Koller D, Parr R and Venkataraman S (2003). Efficient solution algorithms for factored MDPs, Journal of Artificial Intelligence Research, 19:1, (399-468), Online publication date: 1-Jul-2003.
  657. Knopov P and Maryanovich T (2003). On Some Actual Problems of Estimating Risk in Complex Systems under Insufficient Information, Cybernetics and Systems Analysis, 39:4, (576-585), Online publication date: 1-Jul-2003.
  658. Datta A, Choudhary A, Bittner M and Dougherty E (2003). External Control in Markovian Genetic Regulatory Networks, Machine Language, 52:1-2, (169-191), Online publication date: 1-Jul-2003.
  659. Majercik S and Littman M (2003). Contingent planning under uncertainty via stochastic satisfiability, Artificial Intelligence, 147:1-2, (119-162), Online publication date: 1-Jul-2003.
  660. Givan R, Dean T and Greig M (2003). Equivalence notions and model minimization in Markov decision processes, Artificial Intelligence, 147:1-2, (163-223), Online publication date: 1-Jul-2003.
  661. Liu Y and Stoller S (2003). Dynamic Programming via Static Incrementalization, Higher-Order and Symbolic Computation, 16:1-2, (37-62), Online publication date: 1-Mar-2003.
  662. ACM
    Acar U, Blelloch G and Harper R (2003). Selective memoization, ACM SIGPLAN Notices, 38:1, (14-25), Online publication date: 15-Jan-2003.
  663. ACM
    Acar U, Blelloch G and Harper R Selective memoization Proceedings of the 30th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, (14-25)
  664. Fleischer L and Sethuraman J Approximately optimal control of fluid networks Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms, (56-65)
  665. Cai M, Deng X and Wang L (2003). Approximate sequencing for variable length tasks, Theoretical Computer Science, 290:3, (2037-2044), Online publication date: 3-Jan-2003.
  666. Lee I, Lau H and Wai L An experimental evaluation of reinforcement learning for gain scheduling Design and application of hybrid intelligent systems, (351-360)
  667. Achir N and Pujolle G Multi-object video rate control Network control and engineering for Qos, security and mobility II, (191-202)
  668. Sebastian T, Klein P and Kimia B (2003). On Aligning Curves, IEEE Transactions on Pattern Analysis and Machine Intelligence, 25:1, (116-125), Online publication date: 1-Jan-2003.
  669. Wallace R (2002). A family of restricted subadditive recursions, Discrete Applied Mathematics, 124:1-3, (127-139), Online publication date: 15-Dec-2002.
  670. ACM
    Grigoras R, Charvillat V and Douze M Optimizing hypervideo navigation using a Markov decision process approach Proceedings of the tenth ACM international conference on Multimedia, (39-48)
  671. Liu C, Koga M and Fujisawa H (2002). Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading, IEEE Transactions on Pattern Analysis and Machine Intelligence, 24:11, (1425-1437), Online publication date: 1-Nov-2002.
  672. Ormoneit D and Sen Ś (2002). Kernel-Based Reinforcement Learning, Machine Language, 49:2-3, (161-178), Online publication date: 1-Nov-2002.
  673. Senkul S and Polat F (2002). Learning intelligent behavior in a non-stationary and partially observable environment, Artificial Intelligence Review, 18:2, (97-115), Online publication date: 1-Oct-2002.
  674. Smith A (2002). Applications of the self-organising map to reinforcement learning, Neural Networks, 15:8-9, (1107-1124), Online publication date: 1-Oct-2002.
  675. ACM
    Zhang H and Hou J A scheduling algorithm for transporting variable rate coded voice in bluetooth networks Proceedings of the 5th ACM international workshop on Wireless mobile multimedia, (25-32)
  676. Schwind M and Wendt O Dynamic Pricing of Information Products Based on Reinforcement Learning Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence, (51-66)
  677. Ho Y and Liu R (2002). A Novel Routing Protocol for Supporting QoS for Ad Hoc Mobile Wireless Networks, Wireless Personal Communications: An International Journal, 22:3, (359-385), Online publication date: 1-Sep-2002.
  678. Geffner H Perspectives on artificial intelligence planning Eighteenth national conference on Artificial intelligence, (1013-1023)
  679. Liberatore P The size of MDP factored policies Eighteenth national conference on Artificial intelligence, (267-272)
  680. Andre D and Russell S State abstraction for programmable reinforcement learning agents Eighteenth national conference on Artificial intelligence, (119-125)
  681. ACM
    Hanna H and Mouaddib A Task selection problem under uncertainty as decision-making Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3, (1303-1308)
  682. ACM
    Yokoo M and Suzuki K Secure multi-agent dynamic programming based on homomorphic encryption and its application to combinatorial auctions Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1, (112-119)
  683. Hao J and Li X (2002). Word spotting based ona posterior measure of keyword confidence, Journal of Computer Science and Technology, 17:4, (491-497), Online publication date: 1-Jul-2002.
  684. Peters J and van der Smagt P (2002). Searching a Scalable Approach to Cerebellar Based Control, Applied Intelligence, 17:1, (11-33), Online publication date: 5-Jun-2002.
  685. Tsybakov B (2002). Optimum Discarding in a Bufferless System, Queueing Systems: Theory and Applications, 41:1/2, (165-197), Online publication date: 1-Jun-2002.
  686. Gabasov R, Kirillova F and Balashevich N (2002). Synthesis of Optimal Closed Systems, Cybernetics and Systems Analysis, 38:3, (396-411), Online publication date: 1-May-2002.
  687. Kostenko V (2002). The Problem of Schedule Construction in the Joint Design of Hardware and Software, Programming and Computing Software, 28:3, (162-173), Online publication date: 1-May-2002.
  688. Muromtsev D, Muromtsev Y and Orlova L (2002). Combined Design of Energy-Efficient Control of Multistage Processes, Automation and Remote Control, 63:3, (502-510), Online publication date: 15-Mar-2002.
  689. Gabasov R, Dmitruk N and Kirillova F (2002). Optimization of the Multidimensional Control Systems with Parallelepiped Constraints, Automation and Remote Control, 63:3, (345-366), Online publication date: 15-Mar-2002.
  690. Suzuki K and Yokoo M Secure combinatorial auctions by dynamic programming with polynomial secret sharing Proceedings of the 6th international conference on Financial cryptography, (44-56)
  691. ACM
    Levitin A and Papalaskari M (2002). Using puzzles in teaching algorithms, ACM SIGCSE Bulletin, 34:1, (292-296), Online publication date: 1-Mar-2002.
  692. ACM
    Levitin A and Papalaskari M Using puzzles in teaching algorithms Proceedings of the 33rd SIGCSE technical symposium on Computer science education, (292-296)
  693. Borkar V and Meyn S (2002). Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost, Mathematics of Operations Research, 27:1, (192-209), Online publication date: 1-Feb-2002.
  694. Belker T, Beetz M and Cremers A (2002). Learning of plan execution policies for indoor navigation, AI Communications, 15:1, (3-16), Online publication date: 1-Jan-2002.
  695. Jouffe L Reinforcement learning for fuzzy agents New learning paradigms in soft computing, (181-230)
  696. Xianglong Y, Yuncheng F, Tao L and Fei W Decision making using simulation Proceedings of the 33nd conference on Winter simulation, (905-912)
  697. Deveaux L, Paraschiv C and Latourrette M (2001). Bargaining on an Internet Agent-based Market, Electronic Commerce Research, 1:4, (371-401), Online publication date: 1-Oct-2001.
  698. Lin I and Kung S (2001). Extraction of Video Objects via Surface Optimization and Voronoi Order, Journal of VLSI Signal Processing Systems, 29:1-2, (23-39), Online publication date: 1-Aug-2001.
  699. Boutilier C Planning and programming with first-order markov decision processes Proceedings of the 8th conference on Theoretical aspects of rationality and knowledge, (99-110)
  700. Leloup B and Deveaux L (2001). Dynamic Pricing on the Internet, Electronic Commerce Research, 1:3, (265-276), Online publication date: 1-Jul-2001.
  701. Geusebroek J, Smeulders A and Geerts H (2001). A Minimum Cost Approach for Segmenting Networks of Lines, International Journal of Computer Vision, 43:2, (99-111), Online publication date: 1-Jul-2001.
  702. Gabasov R, Kirillova F and Ruzhitskaya E (2001). The Classical Regulation Problem, Automation and Remote Control, 62:6, (875-885), Online publication date: 19-Jun-2001.
  703. ACM
    Choi S and Liu J A dynamic mechanism for time-constrained trading Proceedings of the fifth international conference on Autonomous agents, (568-575)
  704. Bonet B and Geffner H (2001). Planning and Control in Artificial Intelligence, Applied Intelligence, 14:3, (237-252), Online publication date: 9-May-2001.
  705. Liu J, Maluf D and Desmarais M (2001). A New Uncertainty Measure for Belief Networks with Applications to Optimal Evidential Inferencing, IEEE Transactions on Knowledge and Data Engineering, 13:3, (416-425), Online publication date: 1-May-2001.
  706. ACM
    Hemaspaandra L (2001). SIGACT news complexity theory column 31, ACM SIGACT News, 32:1, (21-31), Online publication date: 1-Mar-2001.
  707. ACM
    Dooly D, Goldman S and Scott S (2001). On-line analysis of the TCP acknowledgment delay problem, Journal of the ACM, 48:2, (243-273), Online publication date: 1-Mar-2001.
  708. ACM
    Laroche P, Boniface Y and Schott R A new decomposition technique for solving Markov decision processes Proceedings of the 2001 ACM symposium on Applied computing, (12-16)
  709. Thrun S (2000). Probabilistic Algorithms in Robotics, AI Magazine, 21:4, (93-109), Online publication date: 1-Dec-2000.
  710. ACM
    Ward D, Blackwell A and MacKay D Dasher—a data entry interface using continuous gestures and language models Proceedings of the 13th annual ACM symposium on User interface software and technology, (129-137)
  711. Ibragimov A (2000). On the Existence and Uniqueness of Equilibrium Situations in Markovian Games with Discounting, Cybernetics and Systems Analysis, 36:6, (925-935), Online publication date: 1-Nov-2000.
  712. Munos R (2000). A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions, Machine Language, 40:3, (265-299), Online publication date: 1-Sep-2000.
  713. ACM
    Aguilera M and Strom R Efficient atomic broadcast using deterministic merge Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing, (209-218)
  714. McCallum A, Nigam K, Rennie J and Seymore K (2000). Automating the Construction of Internet Portals with Machine Learning, Information Retrieval, 3:2, (127-163), Online publication date: 1-Jul-2000.
  715. ACM
    Wolpert D, Kirshner S, Merz C and Tumer K Adaptivity in agent-based routing for data networks Proceedings of the fourth international conference on Autonomous agents, (396-403)
  716. Tambe M and Zhang W (2000). Towards Flexible Teamwork in Persistent Teams, Autonomous Agents and Multi-Agent Systems, 3:2, (159-183), Online publication date: 1-Jun-2000.
  717. ACM
    Orlin J, Schulz A and Sengupta S ε-optimization schemes and L-bit precision (extended abstract) Proceedings of the thirty-second annual ACM symposium on Theory of computing, (565-572)
  718. Singh S, Jaakkola T, Littman M and Szepesvári C (2000). Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms, Machine Language, 38:3, (287-308), Online publication date: 1-Mar-2000.
  719. Bobick A and Intille S (1999). Large Occlusion Stereo, International Journal of Computer Vision, 33:3, (181-200), Online publication date: 3-Sep-1999.
  720. ACM
    Provost F, Jensen D and Oates T Efficient progressive sampling Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, (23-32)
  721. Hauskrecht M, Pandurangan G and Upfal E Computing near optimal strategies for stochastic investment planning problems Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2, (1310-1315)
  722. Hoey J, St-Aubin R, Hu A and Boutilier C SPUDD Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence, (279-288)
  723. ACM
    Mansour Y Reinforcement learning and mistake bounded algorithms Proceedings of the twelfth annual conference on Computational learning theory, (183-192)
  724. ACM
    Salek A, Lou J and Pedram M MERLIN Proceedings of the 36th annual ACM/IEEE Design Automation Conference, (472-478)
  725. ACM
    Cong J, Fang J and Khoo K VIA design rule consideration in multi-layer maze routing algorithms Proceedings of the 1999 international symposium on Physical design, (214-220)
  726. ACM
    Levitin A Do we teach the right algorithm design techniques? The proceedings of the thirtieth SIGCSE technical symposium on Computer science education, (179-183)
  727. Melamed I (1999). Bitext maps and alignment via pattern recognition, Computational Linguistics, 25:1, (107-130), Online publication date: 1-Mar-1999.
  728. ACM
    Levitin A (1999). Do we teach the right algorithm design techniques?, ACM SIGCSE Bulletin, 31:1, (179-183), Online publication date: 1-Mar-1999.
  729. Greco S (1999). Dynamic Programming in Datalog with Aggregates, IEEE Transactions on Knowledge and Data Engineering, 11:2, (265-283), Online publication date: 1-Mar-1999.
  730. Boutilier C Knowledge representation for stochastic decision processes Artificial intelligence today, (111-152)
  731. Asada M, Suzuki S, Takahashi Y, Uchibe E, Nakamura M, Mishima C, Ishizuka H and Kato T (1998). Trackies, AI Magazine, 19:3, (71-78), Online publication date: 1-Sep-1998.
  732. Clarke F, Hiriart-Urruty J and Ledyaev Y (1998). On Global Optimality Conditions for Nonlinear Optimal Control Problems, Journal of Global Optimization, 13:2, (109-122), Online publication date: 1-Sep-1998.
  733. Walker M, Fromer J and Narayanan S Learning optimal dialogue strategies Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2, (1345-1351)
  734. Hauskrecht M, Meuleau N, Kaelbling L, Dean T and Boutilier C Hierarchical solution of Markov decision processes using macro-actions Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, (220-229)
  735. Boutilier C, Brafman R and Geib C Structured reachability analysis for Markov decision processes Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, (24-32)
  736. ACM
    Heckbert P Color image quantization for frame buffer display Seminal graphics: pioneering efforts that shaped the field, (335-345)
  737. Kalmár Z, Szepesvári C and Lőrincz A (1998). Module-Based Reinforcement Learning, Autonomous Robots, 5:3-4, (273-295), Online publication date: 1-Jul-1998.
  738. Fuentes O and Nelson R (1998). Learning Dextrous Manipulation Skills for Multifingered Robot Hands Using the Evolution Strategy, Autonomous Robots, 5:3-4, (395-405), Online publication date: 1-Jul-1998.
  739. ACM
    Washington R Markov tracking for agent coordination Proceedings of the second international conference on Autonomous agents, (70-77)
  740. Kalmár Z, Szepesvári C and Lörincz A (1998). Module-Based Reinforcement Learning, Machine Language, 31:1-3, (55-85), Online publication date: 1-Apr-1998.
  741. Fuentes O and Nelson R (1998). Learning Dextrous Manipulation Skills for MultifingeredRobot Hands Using the Evolution Strategy, Machine Language, 31:1-3, (223-237), Online publication date: 1-Apr-1998.
  742. Dietterich T (1997). Machine‐Learning Research, AI Magazine, 18:4, (97-136), Online publication date: 1-Dec-1997.
  743. Dietterich T and Flann N (1997). Explanation-Based Learning and Reinforcement Learning, Machine Language, 28:2-3, (169-210), Online publication date: 1-Sep-1997.
  744. Boutilier C Correlated action effects in decision theoretic regression Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence, (30-37)
  745. ACM
    Lakos J Technology retargeting for IC layout Proceedings of the 34th annual Design Automation Conference, (460-465)
  746. Wang C, Gao C and Shi Z (1997). An Algorithm for Continuous Type Optimal LocationProblem, Computational Optimization and Applications, 7:2, (239-253), Online publication date: 1-Mar-1997.
  747. ACM
    Barbuceanu M and Fox M Integrating communicative action, conversations and decision theory to coordinate agents Proceedings of the first international conference on Autonomous agents, (49-58)
  748. Park Y and Cho H (1997). Task Oriented Optimum Positioning of a Mobile Manipulator Base in a Cluttered Environment, Journal of Intelligent and Robotic Systems, 18:2, (147-168), Online publication date: 1-Feb-1997.
  749. Atkeson C, Moore A and Schaal S (1997). Locally Weighted Learning for Control, Artificial Intelligence Review, 11:1-5, (75-113), Online publication date: 1-Feb-1997.
  750. Chand S, Moskowitz H, Novak A, Rekhi I and Sorger G (1996). Capacity Allocation for Dynamic Process Improvement with Quality and Demand Considerations, Operations Research, 44:6, (964-975), Online publication date: 1-Dec-1996.
  751. Bacchus F, Boutilier C and Grove A Rewarding behaviors Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2, (1160-1167)
  752. Boutilier C Planning, learning and coordination in multiagent decision processes Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge, (195-210)
  753. Bertsekas D, Guerriero F and Musmanno R (1996). Parallel asynchronous label-correcting methods for shortest paths, Journal of Optimization Theory and Applications, 88:2, (297-320), Online publication date: 1-Feb-1996.
  754. Parvin B, Peng C, Johnston W and Maestre F (1995). Tracking of Tubular Molecules for Scientific Applications, IEEE Transactions on Pattern Analysis and Machine Intelligence, 17:8, (800-805), Online publication date: 1-Aug-1995.
  755. ACM
    Kimbrough S (1995). APL, dynamic programming, and the optimal control of electromagnetic brake retarders, ACM SIGAPL APL Quote Quad, 25:4, (98-108), Online publication date: 8-Jun-1995.
  756. ACM
    Bernecky R (1995). The role of dynamic programming & control structures in performance, ACM SIGAPL APL Quote Quad, 25:4, (11-19), Online publication date: 8-Jun-1995.
  757. ACM
    Kimbrough S APL, dynamic programming, and the optimal control of electromagnetic brake retarders Proceedings of the international conference on Applied programming languages, (98-108)
  758. ACM
    Bernecky R The role of dynamic programming & control structures in performance Proceedings of the international conference on Applied programming languages, (11-19)
  759. Chou A, Cooperstock J, El-Yaniv R, Klugerman M and Leighton T The statistical adversary allows optimal money-making trading strategies Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms, (467-476)
  760. ACM
    Deering S (1995). Multicast routing in internetworks and extended LANs, ACM SIGCOMM Computer Communication Review, 25:1, (88-101), Online publication date: 11-Jan-1995.
  761. Barto A, Bradtke S and Singh S (1995). Learning to act using real-time dynamic programming, Artificial Intelligence, 72:1-2, (81-138), Online publication date: 1-Jan-1995.
  762. Cassandra A, Kaelbling L and Littman M Acting optimally in partially observable stochastic domains Proceedings of the Twelfth AAAI National Conference on Artificial Intelligence, (1023-1028)
  763. Merialdo B (1994). Tagging English text with a probabilistic model, Computational Linguistics, 20:2, (155-171), Online publication date: 1-Jun-1994.
  764. Sun M (1993). Revised simplex algorithm for finite Markov decision processes, Journal of Optimization Theory and Applications, 79:2, (405-413), Online publication date: 1-Nov-1993.
  765. Chen S Aligning sentences in bilingual corpora using lexical information Proceedings of the 31st annual meeting on Association for Computational Linguistics, (9-16)
  766. ACM
    Harrison R and Glass C Dynamic programming in a pure functional language Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice, (179-186)
  767. Bird R and de Moor O (2022). List partitions, Formal Aspects of Computing, 5:1, (61-78), Online publication date: 1-Jan-1993.
  768. Debili F and Sammouda E Aligning sentences in bilingual texts Proceedings of the 14th conference on Computational linguistics - Volume 2, (517-524)
  769. ACM
    Sneidovich M and Findlay S (1992). Jogging with APL along the shortest path, ACM SIGAPL APL Quote Quad, 23:1, (221-227), Online publication date: 15-Jul-1992.
  770. ACM
    Sneidovich M and Findlay S Jogging with APL along the shortest path Proceedings of the international conference on APL, (221-227)
  771. ACM
    Smith B and Katz J The range scheduling aid Proceedings of the 3rd international conference on Industrial and engineering applications of artificial intelligence and expert systems - Volume 1, (275-280)
  772. ACM
    Desnoyer J, Dessoude O and Zavidovique B A stochastic approach to sensor fusion and perception control Proceedings of the 3rd international conference on Industrial and engineering applications of artificial intelligence and expert systems - Volume 1, (169-174)
  773. Payton D (1990). Internalized plans, Robotics and Autonomous Systems, 6:1-2, (89-103), Online publication date: 1-Jun-1990.
  774. ACM
    Marcellin M and Fischer T (1990). Generalized predictive TCQ of speech, Communications of the ACM, 33:1, (11-19), Online publication date: 3-Jan-1990.
  775. ACM
    Deering S (1988). Multicast routing in internetworks and extended LANs, ACM SIGCOMM Computer Communication Review, 18:4, (55-64), Online publication date: 1-Aug-1988.
  776. ACM
    Deering S Multicast routing in internetworks and extended LANs Symposium proceedings on Communications architectures and protocols, (55-64)
  777. ACM
    Veronis J Correction of phonographic errors in natural language interfaces Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval, (101-115)
  778. Marlowe T A least cost partition algorithm Proceedings of 1986 ACM Fall joint computer conference, (637-647)
  779. Keil J (1985). Decomposing a Polygon into Simpler Components, SIAM Journal on Computing, 14:4, (799-817), Online publication date: 1-Nov-1985.
  780. Nguyen D (1985). An Analysis of Optimal Advertising Under Uncertainty, Management Science, 31:5, (622-633), Online publication date: 1-May-1985.
  781. Ciancutti M (1985). On Discrete Search for a Multiple Number of Objects, SIAM Journal on Algebraic and Discrete Methods, 6:2, (335-340), Online publication date: 1-Apr-1985.
  782. Helman P and Rosenthal A (1985). A Comprehensive Model of Dynamic Programming, SIAM Journal on Algebraic and Discrete Methods, 6:2, (319-334), Online publication date: 1-Apr-1985.
  783. Newell A (1984). Introduction to the COMTEX Microfiche Edition of Reports on Artificial Intelligence from Carnegie‐Mellon University, AI Magazine, 5:3, (35-39), Online publication date: 1-Sep-1984.
  784. ACM
    Kumar V Integrating knowledge in problem solving search procedures Proceedings of the 1984 annual conference of the ACM on The fifth generation challenge, (5-10)
  785. Anderson E, Nash P and Perold A (1983). Some Properties of a Class of Continuous Linear Programs, SIAM Journal on Control and Optimization, 21:5, (758-765), Online publication date: 1-Sep-1983.
  786. Weste N, Burr D and Ackland B (1983). Dynamic Time Warp Pattern Matching Using an Integrated Multiprocessing Array, IEEE Transactions on Computers, 32:8, (731-744), Online publication date: 1-Aug-1983.
  787. ACM
    Plass M and Stone M Curve-fitting with piecewise parametric cubics Proceedings of the 10th annual conference on Computer graphics and interactive techniques, (229-239)
  788. ACM
    Plass M and Stone M (1983). Curve-fitting with piecewise parametric cubics, ACM SIGGRAPH Computer Graphics, 17:3, (229-239), Online publication date: 1-Jul-1983.
  789. Chuquillanqui S Internal connection problem in large optimized PLAs Proceedings of the 20th Design Automation Conference, (795-802)
  790. ACM
    Blake R (1982). Optimal control of thrashing, ACM SIGMETRICS Performance Evaluation Review, 11:4, (1-10), Online publication date: 1-Dec-1982.
  791. Rodé G (1982). Asymptotic Properties of a Finite State Continuous Time Markov Decision Process, SIAM Journal on Control and Optimization, 20:6, (884-892), Online publication date: 1-Nov-1982.
  792. ACM
    Blake R Optimal control of thrashing Proceedings of the 1982 ACM SIGMETRICS conference on Measurement and modeling of computer systems, (1-10)
  793. ACM
    Heckbert P Color image quantization for frame buffer display Proceedings of the 9th annual conference on Computer graphics and interactive techniques, (297-307)
  794. ACM
    Heckbert P (1982). Color image quantization for frame buffer display, ACM SIGGRAPH Computer Graphics, 16:3, (297-307), Online publication date: 1-Jul-1982.
  795. ACM
    Sniedovich M Use of APL in operations research an interactive dynamic programming model Proceedings of the international conference on APL, (291-297)
  796. ACM
    Sniedovich M (1981). Use of APL in operations research an interactive dynamic programming model, ACM SIGAPL APL Quote Quad, 12:1, (291-297), Online publication date: 1-Sep-1981.
  797. ACM
    Price C The assignment of computational tasks among processors in a distributed system Proceedings of the May 4-7, 1981, national computer conference, (291-296)
  798. Tarjan R (2006). Complexity of Combinatorial Algorithms, SIAM Review, 20:3, (457-491), Online publication date: 1-Jul-1978.
  799. ACM
    Giles J and Hoff G Techniques for the development of a multi-objective optimization model Proceedings of the 16th annual Southeast regional conference, (64-67)
  800. ACM
    Lew A (1978). Optimal conversion of extended-entry decision tables with general cost criteria, Communications of the ACM, 21:4, (269-279), Online publication date: 1-Apr-1978.
  801. Dao T Design and implementation of a non-binary code for byte-organized memory with binary and quaternary logics Proceedings of the eighth international symposium on Multiple-valued logic, (55-64)
  802. Jamshidi M and Heidari M (1977). Brief paper, Automatica (Journal of IFAC), 13:3, (287-293), Online publication date: 1-May-1977.
  803. Nashed M (2006). An Algorithmic Approach to Nonlinear Analysis and Optimization (Edward J. Beltrami); The Approximate Minimization of Functionals (James W. Daniel); Approximate Methods in Optimization Problems (Vladimir F. Demyanov and Aleksandr M. Rumnov); Computational Methods in Optimization , SIAM Review, 19:2, (341-358), Online publication date: 1-Apr-1977.
  804. Whittemore A and Saunders S (1977). Optimal Inventory Under Stochastic Demand with Two Supply Options, SIAM Journal on Applied Mathematics, 32:2, (293-305), Online publication date: 1-Mar-1977.
  805. Dudnik E Uncertainty and optimization in the design of building subsystems Proceedings of the 14th Design Automation Conference, (239-243)
  806. ACM
    Misra J A principle of algorithm design on limited problem domain Proceedings of the 13th Design Automation Conference, (479-483)
  807. Basar T (1976). On the uniqueness of the Nash solution in Linear-Quadratic differential Games, International Journal of Game Theory, 5:2-3, (65-90), Online publication date: 1-Jun-1976.
  808. Davis M (1976). The Separation Principle in Stochastic Control via Girsanov Solutions, SIAM Journal on Control and Optimization, 14:1, (176-188), Online publication date: 1-Jan-1976.
  809. Jones R (1975). Comparison Theorems for Matrix Riccati Equations, SIAM Journal on Applied Mathematics, 29:1, (77-90), Online publication date: 1-Jul-1975.
  810. Ghandour E (1974). Initial Value Problem for Boundary Values of a Green’s Function, SIAM Journal on Applied Mathematics, 27:4, (649-655), Online publication date: 1-Dec-1974.
  811. Shiveley M and Salley E The optimal routing of vehicles using the GASP II simulator Proceedings of the 7th conference on Winter simulation - Volume 2, (763-764)
  812. ACM
    Lyon G (1974). Syntax-directed least-errors analysis for context-free languages, Communications of the ACM, 17:1, (3-14), Online publication date: 1-Jan-1974.
  813. ACM
    Lew A Memory allocation in paging systems Proceedings of the ACM annual conference, (232-235)
  814. Ritch P (1973). Discrete optimal control with multiple constraints I Constraint separation and transformation technique, Automatica (Journal of IFAC), 9:4, (415-429), Online publication date: 1-Jul-1973.
  815. Polak E (2006). An Historical Survey of Computational Methods in Optimal Control, SIAM Review, 15:2, (553-584), Online publication date: 1-Apr-1973.
  816. Saeks R (2006). State in Hilbert Space, SIAM Review, 15:2, (283-308), Online publication date: 1-Apr-1973.
  817. Garey M (1972). Optimal Binary Identification Procedures, SIAM Journal on Applied Mathematics, 23:2, (173-186), Online publication date: 1-Sep-1972.
  818. ACM
    Chang C Dynamic programming as applied to feature subset selection in a pattern recognition system Proceedings of the ACM annual conference - Volume 1, (94-103)
  819. ACM
    Slagle J and Lee R (1971). Application of game tree searching techniques to sequential pattern recognition, Communications of the ACM, 14:2, (103-110), Online publication date: 1-Feb-1971.
  820. Grinold R (1970). Symmetric Duality for Continuous Linear Programs, SIAM Journal on Applied Mathematics, 18:1, (84-97), Online publication date: 1-Jan-1970.
  821. Lu S, Liu H and Li C Manifold Regularized Stacked Autoencoder for Feature Learning 2015 IEEE International Conference on Systems, Man, and Cybernetics, (2950-2955)
  822. Mouhagir H, Talj R, Cherfaoui V, Guillemard F and Aioun F A Markov Decision Process-based approach for trajectory planning with clothoid tentacles 2016 IEEE Intelligent Vehicles Symposium (IV), (1254-1259)
  823. Gritschneder F, Hatzelmann P, Thom M, Kunz F and Dietmayer K Adaptive learning based on guided exploration for decision making at roundabouts 2016 IEEE Intelligent Vehicles Symposium (IV), (433-440)
  824. Sun Y, Uysal-Biyikoglu E, Yates R, Koksal C and Shroff N Update or wait: How to keep your data fresh IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications, (1-9)
  825. Han Z, Tan H, Chen G, Wang R, Chen Y and Lau F Dynamic virtual machine management via approximate Markov decision process IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications, (1-9)
  826. Cao L, Hu B, Dong X, Xiong G, Zhu F, Shen Z, Shen D and Liu Y Two intersections traffic signal control method based on ADHDP 2016 IEEE International Conference on Vehicular Electronics and Safety (ICVES), (1-5)
  827. Välimäki T and Ritala R Optimizing gaze direction in a visual navigation task 2016 IEEE International Conference on Robotics and Automation (ICRA), (1427-1432)
  828. Liu L and Michael N An MDP-based approximation method for goal constrained multi-MAV planning under action uncertainty 2016 IEEE International Conference on Robotics and Automation (ICRA), (56-62)
  829. Hahn J and Zoubir A Risk-sensitive decision making via constrained expected returns 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (2569-2573)
  830. Liu C, Atkeson C, Feng S and Xinjilefu X Full-body motion planning and control for the car egress task of the DARPA robotics challenge 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids), (527-532)
  831. Zhang X, Wang J and Poor H Reinforcement Learning Based QoS-Provisioning over Energy-Harvesting 5G Wireless Ad-Hoc Networks 2019 IEEE Global Communications Conference (GLOBECOM), (1-6)
  832. Omidvar M, Kazimipour B, Li X and Yao X CBCC3 — A contribution-based cooperative co-evolutionary algorithm with improved exploration/exploitation balance 2016 IEEE Congress on Evolutionary Computation (CEC), (3541-3548)
  833. Malla N, Shrestha D, Ni Z and Tonkoski R Supplementary control for virtual synchronous machine based on adaptive dynamic programming 2016 IEEE Congress on Evolutionary Computation (CEC), (1998-2005)
  834. Suryan V, Sinha A, Malo P and Deb K Handling inverse optimal control problems using evolutionary bilevel optimization 2016 IEEE Congress on Evolutionary Computation (CEC), (1893-1900)
  835. Hu Z, Chen P, Zhu M and Liu P Reinforcement Learning for Adaptive Cyber Defense Against Zero-Day Attacks Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, (54-93)
  836. Miehling E, Rasouli M and Teneketzis D Control-Theoretic Approaches to Cyber-Security Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, (12-28)
  837. ACM
    Kernighan B Optimal segmentation points for programs Proceedings of the second symposium on Operating systems principles, (47-53)
  838. Bellman R (2020). On “Colonel Blotto” and Analogous Games, SIAM Review, 11:1, (66-68), Online publication date: 1-Jan-1969.
  839. Shapiro J (1968). Shortest Route Methods for Finite State Space Deterministic Dynamic Programming Problems, SIAM Journal on Applied Mathematics, 16:6, (1232-1250), Online publication date: 1-Nov-1968.
  840. Hanson M (1968). Duality for a Class of Infinite Programming Problems, SIAM Journal on Applied Mathematics, 16:2, (318-323), Online publication date: 1-Mar-1968.
  841. Pruzan P and Munch-Andersen B (1967). On the Application of Dynamic Programming-Type Algorithms to Antenna Design, SIAM Journal on Applied Mathematics, 15:5, (1113-1129), Online publication date: 1-Sep-1967.
  842. Brogan W (1967). Theory and application of optimal control for distributed parameter systems-I, Automatica (Journal of IFAC), 4:3, (107-120), Online publication date: 1-Aug-1967.
  843. Denardo E (2006). Contraction Mappings in the Theory Underlying Dynamic Programming, SIAM Review, 9:2, (165-177), Online publication date: 1-Apr-1967.
  844. Boylan E (1966). Existence and Uniqueness Theorems for the Optimal Inventory Equation, SIAM Journal on Applied Mathematics, 14:5, (961-969), Online publication date: 1-Sep-1966.
  845. Wong E (2006). A Linear Search Problem, SIAM Review, 6:2, (168-174), Online publication date: 1-Apr-1964.
  846. ACM
    Estrin G and Fuller R Some applications for content-addressable memories Proceedings of the November 12-14, 1963, fall joint computer conference, (495-508)
  847. ACM
    Gluss B (1962). Further remarks on line segment curve-fitting using dynamic programming, Communications of the ACM, 5:8, (441-443), Online publication date: 1-Aug-1962.
  848. ACM
    Mugele R A nonlinear digital optimizing program for process control systems Proceedings of the May 1-3, 1962, spring joint computer conference, (15-32)
  849. ACM
    Kelley J (1961). Techniques for storage allocation algorithms, Communications of the ACM, 4:10, (449-454), Online publication date: 1-Oct-1961.
  850. ACM
    Bellman R (1961). On the approximation of curves by line segments using dynamic programming, Communications of the ACM, 4:6, (284), Online publication date: 1-Jun-1961.
  851. ACM
    Kalman R and Koepcke R The role of digital computers in the dynamic optimization of chemical reactions Papers presented at the the March 3-5, 1959, western joint computer conference, (107-116)
Contributors
  • University of Southern California

Recommendations