Online decision problems with large strategy sets

January 2005

Author:
Robert David Kleinberg
Massachusetts Institute of Technology
,
Adviser:
F. Thomson Leighton
Massachusetts Institute of Technology

Publisher:

Massachusetts Institute of Technology
201 Vassar Street, W59-200 Cambridge, MA
United States

Order Number:AAI0808681

Pages:

Purchase on ProQuest

Bibliometrics

Abstract

In an online decision problem, an algorithm performs a sequence of trials, each of which involves selecting one element from a fixed set of alternatives (the "strategy set") whose costs vary over time. After T trials, the combined cost of the algorithm's choices is compared with that of the single strategy whose combined cost is minimum. Their difference is called regret, and one seeks algorithms which are efficient in that their regret is sublinear in T and polynomial in the problem size. We study an important class of online decision problems called generalized multi-armed bandit problems. In the past such problems have found applications in areas as diverse as statistics, computer science, economic theory, and medical decision-making. Most existing algorithms were efficient only in the case of a small (i.e. polynomial-sized) strategy set. We extend the theory by supplying non-trivial algorithms and lower bounds for cases in which the strategy set is much larger (exponential or infinite) and the cost function class is structured, e.g. by constraining the cost functions to be linear or convex. As applications, we consider adaptive routing in networks, adaptive pricing in electronic markets, and collaborative decision-making by untrusting peers in a dynamic environment. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

Cited By

Contributors

Frank Thomson Leighton
Massachusetts Institute of Technology
- Publication Years1981 - 2007
- Publication counts63
- Citation count1,652
- Available for Download12
- Downloads (cumulative)4,702
- Downloads (12 months)585
- Downloads (6 weeks)84
- Average Downloads per Article392
- Average Citation per Article26
View Full Profile
Robert David Kleinberg
Cornell University
- Publication Years2003 - 2023
- Publication counts129
- Citation count3,630
- Available for Download91
- Downloads (cumulative)39,475
- Downloads (12 months)3,748
- Downloads (6 weeks)590
- Average Downloads per Article434
- Average Citation per Article28
View Full Profile

Index Terms

Recommendations

Complete problems, creative sets and isomorphism conjectures
Read More
Efficient algorithms for online decision problems
Special issue: Learning theory 2003

In an online decision problem, one makes a sequence of decisions without knowledge of the future. Each period, one pays a cost based on the decision and observed state. We give a simple approach for doing nearly as well as the best single decision, ...
Read More
P-Selective Sets and Reducing Search to Decision vs Self-Reducibility

We distinguish self-reducibility of a languageLwith the question of whether search reduces to decision forL. Results include: (i) If NE E, then there exists a setLin NP P such that search reduces to decision forL, search doesnotnonadaptively reduce to ...
Read More

Comments

Browse Theses

Sections

Cited By

Index Terms

Complete problems, creative sets and isomorphism conjectures

Efficient algorithms for online decision problems

P-Selective Sets and Reducing Search to Decision vs Self-Reducibility

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Complete problems, creative sets and isomorphism conjectures

Efficient algorithms for online decision problems

P-Selective Sets and Reducing Search to Decision vs Self-Reducibility