Regret Analysis Of Stochastic And Nonstochastic . . .
| Use attributes for filter ! | |
| Google books | books.google.com |
|---|---|
| Originally published | 2012 |
| Authors | Nicolò Cesa-Bianchi |
| Sébastien Bubeck | |
| Date of Reg. | |
| Date of Upd. | |
| ID | 2024502 |
About Regret Analysis Of Stochastic And Nonstochastic . . .
A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. . . .