Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

S. Bastian Bubeck Nicolo Cesa-Bianchi Sebastien Bubeck

Broschiertes Buch

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Versandkostenfrei!

Versandfertig in 1-2 Wochen

90,99 €

inkl. MwSt.

PAYBACK Punkte

45 °P sammeln!

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the nex...

Weiterlesen / Aufklappen

Andere Kunden interessierten sich für

Differential Equations and Asymptotic …

85,99 €
Weihai Zhang, Lihua …
Stochastic H2/H ¿ Control

62,99 €
Radi Petrov …
Deterministic and Stochastic Approaches …

181,99 €
Mark Lawrence …
Stochastic Operations Research

103,99 €
Radi Petrov …
Deterministic and Stochastic Approaches …

242,99 €
Juraj Ko ák, Rudolf …
Stochastic Weight Update in Neural …

32,99 €
Stochastic Optimization

101,99 €
Stochastic Control

114,99 €
Manu Joseph, Jeffrey …
Modern Time Series Forecasting with …

58,99 €
Dieter Melkebeek …
A Survey of Lower Bounds for …

76,99 €