In probability theory and machine learning, the multi-armed bandit problem is named from imagining a gambler at a row of slot machines, who has to decide which machines to play, how many times to play each machine and in which order to play them, and whether to continue with the current machine or try a different machine.
30-Day Activity
Event Timeline
1 events · 90 days