07 10 UCB Optimistic Initialization

Name: 07 10 UCB Optimistic Initialization
Uploaded: Oct 5, 2020
Duration: 2954 s
Description: The Upper Confidence Bounds multi-armed bandit algorithm is a statistically smart way to balance exploration and exploitation when making decisions under uncertainties. In this video, I explain and implement UCB. Notebook : https://colab.research.google.com/drive/1egLv7viZQXfqynh6bPli5V_pnwBQ-SQG?usp=sharing

Pie Labs1.45K subscribers

580 views

Oct 5, 2020

49:14

The Upper Confidence Bounds multi-armed bandit algorithm is a statistically smart way to balance exploration and exploitation when making decisions under uncertainties. In this video, I explain and implement UCB. Notebook : https://colab.research.google.com/drive/1egLv7viZQXfqynh6bPli5V_pnwBQ-SQG?usp=sharing

Download

0 formats

No download links available.