UCB algorithm
ConceptMentioned in 1 video
Upper Confidence Bound, a strategy used in multi-armed bandit problems to balance exploration (trying new options) and exploitation (choosing the best known option).
Upper Confidence Bound, a strategy used in multi-armed bandit problems to balance exploration (trying new options) and exploitation (choosing the best known option).