7th International Conference on Computer and Knowledge Engineering (ICCKE 2017), , 2017-10-26

Title : ( Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems )

Authors: kambiz shojaee ghandeshtani , Habib Rajabi Mashhadi ,

Access to full-text not allowed by authors

Citation: BibTeX | EndNote

Abstract

The Multi Arm Bandit (MAB) problem is a wellknown decision making problem, where the gambler (operator), seeks the highest value and best choice among arms with different reward distributions. In recent years, many effective strategies have been proposed for the mentioned MAB problem. The strategy’s procedure mainly includes attempts on acquiring a better exploitation and efficiency, while also maximizing reliability by effectively exploring the search space. Hence, a suitable tradeoff between exploration and exploitation has been deemed necessary for achieving highest expected rewards. In the presented paper, the effect of assigning initial values has been studied as a deciding factor in the aforementioned tradeoff. In this regard, the initial value has been studied as an implicit exploration agent in greedy based selection strategies. Simulation and comparison of five methods have been presented and evaluated, using a similar exemplary case. Contrary to common belief, the obtained results have illustrated the high efficiency and exploration capability with proper initializing.

Keywords

, Reinforcement learning; multi, arm bandit problem; greedy selection;
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@inproceedings{paperid:1067222,
author = {Shojaee Ghandeshtani, Kambiz and Rajabi Mashhadi, Habib},
title = {Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems},
booktitle = {7th International Conference on Computer and Knowledge Engineering (ICCKE 2017),},
year = {2017},
location = {مشهد, IRAN},
keywords = {Reinforcement learning; multi-arm bandit problem; greedy selection;},
}

[Download]

%0 Conference Proceedings
%T Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems
%A Shojaee Ghandeshtani, Kambiz
%A Rajabi Mashhadi, Habib
%J 7th International Conference on Computer and Knowledge Engineering (ICCKE 2017),
%D 2017

[Download]