Reference: nareyek-03-reinforcement

Reference Nareyek, A. 2003.
Choosing Search Heuristics by Non-Stationary Reinforcement Learning.
In Resende, M. G. C., and de Sousa, J. P. (eds.), Metaheuristics: Computer Decision-Making, Kluwer Academic Publishers, 523-544.

Search decisions are often made using heuristic methods because real-world applications can rarely be tackled without any heuristics. In many cases, multiple heuristics can potentially be chosen, and it is not clear a priori which would perform best. In this article, we propose a procedure that learns, during the search process, how to select promising heuristics. The learning is based on weight adaptation and can even switch between different heuristics during search. Different variants of the approach are evaluated within a constraint-programming environment.

Keywords: Non-Stationary Reinforcement Learning; Optimization; Local Search; Constraint Programming

Download Rules and Regulations     [PDF; 379 Kb]   [zipped PDF; 344 Kb]     [download viewer]