Exploration Strategies for Model-based Learning in Multi-agent SystemsDavid CarmelShaul Markovitch1999JAAMAS
A selective macro-learning algorithm and its application to the N × N sliding-tile puzzleLev FinkelsteinShaul Markovitch1998JAIR