Issue756

Title Unintended behavior change for reward progress in lazy search
Priority feature Status chatting
Superseder Nosy List jendrik, malte, mkatz, silvan
Assigned To Keywords
Optional summary

Created on 2017-12-20.12:21:57 by mkatz, last changed by mkatz.

Messages
msg6759 (view) Author: mkatz Date: 2017-12-20.12:27:40
Last before issue77 revision: 874838d3625e
After issue77 revision: bcb6cef0e11a
msg6758 (view) Author: malte Date: 2017-12-20.12:27:32
...or perhaps it's worth also testing cg and cea (both as single heuristics in
lazy search, of course with preferred operators), as we expected a behaviour
change that can be dramatic, but only for certain heuristics. Or at least that's
what Michael saw in his experiments.
msg6757 (view) Author: malte Date: 2017-12-20.12:26:34
Someone want to set up an experiment for this? The code change to test this is
simple, only a few lines. Michael or I can give pointers to the necessary
changes if it helps.

This is only relevant for configurations involving lazy search and preferred
operators, so I suggest we test lazy search with the FF heuristic and preferred
operators, lama-first, and lama (separately).
msg6756 (view) Author: silvan Date: 2017-12-20.12:23:08
Moved summary provided by Michael to Change Note.
msg6755 (view) Author: silvan Date: 2017-12-20.12:22:51
An introduction of EvaluationContext in issue77 has caused a change in the
behavior of 
the lazy search. 
Before: reward_progress() would be invoked every time there was a new best
heuristic 
value found, including the initial state. 
After: reward_progress() is not invoked for the initial state, and otherwise the
same.

It is not clear which behavior is better, but the change does seem to be
unintended and 
is due to a separate handling of the progress check for the initial state in the
newer 
version.
History
Date User Action Args
2017-12-20 12:27:40mkatzsetmessages: + msg6759
2017-12-20 12:27:32maltesetmessages: + msg6758
2017-12-20 12:26:34maltesetmessages: + msg6757
2017-12-20 12:23:39jendriksetnosy: + jendrik
2017-12-20 12:23:08silvansetmessages: + msg6756
2017-12-20 12:22:51silvansetstatus: unread -> chatting
nosy: + malte, mkatz, silvan
messages: + msg6755
summary: An introduction of EvaluationContext in issue77 has caused a change in the behavior of the lazy search. Before: reward_progress() would be invoked every time there was a new best heuristic value found, including the initial state. After: reward_progress() is not invoked for the initial state, and otherwise the same. It is not clear which behavior is better, but the change does seem to be unintended and is due to a separate handling of the progress check for the initial state in the newer version. ->
2017-12-20 12:21:57mkatzcreate