Created on 2011-01-07.14:23:00 by malte, last changed by clemens.
File name |
Uploaded |
Type |
Edit |
Remove |
all.groups
|
malte,
2011-01-07.18:30:39
|
application/octet-stream |
|
|
domain.pddl
|
malte,
2011-01-07.16:40:32
|
application/octet-stream |
|
|
output
|
malte,
2011-01-07.18:30:19
|
application/octet-stream |
|
|
problem.pddl
|
malte,
2011-01-07.16:40:41
|
application/octet-stream |
|
|
unnamed
|
erez,
2011-01-07.20:00:03
|
text/html |
|
|
unnamed
|
silvia,
2011-01-10.10:35:03
|
text/html |
|
|
msg11257 (view) |
Author: clemens |
Date: 2023-08-02.17:03:34 |
|
We believe the purpose of this issue is served with the integration of issue1036. I hereby mark this issue as resolved. There is however one part of the discussion that might still be relevant for future reference: We would like to investigate in which cases it makes sense to generate a reasonable ordering, as touched upon in msg1150. I've opened issue1104 for this purpos and mentioned the connection to this issue.
|
msg2378 (view) |
Author: malte |
Date: 2012-11-29.17:12:30 |
|
I've been checking on some of our planner configurations and was surprised that
we don't use reasonable orders in BJOLP. I think we discussed a while ago that
and how reasonable orders can be used for admissible heuristics, but never acted
on it.
Adding this comment and changing the title of the issue as a reminder to look
into this later.
When we look into this, we need to revisit the way that the orderings are
handled by the heuristics.
|
msg1643 (view) |
Author: erez |
Date: 2011-08-15.12:15:04 |
|
In the code we don't make a distinction between reached and accepted, we only
keep track of something we call reached, but is really the accepted landmarks.
The code that keeps track of this is still more or less the same as in the
original LAMA (at least, the logic is the same) - once a landmark becomes true,
we check if all of its parents are also reached (according to the same
definition), and if so, add it to reached.
You can have a look at this in LandmarkStatusManager::update_reached_lms(...)
So if I understand Silvia correctly, that means the planner still has this
problem.
|
msg1642 (view) |
Author: silvia |
Date: 2011-08-15.12:15:02 |
|
Sure, we can point out that it should have been done differently.
|
msg1641 (view) |
Author: malte |
Date: 2011-08-15.12:09:07 |
|
> If you're going to write an errata for the paper, note that it's not
> enough to fix the definition in the text... what would also be
> required is to correct the planner code so that it implements the new
> definition and then rerun all the experiments. I don't think we want
> to do this :)
Surely if it's a technical error it should be pointed out even if we don't want
to rerun the experiments?
> If we want to fix this in the planner, we'll have to know for each
> state which landmarks have been true at least once on the path to this
> state. Do we already keep track of this information (e.g. to
> distinguish between using first achievers and all achievers for a
> landmark in the cost partitioning for the admissible LM count
> heuristic), or is this something that would need to be added?
As far as I can tell, we only keep track of the reached landmarks, but not of
the accepted landmarks. Erez knows more about this and can probably usefully
comment on the code -- I don't think I can.
|
msg1640 (view) |
Author: silvia |
Date: 2011-08-15.12:05:02 |
|
Addressing the "theory" question: we wrote in the AAAI paper that "an
accepted landmark is required again if it is not true in s and it is
the greedy-necessary predecessor of some landmark which *is not
accepted*." This should rather be "...and it is the greedy-necessary
predecessor of some landmark which *has never been true*". By using
the former rather than the latter definition in our heuristic, we're
treating all greedy-necessary orderings as if they were necessary
orderings. The result of that is that the heuristic may sometimes
regard a landmark as required again even though we're not actually
sure it's really required again.
If you're going to write an errata for the paper, note that it's not
enough to fix the definition in the text... what would also be
required is to correct the planner code so that it implements the new
definition and then rerun all the experiments. I don't think we want
to do this :)
If we want to fix this in the planner, we'll have to know for each
state which landmarks have been true at least once on the path to this
state. Do we already keep track of this information (e.g. to
distinguish between using first achievers and all achievers for a
landmark in the cost partitioning for the admissible LM count
heuristic), or is this something that would need to be added?
|
msg1584 (view) |
Author: malte |
Date: 2011-08-12.17:40:53 |
|
Wow, it's been 7 months since the last update to this issue.
I think it would be good if we could wrap our head around this conceptually for
the journal paper, and decide if anything else needs to be done from an
implementation perspective.
Can someone who understands the landmark code better than me tell me which of
the concerns raised in msg1125 and msg1150 are now resolved and which are still
valid?
Regarding implementation choices, we should write down somewhere in the code
what the tradeoffs are (how to define reasonable orders w.r.t. simultaneous
achievement, and the two possibilities for implementing the sufficient
criterion, as well as which one we implemented and that we haven't tested the
other one). Basically, it should contain enough information to allows us to
close this issue without losing significant information.
Silvia mentions that the theory of this was wrong, too:
I noticed that this is already wrong in the definitions of the AAAI-08 and
JAIR papers.
Silvia, can you elaborate what exactly is wrong where, and how to fix it?
Regarding the AAAI paper, I'd like to add an erratum to my webpage, and we
should add a TODO into the journal paper that notes that this is wrong elsewhere
and why, and in which cases it can cause trouble (maybe quoting the example in
this issue).
|
msg1160 (view) |
Author: malte |
Date: 2011-01-12.23:32:52 |
|
Addendum: Results here:
http://www.informatik.uni-freiburg.de/~helmert/exp-lama-combined-eval-d-abs.html
http://www.informatik.uni-freiburg.de/~helmert/exp-lama-combined-eval-p-abs.html
The new config is the one called WORK-WORK-WORK-lama. You can see it's slightly
better than the "noreas" configs. It has lower coverage than the configs that
ignore costs, but that's to be expected.
|
msg1159 (view) |
Author: malte |
Date: 2011-01-12.23:11:56 |
|
OK, experiments are completed and solve (a few) more problems than the
corresponding LAMA version without reasonable orders, so this looks good.
Also, I cannot reproduce the failures of the pure LM-count heuristic on simple
Woodworking instances any more.
I'm demoting the priority of this and changing the title to reflect what still
needs to be done (after the IPC).
The queue is empty right now, so I'll start two more experiments with planner
versions that treat costs slightly differently, but I think we can stop messing
with the code for the IPC (also since there's very little time left and it'd be
good to spend that on more pressing things).
|
msg1152 (view) |
Author: malte |
Date: 2011-01-11.21:49:50 |
|
Experiments with the changed definition are running.
|
msg1151 (view) |
Author: malte |
Date: 2011-01-11.21:33:49 |
|
I've incorporated Silvia's suggestion of commenting out case 3. of
"inconsistent" and pushed the change. However, I have not closed the branch yet
since I think we should also explore the alternative solutions for this (maybe
not before the IPC though -- I have very very few cycles to spare at the moment).
Just checking whether any op has a AND b as effects sounds like a good solution
to me, but we'd have to change the code to make that test efficient.
A natural solution would be to include that info in the causal graph, which
already contains something like that info anyway. We'd have to change it so that
existence of edges of particular types (precond->eff, eff->eff) can be
efficiently queried and that the different causes for edges in the causal graph
can be distinguished.
|
msg1150 (view) |
Author: malte |
Date: 2011-01-11.21:30:06 |
|
Quoting from an email by Silvia:
Comment out or remove lines 628--640 in the landmarks_graph.cc file. (That is
case 3. in the interferes() function.)
Explanation:
As described in the LAMA paper on page 20, a reasonable order is added between A
and B if
(i) B is needed at the same time or after A, and
(ii) achieving B before A would mean having to make it false again in order to
achieve A.
We want to have that cases where A and B are both first achieved at the same
time no longer induce a reasonable ordering. So, we change (ii) to say
(ii modified) A and B are not achieved simultaneously and achieving B before A
would mean having to make it false again in order to achieve A.
At the moment, (ii) is approximated to hold if
(I) A and B cannot both be true at the time when A is first achieved, or
(II) B is false one timestep before A is first achieved
The solution now, I think, is to scratch (II). If A and B become true together
for the first time, then (II) can hold and (at the moment) thus justify a
reasonable order. By scratching (II) we avoid this.
Of course, if we do this we may also lose some reasonable orders that we'd
actually like to have (i.e., where A and B do not become true together). But I
can't think of a way of fixing our problem without potentially losing some
desirable orderings as well.
An alternative would be to test for each reasonable order currently generated by
the code whether an operator exists that can make A and B true at the same time,
and if that's the case, not generate the order.
The first solution would lose desirable orders if there exists a landmark X that
is ordered greedy-necessarily before A and is inconsistent with B. The second
solution would lose desirable orders if there's an operator that achieves both A
and B.
Do you have an intuition in which of the two cases we'd lose more desired
orders? I'm leaning towards the first solution because I'd think that case
happens less frequently, but we could experiment with both if you think it's
worthwhile.
|
msg1128 (view) |
Author: malte |
Date: 2011-01-10.11:46:58 |
|
> I see everyone's online now.
> Should we have a skype chat?
We did. :-) The resolution was for Silvia to look into adapting the computation
of reasonable orders so that it creates strict orders, not non-strict ones.
If that turns out to be difficult, a good alternative would be to allow
accepting A and B in the same state if the only order that connects them is a
reasonable order.
Silvia will send me a patch once this is ready. Actually, the best thing would
be to attach it to this issue. I think you can do this with hg diff, hg export,
or hg bundle. The last one is probably strongest in including all hg-relevant
metadata such as log messages and author names. If you use diff or export,
please use the --git option which enables a more useful patch format.
|
msg1127 (view) |
Author: erez |
Date: 2011-01-10.10:55:16 |
|
I see everyone's online now.
Should we have a skype chat?
|
msg1126 (view) |
Author: malte |
Date: 2011-01-10.10:51:25 |
|
Thanks for the analysis! Which concrete solution do you advocate for the IPC?
It'd be good to have this implemented within the next one or two days.
|
msg1125 (view) |
Author: silvia |
Date: 2011-01-10.10:35:03 |
|
I'm not sure if we should generally accept both parent and child landmarks
at the same time if they become true simultaneously. While this would be ok
for reasonable orderings, it is not so for natural, greedy-neccessary or
necessary orderings. Those state that A must be true *strictly before* B in
any valid plan, not at the same time, and we'd lose accuracy if we did not
enforce this knowledge during the search. For example, there might be an
operator that achieves A and B together but leads to a dead-end area of the
search space, whereas in order to reach the goal, we must achieve A strictly
before B. In this case, there exists a strictly-before ordering (natural or
greedy-necessary or necessary) which could be detected. (I'm not sure our
current procedures would find this ordering, certainly not RHW. But at least
in theory this is an issue.)
I'd agree with Malte that conceptually, the alternative approach is better:
not generate any reasonable orderings A-->B in the case where they could
possible be achieved together (i.e., if there exists an operator that can
achieve A and B simultaneously).
I have just realised a further problem of th RHW procedure, and that
concerns the introduction of natural orderings in the post-processing step.
Here, we generate an ordering A-->B if B is not part of the restricted
relaxed planning graph that never adds A. But this includes the case where A
and B could become true simultaneously. So in this case as well, we should
not generate such an ordering if the two could possibly become true at the
same time.
One thing that is furthermore really bad in Malte's example is that LM4 is
considered required again. It is ordered greedy-necessarily before LM1,
which means it must be true one time step before LM1 becomes true *for the
first time*. But once LM1 has been true, we cannot infer that LM4 must be
true again before LM1 becomes true the second time. I noticed that this is
already wrong in the definitions of the AAAI-08 and JAIR papers. It should
be
(a) LM4 is required again if its ordering before LM1 is *necessary*, not
just greedy-necessary
(b) if the ordering is just greedy-necessary, then LM4 is required again
only if LM1 has never been true (rather than: LM1 has not been accepted yet)
We don't have necessary orderings in the code, so (a) is not an issue, but
should be kept in mind. We should fix (b). But if we implement the solution
regarding the reasonable orderings, then every landmark will be accepted the
first time it becomes true (right?), which automatically solves (b).
Otherwise, we'd have to keep track of which landmarks have already been
true, in addition to which landmarks have already been accepted.
|
msg1123 (view) |
Author: erez |
Date: 2011-01-09.09:37:47 |
|
I pushed a branch issue202, which contains the change that a landmark is also
achieved if it's parents are achieved simultaneously.
This solves the small problem Malte created, but I have not run any further
performance tests.
Malte - if you can, please try it more thuroughly.
|
msg1122 (view) |
Author: erez |
Date: 2011-01-08.10:20:34 |
|
This is the issue that Jorg raised (I think about a year ago), and we actually
had a slide about it in our tutorial:
---------------------------------------------------------------------
The definition of reasonable orderings allows an ordering A ->r B
if A and B become true simultaneously.
Two solutions:
1. Accept a landmark if it has been made true at the same time as its
predecessor (Buffet and Hoffmann, 2010)
2. Modify the definition of reasonable orderings to disallow such
orderings
-----------------------------------------------------------------------
I think we can do (1) easily enough.
|
msg1121 (view) |
Author: malte |
Date: 2011-01-08.09:27:31 |
|
> The part that is intuitively wrong here is the way that the reached set
> is defined, not the dead-end detection, so it's worth thinking about
> whether that can be fixed in any way.
A simple solution would be to allow reaching A and B simultaneously even if A
-->r B. I faintly remember that Silvia discussed precisely this at some point,
although I don't know if it was in the LAMA paper or just in our discussions
while it was written.
A better solution might be to not generate reasonable orderings between A and B
in cases where they can be simultaneously made true in the first place.
In both cases, we need to think about whether this fixes the general problem or
just this particular case of it. I remember that Silvia explained how LAMA
sometimes returned a nonzero heuristic value for the goal; this probably
resolves some interesting cases of this, but probably not all of them.
|
msg1117 (view) |
Author: malte |
Date: 2011-01-07.20:20:03 |
|
On 07.01.2011 20:00, Erez Karpas wrote:
>
> Erez Karpas <batman@gmail.com> added the comment:
>
> I still need to go over this some more, but one immediate thing would be to
> cancel the dead end detection of lm count.
> When used with FF, this should not matter (I think).
> I'll look into this tomorrow.
Note the word I put in parentheses: :-)
So far for the analysis. Now the big question is: how to fix this?
Any (principled) ideas?
I don't think this is terribly urgent, so I suggest we think about it a
bit more before implementing something.
The part that is intuitively wrong here is the way that the reached set
is defined, not the dead-end detection, so it's worth thinking about
whether that can be fixed in any way.
|
msg1116 (view) |
Author: erez |
Date: 2011-01-07.20:00:03 |
|
I still need to go over this some more, but one immediate thing would be to
cancel the dead end detection of lm count.
When used with FF, this should not matter (I think).
I'll look into this tomorrow.
On Jan 7, 2011 8:16 PM, "Malte Helmert" <downward.issues@googlemail.com>
wrote:
>
> Malte Helmert <helmert@informatik.uni-freiburg.de> added the comment:
>
> OK, I see what happens here, but I need your help in deciding how to
> fix this, since I don't understand the ideas behind everything
> properly here.
>
> Sorry this is long, but please both of you read it and try to follow
> it closely.
>
> Let me first explain the example problem. In the SAS+ encoding (file
> "output"), it has three state variables, all of which are binary.
> (There's no need to look at the PDDL -- it probably is more confusing
> than helpful since the chosen SAS+ encoding is not immediately obvious
> from it.) Let's call the variables v0, v1, v2 and write a state like
> 100, 011 etc., giving the three variable values in sequence.
>
> The initial state is 111. The goal states are all states of the form
> **0, i.e., where v2=0.
>
> There are three operators:
>
> op1 has precondition v0=1 and assigns v0 and v1 to 0.
> Symbolically: op1: 1** => 00*
>
> op2 has precondition v0=0 and assigns v1 to 0.
> Symbolically: op2: 0** => *0*
> (It turns out this operator is completely useless, but somehow I need
> it to make this example fail. This may either be due to a subtlety in
> the landmarks generation or it is somehow translator-related)
>
> op3 has preconditions v0=0 and v1=0 and assigns v2 to 0.
> Symbolically: op3: 00* => **0.
>
> The (reachable part of the) state space of the problem looks like
> this:
>
> 111 ==op1=> 001 ==op3=> 000
>
> where 111 is the initial state and 000 is the only reachable goal
> state.
>
> Additionally, there are self-loops in state 001 with op2 and in state
> 000 with op2 and op3.
>
> As you can tell from the graph, all facts in this problem are
> landmarks (they all must be set to 1 initially, and they all must be 0
> in the only reachable goal state), and the planner does detect this.
> So there are exactly 6 landmarks, all of them fact landmarks.
>
> In my planner run, the landmark IDs are associated with the different
> facts as follows:
>
> LM0: v2=0 (the goal)
> LM1: v0=0
> LM2: v1=0
> LM3: v2=1
> LM4: v0=1
> LM5: v1=1
>
> The landmark graph looks as follows (ASCII art; view in fixed-width
> font):
>
>
> LM5 ---> LM2 -------> LM0 <--- LM3
> ^ \ ^
> | \ |
> | \ |
> | -----> LM1
> | ^
> | |
> | |
> LM4------------
>
> All orderings are greedy-necessary except for the ordering from LM2 to
> LM1, which is reasonable.
>
> The greedy-necessary orderings all make sense. I don't know the LM
> stuff well enough off-hand to understand how exactly it generates
> those, but they do satisfy the formal definitions.
>
> In particular, LM1 (v0=0) and LM2 (v1=0) are gn-ordered before LM0
> (v2=0) because the only operator that achieves LM2 has LM1 and LM2 in
> its precondition.
>
> LM3 (v2=1) before LM0 is obvious anyway because LM3 is initially true,
> LM0 is initially false, and LM3 the negation of LM0, hence must hold
> when LM0 is first made true.
>
> Finally, LM4 (v0=1) before LM1 (v0=0) and LM2 (v1=0) makes sense
> because LM4 is initially true and it is the precondition of the only
> operator which is initially applicable (op1), and LM1 and LM2 are the
> effects of this operator.
>
> This leaves the reasonable order from LM2 (v1=0) to LM1 (v0=0). I
> don't understand reasonable orders well enough to understand why that
> is there, but one thing that the LAMA paper observes is that if two
> facts are always first added simultaneously, then the definition
> postulates reasonable ordering between them in both directions. This
> is the case here: indeed, v0 and v1 always have the same value in all
> reachable states. Hence, maybe the planner discovers the ordering LM1
> => LM2 and also LM2 => LM1 and then arbitrarily keeps only LM2 => LM1
> in its cycle-breaking code.
>
> Now for what happens during search. I've added some tracing output,
> with which the relevant part of the code looks like this
> {{explanations embedded in double braces}}:
>
> =========================================================================
> Discovered 6 landmarks, of which 0 are disjunctive and 0 are conjunctive
> 7 edges
> Initializing landmarks count heuristic...
> 3 initial landmarks, 1 goal landmarks
> Conducting lazy best first search, (real) bound = 2147483647
>
> {{Now it dumps the first state to evaluate, which is the initial
> state.}}
>
> var0: 1
> var1: 1
> var2: 1
>
> {{Now it dumps which of LM0, ..., LM5 are considered reached in this
> state.}}
>
> not reached: 0
> not reached: 1
> not reached: 2
> reached: 3
> reached: 4
> reached: 5
> {{This makes sense: LM3, LM4, LM5 are initially true; the others are
> not.}}
>
> dead_end: 0
> {{This state, the initial state, is not a dead end. Correct.}}
>
> Best heuristic value: 3 [g=0, 1 evaluated, 0 expanded, t=0s]
> {{This h value also makes sense for the initial state. There are three
> unreached landmarks (LM0, LM1, LM2), and the problem is unit cost.
> We don't do cost partitioning.}}
>
> {{We now dump the next state to evaluate.}}
> var0: 0
> var1: 0
> var2: 1
> {{This makes sense: the state is 001, the only successor of 111.
> We now dump the reached landmarks for this state.}}
> not reached: 0
> not reached: 1
> reached: 2
> reached: 3
> reached: 4
> reached: 5
> {{OK, this output above is the first thing that puzzles me a bit. Of
> course, LM3, LM4 and LM5 are still reached, and additionally LM2 is
> now reached, which also makes sense. But why is LM1 not reached?
> From my weak understanding of how the orderings work, I guess this
> is because it has a landmark ordered before it, LM2, which was not
> reached in the parent state, and hence LM1 cannot be considered
> reached. However, LM2 was reached *simultaneously* with LM1.
> That reminds me that Silvia and I had long discussions about the
> matter of simultaneous achievement of landmarks in the context of
> the LAMA paper, and maybe this is where we see these theoretical
> considerations in action.
>
> Let's look at what we can infer from the given reached set.}}
> required again (lost children): 4
> {{OK, given the above, this may make sense. There is a
> greedy-necessary ordering from LM4 to LM1, and LM4 is not true in
> the current state. Since we have not accepted LM1, LM1 is also
> considered not reached. Therefore, LM4 is considered "required
> again". Makes sense -- the problem here is that LM1 should really
> have been accepted.}}
> OOPS2!
> LM 4: v0=1 [0 parents, 2 children, min_cost = 1, shared_cost = 0, 0
> forward_orders, 0 first_achievers, 0 possible_achievers]
> dead_end: 1
> {{Now the dead-end detection kicks in: we have a required-again
> landmark LM4. However, LM4 doesn't have any achievers at all. There
> is a dead-end check for such situations, which triggers here.}}
> Completely explored state space -- no solution!
> =========================================================================
>
> So far for the analysis. Now the big question is: how to fix this?
> Any (principled) ideas?
>
> _______________________________________________________
> Fast Downward issue tracker <downward.issues@gmail.com>
> <http://issues.fast-downward.org/issue202>
> _______________________________________________________
|
msg1115 (view) |
Author: malte |
Date: 2011-01-07.19:16:25 |
|
OK, I see what happens here, but I need your help in deciding how to
fix this, since I don't understand the ideas behind everything
properly here.
Sorry this is long, but please both of you read it and try to follow
it closely.
Let me first explain the example problem. In the SAS+ encoding (file
"output"), it has three state variables, all of which are binary.
(There's no need to look at the PDDL -- it probably is more confusing
than helpful since the chosen SAS+ encoding is not immediately obvious
from it.) Let's call the variables v0, v1, v2 and write a state like
100, 011 etc., giving the three variable values in sequence.
The initial state is 111. The goal states are all states of the form
**0, i.e., where v2=0.
There are three operators:
op1 has precondition v0=1 and assigns v0 and v1 to 0.
Symbolically: op1: 1** => 00*
op2 has precondition v0=0 and assigns v1 to 0.
Symbolically: op2: 0** => *0*
(It turns out this operator is completely useless, but somehow I need
it to make this example fail. This may either be due to a subtlety in
the landmarks generation or it is somehow translator-related)
op3 has preconditions v0=0 and v1=0 and assigns v2 to 0.
Symbolically: op3: 00* => **0.
The (reachable part of the) state space of the problem looks like
this:
111 ==op1=> 001 ==op3=> 000
where 111 is the initial state and 000 is the only reachable goal
state.
Additionally, there are self-loops in state 001 with op2 and in state
000 with op2 and op3.
As you can tell from the graph, all facts in this problem are
landmarks (they all must be set to 1 initially, and they all must be 0
in the only reachable goal state), and the planner does detect this.
So there are exactly 6 landmarks, all of them fact landmarks.
In my planner run, the landmark IDs are associated with the different
facts as follows:
LM0: v2=0 (the goal)
LM1: v0=0
LM2: v1=0
LM3: v2=1
LM4: v0=1
LM5: v1=1
The landmark graph looks as follows (ASCII art; view in fixed-width
font):
LM5 ---> LM2 -------> LM0 <--- LM3
^ \ ^
| \ |
| \ |
| -----> LM1
| ^
| |
| |
LM4------------
All orderings are greedy-necessary except for the ordering from LM2 to
LM1, which is reasonable.
The greedy-necessary orderings all make sense. I don't know the LM
stuff well enough off-hand to understand how exactly it generates
those, but they do satisfy the formal definitions.
In particular, LM1 (v0=0) and LM2 (v1=0) are gn-ordered before LM0
(v2=0) because the only operator that achieves LM2 has LM1 and LM2 in
its precondition.
LM3 (v2=1) before LM0 is obvious anyway because LM3 is initially true,
LM0 is initially false, and LM3 the negation of LM0, hence must hold
when LM0 is first made true.
Finally, LM4 (v0=1) before LM1 (v0=0) and LM2 (v1=0) makes sense
because LM4 is initially true and it is the precondition of the only
operator which is initially applicable (op1), and LM1 and LM2 are the
effects of this operator.
This leaves the reasonable order from LM2 (v1=0) to LM1 (v0=0). I
don't understand reasonable orders well enough to understand why that
is there, but one thing that the LAMA paper observes is that if two
facts are always first added simultaneously, then the definition
postulates reasonable ordering between them in both directions. This
is the case here: indeed, v0 and v1 always have the same value in all
reachable states. Hence, maybe the planner discovers the ordering LM1
=> LM2 and also LM2 => LM1 and then arbitrarily keeps only LM2 => LM1
in its cycle-breaking code.
Now for what happens during search. I've added some tracing output,
with which the relevant part of the code looks like this
{{explanations embedded in double braces}}:
=========================================================================
Discovered 6 landmarks, of which 0 are disjunctive and 0 are conjunctive
7 edges
Initializing landmarks count heuristic...
3 initial landmarks, 1 goal landmarks
Conducting lazy best first search, (real) bound = 2147483647
{{Now it dumps the first state to evaluate, which is the initial
state.}}
var0: 1
var1: 1
var2: 1
{{Now it dumps which of LM0, ..., LM5 are considered reached in this
state.}}
not reached: 0
not reached: 1
not reached: 2
reached: 3
reached: 4
reached: 5
{{This makes sense: LM3, LM4, LM5 are initially true; the others are
not.}}
dead_end: 0
{{This state, the initial state, is not a dead end. Correct.}}
Best heuristic value: 3 [g=0, 1 evaluated, 0 expanded, t=0s]
{{This h value also makes sense for the initial state. There are three
unreached landmarks (LM0, LM1, LM2), and the problem is unit cost.
We don't do cost partitioning.}}
{{We now dump the next state to evaluate.}}
var0: 0
var1: 0
var2: 1
{{This makes sense: the state is 001, the only successor of 111.
We now dump the reached landmarks for this state.}}
not reached: 0
not reached: 1
reached: 2
reached: 3
reached: 4
reached: 5
{{OK, this output above is the first thing that puzzles me a bit. Of
course, LM3, LM4 and LM5 are still reached, and additionally LM2 is
now reached, which also makes sense. But why is LM1 not reached?
From my weak understanding of how the orderings work, I guess this
is because it has a landmark ordered before it, LM2, which was not
reached in the parent state, and hence LM1 cannot be considered
reached. However, LM2 was reached *simultaneously* with LM1.
That reminds me that Silvia and I had long discussions about the
matter of simultaneous achievement of landmarks in the context of
the LAMA paper, and maybe this is where we see these theoretical
considerations in action.
Let's look at what we can infer from the given reached set.}}
required again (lost children): 4
{{OK, given the above, this may make sense. There is a
greedy-necessary ordering from LM4 to LM1, and LM4 is not true in
the current state. Since we have not accepted LM1, LM1 is also
considered not reached. Therefore, LM4 is considered "required
again". Makes sense -- the problem here is that LM1 should really
have been accepted.}}
OOPS2!
LM 4: v0=1 [0 parents, 2 children, min_cost = 1, shared_cost = 0, 0
forward_orders, 0 first_achievers, 0 possible_achievers]
dead_end: 1
{{Now the dead-end detection kicks in: we have a required-again
landmark LM4. However, LM4 doesn't have any achievers at all. There
is a dead-end check for such situations, which triggers here.}}
Completely explored state space -- no solution!
=========================================================================
So far for the analysis. Now the big question is: how to fix this?
Any (principled) ideas?
|
msg1114 (view) |
Author: malte |
Date: 2011-01-07.18:30:39 |
|
I'm attaching output and all.groups so that you can precisely follow the
explanation of what goes wrong here.
|
msg1110 (view) |
Author: malte |
Date: 2011-01-07.16:40:32 |
|
OK, I found something that looks like a minimal failing example: three state
variables, three operators, three reachable states. There's a solution of length
2, but --search 'lazy_greedy(lmcount(lm_rhw(reasonable_orders=true)))' fails.
Problem attached.
|
msg1109 (view) |
Author: malte |
Date: 2011-01-07.15:12:26 |
|
OK, this is already present in the very first trunk revision that includes the
landmark code (r118) if we enable reasonable orders there by changing the code.
|
msg1108 (view) |
Author: malte |
Date: 2011-01-07.15:08:11 |
|
OK, this bug is ancient. The earliest version where I can reliably reproduce the
failure is r144, from October 20th 2009 (!) I don't think it was actually
introduced in this issue; what this issue did was enable reasonable orders which
were disabled before. So the true cause must be even older.
|
msg1107 (view) |
Author: malte |
Date: 2011-01-07.14:23:00 |
|
The LM-count heuristic fails to solve Woodworking instance #01, even though it
is solvable. For example, it is solved when disabling reasonable orders.
Transcript:
~/tmp/downward/ipc/src/search$ make debug && X=01 &&
../translate/translate.py
../../benchmarks/woodworking-sat08-strips/p$X.pddl &&
../preprocess/preprocess < output.sas && ./downward-1-debug --search
'lazy_greedy(lmcount(lm_rhw(reasonable_orders=true)))' < output
[...]
Simplifying transitions... done!
Initializing Exploration...
Reading invariants from file...
done
Generating landmarks using the RPG/SAS+ approach
approx. reasonable orders
approx. obedient reasonable orders
Removed 0 reasonable or obedient reasonable orders
Landmarks generation time: 0.00308396s
Discovered 23 landmarks, of which 0 are disjunctive and 0 are conjunctive
39 edges
Initializing landmarks count heuristic...
13 initial landmarks, 7 goal landmarks
Conducting lazy best first search, (real) bound = 2147483647
Best heuristic value: 130 [g=0, 1 evaluated, 0 expanded, t=0s]
Best heuristic value: 100 [g=30, 2 evaluated, 1 expanded, t=0s]
Best heuristic value: 90 [g=50, 4 evaluated, 3 expanded, t=0s]
Best heuristic value: 80 [g=60, 6 evaluated, 5 expanded, t=0s]
Completely explored state space -- no solution!
Actual search time: 0s
Initial state h value: 130.
Expanded 242 state(s).
Reopened 0 state(s).
Evaluated 484 state(s).
Evaluations: 484
Generated 1892 state(s).
Search time: 0s
Total time: 0s
Peak memory: 4348 KB
|
|
Date |
User |
Action |
Args |
2023-08-02 17:03:34 | clemens | set | status: in-progress -> resolved nosy:
+ clemens messages:
+ msg11257 |
2012-11-29 17:12:30 | malte | set | messages:
+ msg2378 title: explore alternative definitions for reasonable orderings -> explore the use of reasonable orders in the admissible heuristics and explore alternative definitions |
2011-08-15 12:15:05 | erez | set | messages:
+ msg1643 |
2011-08-15 12:15:02 | silvia | set | messages:
+ msg1642 |
2011-08-15 12:09:07 | malte | set | messages:
+ msg1641 |
2011-08-15 12:05:02 | silvia | set | messages:
+ msg1640 |
2011-08-12 17:40:53 | malte | set | messages:
+ msg1584 |
2011-01-12 23:32:52 | malte | set | messages:
+ msg1160 |
2011-01-12 23:11:56 | malte | set | priority: urgent -> feature messages:
+ msg1159 title: LM-count heuristic fails on solvable problems -> explore alternative definitions for reasonable orderings |
2011-01-11 21:49:51 | malte | set | messages:
+ msg1152 |
2011-01-11 21:33:50 | malte | set | messages:
+ msg1151 |
2011-01-11 21:30:06 | malte | set | messages:
+ msg1150 |
2011-01-10 11:46:58 | malte | set | assignedto: malte -> silvia messages:
+ msg1128 |
2011-01-10 10:55:16 | erez | set | messages:
+ msg1127 |
2011-01-10 10:51:26 | malte | set | messages:
+ msg1126 |
2011-01-10 10:35:04 | silvia | set | files:
+ unnamed messages:
+ msg1125 |
2011-01-09 09:37:47 | erez | set | messages:
+ msg1123 |
2011-01-08 10:20:34 | erez | set | messages:
+ msg1122 |
2011-01-08 09:27:32 | malte | set | messages:
+ msg1121 |
2011-01-07 20:20:03 | malte | set | messages:
+ msg1117 |
2011-01-07 20:00:03 | erez | set | files:
+ unnamed messages:
+ msg1116 |
2011-01-07 19:16:25 | malte | set | messages:
+ msg1115 |
2011-01-07 18:30:39 | malte | set | files:
+ all.groups messages:
+ msg1114 |
2011-01-07 18:30:19 | malte | set | files:
+ output |
2011-01-07 16:40:41 | malte | set | files:
+ problem.pddl |
2011-01-07 16:40:32 | malte | set | files:
+ domain.pddl messages:
+ msg1110 |
2011-01-07 15:12:26 | malte | set | messages:
+ msg1109 |
2011-01-07 15:08:11 | malte | set | messages:
+ msg1108 |
2011-01-07 14:23:00 | malte | create | |
|