Issue482

Title Patterns generated by PatternGenerationEdelkamp are not sorted
Priority bug Status resolved
Superseder Nosy List florian, malte, silvan
Assigned To florian Keywords
Optional summary

Created on 2014-09-28.21:41:34 by florian, last changed by florian.

Messages
msg3602 (view) Author: florian Date: 2014-09-29.15:59:59
Merged.
msg3601 (view) Author: malte Date: 2014-09-29.15:53:14
Yes, please merge. Thanks!
msg3600 (view) Author: florian Date: 2014-09-29.15:50:01
Ok, I inserted the std::sort. Experiments in debug mode show that the assertion
no longer fails, and experiments in release mode show that the change did no
significant damage:

http://ai.cs.unibas.ch/_tmp_files/pommeren/issue482-base-issue482-v1-compare-debug.html#unexplained-errors

http://ai.cs.unibas.ch/_tmp_files/pommeren/issue482-base-issue482-v1-compare-release.html

Should I merge?
msg3598 (view) Author: malte Date: 2014-09-29.11:15:20
This is not time-critical, so if std::sort is fastest, that would be my
preferred solution.
msg3596 (view) Author: florian Date: 2014-09-29.10:22:53
I can think of several ways to get a sorted vector in this function and have
little intuition which would be best.

1) std::sort would result in the simplest code.
2) erase-remove idiom (filtering out the elements from the vector that are not
in the hash_set) should have amortized linear runtime but more complicated code.
I'm not sure if it would pay off on the smallish patterns.
3) We could copy from the hash_set to a std::set and then to the vector.
4) We could use a std::set instead of the hash_set throughout the whole method.
msg3591 (view) Author: malte Date: 2014-09-28.21:43:04
I think it's a good idea to keep the assertion. Sorting patterns is cheap, and
normalized data is often useful, e.g. for debugging. Sorting patterns also helps
finding duplicates.
msg3590 (view) Author: florian Date: 2014-09-28.21:41:34
The pattern database heuristic asserts that its pattern is sorted. I don't know
if this assumption is used somewhere currently. However,
PatternGenerationEdelkamp creates ZeroOnePDBsHeuristics with pattern collections
of unsorted patterns, e.g. in airport/p16.
I had a look at the code and it looks like patterns are sorted up until the call
to remove_irrelevant_variables. This method uses a hash set to filter some
variables from the pattern and iterating over this set returns the variables
without order.

We could either keep the variables sorted here or change the assertion if
unsorted patterns are acceptable.
History
Date User Action Args
2014-09-29 15:59:59floriansetstatus: chatting -> resolved
messages: + msg3602
2014-09-29 15:53:14maltesetmessages: + msg3601
2014-09-29 15:50:01floriansetmessages: + msg3600
2014-09-29 11:57:46silvansetnosy: + silvan
2014-09-29 11:15:20maltesetmessages: + msg3598
2014-09-29 10:22:54floriansetassignedto: florian
messages: + msg3596
2014-09-28 21:43:04maltesetstatus: unread -> chatting
messages: + msg3591
2014-09-28 21:41:34floriancreate