Get word from complement lazy determinization #420

Adda0 · 2024-07-10T06:05:42Z

This PR implements a method to get an arbitrary word from a complement of a language lazily, as requested by #415. That is, we start computing the complement by determinizing the existing NFA while making it complete. We immediately stop when we encounter a macrostate which is not final. Then we can return the access word for this macrostate as an arbitrary word from the complement.

The PR utilizes the overall algorithm of determinization, but does not run nfa::determinize() directly, as there are too many changes to extract any common operations without significantly overengineering the function.
The PR implements the callback function for nfa::determinize() discussed in #415, but since the proper handling of all edge cases proved to be more complex, the callback function is not being used in Nfa::get_word_from_complement() in the end. The callback might yet come in handy for lazily getting an arbitrary word from the language difference, which will be implemented in a future PR.

This PR implements the first requested operation from #415.

…tates

…elation

jurajsic · 2024-07-11T12:49:00Z

src/nfa/operations.cc

@@ -249,7 +249,7 @@ namespace {
            }

            // add moves of S to the sync ex iterator
-            // TODO: shouldn't we also reset first?
+            synchronized_iterator.reset();


What happened here? Is this needed?

Even after looking at it previously, I still cannot explain how it managed to work without the reset. One should intuitively clear the iterator for every macrostate. clear() is a fast O(1) operation, so I added it as this is what I would do if I implemented the function again. I can take a look once more and probably try to debug it to make sure.

Aha, I have done some more digging and I got it now. It works thanks to a lucky coincidence. Since we have been up until now always using the SynchronizedIterator (called advancer further for clarity) only to fully iterate over the whole vectors iterated over, the advancer remains clear after the while loops while(iterator.advance()). That holds only because at the end of every advance(), the iterators kept in the advancer are popped from the advancer when they iterated all the way to their respective end iterators. If we were to stop the iteration earlier for any macrostate, the iterators would remain in the advancer and the next macrostate would only add new iterators to these iterators from the previous macrostate. The advancer would iterate over all of them (the remaining ones and the new one which one actually want to iterate over) together, causing the advance() to not work correctly.

As a rule of thumb, we should therefore always clear our advancers to make sure that this happenstance cannot occur. Henceforth, I consider this change to add clear() everywhere a valid one.

src/nfa/operations.cc

jurajsic

Looks good, can't say I fully understand it, but we will test it by including it in noodler.

Adda0 · 2024-07-12T05:42:13Z

We will see how it works. I used iterators to the subset map, as it is 10 times faster on even the smallest NFAs, and presumably more on larger ones. Whether it is enough remains to be seen.

fix: Make Nfa::get_words() a const method

77d4352

Adda0 force-pushed the get_word_from_complement_lazy_determinization branch from 9e4b886 to 1400462 Compare July 11, 2024 06:52

Adda0 added 5 commits July 11, 2024 09:08

feat: Lazily get an arbitrary word from a complement of an NFA

5c026ef

feat: Add callback to determinize() to handle discovery of new macros…

fbe9f18

…tates

fix: Reset synchronized iterator after each iteration

d670d51

feat: Allow passing alphabet missing some symbols in the transition r…

f8767f2

…elation

perf: Minimize number of copies of state sets

b1dd803

Adda0 force-pushed the get_word_from_complement_lazy_determinization branch from 1400462 to b1dd803 Compare July 11, 2024 07:08

Adda0 requested a review from jurajsic July 11, 2024 07:24

jurajsic reviewed Jul 11, 2024

View reviewed changes

src/nfa/operations.cc Show resolved Hide resolved

jurajsic approved these changes Jul 11, 2024

View reviewed changes

Adda0 merged commit 7b55766 into devel Jul 12, 2024
18 checks passed

Adda0 deleted the get_word_from_complement_lazy_determinization branch July 12, 2024 05:42

This was referenced Jul 12, 2024

fix: Use pointers to key-value pair in unordered map #422

Merged

Language difference #424

Merged

jurajsic mentioned this pull request Jul 21, 2024

Model generation fixing VeriFIT/z3-noodler#156

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get word from complement lazy determinization #420

Get word from complement lazy determinization #420

Adda0 commented Jul 10, 2024 •

edited

Loading

jurajsic Jul 11, 2024

Adda0 Jul 11, 2024

Adda0 Jul 12, 2024

jurajsic left a comment

Adda0 commented Jul 12, 2024

Get word from complement lazy determinization #420

Get word from complement lazy determinization #420

Conversation

Adda0 commented Jul 10, 2024 • edited Loading

jurajsic Jul 11, 2024

Choose a reason for hiding this comment

Adda0 Jul 11, 2024

Choose a reason for hiding this comment

Adda0 Jul 12, 2024

Choose a reason for hiding this comment

jurajsic left a comment

Choose a reason for hiding this comment

Adda0 commented Jul 12, 2024

Adda0 commented Jul 10, 2024 •

edited

Loading