feat: Add support for converting multiple NFAs into one DFA. #7

Louis-He · 2024-12-05T05:39:40Z

Description

Validation performed

LinZhihao-723 · 2024-12-05T23:50:54Z

src/dfa/dfa.rs

+    accept: Vec<State>,
+    states: HashSet<State>,
+    transitions: HashMap<State, HashMap<char, Transition>>, // from_state -> symbol -> to_state
+    dfa_to_accepted_nfa_state_mapping: Option<HashMap<State, Vec<(usize, crate::nfa::nfa::State)>>>, // to determine which NFA gets matched


I'm not sure why cargo fmt doesn't work, but this line exceeds the 100-char limit, right? Can we put the comment in a separate line before this line? (It's hard to navigate long lines without using mouse, lol)

LinZhihao-723 · 2024-12-05T23:54:08Z

src/dfa/dfa.rs

+
+// Helper functions for converting multiple NFAs to a single DFA
+impl DFA {
+    fn epsilon_closure(


Nit: Shouldn't this be a part of NFA?

LinZhihao-723 · 2024-12-05T23:54:51Z

src/dfa/dfa.rs

+use std::process::id;
+
+#[derive(Clone, Debug, Eq, Hash, PartialEq)]
+struct State(String);


Why do we need the state to be a string?

LinZhihao-723 · 2024-12-06T17:14:31Z

src/dfa/dfa.rs

+            );
+    }
+
+    fn simulate(&self, input: &str) -> (Option<HashSet<usize>>, bool) {


We need a different API that takes char by char since the lexer emits chars as tokens, we don't know the size of string in advance.
In this way, we might also need an API to reset the current simulation state.

LinZhihao-723 · 2024-12-06T17:18:12Z

src/dfa/dfa.rs

+    start: State,
+    accept: Vec<State>,
+    states: HashSet<State>,
+    transitions: HashMap<State, HashMap<char, Transition>>, // from_state -> symbol -> to_state


As discussed offline, there's a performance concern in this part; shall we add a TODO to keep track of the issue?

Louis-He added 12 commits October 28, 2024 22:41

init nfa

e23e37f

add skeleton code for NFA and AST -> NFA transitions

689b46d

complete AST to NFA conversion

d846b69

clean up coding format

722ee1e

add dummy tag into the transition in NFA

f4642f1

complete naive NFA to DFA conversion

b874cbc

fix format

e7812d4

add support for multiple NFAs to one DFA

a7e5c80

reformat

e52d180

Merge branch 'main' into dfa

169be54

able to identify which NFA got matched

6863a92

reformat

effcc57

LinZhihao-723 requested changes Dec 6, 2024

View reviewed changes

Louis-He added 2 commits December 7, 2024 19:02

change transition from taking a char to taking an one-hot encoding u128

add4897

add DFA single character simulation skeleton code

9efb249

LinZhihao-723 changed the title ~~Add DFA, able to merge multiple NFAs and identify which NFA got matched~~ feat: Add support for converting multiple NFAs into one DFA. Dec 8, 2024

LinZhihao-723 approved these changes Dec 8, 2024

View reviewed changes

LinZhihao-723 merged commit 8174f5a into Toplogic-Inc:main Dec 8, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add support for converting multiple NFAs into one DFA. #7

feat: Add support for converting multiple NFAs into one DFA. #7

Louis-He commented Dec 5, 2024

LinZhihao-723 Dec 5, 2024

LinZhihao-723 Dec 5, 2024

LinZhihao-723 Dec 5, 2024

LinZhihao-723 Dec 6, 2024

LinZhihao-723 Dec 6, 2024

feat: Add support for converting multiple NFAs into one DFA. #7

feat: Add support for converting multiple NFAs into one DFA. #7

Conversation

Louis-He commented Dec 5, 2024

Description

Validation performed

LinZhihao-723 Dec 5, 2024

Choose a reason for hiding this comment

LinZhihao-723 Dec 5, 2024

Choose a reason for hiding this comment

LinZhihao-723 Dec 5, 2024

Choose a reason for hiding this comment

LinZhihao-723 Dec 6, 2024

Choose a reason for hiding this comment

LinZhihao-723 Dec 6, 2024

Choose a reason for hiding this comment