Complex conflicts handling if we need to peek more than 1 token. #24

DiscreteTom · 2023-10-09T10:41:50Z

E.g. when we parsing javascript:

f(({a, b}) => a + b);

with the following grammar rules:

exp := '(' '{' identifier (',' identifier)* '}' ')' '=>' exp
exp := '(' exp ')'
exp := object
object := '{' (object_entry (',' object_entry)*)? '}'
object_entry := identifier (':' exp)?

when we digest f(({ a we don't know whether the a is an object entry or an arrow function param. We have to peek maybe many tokens to judge that.

Solution for this issue:

Re-parse, see Re-parse for unresolved conflict? #19 . Not recommended.
Optimize grammar rules to prevent this to happen. Introducing more intermediate NT. Bad user experience.
Allow grammar rule to do more than reduce AST nodes. E.g. override parser buffer.

The text was updated successfully, but these errors were encountered:

DiscreteTom · 2023-10-12T10:38:13Z

Maybe we can write an algorithm to check if a conflict can be safely resolved by re-parse.

For the above conflict, it can be safely resolved by re-parse without early accept.

wait for #24

DiscreteTom · 2023-12-30T10:52:56Z

Another idea: subset construction (子集构造法) to turn the NFA into DFA?

DiscreteTom added enhancement New feature or request parser conflict handling labels Oct 9, 2023

DiscreteTom added a commit that referenced this issue Oct 12, 2023

chore: remove simple-ts-parser for now

d1bc0b8

wait for #24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complex conflicts handling if we need to peek more than 1 token. #24

Complex conflicts handling if we need to peek more than 1 token. #24

DiscreteTom commented Oct 9, 2023

DiscreteTom commented Oct 12, 2023

DiscreteTom commented Dec 30, 2023

Complex conflicts handling if we need to peek more than 1 token. #24

Complex conflicts handling if we need to peek more than 1 token. #24

Comments

DiscreteTom commented Oct 9, 2023

DiscreteTom commented Oct 12, 2023

DiscreteTom commented Dec 30, 2023