Improve performance of hidden node parsing #1790

msujew · 2025-01-14T18:43:03Z

I've noticed in a project with a large file with a lot of comments (~20 000 LoC, ~10 000 comments), that parsing started taking a huge performance hit. It seems like our current method of iterating through the CST to add hidden nodes leads to super-linear behavior in the parsing phase.

You can reproduce this by using the test I've added to langium-parser.test.ts and run it on the current main branch. While the parse takes only ~150ms on my machine with the new changes, it takes roughly ~5000ms on main.

This change replaces the existing hidden node placement algorithm with a different one that should perform linearly. Note that the placement of the hidden nodes has not changed with this algorithm. Existing tests (and also a few new ones) verify this behavior.

Some ideas for this new algorithm were taken from #1733.

packages/langium/src/parser/langium-parser.ts

Lotes

I was hesitating with the approval, but actually there is no need for waiting, I got it. If you want an additional approval or more comments, you could reach out for Ben. Or ask me, what I should look at specifically.

Improve performance of hidden node parsing

940fa56

msujew added parser Parser related issue performance Issues related to the runtime performance of Langium labels Jan 14, 2025

Lotes reviewed Jan 15, 2025

View reviewed changes

packages/langium/src/parser/langium-parser.ts Outdated Show resolved Hide resolved

Improve performance even further

faa656e

msujew requested a review from Lotes January 15, 2025 11:20

Lotes reviewed Jan 15, 2025

View reviewed changes

packages/langium/src/parser/langium-parser.ts Outdated Show resolved Hide resolved

Rename api

11d88a1

msujew force-pushed the msujew/hidden-node-performance branch from 3266554 to 11d88a1 Compare January 15, 2025 13:05

Lotes approved these changes Jan 17, 2025

View reviewed changes

msujew merged commit c006777 into main Jan 17, 2025
5 checks passed

msujew deleted the msujew/hidden-node-performance branch January 17, 2025 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of hidden node parsing #1790

Improve performance of hidden node parsing #1790

msujew commented Jan 14, 2025 •

edited

Loading

Lotes left a comment

Improve performance of hidden node parsing #1790

Improve performance of hidden node parsing #1790

Conversation

msujew commented Jan 14, 2025 • edited Loading

Lotes left a comment

Choose a reason for hiding this comment

msujew commented Jan 14, 2025 •

edited

Loading