Implement async parsers #1352

msujew · 2024-01-23T11:56:23Z

A follow up on #1306.

Implements the basics to create an async parser based on node worker threads. While in theory this should be faster than the sync parser implementation (due to parallelisation), it might be slower on startup due to the time it takes for all the workers to start up. Once the threads are all up and running, it is indeed faster. The break even point is fairly late though, roughly after parsing ~2 million LoC.

The main benefit of this change is that it allows parser cancellation by simply killing workers. The cancellation happens after a timeout, since it needs to take the time it takes to create a new worker into account. The default is 200ms, but can be arbitrarily changed by adopters.

Most of the change is actually just the new Hydrator service which tries to dehydrate/hydrate AST nodes to be processed by the structured cloning algorithm of workers.

packages/langium/src/parser/async-parser.ts

sailingKieler

Great stuff @msujew.

I just have some minor questions & remarks.
Btw. Would it work the same way (in principle) in the browser with webworkers?

sailingKieler · 2024-01-30T12:31:12Z

packages/langium/src/parser/async-parser.ts

+    async parse<T extends AstNode>(text: string, cancelToken: CancellationToken): Promise<ParseResult<T>> {
+        const worker = await this.acquireParserWorker(cancelToken);
+        const deferred = new Deferred<ParseResult<T>>();
+        deferred.disposables.push(cancelToken.onCancellationRequested(() => {


This code is very dense. Could you please separate this and motivate the attachment of the Disposable returned by the listener registration to deferred?
deferred is supposed to have a finite lifetime, right?
And onCanceled-Listeners won't be called multiple times, right?

I've refactored this a bit, but it still works the same way. This is basically a safeguard: If the cancellation is requested after parsing has finished, we don't want to kill the worker anymore. The cancellation has a way longer lifetime than the deferred promise here. So we have to remove the listener again once were finished with it.

packages/langium/src/parser/async-parser.ts

packages/langium/test/parser/worker-thread-async-parser.test.ts

sailingKieler · 2024-01-30T12:40:24Z

packages/langium/test/parser/worker-thread.js

+        ...result,
+        value: dehydrated
+    });
+});


Can this considered a blue print for worker implementations?
Or should everybody built one on it's own? A remark on that including some dos and dont's would be nice, here.

Yes, everyone should build their own.
Do's: Parse the text and send the dehydrated AST back
Dont's: Anything else.

packages/langium/src/serializer/hydrator.ts

packages/langium/src/node/worker-thread-async-parser.ts

sailingKieler · 2024-01-30T14:49:51Z

One more thing:
I prefer to block merging this PR until #1258 is merged, and to fix the conflicts as part of this PR.
So maybe I was a bit to quick with my approval 😅

msujew added the parser Parser related issue label Jan 23, 2024

msujew force-pushed the msujew/async-parser-impl branch from 3d40610 to 5beae71 Compare January 24, 2024 12:20

Lotes reviewed Jan 29, 2024

View reviewed changes

packages/langium/src/parser/async-parser.ts Show resolved Hide resolved

sailingKieler approved these changes Jan 30, 2024

View reviewed changes

msujew force-pushed the msujew/async-parser-impl branch 2 times, most recently from b626ca3 to 9d8d6ad Compare February 13, 2024 15:57

msujew added 9 commits February 14, 2024 12:55

Implement async parsers

59627be

Improve documentation

3873a89

Improve startup time

f2e3cec

Improve cancellation handling

8f07077

Fix error

f38f604

Add failsafe for parsing

c229861

Revert deferred changes

bc3c205

Address review comments

50d606f

Review comments no. 2

194ec01

msujew force-pushed the msujew/async-parser-impl branch from 9d8d6ad to 194ec01 Compare February 14, 2024 13:24

msujew merged commit e78aeba into main Feb 14, 2024
5 checks passed

msujew deleted the msujew/async-parser-impl branch February 14, 2024 13:28

msujew added this to the v3.0.0 milestone Feb 14, 2024

msujew mentioned this pull request Jun 1, 2024

Any possibility of letting documents building process become synchronous conditionally? #1522

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement async parsers #1352

Implement async parsers #1352

msujew commented Jan 23, 2024

sailingKieler left a comment

sailingKieler Jan 30, 2024

msujew Feb 14, 2024

sailingKieler Jan 30, 2024

msujew Feb 14, 2024 •

edited

Loading

sailingKieler commented Jan 30, 2024 •

edited

Loading

Implement async parsers #1352

Implement async parsers #1352

Conversation

msujew commented Jan 23, 2024

sailingKieler left a comment

Choose a reason for hiding this comment

sailingKieler Jan 30, 2024

Choose a reason for hiding this comment

msujew Feb 14, 2024

Choose a reason for hiding this comment

sailingKieler Jan 30, 2024

Choose a reason for hiding this comment

msujew Feb 14, 2024 • edited Loading

Choose a reason for hiding this comment

sailingKieler commented Jan 30, 2024 • edited Loading

msujew Feb 14, 2024 •

edited

Loading

sailingKieler commented Jan 30, 2024 •

edited

Loading