fix(oidc): make sure we keep track of an ongoing OIDC refresh up to the end #4304

bnjbvr · 2024-11-21T10:04:07Z

There's a lock making sure we're not doing multiple refreshes of an OIDC token at the same time. Unfortunately, this lock could be dropped, if the task spawned by the inner function was detached.

The lock must be held throughout the entire detached task's lifetime, which this refactoring ensures, by setting the lock's result after calling the inner function.

codecov · 2024-11-21T10:18:38Z

Codecov Report

Attention: Patch coverage is 78.72340% with 10 lines in your changes missing coverage. Please review.

Project coverage is 85.08%. Comparing base (bc70f3c) to head (aa43b2c).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
crates/matrix-sdk/src/oidc/mod.rs	78.26%	10 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4304      +/-   ##
==========================================
- Coverage   85.09%   85.08%   -0.02%     
==========================================
  Files         274      274              
  Lines       30191    30190       -1     
==========================================
- Hits        25691    25686       -5     
- Misses       4500     4504       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

poljar · 2024-11-21T12:32:47Z

Hmm I don't understand this:

if the task spawned by the inner function was detached.

When or how can the inner function detach the spawned task? It doesn't do so as far as I can tell, it awaits the join of the task.

What am I missing?

bnjbvr · 2024-11-21T12:41:11Z

Hmm I don't understand this:

if the task spawned by the inner function was detached.

When or how can the inner function detach the spawned task? It doesn't do so as far as I can tell, it awaits the join of the task.

What am I missing?

Oh sorry: there was a spawn() call in the inner function, which now lives in the parent. The spawn() existed so that the token refresh kept on, despite the caller cancelling the network request (since this is all called when spawning a network request to the homeserver). Only in that case (caller cancelling) is the future detached.

poljar · 2024-11-21T12:51:28Z

You mean, cancelling the parent doesn't cancel the spawned task in the child?

So now you moved the spawn into the parent and this now cancels the spawned task as well?

Or is the whole child/parent interaction unimportant and the reason this now works as expected is that the lock guard has been moved into the spawned task?

bnjbvr · 2024-11-21T12:54:44Z

The spawned task is never cancelled: using async fn lol() { spawn(fut).await } makes sure that fut is executed, even when the call to lol() has been aborted (i.e. it's been spawned and cancelled). This can happen often, especially in an invisible manner over the FFI boundary.

the reason this now works as expected is that the lock guard has been moved into the spawned task?

Exactly: the spawned task now keeps the mutex guard (as an owned mutex) over the entire lifetime of the spawned task.

poljar · 2024-11-21T13:00:03Z

Exactly: the spawned task now keeps the mutex guard (as an owned mutex) over the entire lifetime of the spawned task.

Alright, final question then. What does the moving of the spawn from the child to the parent then achieve? Couldn't we just have moved the owned lock guard to be an argument of the child method?

bnjbvr · 2024-11-21T13:06:25Z

Makes the code simpler: there's no spawn hidden in the child inner function, it's the parent who spawns it now. Since it only involved moving some other code that can fail early upwards, that seemed simpler. (Also, moving the guard down would have meant that every ? / early return in the spawned task would now need to set the guard's result as well, making it needlessly verbose)

poljar

Thanks for the explanation. A regression test would have been nice as well but is probably not easy to write.

…he end There's a lock making sure we're not doing multiple refreshes of an OIDC token at the same time. Unfortunately, this lock could be dropped, if the task spawned by the inner function was detached. The lock must be held throughout the entire detached task's lifetime, which this refactoring ensures, by setting the lock's result after calling the inner function.

bnjbvr · 2024-11-21T17:24:08Z

A regression test would have been nice as well but is probably not easy to write.

Indeed, this requires super perfect control over the timing of responses / API calls. Likely doable, but agreed the value is 🤷 compared to the code changes.

bnjbvr requested a review from a team as a code owner November 21, 2024 10:04

bnjbvr requested review from poljar and removed request for a team November 21, 2024 10:04

poljar approved these changes Nov 21, 2024

View reviewed changes

bnjbvr force-pushed the bnjbvr/fix-oidc-race branch from 2ab7ce3 to aa43b2c Compare November 21, 2024 17:21

bnjbvr enabled auto-merge (rebase) November 21, 2024 17:24

bnjbvr merged commit 48fbda8 into main Nov 21, 2024
40 checks passed

bnjbvr deleted the bnjbvr/fix-oidc-race branch November 21, 2024 17:36

ganfra mentioned this pull request Nov 21, 2024

fix : protect some usages of client to avoid crashes element-hq/element-x-android#3886

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(oidc): make sure we keep track of an ongoing OIDC refresh up to the end #4304

fix(oidc): make sure we keep track of an ongoing OIDC refresh up to the end #4304

bnjbvr commented Nov 21, 2024

codecov bot commented Nov 21, 2024 •

edited

Loading

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar left a comment

bnjbvr commented Nov 21, 2024

fix(oidc): make sure we keep track of an ongoing OIDC refresh up to the end #4304

fix(oidc): make sure we keep track of an ongoing OIDC refresh up to the end #4304

Conversation

bnjbvr commented Nov 21, 2024

codecov bot commented Nov 21, 2024 • edited Loading

Codecov Report

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar commented Nov 21, 2024

bnjbvr commented Nov 21, 2024

poljar left a comment

Choose a reason for hiding this comment

bnjbvr commented Nov 21, 2024

codecov bot commented Nov 21, 2024 •

edited

Loading