Make requests concurrently #13

xoen · 2024-10-04T18:21:20Z

This is related to Issue #7.

Spawn a Tokio task per request instead of making them sequentially.
Details about how the data flows back from the green threads into the format we wants are given in the commit message here - hopefully makes sense.

As mentioned in that commit message I may experiment with the result unwrapping further to see if I can get to something even simpler/more readable (you should have seen the original version of this 😅)

I've made some tweaks/refactorings. Explaination is given in the individual commit messages.
If you really prefer I can split the refactorings commits into its own PR and then have a PR only for the concurrent change.

`to_tuple()` was using unwrap instead of handling the error, e.g. return an `Err(ApiError)` in this case. AFAICT `to_tuple()` is only used on date strings coming from the API so it should be fine, but it still makes sense to handle potentiall parsing errors. NOTE: The `?` operator implicitly converts the given value into the return error type via `From`/`Into`. That's what the `#[from]` attribute is doing: it's implementing the `From` trait to convert a `chrono::ParseError` into an `ApiError`.

Both branches push into the `ranges` `Vec`.

By using the `?` operator to bubble up `chrono::ParseError`. This is now possible because we convert those into `ApiError`.

This is a better name that conveys what it does and it also avoid potential confusion with the standard `parse()` method that uses the `FromStr` trait.

This is a common pattern in crates that return `Result` with a given error. For example `std::io::Result` is a type alias to `std::result::Result<T, std::io::Error>`.

The logic to get make the HTTP request, handle errors, deserialise the response was common. I've factored out this into its own generic function.

This will be more useful in the logic that unwrap the tokio tasks results and gradually peal off the various layers of `Result`.

When requesting data for different date ranges make these requests concurrently instead of making them sequentially. Broadly speaking this is what's happening: - spawn a Tokio task for each of these requests. - each task will return a `Result<Vec<IntensityForDate>>` - wait for them to return using `try_join_all()` - this returns a `Result<Vec<Result<Vec<IntensityForDate>>>>` - if it's an error (e.g. some of the task panicked) we return it using `?` - we then have a `Vec<Result<Vec<IntensityForDate>>>`. One element for each of the requests' results. - we "invert" that and turn it into `Result<Vec<Vec<IntensityForDate>>>` - either we return the first error or unwrap the list of tuples - we now have a `Vec<Vec<IntensityForDate>>`, one element per response and each element is a `Vec<IntensityForDate>` (the list of tuples with date/intensity) - we finally flatten this into a simpler `Vec<IntensityForDate>` This is a bit more convoluted than I'd like but I hope it makes sense. Some of the types in the variable declarations are there mainly for readability sake given the levels of nesting but some of them are there to help Rust figure out how to `collect()`. **NOTE**: I'm still not 100% happy with this and I do wonder if the unwrapping/error handling could be done more elegantly - possible. Also, potentially there may be a way of doing this in a single bigger step, I may experiment with it further to see if it turns out to be a bit more readable.

xoen · 2024-10-04T18:24:35Z

src/lib.rs

@@ -30,6 +30,8 @@ pub enum ApiError {
    Error(String),
 }

+pub type Result<T> = std::result::Result<T, ApiError>;


One reason why I introduced this was because writing down the types when massaging the return values from the tokio tasks was getting a bit out of control and error prone.

jnioche · 2024-10-05T12:15:04Z

Spawn a Tokio task per request instead of making them sequentially.

it the number of requests needed is large, we might flood the API (unlikely though). Maybe later on we could limit the number of threads used.

If you really prefer I can split the refactorings commits into its own PR and then have a PR only for the concurrent change.

separating them would be extra work for now but maybe in the future let's aim to have separate PRs to make it easier to review

jnioche · 2024-10-05T12:29:17Z

we are now getting "The date range you have specified is greater than 14 days. Please select a smaller date range."
with
carbonintensity-api -s 2022-01-01 bs7

but are getting the expected values with the main branch

simplified pipeling that - unwraps/bubble up tokio `JoinError` - converts list of tasks Results into a single Result of tuples - flatten the tuples (Vec of Vecs)

This reverts commit 111d09f. It turns out the logic inside the if/else blocks was **not** the same. This was a bug :)

xoen · 2024-10-05T12:49:52Z

we are now getting "The date range you have specified is greater than 14 days. Please select a smaller date range."
with
carbonintensity-api -s 2022-01-01 bs7

but are getting the expected values with the main branch

Its-a me a bug :)!

I had a look and tried go back in time slowly and it turns out that this commit introduced a bug: 111d09f - I've reverted it and it now seems to work.

PS: Wow, it's amazing how much of a time difference there is between making these requests concurrently and sequentially :)

jnioche · 2024-10-05T16:11:15Z

implements #7

jnioche · 2024-10-05T16:13:10Z

PS: Wow, it's amazing how much of a time difference there is between making these requests concurrently and sequentially :)

yes! 10s vs 177s

jnioche · 2024-10-05T16:14:39Z

Fab, thanks @xoen

xoen added 8 commits October 4, 2024 10:43

Moved common logic outside if in normalise_dates

111d09f

Both branches push into the `ranges` `Vec`.

Simplified normalise_dates() error handling

f811e5e

By using the `?` operator to bubble up `chrono::ParseError`. This is now possible because we convert those into `ApiError`.

Renamed parse date utility function to parse_date()

27fdfea

This is a better name that conveys what it does and it also avoid potential confusion with the standard `parse()` method that uses the `FromStr` trait.

Add Result<T> type alias to simplify signatures

c1d6cf5

This is a common pattern in crates that return `Result` with a given error. For example `std::io::Result` is a type alias to `std::result::Result<T, std::io::Error>`.

Extracted HTTP request handling in a function

f2504b2

The logic to get make the HTTP request, handle errors, deserialise the response was common. I've factored out this into its own generic function.

Added IntensityForDate type alias for (NaiveDateTime, i32)

0dd4108

This will be more useful in the logic that unwrap the tokio tasks results and gradually peal off the various layers of `Result`.

xoen commented Oct 4, 2024

View reviewed changes

xoen requested a review from jnioche October 4, 2024 18:27

Simplified tokio join_all() result/unwrapping

801cce9

simplified pipeling that - unwraps/bubble up tokio `JoinError` - converts list of tasks Results into a single Result of tuples - flatten the tuples (Vec of Vecs)

xoen marked this pull request as ready for review October 5, 2024 12:33

Revert "Moved common logic outside if in normalise_dates"

9787590

This reverts commit 111d09f. It turns out the logic inside the if/else blocks was **not** the same. This was a bug :)

jnioche added this to the 0.3.0 milestone Oct 5, 2024

jnioche added the enhancement New feature or request label Oct 5, 2024

jnioche approved these changes Oct 5, 2024

View reviewed changes

jnioche merged commit 9d67439 into main Oct 5, 2024
1 check passed

jnioche deleted the concurrently branch October 5, 2024 16:14

xoen mentioned this pull request Oct 5, 2024

Add some tests for the date handling logic #18

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make requests concurrently #13

Make requests concurrently #13

xoen commented Oct 4, 2024

xoen Oct 4, 2024

jnioche commented Oct 5, 2024 •

edited

Loading

jnioche commented Oct 5, 2024

xoen commented Oct 5, 2024 •

edited

Loading

jnioche commented Oct 5, 2024

jnioche commented Oct 5, 2024

jnioche commented Oct 5, 2024

Make requests concurrently #13

Make requests concurrently #13

Conversation

xoen commented Oct 4, 2024

xoen Oct 4, 2024

Choose a reason for hiding this comment

jnioche commented Oct 5, 2024 • edited Loading

jnioche commented Oct 5, 2024

xoen commented Oct 5, 2024 • edited Loading

jnioche commented Oct 5, 2024

jnioche commented Oct 5, 2024

jnioche commented Oct 5, 2024

jnioche commented Oct 5, 2024 •

edited

Loading

xoen commented Oct 5, 2024 •

edited

Loading