You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Succeeding issue of recently closed #546.
I merged both topics into one issue as I think they interfere - e.g. choosing a specific DB insertion method might not work (efficiently) in multiprocessing. If you disagree, feel free to separate them.
Explore faster methods of writing to the database for
sqlite
postgres
Add multiprocessing for
XML parsing
Write to DB (if applicable due to concurrency -> table locks)
Hi @nesnoj! Thanks for creating this issue. I'm testing my approach a bit more (different number of cores & different operating systems) and I'll create the PR. I've been developing and testing on MacOS & Linux and I'll continue with Windows. The approach is quite simple and uses the standard concurrent.futures library with a few options to optimize the access to the database
About writing the data to the postgre database: would you mind tackling this separately? I didn't get the chance to look into how optimize writing yet and the parallelization is database-agnostic right now
About writing the data to the postgre database: would you mind tackling this separately? I didn't get the chance to look into how optimize writing yet and the parallelization is database-agnostic right now
Sure, feel free to create a separate issue if that makes sense to you.
Succeeding issue of recently closed #546.
I merged both topics into one issue as I think they interfere - e.g. choosing a specific DB insertion method might not work (efficiently) in multiprocessing. If you disagree, feel free to separate them.
Notes on DB insertion: #546 (comment)
Notes on parallelization: #546 (comment)
Feel free to amend :)
The text was updated successfully, but these errors were encountered: