Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Group TripleMaps for Performance #2

Open
dachafra opened this issue Oct 21, 2020 · 6 comments
Open

Group TripleMaps for Performance #2

dachafra opened this issue Oct 21, 2020 · 6 comments

Comments

@dachafra
Copy link
Member

issue: running each source row through each TripleMap may be slow because it means "rewinding" the database and processing each row multiple times

suggestion: process all TripleMaps with the same (or equivalent source) for each input row

link: SDM-TIB/SDM-RDFizer#24 (comment)

@marioscrock
Copy link

marioscrock commented Jan 4, 2021

I think this is more a tool-related optimization, however, I would suggest considering also joins conditions in the process of grouping TripleMaps for performance. This type of analysis (considering join "dependencies" among TripleMaps) could also help in enabling efficient concurrency in processing groups of not-constrained TripleMaps.

@dachafra
Copy link
Member Author

dachafra commented Mar 2, 2022

This issue is out of the scope of the CG, as it's specific for the optimizations of each tool. Closing it

@dachafra dachafra closed this as completed Mar 2, 2022
@bjdmeest
Copy link
Member

bjdmeest commented Mar 2, 2022

Discussions like this would be nice for a 'Best Practices' note or smth, do you think we should keep tabs somewhere? eg Yet Another Repo? 😅 (bc I remember implementation considerations at our slack channel too)

@dachafra
Copy link
Member Author

dachafra commented Mar 2, 2022

I agree that we can generate a Best practices repository where we collaboratively post well-known optimizations or techniques to speed up the materialization process. @bjdmeest would you mind to create that repo under the organization?

@bjdmeest
Copy link
Member

bjdmeest commented Mar 2, 2022

Done at https://github.com/kg-construct/best-practices, with a first issue! :D

@dachafra dachafra transferred this issue from kg-construct/mapping-challenges Mar 2, 2022
@dachafra
Copy link
Member Author

dachafra commented Mar 2, 2022

re-open it in the correct repo

@dachafra dachafra reopened this Mar 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants