-
-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
zimcheck performance improvement #134
Comments
@MiguelRocha Good to me to implement. If the amount of changes is subtantial, please split this ticket. |
Update Brew before using it
@MiguelRocha Do you mind if I work on this in combination with optimizations on the libzim side? |
@veloman-yunkan please feel free to do it. And let me know if you need any help. I have a draft branch that already fixes the first point that I described and I will create a PR after finishing other issues. |
@MiguelRocha Would you be able to move the |
@mgautierfr @MiguelRocha I believe this should not be closed because the following points need to be implemented:
|
Yes, it was automatically closed when I've merged #162. |
There is no obvious and consensual solution to optimize |
zimcheck performance is quite important since for big articles can take several minutes (maybe hours).
There are a few performance improvements that can be done in zimcheck specific code:
getLinks()
getLinks()
performanceisExternalUrl()
The first 2 points are responsible for 18% while the last one takes 20% speedtime.
Also, there maybe some performance gains if we find a better way to validate the URLs. Currently we are calling getArticle() from libzim and this takes around 50% of the time since it needs access the direntCache. (related: openzim/libzim#385 (comment))
The text was updated successfully, but these errors were encountered: