-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Handling large repositories #92
Comments
The issue may have something to do with the Github App? Reading through the publishing article again, there is an error related to not having the app on the repository. I did not get that error when I submitted, but looking now I don't see any apps integrated on the repo. When I logged in with github, I didn't get redirected to any authorization request, the page just looped back to the login with github message. I refreshed the page and was logged in. So something in that login step isn't working as expected. Firefox 134.0.2 on Windows. Edit: tried on Edge also, and same result; logging in isn't getting a App permissions request from Github for me. And since the error given wasn't |
As of the latest update, the app is only used for logging into the wiki and no longer needs to be installed on repositories. The issue is going to be somewhere else, but I'm not sure yet. I'll try to register the project locally and see if I get the same error. Looking at the config, everything seems fine. |
I've investigated the issue and it seems like there's multiple factors at play here. The issue(s)The repository you're trying to register looks to be over 2GB large even when shallow-cloned, which just by itself shouldn't have been a problem. However, rather than download speed issues, I found out libgit2 is terribly sluggish at cloning large repositories. In my local test, this took several minutes, with the checkout phase taking significantly longer than fetch itself. Combined with this comes the fact that the form submission is proxied by the frontend instead of being sent directly to the backend. The serverless function has an execution time limit, and after it is reached, the server returns an error, giving the user the aforementioned "Something Went Wrong" page. Possible solutionsThe problem here is we clone all of the repository's history instead of doing a shallow clone so that we can display a "last edit time" stat for docs pages. Even if we sacrificed that in favor of taking up less storage space, we can't determine the size of the docs folder alone before the repo is cloned. If the total repo size is 10G while the docs folder is a mere 10MB in size, even then we wouldn't be able to clone it. Likely solution for slow speeds: To avoid both quickly running out of storage as well as similar cloning issues, I'm considering introducing a reasonable repository size limit for projects. A few hundred MBs should suffice for any project even with large assets. Alternative: Instead of using libgit2, we can use the native git command line program. However, retrieving errors from CLI is much harder and less precise since we're effectively limited to strings and exit codes. With an embedded library, this is much more convenient. Solving the error screen: I've updated the backend to use a shallow clone when first validating the repository, which should speed up the submission process. However, there is still a risk of the function timing out, and that applies to this case as well. To avoid that, we'll need to update the forms to be submitted from the client directly to the server instead of the frontend. WorkaroundIn your case, I can recommend creating a separate repository for hosting your mod's documentation. It may not be the most convenient solution, but given the unusually large size of the repository, our options are limited. One last thing I noticed is that the project ID contains an underscore ( Thanks for reporting this issue btw! |
The vast majority of that size is multi-versioned KDoc, to the tune of 2.2 GB. Which, I thought I had trimmed that size significantly, but seems I wasn't trimming everything I thought I was. That I can fix, but if it is still too big afterwards, it would be a bit of a pickle for me. Having everything in one repo was one of the main attractants to this setup, as I can organize and work on everything in one place at one time. Is last edit time the only reason you are making a clone, shallow or otherwise? I would assume there is a Github API route for seeing details about a repos last commit; perhaps this is the CLI solution you are referring to. Let me re-trim the documentation and see where it ends up first. |
Trimmed most of the fat off the docs, fixed the json issue, still getting the error even with the repo down to 130 MB (barely larger than the repo of Cinderscapes and much smaller than that of Spectrum, for a couple examples from repo forks I have lying around) So the few hundred MB posed in your Possible Solutions seems still too large without making other changes. |
Looks like the project was added succesfully this time https://moddedmc.wiki/en/project/fzzy-config/docs. Does it show up on your dev dashboard? |
Oh, yes I do. Still got the server error when I submitted it though. |
Roughly how long did the submission take? Could it have been going for longer than 15 seconds? In that case it's going to be the timeout error that I still need to fix. |
I wasn't timing it, but it's plausible it took longer than 15 seconds. I definitely see what you mean about the slowness of clones from Example Log
|
Most of the time it's quicker to actually just call the git commands through a shell process than use libgit2, libgit2 is plagued with performance issues - most IDEs have moved away from libgit2 these days (Visual Studio was the last to stop using it in 2020) |
Trying to set up my first wiki on the site, and I'm running into a "Something went Wrong" page.
Link to wiki directory in repo: https://github.com/fzzyhmstrs/fconfig/tree/master/wiki
When I look in my browser console, I see a couple errors related to 2117-21b5f9f9cbf4f2d8.js
I am filling the form out like so:
data:image/s3,"s3://crabby-images/d7f87/d7f87655bf81fecab26c08fb0302dce940f6a2b1" alt="Image"
Upon clicking "Submit", it will think for a while and then redirect to the Something Went Wrong page.
My repository:
data:image/s3,"s3://crabby-images/1cdcc/1cdcca9a262f9ae227d36e86f480adbb7075f8d0" alt="Image"
Branch and wiki folder. The
data:image/s3,"s3://crabby-images/c5771/c57711d4a8a3d2baeaffa9a1a361791a001e525b" alt="Image"
/docs
folder is already used for my separately published KDoc, hence specifying and using/wiki
:The text was updated successfully, but these errors were encountered: