You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What did you expect to happen? What happened instead?
I expect crawls with a custom browser profile to run successfully. Instead, the crawl fails with a cryptic S3 error in the crawler logs because the crawler is looking for the profile at <s3 endpoint>/<oid>/profile/<filename> but the file is actually at <s3 endpoint>/<oid>/<filename>.
Step-by-step reproduction instructions
Create a new browser profile
Attempt to run a crawl with that browser profile
Watch the crawl fail
Additional details
It seems the profile/ path prefix for profiles got dropped at some point in the storage refactoring work (PR #1296). To fix, we'll add the prefix back and move any files that need to be moved in the S3 buckets.
The text was updated successfully, but these errors were encountered:
Fixes#1364
Regression fix for issue introduced in storage refactoring (see issue
for more details).
Changes:
1. Add `profiles/` prefix to profile filename passed in to crawler for
profile creation and written into db
2. Remove hardcoded `profiles/` prefix from crawler YAML
3. Add migration to add `profiles/` prefix to profile filenames that
don't already have it, including updating PROFILE_FILENAME in ConfigMaps
This way between the related storage document and the profile filename,
we have the full path to the object in the database rather than relying
on additional prefixes hardcoded into k8s job YAML files.
Note that this as a follow-up it'll be necessary to manually move any
profiles that had been written into the `<oid>` "directory" in object
storage rather than `<oid>/profiles` to the latter. This should only
affect profiles created very recently in a 1.8.0-beta release.
Browsertrix Cloud Version
v1.8.0-beta.2-3aebf2e
What did you expect to happen? What happened instead?
I expect crawls with a custom browser profile to run successfully. Instead, the crawl fails with a cryptic S3 error in the crawler logs because the crawler is looking for the profile at
<s3 endpoint>/<oid>/profile/<filename>
but the file is actually at<s3 endpoint>/<oid>/<filename>
.Step-by-step reproduction instructions
Additional details
It seems the
profile/
path prefix for profiles got dropped at some point in the storage refactoring work (PR #1296). To fix, we'll add the prefix back and move any files that need to be moved in the S3 buckets.The text was updated successfully, but these errors were encountered: