-
Notifications
You must be signed in to change notification settings - Fork 284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG: Can't preview image and pdf also can't converte to PDF #3816
Comments
Hi, thanks for using Aleph. Please make sure to provide the information specified in the issue template when opening issues in this repository. Based on the error message, I think your |
Which Aleph version are you using? |
@PhamPham92 Hey! I also faced the same issue with UPD: Frankly, it helped only for some particular case/document! I reproduced this bug with other documents, unfortunately... |
Hello, thanks for your answer. I have already tried, but it didn't help. |
I have the same issue while using 3.17.0 In docker-compose.yml I tried to comment out "~:/host" With a dataset of 20k files (2.2 GB), files with text are well indexed, small images often, larger images and pdfs almost never. When I tried to crawl 2 .pngs and 2 .pdfs, 1 pdf was successfully indexed. With a dataset of 6k files (860.9 MB) the success rate for pdfs was about 50 %, but it still failed with larger images. |
Hello, Exactly the same issue for me! |
One of my instances is also getting this bug:
I am using Aleph version 3.17.0 and ingest-file version 3.22.0. This bug did not occur with previous versions (that I know of).
Please let me know, if I can assist finding the bug's fault with my setup. |
could you all enter a shell within a running |
$ docker-compose run --rm ingest-file bash (main✱)
WARN[0000] /opt/aleph/docker-compose.yml: `version` is obsolete
[+] Creating 2/0
✔ Container aleph-redis-1 Running 0.0s
✔ Container aleph-postgres-1 Running 0.0s
root@ingest:/ingestors# echo $FTM_STORE_URI
postgresql://aleph:aleph@postgres/aleph
root@ingest:/ingestors# echo $ALEPH_DATABASE_URI
Thanks, @simonwoerpel. I also echoed |
No, |
It looks like the I was able to fix the issue by adding
to I then had to reingest all documents, but at least it works now. |
ingest-file uses the Tags functionality from servicelayer to cache certain document types during ingest to speed up its operation. Normally the settings file defaults to trying the ALEPH_DATABASE_URI followed by the FTM_STORE_URI, which in most cases should be set to non-default values. But if one were to just follow our prod environment docs blindly and not uncomment these two settings then the TAGS_DATABASE_URI would be unset, causing servicelayer to pick the default (sqlite:///) which breaks now that we have multiple threads writing to the tags database. This change documents how this setting works and encourages one to set it if the above two settings are left to defaults. Relates to #4002, #3816
ingest-file uses the Tags functionality from servicelayer to cache certain document types during ingest to speed up its operation. Normally the settings file defaults to trying the ALEPH_DATABASE_URI followed by the FTM_STORE_URI, which in most cases should be set to non-default values. But if one were to just follow our prod environment docs blindly and not uncomment these two settings then the TAGS_DATABASE_URI would be unset, causing servicelayer to pick the default (sqlite:///) which breaks now that we have multiple threads writing to the tags database. This change documents how this setting works and encourages one to set it if the above two settings are left to defaults. Relates to #4002, #3816
I explained how to fix this in #4002 (comment). Proper documentation fix in #4108. |
Hello,
I have this msg when i want to preview a image or a pdf :



Can someone help me ? thank you.
The text was updated successfully, but these errors were encountered: