-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
right to left and left to right Orientation of the Persian digital copies in the full-text display on IIIF, DFG Viewer and PDF #54
Comments
Examples taken from: https://opendata.uni-halle.de/handle/1981185920/88120 |
Unfortunately the problem originates from the data itself, which contains the letters already in reversed order. Still, we must face the PDF text layer. |
Regarding the word level representation in the online viewers, they are out of scope of this tool. @MaidaButtar It seems to me that the rendered characters in the outline to navigate between sections / chapters (usually displayed at the left part of a PDF-viewer, like Firefox Browser) are properly ordered. |
I checked the PDF files and now both known cases have occurred that not only is the order of the letters in the word inverted, but so is the order of the words. In other words, the first word is at the end of the line. And it is correct, the subdivision of sections, chapters on the left is ordered properly on the PDF viewer. |
@MaidaButtar Can you please try these cases and report their results:
And exactly which PDF-reader tool are you using? |
@M3ssman
-nothing is found as if there is no match
|
example: |
To give an update:
Therefore I'm afraid this issue is tied to the overall update of PDF generation (Next Version PDF Processing). |
The following problems occur when recognizing and displaying the left-to-right and right-to-left orientation in the full-text display of the IIIF and DFG Viewer and in the PDF files in Persian:
IIIF Viewer: The order of the words is correct, but the letters in the words are reversed. The order of digits is correct (reason: numbers are read from left to right)
DFG Viewer: Line breaks are all gone, the order of words is halfway correct, but again, the letters in the word are reversed. The order of digits is correct.
PDF: Order of words reversed at line level, but letters in the word are not reversed. Order of digits correct..
The text was updated successfully, but these errors were encountered: