Getting Unicode Block after the pdf conversion #1466
Replies: 4 comments
-
This a MuPDF issue:
|
Beta Was this translation helpful? Give feedback.
-
How can we use the alternate font if some font is not supported? |
Beta Was this translation helpful? Give feedback.
-
Hm, you have to:
There are two script here: repl-fontnames.py and repl-font.py. First run [
{
"oldfont": [
"ArialMT"
],
"newfont": "keep",
"info": "92 glyphs, size 31580, serifed, subset font"
},
{
"oldfont": [
"Ubuntu-Regular"
],
"newfont": "keep",
"info": "261 glyphs, size 22044, serifed, subset font"
}
] Edit this file and replace the words "keep" by fontnames you would like instead - in this case best use "helv": [
{
"oldfont": [
"ArialMT"
],
"newfont": "helv",
"info": "92 glyphs, size 31580, serifed, subset font"
},
{
"oldfont": [
"Ubuntu-Regular"
],
"newfont": "helv",
"info": "261 glyphs, size 22044, serifed, subset font"
}
] Then run
The resulting PDF |
Beta Was this translation helpful? Give feedback.
-
In the next version, I will issue a warning, if |
Beta Was this translation helpful? Give feedback.
-
Hey,
I want to extract the text from the pdf. To extract the text from the pdf I am using the below-mentioned code.
Code Snippet:
What is actually happening?
If I am trying to extract the text from pdf after conversion, the text is not extracted properly. (I gam getting Unicode Block instead of actual text).
NOTE: I have to convert the pdf into a new pdf because of this issue.
For reference, I shared the document in a personal email. [email protected]
Can you please do needful?
Thank you.
System Specification:
Beta Was this translation helpful? Give feedback.
All reactions