Images in table #21

chillyoung4679 · 2024-05-23T10:14:43Z

Hello,

My PDF file contains long tables, and the tables include images. I tried

md_text = pymupdf4llm.to_markdown("input.pdf", write_images=True)

and the result was that the images were extracted but placed below the table.

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text |  |
| Text | Text |  |
| Text | Text |  |

![image1](images1.png)
![image2](images2.png)
![image3](images3.png)

However, I want the images to be inside the table. like:

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text | ![image1](images1.png) |
| Text | Text | ![image2](images2.png) |
| Text | Text | ![image3](images3.png) |

How can I achieve this?

Best

The text was updated successfully, but these errors were encountered:

JorjMcKie · 2024-05-23T10:23:47Z

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

rahuldev-2021 · 2024-12-02T12:10:42Z

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

Is there any solution for this issue. I am also facing the same issue,how can I resolve this?
I need the output in the below format

Col1	Col2	Image
Text	Text
Text	Text
Text	Text

JorjMcKie added the enhancement New feature or request label May 23, 2024

JorjMcKie added the postponed label Jun 8, 2024

JorjMcKie mentioned this issue Jun 13, 2024

Embedded links inside the table are not extracted #42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Images in table #21

Images in table #21

chillyoung4679 commented May 23, 2024

JorjMcKie commented May 23, 2024

rahuldev-2021 commented Dec 2, 2024 •

edited

Loading

Images in table #21

Images in table #21

Comments

chillyoung4679 commented May 23, 2024

JorjMcKie commented May 23, 2024

rahuldev-2021 commented Dec 2, 2024 • edited Loading

rahuldev-2021 commented Dec 2, 2024 •

edited

Loading