Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Images in table #21

Open
chillyoung4679 opened this issue May 23, 2024 · 2 comments
Open

Images in table #21

chillyoung4679 opened this issue May 23, 2024 · 2 comments
Labels
enhancement New feature or request postponed

Comments

@chillyoung4679
Copy link

Hello,

My PDF file contains long tables, and the tables include images. I tried

md_text = pymupdf4llm.to_markdown("input.pdf", write_images=True)

and the result was that the images were extracted but placed below the table.

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text |  |
| Text | Text |  |
| Text | Text |  |

![image1](images1.png)
![image2](images2.png)
![image3](images3.png)

However, I want the images to be inside the table. like:

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text | ![image1](images1.png) |
| Text | Text | ![image2](images2.png) |
| Text | Text | ![image3](images3.png) |

How can I achieve this?

Best

@JorjMcKie
Copy link
Contributor

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

@rahuldev-2021
Copy link

rahuldev-2021 commented Dec 2, 2024

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

Is there any solution for this issue. I am also facing the same issue,how can I resolve this?
I need the output in the below format

Col1 Col2 Image
Text Text image1
Text Text image2
Text Text image3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request postponed
Projects
None yet
Development

No branches or pull requests

3 participants