Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLamaparse can not parse content of scanned pages, if scanned page orientation changes and when there are two pages scanned in one page #471

Open
haniehm opened this issue Oct 31, 2024 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@haniehm
Copy link

haniehm commented Oct 31, 2024

Screenshot 2024-10-31 at 3 51 22 PM
Describe the bug
The quality of the pdf is not bad at all . However, parser is not performing and no content is detected when there is a change in "page orientation", which often happens during scanning documents specially when people start with single page scan and then move to two pages in one.

Advisory_Circular_DM1e_2024.pdf

Files
If possible, please provide the PDF file causing the issue.

Job ID
If you have it, please provide the ID of the job you ran.
You can find it here: https://cloud.llamaindex.ai/parse in the "History" tab.

08edeb5f-6d9d-49b1-883b-d61265a93581

Client:
Please remove untested options:

  • API

Additional context
Add any additional context about the problem here.
What options did you use? Premium mode, multimodal, fast mode, parsing instructions, etc.
Screenshots, code snippets, etc.

@haniehm haniehm added the bug Something isn't working label Oct 31, 2024
@haniehm haniehm changed the title LLamaparse can not parse content of scanned pages, if scan page orientations changes and when there are two pages scanned in one page LLamaparse can not parse content of scanned pages, if scanned page orientation changes and when there are two pages scanned in one page Nov 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants