Incorrect tabular data extraction for handwritten docs when submitted as PDF while correctly extracting accurate data when submitted as an image. #476

UsamaAmjad03 · 2024-11-05T11:53:12Z

Describe the bug
I am submitting a File containing the tabular data with handwritten text in it with model openai gpt-4o, but when being submitted as an image (jpeg, png etc) it gives accurate result but when submitted as a PDF file it give wrong values inside table.

Job ID
4ab24faf-4ef1-4cba-bc04-96f8f1c6de09 (For PDF that gave wrong result)

f4fb4c09-32c3-4a90-bfee-d80e32e81c5b (For image that gave accurate values)

Client:
Please remove untested options:

Python Library
API
Frontend (cloud.llamaindex.ai)

BinaryBrain · 2024-11-08T11:20:58Z

We don't support handwritten text if that's the problem.

UsamaAmjad03 added the bug Something isn't working label Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect tabular data extraction for handwritten docs when submitted as PDF while correctly extracting accurate data when submitted as an image. #476

Incorrect tabular data extraction for handwritten docs when submitted as PDF while correctly extracting accurate data when submitted as an image. #476

UsamaAmjad03 commented Nov 5, 2024

BinaryBrain commented Nov 8, 2024

Incorrect tabular data extraction for handwritten docs when submitted as PDF while correctly extracting accurate data when submitted as an image. #476

Incorrect tabular data extraction for handwritten docs when submitted as PDF while correctly extracting accurate data when submitted as an image. #476

Comments

UsamaAmjad03 commented Nov 5, 2024

BinaryBrain commented Nov 8, 2024