-
Notifications
You must be signed in to change notification settings - Fork 7.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When the knowledge base performs segmentation analysis, pictures in the document are lost #11134
Comments
I found a similar unsolved discussion titled "When processing documents, you can save the images to segments" [1]. This might be related to the issue you're experiencing with images being lost during segmentation analysis. In the Dify codebase, images in DOCX documents are extracted and processed using the Additionally, there is a test named |
Link #11063, currently, we do not parse images in our system. |
[11063](url) is to analyze the content in the picture. I want to not parse the content in the picture, but to follow the picture intact in the context of the original text. We now do knowledge questions and answers. When users do knowledge questions and answers, they hope that the large model can not only return the text, but also hope to return the pictures related to the knowledge questions and answers. Example: I created a knowledge base related to the user manual, which contains many description of the operation steps and screenshots to assist instructions. When users ask questions and answers to the user manual knowledge base, the large model not only returns the text of the operation steps, but also returns relevant screenshots |
Self Checks
Dify version
0.12.1
Cloud or Self Hosted
Cloud
Steps to reproduce
When the knowledge base performs segmentation analysis, pictures in the document are lost
✔️ Expected Behavior
When the knowledge base performs segmentation parsing, the pictures in the document can be attached to the context, the same position as the original text displayed. In this way, when performing knowledge retrieval, the pictures can also be retrieved as knowledge.
❌ Actual Behavior
When the knowledge base was performing segmentation analysis, the pictures in the document were lost and were not stored in the knowledge base
The text was updated successfully, but these errors were encountered: