Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

💡 [REQUEST] - <title>讀取並理解文件PDF #687

Open
aybs2060 opened this issue Dec 6, 2024 · 2 comments
Open

💡 [REQUEST] - <title>讀取並理解文件PDF #687

aybs2060 opened this issue Dec 6, 2024 · 2 comments
Labels
question Further information is requested

Comments

@aybs2060
Copy link

aybs2060 commented Dec 6, 2024

起始日期 | Start Date

12/06/2024

实现PR | Implementation PR

相关Issues | Reference Issues

摘要 | Summary

讓模型可以讀取類似pdf並理解pdf(或是office系列)
pdf裡面有圖有文字的那種 像是研究報告
希望有個範例給我參考

基本示例 | Basic Example

缺陷 | Drawbacks

未解决问题 | Unresolved questions

@aybs2060 aybs2060 added the question Further information is requested label Dec 6, 2024
@LDLINGLINGLING
Copy link
Collaborator

我的理解是,你可以先将pdf转换成jpg格式的图片

@aybs2060
Copy link
Author

你的意思是說 我需要一張圖片一張圖片丟嗎?
這樣會需要重新製作數據集嗎?
可是我需要類似LangChain 可以支援任何文件類型的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants