Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA]: Add support for table and chart extraction from images, docx files, and pptx files. #191

Open
drobison00 opened this issue Oct 24, 2024 · 0 comments · May be fixed by #217
Open

[FEA]: Add support for table and chart extraction from images, docx files, and pptx files. #191

drobison00 opened this issue Oct 24, 2024 · 0 comments · May be fixed by #217
Assignees
Labels
feature request New feature or request

Comments

@drobison00
Copy link
Collaborator

drobison00 commented Oct 24, 2024

Is this a new feature, an improvement, or a change to existing functionality?

New Feature

How would you describe the priority of this feature request

Currently preventing usage

Please provide a clear description of problem this feature solves

Currently the processing pathways for images/docx/pptx files do not integrate image extraction components for tables and charts. This issue will introduce support for those features.

Describe the feature, and optionally a solution or implementation and any alternatives

New functionality will include updates to docx_helper.py, pptx_helper.py, to include table and chart detection via yolox, and downstream table/chart extractors; it will also add an extraction stage for jpg, jpeg, png, and svg elements that leverages yolox and downstream table/chart extractors.

Additional context

No response

@drobison00 drobison00 added the feature request New feature or request label Oct 24, 2024
@drobison00 drobison00 self-assigned this Oct 24, 2024
@drobison00 drobison00 linked a pull request Nov 5, 2024 that will close this issue
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant