Skip to content

Commit

Permalink
add marker and markitdown tools
Browse files Browse the repository at this point in the history
  • Loading branch information
noworneverev committed Dec 17, 2024
1 parent a2b9dfe commit d360f4e
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -117,6 +117,8 @@
| [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit) | A Comprehensive Toolkit for High-Quality PDF Content Extraction. | [![Stars](https://img.shields.io/github/stars/opendatalab/PDF-Extract-Kit?style=flat)](https://github.com/opendatalab/PDF-Extract-Kit/stargazers) |
| [grobid](https://github.com/kermitt2/grobid) | A machine learning software for extracting information from scholarly documents. | [![Stars](https://img.shields.io/github/stars/kermitt2/grobid?style=flat)](https://github.com/kermitt2/grobid/stargazers) |
| [GOT-OCR2.0](https://github.com/Ucas-HaoranWei/GOT-OCR2.0) | Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model. | [![Stars](https://img.shields.io/github/stars/Ucas-HaoranWei/GOT-OCR2.0?style=flat)](https://github.com/Ucas-HaoranWei/GOT-OCR2.0/stargazers) |
| [marker](https://github.com/VikParuchuri/marker) | Convert PDF to markdown + JSON quickly with high accuracy. | [![Stars](https://img.shields.io/github/stars/VikParuchuri/marker?style=flat)](https://github.com/VikParuchuri/marker/stargazers) |
| [markitdown](https://github.com/microsoft/markitdown) | Python tool for converting files and office documents to Markdown. | [![Stars](https://img.shields.io/github/stars/microsoft/markitdown?style=flat)](https://github.com/microsoft/markitdown/stargazers) |

## UI/Interface

Expand Down
2 changes: 2 additions & 0 deletions docs/awesome-rag/tools.md
Original file line number Diff line number Diff line change
Expand Up @@ -121,6 +121,8 @@ sidebar_position: 1
| [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit) | A Comprehensive Toolkit for High-Quality PDF Content Extraction. | [![Stars](https://img.shields.io/github/stars/opendatalab/PDF-Extract-Kit?style=flat)](https://github.com/opendatalab/PDF-Extract-Kit/stargazers) |
| [grobid](https://github.com/kermitt2/grobid) | A machine learning software for extracting information from scholarly documents. | [![Stars](https://img.shields.io/github/stars/kermitt2/grobid?style=flat)](https://github.com/kermitt2/grobid/stargazers) |
| [GOT-OCR2.0](https://github.com/Ucas-HaoranWei/GOT-OCR2.0) | Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model. | [![Stars](https://img.shields.io/github/stars/Ucas-HaoranWei/GOT-OCR2.0?style=flat)](https://github.com/Ucas-HaoranWei/GOT-OCR2.0/stargazers) |
| [marker](https://github.com/VikParuchuri/marker) | Convert PDF to markdown + JSON quickly with high accuracy. | [![Stars](https://img.shields.io/github/stars/VikParuchuri/marker?style=flat)](https://github.com/VikParuchuri/marker/stargazers) |
| [markitdown](https://github.com/microsoft/markitdown) | Python tool for converting files and office documents to Markdown. | [![Stars](https://img.shields.io/github/stars/microsoft/markitdown?style=flat)](https://github.com/microsoft/markitdown/stargazers) |

## UI/Interface

Expand Down

0 comments on commit d360f4e

Please sign in to comment.