tika

The simple monolithic application demonstrates: the extraction of the images of the PDF document pages using Apache Tika, the storage of the images files into the local filesystem, the display of the pages using the ngx-swiper-wrapper library.

pdf spring tika jhipster image-storage ngx-swiper-wrapper

Updated May 9, 2023
Java

sarbanandabhikkhu / DhammaChakka

Star

Early Buddhist texts from the Tipitaka (Tripitaka). Suttas (sutras) with the Buddha's teachings on mindfulness, insight, wisdom, and meditation.

tika buddhism pali tipitaka sutta vinaya dhammachakka abhidhamma atthakatha

Updated Jul 6, 2023
JavaScript

msafwankarim / lufin

Star

LuFIn (Lucene File Indexer)

tika lucene fileindex

Updated Nov 6, 2023
Java

skvkel / information-retrieval-system

Star

Information retrieval system for documents.

java information-retrieval tika apache lucene

Updated Feb 15, 2022
HTML

tirthmehta / Apache-Solr-based-Web-Search-Engine

Star

Deployment of a search engine utilizing Apache Solr, Apache Tika and spelling correction programs.

python java php solr tika

Updated Jul 28, 2017

sesam-community / content-extractor

Star

Extract textual information using the Apache Tika library from JSON streams

docker tika transform sesam

Updated Apr 25, 2017
Java

gcpetri / SiteMap-Python

Star

Extracts GPS coordinates from pdf files and Points/Polygons from kmz files to create a master kml file. 🌎

pyqt5 tika geolocation python3 geology

Updated Jul 7, 2021
HTML

AidaRosaCalvo / info-retrieval-system

Star

Este proyecto consiste en la construcción de un sistema de recuperación de información que puede manipular documentos de diferentes formatos provenientes de un repositorio de información. La aplicación utiliza herramientas como Lucene y Tika para indexar y extraer información de los documentos.

clustering tika javafx java-8 lucene kmeans-clustering linkage c-means-implementation

Updated Jun 23, 2024
Java

tusharkm / search_engine_using_lucene

Star

A Java application that uses Lucene and Tika to search document and display the document part in which the document is found.Along with precision and recall value

java search-engine tika lucenesearch

Updated Aug 20, 2017
Java

voltek62 / Rwahoo

Star

Create the ultimate scraper with Apache Tika for R

cran r tika

Updated Mar 23, 2018
R

wbicode / TikaService-Installer

Star

A Windows Installer (MSI) for the windows service wrapper of the tika JSR 311 network server.

installer tika wix-toolset tika-server msi-installer

Updated Feb 15, 2022
C#

stainlessai / grails-tika

Star

A plugin for using Apache Tika in Grails/Micronaut projects

tika micronaut grails4

Updated Nov 5, 2019
Groovy

frytoli / geotopic-parser-enabled-tika-docker

Star

Container-ized (Docker) GeoTopicParser-Enabled Apache Tika Server with Lucene Geo Gazetteer.

docker tika gazetteer tika-server geo-gazetteer

Updated Apr 5, 2021
Dockerfile

DDansAbelenda / doc-clusterizer

Star

DocClusterizer is a Java desktop application designed to analyze and cluster documents based on their content similarity. The application utilizes Lucene and Tika libraries to process various file extensions such as txt, pdf, docx, and pptx.

tika javafx java-8 lucene kmeans-clustering linkage document-clustering kmeans-algorithm lucene-analyzer unsupervised-clustering fuzzycmeans

Updated Apr 6, 2024
Java

Improve this page

Add a description, image, and links to the tika topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tika topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tika

Here are 145 public repositories matching this topic...

kairohm / tikatree

wbicode / TikaService

lguberan / LuceneFx

kressi / search-media

zhurlik / doc-search

dataiku / dss-plugin-nlp-extraction

contribution-jhipster-uga / sample-jhipster-docpreview

sarbanandabhikkhu / DhammaChakka

msafwankarim / lufin

skvkel / information-retrieval-system

tirthmehta / Apache-Solr-based-Web-Search-Engine

sesam-community / content-extractor

gcpetri / SiteMap-Python

AidaRosaCalvo / info-retrieval-system

tusharkm / search_engine_using_lucene

voltek62 / Rwahoo

wbicode / TikaService-Installer

stainlessai / grails-tika

frytoli / geotopic-parser-enabled-tika-docker

DDansAbelenda / doc-clusterizer

Improve this page

Add this topic to your repo