Skip to content
This repository was archived by the owner on Jun 22, 2018. It is now read-only.
thkoch2001 edited this page Sep 14, 2010 · 7 revisions

Welcome to the hbasene wiki!

HBasene is a scalable information retrieval engine, compatible with the Lucene library while using HBase as the store for the underlying TF-IDF representation.

Early results look good, with respect to scalability aspects. However there is room for improvement of the search latency. This is currently being worked on.

For those familiar with BigTable, the mapping between the tf-idf (as any good book on IR should give an introduction on), is available here at HBase TF-IDF index formats.

A beginner’s tutorial on the same, is available here at Hello World.

Meanwhile, there is a mailing list created here – [email protected], that should be used for discussions about HBasene

Clone this wiki locally