Skip to content

Install Guide

阪本 浩太郎 edited this page Jan 28, 2017 · 1 revision

Install the following softwares.

  • Programming
  • Oracle Java SE Development Kit 8
  • Scala 2.12.1
  • Simple Build Tool 0.13.13
  • Japanese Language Analyzer
  • MeCab 0.996
  • UniDic unidic-mecab 2.1.2
  • Search Engine
  • Indri 5.11

Git clone or download this repository.

Japanese Language Analyzer

  • MeCab
  • UniDic user dictionary (non-distribution)
  • IPA dictionary's user disctionary (non-distribution)

How to install

Installation of MeCab (Mandatory)
Download mecab-0.996.tar.gz from
http://taku910.github.io/mecab/#download

tar zxfv mecab-0.996.tar.gz
cd mecab-0.996
./configure --with-charset=utf8 --enable-utf8-only
make
make check
sudo make install

Installation of IPADic (Optional)
Download mecab-ipadic-2.7.0-20070801.tar.gz from
http://taku910.github.io/mecab/#download

tar zxfv mecab-ipadic-2.7.0-20070801.tar.gz
cd mecab-ipadic-2.7.0-20070801
./configure --with-charset=utf8
make
sudo make install

Installation of UniDic (Mandatory)
Download unidic-mecab-2.1.2_bin.zip from
https://ja.osdn.net/projects/unidic/releases/

unzip unidic-mecab-2.1.2_bin.zip
mv unidic-mecab-2.1.2_bin /usr/local/lib/mecab/dic/unidic

Edit UniDic's dicrc

sudo vi /usr/local/lib/mecab/dic/unidic/dicrc

To comment-out output-format-type, prepend a ';' to the following line

output-format-type = unidic

as below

;output-format-type = unidic

Search Engine

  • Indri

How to install

Installation of Indri (Mandatory)

cd "path to download"
svn checkout svn://svn.code.sf.net/p/lemur/code/ lemur-code
cd ./lemur-code/indri/trunk/
./configure
make
make install

How to index

See https://bitbucket.org/kotaro_sakamoto/indriindexinjapanese

Clone this wiki locally