How to use the project

Unfortunately, the data set can not be included as a submodule and needs to be downloaded manually. Download the full data set from here and place in standard-mining/data/million_post_corpus.

File structure:

standard-mining
+-- code  
|   +-- austriazismen.py  
|   +-- ...  
+-- data  
|   +-- million_post_corpus  
|   |   +-- corpus.sqlite3  
|   |   +-- database_schema.md  
|   |   +-- ...

How to use the Stanford Tagger

Download:

cd ~
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2018-02-27.zip
unzip stanford-corenlp-full-2018-02-27.zip
cd stanford-corenlp-full-2018-02-27
wget http://nlp.stanford.edu/software/stanford-german-corenlp-2018-02-27-models.jar

Usage:

java -Xmx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer \
-serverProperties StanfordCoreNLP-german.properties \
-preload tokenize,ssplit,pos,ner,parse \
-status_port 9002  -port 9002 -timeout 15000

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
code		code
subprojects		subprojects
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to use the project

How to use the Stanford Tagger

About

Releases

Packages

Languages

mehrkonfetti/standard-mining

Folders and files

Latest commit

History

Repository files navigation

How to use the project

How to use the Stanford Tagger

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages