LANGUAGE UNDERSTANDING SYSTEM - SECOND PROJECT

Conditional Random Fields

Getting Started

For each experiment described in the report, it possibile to replicate them performing the following steps:

1 - Launch one of the four Init.sh script
2 - Use the newly generated training and test sets with the CRF++ commands crf_learn and crf_test
3 - Evaluate the performances with the conlleval.pl script

Cross-Validation

Cross-Validation is executable by launching the cross-validation.sh script. Every time the script is run, a series of question are asked to the user, in order to perform CV properly. Here's an example:

Enter k for cross-validation > 10
Enter the hyperparameter > 2
Entern number of threads > 20
Specify the template to be used > template.Baseline
Training set (file name)> train.Baseline
Test set (file name)> test.Baseline
What features strategy did you use? (1 - Baseline, 2 - Lemma, 3 - Suffix, 4 - Last 2 Chars)> 1
Do you want to keep shuffle? (y/n) > y
Do you want to keep shuffle? (y/n) > y
Do you want to keep shuffle? (y/n) > y
Do you want to keep shuffle? (y/n) > y
Do you want to keep shuffle? (y/n) > n

After some shuffles, the CV will be performed. An ouput similar to this one will be returned:

AVERAGE SCORES
Accuracy: 98.315%
Precision: 96.63%
Recall: 93.599%
F-1: 95.08%

UNBIASED VARIANCES
Accuracy: 8.57224%
Precision: 39.7916%
Recall: 156.189%
F-1: 90.7904%

The output generated by the CV on the folds and the final average scores and unbiased variances will be stored, respectively, in files having the following formats:

outputFold10.param2.strategy1
score.variancesFolds10Param2strategy1

Templates

Several templates are already available. It is reccommended to use them with respect to the features strategy adopted. The templates present are:

template.Baseline
template.Lemma
template.Suffix
template.Last2chars

Authors

Alessandro Rizzuto - ID 187156 - Balthus1989

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
NLSPARQL.test.data		NLSPARQL.test.data
NLSPARQL.test.feats.txt		NLSPARQL.test.feats.txt
NLSPARQL.train.data		NLSPARQL.train.data
NLSPARQL.train.feats.txt		NLSPARQL.train.feats.txt
README.md		README.md
clean.sh		clean.sh
conlleval.pl		conlleval.pl
cross-validation.sh		cross-validation.sh
gitkey		gitkey
gitkey.pub		gitkey.pub
init-Baseline.sh		init-Baseline.sh
init-Lemma.sh		init-Lemma.sh
init-Suffix.sh		init-Suffix.sh
init-last2chars.sh		init-last2chars.sh
model		model
output.Baseline		output.Baseline
shuffling.sh		shuffling.sh
sum.sh		sum.sh
template.Baseline		template.Baseline
template.Lemma		template.Lemma
template.last2chars		template.last2chars
template.suffix		template.suffix
total.dataset		total.dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LANGUAGE UNDERSTANDING SYSTEM - SECOND PROJECT

Conditional Random Fields

Getting Started

Cross-Validation

Templates

Authors

About

Releases

Packages

Languages

Balthus1989/lus2ndproject

Folders and files

Latest commit

History

Repository files navigation

LANGUAGE UNDERSTANDING SYSTEM - SECOND PROJECT

Conditional Random Fields

Getting Started

Cross-Validation

Templates

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages