nlp-final-project

The repository contains the final project for the course "natural Language Processing" at RUG.
The project is base on esnli dataset and focuses on analysing the variants of explanation generation.

The dataset contains text pairs [premisis] and [hypothesis] and the [label] of the pair: entailment, contradiction or neutral. IN addition each pain acomplished with [explanation] on a natural language.
The task is to generate the explanation and label for the given pair.

The project is analysing three main approaches:

Generating explanation using saparate model and then predict label using the explanation
Jointly generate label and explanation using the same text generation model
Predict label using saparate model and then generate explanation having label as input

Installation

Note: python 3.9 is required

Create a virtual environment and install the requirements

python -m venv .venv
python -m pip install -r requirements.txt

Cluster

The High Performance Computing cluster Hábrók of the RUG was used for models training and experiments.

The .sh scripts for running the experiments on the cluster are located in each scripts/ subfolder, the setup instruction is available as scripts/README.md.

Structure

scripts/classification - contains training scripts for each any classification model used.
scripts/generation - contains training scripts for each any explanation generation model used.
scripts/pipeline - contains scripts to evaluate models than requires other models predictions as input.

Training scripts accept the following arguments:

--base-model - model to finetune available at Hugginface Hub (e.g. roberta-base)
--config-name - name of the config in params.json file (separate for each script)
generation_type/classification_type - type of trainin process, whatever use or not explanation or lable in input data

See more python [script_name].py --help

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
scripts		scripts
.gitignore		.gitignore
README.md		README.md
pipelines.png		pipelines.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nlp-final-project

Installation

Cluster

Structure

About

Releases

Packages

Contributors 2

Languages

lct-rug-2022/nlp-final-project

Folders and files

Latest commit

History

Repository files navigation

nlp-final-project

Installation

Cluster

Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages