Introduction

This repository holds a baseline model for programming project in Natural Language Processing 2022.

# Example
$ python eval_model_jsonl.py data/LecNLP_test_ja.jsonl --output_file outputs/LecNLP_test_ja_prediction.jsonl --lang ja
$ python eval_model_jsonl.py data/LecNLP_test_en.jsonl --output_file outputs/LecNLP_test_en_prediction.jsonl --lang en

# Example: Evaluate only 100 samples from test data (determined as the --sample option for faster development)
$ python eval_model_jsonl.py data/LecNLP_test_ja.jsonl --output_file outputs/LecNLP_test_ja_prediction.jsonl --lang ja --sample 100

Japanese: originally rinna Corporation's Japanese GPT model.

English: originally gpt-2-large model.

You can see the results in detail with the following command.

$ jq -s '.' outputs/LecNLP_test_ja_prediction.jsonl | less
$ jq -s '.' outputs/LecNLP_test_en_prediction.jsonl | less

Exercise Contents

Add sentences to the question text to create your own prompt. Modify add_prompt function in util.py.

# util.py

'''
Input: question
Output: prompt
Note:
    ・The model's answer is the content of the model's output 「」(Japanese) or [ ] (English)
    ・So, it is better to end the prompt with "「" (Japanese) or "[" (english)
'''
def add_prompt(question: str, lang: str, **kwargs: dict[str, Any]) -> str:
    # Add your prompt
    if lang == "en":
        prompt = f"Question: {question}? Answer: ["
    elif lang == "ja":
        prompt = f'質問：{question}? 回答：「'
    else:
        assert 0, lang
    return prompt


'''
Input: model prediction str
Output: answer str
Note:
    ・You can also change this part if you want.
'''
def extract_answer(output: str, lang: str) -> str:
    if lang == "en":
        return re.findall("\[(.*?)\]", output)[-1] # capture []
    elif lang == "ja":
        return re.findall("「(.*?)」", output)[-1] # capture 「」
    else:
        assert 0, lang

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.gitignore		.gitignore
README.md		README.md
compare.py		compare.py
eval_model_jsonl.py		eval_model_jsonl.py
requirements.txt		requirements.txt
setup.sh		setup.sh
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Contents

Environment construction

Development & Test Data

Zero-shot inference

Evaluate Test Data

Japanese: originally rinna Corporation's Japanese GPT model.

English: originally gpt-2-large model.

Exercise Contents

About

Releases

Packages

Languages

cl-tohoku/AIO3_GPT_baseline_for_NLP

Folders and files

Latest commit

History

Repository files navigation

Introduction

Contents

Environment construction

Development & Test Data

Zero-shot inference

Evaluate Test Data

Japanese: originally rinna Corporation's Japanese GPT model.

English: originally gpt-2-large model.

Exercise Contents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages