TWICE: What Advantages Can Low-Resource Domain-Specific Embedding Model Bring? - A Case Study on Korea Financial Texts

Installation | Example Usage | Arguments | Task list

We introduce KorFinMTEB, a novel benchmark for the Korean financial domain, specifically tailored to reflect its unique cultural characteristics in low-resource languages.

Domain specificity of embedding models is critical for effective performance. However, existing benchmarks, such as FinMTEB, are primarily designed for high-resource languages, leaving low-resource settings, such as Korean, under-explored. Directly translating established English benchmarks often fails to capture the linguistic and cultural nuances present in low-resource domains. Our experimental results reveal that while the models perform robustly on a translated version of FinMTEB, their performance on KorFinMTEB uncovers subtle yet critical discrepancies, especially in tasks requiring deeper semantic understanding, that underscore the limitations of direct translation.

The basic pipeline is built upon FinMTEB, MTEB

Installation

conda create -n korfinmteb python=3.11
git clone https://github.com/nmixx-fin/TWICE.git
cd TWICE
pip install -r requirements.txt

Example Usage

Using a Python script:

python eval_KorFinMTEB.py --model_name_or_path KURE --task_type CLUSTERING

Using shell script (sh) file :

Before execution, you must specify the model to be tested and the task to be tested in the sh file.

sh run.sh

Arguments

Arguments:
        --model_name_or_path (str, default="BAAI/bge-large-zh"):
            Path to the pre-trained model or model identifier from Hugging Face Model Hub.
        
        --task_type (str, default=None):
            Specifies the type of task to be executed. Available options include:
            - CLASSIFICATION
            - RETRIEVAL
            - CLUSTERING
            - RERANKING
            - STS
            - SUM
            - PAIRCLASSIFICATION
        
        --add_instruction (bool, default=False):
            If set, includes additional instructions for the query. This is an optional flag.
        
        --pooling_method (str, default='cls'):
            Defines the pooling method for the model. Different models may require different pooling methods.

Tasks

For comparison with FinMTEB, only the datasets that directly correspond to FinMTEB (1:1 mapping) will be evaluated first.
In the future, unique sub-tasks built with proprietary Korean data will also be added to the evaluation code.
Each task evaluation includes the sub-tasks listed below.
The task type can be set using the task_type argument.

Classification

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
example_models		example_models
finance_mteb		finance_mteb
results		results
.gitignore		.gitignore
DartCompany2Industry.csv		DartCompany2Industry.csv
FinSTS.csv		FinSTS.csv
MINDS14.csv		MINDS14.csv
NMIXX_kor_Fin_news.csv		NMIXX_kor_Fin_news.csv
README.md		README.md
cls_run.sh		cls_run.sh
cluster_run.sh		cluster_run.sh
cuda-keyring_1.1-1_all.deb		cuda-keyring_1.1-1_all.deb
delete_result_files.py		delete_result_files.py
eval_KorFinMTEB.py		eval_KorFinMTEB.py
eval_instruction.py		eval_instruction.py
paircls_run.sh		paircls_run.sh
qwen_run.sh		qwen_run.sh
requirements.txt		requirements.txt
rerank_run.sh		rerank_run.sh
retrieve_run.sh		retrieve_run.sh
setting.sh		setting.sh
sts_run.sh		sts_run.sh
summ_run.sh		summ_run.sh
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TWICE: What Advantages Can Low-Resource Domain-Specific Embedding Model Bring? - A Case Study on Korea Financial Texts

Installation | Example Usage | Arguments | Task list

Installation

Example Usage

Arguments

Tasks

Classification

PairClassification

Reranking

Clustering

STS

Retrieval

About

Releases

Packages

Contributors 2

Languages

nmixx-fin/TWICE

Folders and files

Latest commit

History

Repository files navigation

TWICE: What Advantages Can Low-Resource Domain-Specific Embedding Model Bring? - A Case Study on Korea Financial Texts

Installation | Example Usage | Arguments | Task list

Installation

Example Usage

Arguments

Tasks

Classification

PairClassification

Reranking

Clustering

STS

Retrieval

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages