LoRA conditioned Deep Gaussian Processes

This repository contains code for training deep gaussian processes for VaR prediction conditioned on LLM embeddings, e.g. LLaMa-3 finetuned using LoRA/QLoRA. Finetuning data is retrieved using the newsdata API. Combining finetuned LLMs and deep GPs is a good choice when:

(1) There is reasonably large and up to date corpus for training and generating RAG embeddings;

(2) The number of assets under consideration is relatively small, i.e. , since GP inference scales as ;

(3) Robust uncertainty estimates are crucial for success of the application (e.g. risk estimation with VaR).

Background

Value at Risk (VaR) is a statistical measure that estimates the maximum potential loss an investment portfolio might experience, within a defined timeframe and confidence level, assuming stationary market conditions. VaR can be optimized based on a portfolio with weights and assets represented as deep gaussian processes:

where the kernel is parameterized by a neural network [1]

and the kernel function is, e.g., an RBF kernel

.

Since GPs produce probabilistic outputs, we can evaluate the VaR based on a trained GP and a user-specified risk level ($\alpha$ = 0 is risk-free, $\alpha$ = 1 is maximal risk) [2]

$\Large V_{\alpha}(\mathbf{x},\mathbf{z})=\textrm{inf}\{\omega:P(f((\mathbf{x},\mathbf{z}))\leq\omega)\geq\alpha\}$

where $f(\mathbf{x},\mathbf{z})$ is a deep GP conditioned on a context embedding $\mathbf{z}$ generated by our finetuned LLaMa model. The VaR is defined as the minimum upper bound $\omega$ required such that $f(\mathbf{x},\mathbf{z})$ is less than the specified risk level.

How to Use This Repository

First install the requirements:
```
pip install -r requirements.txt
```
Collect text from the newsdata API by running
```
python grab_news_text.py
```
The text retrieved can be filtered by country, category, or language with the params argument.

Preprocess and filter raw text data using

python fineweb-2-pipeline.py --dataset="path/to/your/dataset"

Finetune with

tune run lora_finetune_single_device --config ./8B_lora_newstext.yaml

Train deep GP with
```
python dgp.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
checkpoint		checkpoint
data		data
.gitignore		.gitignore
8B_lora_newstext.yaml		8B_lora_newstext.yaml
8B_lora_wikitext.yaml		8B_lora_wikitext.yaml
README.md		README.md
fineweb-2-pipeline.py		fineweb-2-pipeline.py
grab_news_text.py		grab_news_text.py
grab_ticker_data.py		grab_ticker_data.py
pull_yf_ticker_data.ipynb		pull_yf_ticker_data.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA conditioned Deep Gaussian Processes

Background

How to Use This Repository

About

Releases

Packages

Languages

nngabe/gp_lora

Folders and files

Latest commit

History

Repository files navigation

LoRA conditioned Deep Gaussian Processes

Background

How to Use This Repository

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages