final_stage

History

Name		Name	Last commit message	Last commit date
parent directory ..
datasets/compeition_sigir2020		datasets/compeition_sigir2020
README.md		README.md
main.py		main.py
result_xgb_rf.csv		result_xgb_rf.csv
test_1d.py		test_1d.py
test_20d.py		test_20d.py
test_60d.py		test_60d.py

README.md

1. Performance of the model

According our submission records, the performance of the model is as follows:

2. Environment of the code

The environment of the code is PYTHON 3.7 and the following packages are required:

pandas
numpy
matplotlib
sklearn
talib
xgboost

3. Model description

The codes are composed of 4 python scripts and the main.py can directly run out of the test results, which save at the file "result_xgb_rf.csv".

In our codes, we train three prediction models for three time-horizons: 1-day, 20-day, and 60-day.

The 1-day prediction models are XGBOOST model but with different hyper-parameters for each metals

The 20-day prediction models are XGBOOST or Random Forest for different metal.

The 60-day prediction models are Random Forest models but with different features and hyper-parameters.

The major novelty of our method is making use of the predicted label, for each time-horizon, we

First, extract useful but not leaky features from the competition dataset;
Second, train a Random Forest using the training data notate as $ \sum^{window_start}_{window_end} <features,label>$;
Third, predict the label using the validation data on day T, and predict the next label using the next validation data on day T+1 until day T+$V$ ;
Fourth, regard the predicted labels as real labels and retrain a Random Forest using the new training data $ \sum^{window_start+V}_{window_end+V} <features,label>$;
Fifth, repeat the Third and Fourth step until the validation data on the last day.

4. Test time it costs

In our computer(Inter(R) Core(TM) i7-7600U [email protected]), it costs about 2 minutes.

5. Contact method

Please feel free to contact us when you have some troubles reproducing the results.

Contact Email: [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

final_stage

final_stage

README.md

1. Performance of the model

2. Environment of the code

3. Model description

4. Test time it costs

5. Contact method

Files

final_stage

Directory actions

More options

Directory actions

More options

Latest commit

History

final_stage

Folders and files

parent directory

README.md

1. Performance of the model

2. Environment of the code

3. Model description

4. Test time it costs

5. Contact method