- Assignment
- Data Exploration
- Feature Engineering
- Classical Machine Learning Algorithms
- Deep Learning Algorithm
- Conclusion
The task is to predict the day price direction of Amazon.com, Inc. (AMZN). The goal is a binary classification of whether the next day's closing price will be higher than the opening price. The dataset spans from 1997 to 2020, split into training (1997-2016), validation (2016-2018), and testing (2018-2020) periods. Dataset files: AMZN_train.csv, AMZN_val.csv, and AMZN_test.csv. The stock market is very complex and highly volatile. In order to be profitable, we do not need to predict the correct price, but rather, the price direction: whether it will be higher or lower than the price that is today. If we predict it to be higher, we might as well buy some stocks, else, we should probably sell. Therefore, the target would be a binary classification of whether the next-day closing price will be higher than the opening price.
In our initial exploration, we load the datasets and analyze the data attributes. We plot stock prices over time and explore trends.
...
The gradient boosting classifier yielded the best AUC score on the validation set. The next section evaluates its performance on the test set and includes a feature importance analysis.