- Original file of the MCM problem Y
- A translated version of the orignal file in Chinese by the DeepL translator
- the analysis of the Y by George Liew
The python programs totally written by George to crawler sailboats data, and the raw data before cleaning. The .ipynb files are mainly used for debugging, checking logics and knowing the middle process results. After being tried successfully on .ipynb, the codes are organized into functions to run in .py files. The basic method is the same for the two cases, the difference is mainly about parsing different websites.
All the raw data in their original forms, those related to sailboats are obtained through python programs, while others are collected mannually from the Internet. I am mainly responsible to crawl information about the physical traits of sailboats from related websites.
Crawlering the YatchWorld.com for more training samples, which can provide over 8,000 used sailboats information.
Crawlering the [SailboatData.com](https://sailboatdata.com/?paginate=25&page=1) for more sailboat specifications like length, beam, displacement, etc. We can gather features of over 1,500 types of sailboat from the site. But pratically speaking, it is suggested to crawl related information basing on the the searching results website with sample's "Make", "Variant", "Year" as key words. Which is the latest version of the code.
- 2332034.pdf is the final version of thesis we uploaded for problem Y.
- The contents in Chinese given by the teammate who is responsible for test how fit the algorithm is for our training dataset.
- Other related files.
The pictures we used in our thesis.