This is the custom SoFIFA scraper built for our project of Big Data Computing 2021-22 at Sapienza University of Rome.
By Daniele Solombrino and Davide Quaranta.
scrapy 2.11.1
cd src
scrapy crawl sofifa
To save the output as CSV:
scrapy crawl sofifa -o out.csv:csv -a
To set a specific FIFA version to scrape:
scrapy crawl sofifa -o out.csv:csv -a fifa_version=230002
Finding the FIFA version key is simple: just go to the players page on SoFIFA and select an year/date, then note in the ?r=x
value in the URL. For example, ?r=220019
refers to Dec 9 2021 (FIFA 22).
Note that across different FIFA versions, players' fields may be different, hence it may be needed to manually change the fields to scrape.