Skip to content

sofifa.com scraper, built to scrape data needed for our project of Big Data Computing 2021-22 at Sapienza University of Rome

License

Notifications You must be signed in to change notification settings

m-kloeckner/sofifa-scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SoFIFA Scraper

This is the custom SoFIFA scraper built for our project of Big Data Computing 2021-22 at Sapienza University of Rome.

By Daniele Solombrino and Davide Quaranta.

Requirements

scrapy 2.11.1

Usage

cd src
scrapy crawl sofifa

To save the output as CSV:

scrapy crawl sofifa -o out.csv:csv -a

To set a specific FIFA version to scrape:

scrapy crawl sofifa -o out.csv:csv -a fifa_version=230002

Finding the FIFA version to scrape

Finding the FIFA version key is simple: just go to the players page on SoFIFA and select an year/date, then note in the ?r=x value in the URL. For example, ?r=220019 refers to Dec 9 2021 (FIFA 22).

Note that across different FIFA versions, players' fields may be different, hence it may be needed to manually change the fields to scrape.

About

sofifa.com scraper, built to scrape data needed for our project of Big Data Computing 2021-22 at Sapienza University of Rome

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 75.5%
  • Python 24.5%