Skip to content

Latest commit

 

History

History
82 lines (57 loc) · 2.17 KB

README.md

File metadata and controls

82 lines (57 loc) · 2.17 KB

BechdelAI

Measurement and automation of the Bechdel test, female (under)representation and representation inequalities in cinema and audiovisual media

This repo is under active development, the documentation may not be up to date

Structure of the repo

- bechdelai         ----- Python code as a library
- data              ----- Reusable datasets
- notebooks         ----- Experiments
    - video         ----- Experiments on video data
    - audio         ----- Experiments on audio data
    - posters       ----- Experiments on posters
    - nlp           ----- Experiments on nlp

Documentation

Get movies from Allociné / TMDB

Install

pip install -e .

Run with cli

$ scrap-movies --help
Usage: scrap-movies [OPTIONS]

  CLI for scraping movies from Allociné and TMDB.

  It follows these steps:

  1. Extract N movies from Allociné

  2. Find corresponding TMDB id for each movie

  3. Get data for each movie with TMDB API: metadata, crew and cast

  4. Save csv results

Options:
  -n, --movies-number TEXT  Number of movies wanted.  [required]
  -p, --path TEXT           Path where to save csv results.  [required]
  --sort-by TEXT            How to sort movies in Allociné pages. Available
                            choices: ['popularity', 'alphabetic',
                            'press_note', 'public_note']
  --genre TEXT              Filter for genre.
  --year TEXT               Filter for year.
  --country TEXT            Filter for country.
  -v, --verbose BOOLEAN     The person to greet.
  --help                    Show this message and exit.

Examples

# Scrap first current most popular movie in Allociné
scrap-movies -n 1 -p .
# Scrap top 15 most popular movies in Allociné in 2018 from France
scrap-movies -n 15 -p . --year 2018 --country France
# Scrap top 50 with the best public notes
scrap-movies -n 50 -p . --sort-by public_note