Scraper for Black Lives Matter posts. Uses: https://github.com/drawrowfly/tiktok-scraper
- Docker Desktop
- If using Linux make sure
docker
anddocker-compose
are available at command line
- Edit the
HASHTAGS
array inscrape.sh
to set the hashtags to scrape - Edit
NUM_POSTS
inscrape.sh
to specify number of posts to scrape. This is the number scraped per hashtag, not in total. - Run:
docker-compose up --build
- Combining CSVs and removing duplicate rows:
awk '(NR == 1) || (FNR > 1)' *.csv | awk '!v[$1]++' > combined.csv
Note: Need to fix this